Just a thought, but...
could the pattern recognition techniques used in anti-spam software be adapted to pattern recognition of localised (down to company level) methods of:
writing telephone numbers
writing addresses
writing dates
etc.?
Chris
Just a thought, but...
could the pattern recognition techniques used in anti-spam software be adapted to pattern recognition of localised (down to company level) methods of:
writing telephone numbers
writing addresses
writing dates
etc.?
Chris
We have a mechanism called Zimlets for doing stuff like that. One possible feature of a Zimlet is recognizing patterns in content and highlightling them as objects that can be interacted with. For example, our date Zimlet recognizes several common date formats and will let you create an appointment. Other Zimlets recognize URLs, phone numbers, email addresses, tracking numbers, etc.
-Conrad
The zimlet system as it is requires the patterns to be already identified; but people have different habits and styles in terms of how they write addresses and telephone numbers, etc.. What I'm suggesting is a Zimlet (maybe core code would be better?) that could learn these variations.
The current shipping Zimlets don't recognise the way that UK phone numbers are commonly notated, nor addresses. This is presumably true for many other countries around the world - we can either create tens or hundreds of Zimlets to deal with this problem, or we can add a pattern learning layer (like Spam Assassin) which the Zimlets get their cues from.
That way a telephone number Zimlet for integration with Asterisk needs to deal only with acting on the recognised pattern, not recognising the pattern itself.
Companies, like ours (or Zimbra!), who operate internationally would then gain from individual local users teaching their installation the patterns they need and use, rather than starting from a base point which recognises only North American style telephone numbers and addresses (and dates) and does not progress from there without having to create and install additional Zimlets; Zimlets which are essentially identical in functionality to previously installed ones.
Chris
I actually work for a company that does analytics. The problem that you are suggesting is actually quite a bit harder then you might suspect. That being said, it shouldn't be too hard to extend the existing systems to recognize local variations via REGEX and JavaScript, but contextual learning is a difficult problem.
Just worthwhile...
![]()
There are currently 1 users browsing this thread. (0 members and 1 guests)