Zimbra offers Open Source email server software and shared calendar for Linux and the Mac
Go Back   Zimbra :: Forums > Zimbra Collaboration Suite > Developers

Welcome to the Zimbra :: Forums!
Welcome, if you would like to post a comment please register. We also encourage you to explore all things Zimbra with our team and members of the community.

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
  #1 (permalink)  
Old 08-13-2007, 04:53 AM
Active Member
 
Posts: 28
Default The silver lining of spam?

Just a thought, but...

could the pattern recognition techniques used in anti-spam software be adapted to pattern recognition of localised (down to company level) methods of:

writing telephone numbers
writing addresses
writing dates

etc.?

Chris
Reply With Quote
  #2 (permalink)  
Old 08-20-2007, 05:58 PM
Zimbra Employee
 
Posts: 52
Default

We have a mechanism called Zimlets for doing stuff like that. One possible feature of a Zimlet is recognizing patterns in content and highlightling them as objects that can be interacted with. For example, our date Zimlet recognizes several common date formats and will let you create an appointment. Other Zimlets recognize URLs, phone numbers, email addresses, tracking numbers, etc.

-Conrad
__________________
Bugzilla - Wiki - Downloads - Before posting... Search!
Reply With Quote
  #3 (permalink)  
Old 08-21-2007, 02:07 AM
Active Member
 
Posts: 28
Lightbulb A learning Zimlet

The zimlet system as it is requires the patterns to be already identified; but people have different habits and styles in terms of how they write addresses and telephone numbers, etc.. What I'm suggesting is a Zimlet (maybe core code would be better?) that could learn these variations.

The current shipping Zimlets don't recognise the way that UK phone numbers are commonly notated, nor addresses. This is presumably true for many other countries around the world - we can either create tens or hundreds of Zimlets to deal with this problem, or we can add a pattern learning layer (like Spam Assassin) which the Zimlets get their cues from.

That way a telephone number Zimlet for integration with Asterisk needs to deal only with acting on the recognised pattern, not recognising the pattern itself.

Companies, like ours (or Zimbra!), who operate internationally would then gain from individual local users teaching their installation the patterns they need and use, rather than starting from a base point which recognises only North American style telephone numbers and addresses (and dates ) and does not progress from there without having to create and install additional Zimlets; Zimlets which are essentially identical in functionality to previously installed ones.

Chris
Reply With Quote
  #4 (permalink)  
Old 08-24-2007, 06:29 PM
Zimlet Guru & Moderator
 
Posts: 467
Default

Quote:
Originally Posted by putt1ck View Post
The zimlet system as it is requires the patterns to be already identified; but people have different habits and styles in terms of how they write addresses and telephone numbers, etc.. What I'm suggesting is a Zimlet (maybe core code would be better?) that could learn these variations.

The current shipping Zimlets don't recognise the way that UK phone numbers are commonly notated, nor addresses. This is presumably true for many other countries around the world - we can either create tens or hundreds of Zimlets to deal with this problem, or we can add a pattern learning layer (like Spam Assassin) which the Zimlets get their cues from.

That way a telephone number Zimlet for integration with Asterisk needs to deal only with acting on the recognised pattern, not recognising the pattern itself.

Companies, like ours (or Zimbra!), who operate internationally would then gain from individual local users teaching their installation the patterns they need and use, rather than starting from a base point which recognises only North American style telephone numbers and addresses (and dates ) and does not progress from there without having to create and install additional Zimlets; Zimlets which are essentially identical in functionality to previously installed ones.

Chris
I actually work for a company that does analytics. The problem that you are suggesting is actually quite a bit harder then you might suspect. That being said, it shouldn't be too hard to extend the existing systems to recognize local variations via REGEX and JavaScript, but contextual learning is a difficult problem.
Reply With Quote
  #5 (permalink)  
Old 08-25-2007, 01:00 AM
Active Member
 
Posts: 28
Wink I never said it would be easy

Just worthwhile...

Reply With Quote
Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes


Similar Threads

Why Join?

Registering let's you ask questions, makes it easier to search, displays any files attached to posts, and notifies you about replies.

blog.zimbra.com




 

SEO by vBSEO ©2011, Crawlability, Inc.