Xerox Scientists Invent Software that Automatically Indexes, Categorizes & Route Electronic Documents5 Mar, 2004
Xerox Scientists Invent Software that Automatically Indexes, Categorizes & Route Electronic Documents
Xerox Scientists Invent Software the Automatically Indexes, Categorizes & Route Electronic Documents
Scientists at Xerox Corporation have introduced software that is can "read" an electronic document, decide how it should be classified by subject, then route it to the right person's e-mail address or online document management system all completely automatically.
The software, which is a categorizing tool, is intended to help businesses keep their e-document collections orderly and easily accessible, and it is available for licensing from Xerox.
Categorizing tools currently available in the market treat each subject category independently of each other and are considered "flat." For example, although it might seem obvious to humans that biochemistry and biophysics are related categories of information, a flat categorization system wouldn't make the connection. But the Xerox system, based on patented technologies, uses a hierarchical model that is able to understand the dependency between those two categories and therefore make a more informed decision when classifying a document.
According to data gathered from a pilot test of the software, people found the right documents more often and faster because the software understood relationships between documents and categories.
Three integrated functions make the Xerox categorization technology unique:
- The system can start right away. Using advanced machine-learning techniques, with only a few examples it quickly learns by itself how to hierarchically classify documents in existing categories.
- The technology is easy to use and helps people create a comprehensive way to turn unorganized e-files into cleanly labeled document collections.
- The system can learn entirely new categories on its own. The categorization technology detects new or emerging topics and dynamically suggests new categories to the people who are using the system.