9. Duplicated Documents
About1. Introduction2. Overview3. GUI4. Image Signatures5. Unsupervised Filters6. Results & Analysis7. BioFilters8. NeuralFilters9. Duplicated Documents10. Face Recognition11. Auto Part Recognition12. Dynamic Library13. NeuralNet Filter14. Segment Variation15. TV Advertisements16. Counting & Tracking17. Image PreProcessing18. Image Processing19. Batch Job20. Parameters21. Input Option22. Application Developers23. Reference Manual24. Support Services25. Readme.txt

9.1 Why? 
9.2 Data 
9.3 Parameters 
9.4 Signature 
9.5 Training 
9.6 Matching 
9.7 Analysis 
[Home][9. Duplicated Documents]

 

9.   Finding Duplicated Documents

Click menu item �Example/Special Example/Document Duplication�; then click �Batch/Run�, this chapter is done. Now, we will walk through the Document duplication example.

Recent development in scanner technology has made it very easy to convert paper documents into digital documents. A $1000 scanner, for example Fujitsu 4120c, can scan and save 50 pages in a single click.  The scanner creates image names via auto-numbers you have specified. More expensive scanners can scan and save 1,000 pages in a single click.

This chapter attempts to solve a particular problem: to retrieve duplicated Document images. Assume you have a million pages of documents already converted into digital form, and you want to retrieve documents that meet some specified constraints. A typical Document retrieval system should have several components:

    1.  Text;

    2.  Image;

    3.  1-D barcode; and

    4.  2-D barcode.

Each component addresses a particular area of retrieval and their functions generally do not overlap. A complete solution should use all of the above options. This software deals with the image matching only.

Chapter contents include:

 

[Home][About][1. Introduction][2. Overview][3. GUI][4. Image Signatures][5. Unsupervised Filters][6. Results & Analysis][7. BioFilters][8. NeuralFilters][9. Duplicated Documents][10. Face Recognition][11. Auto Part Recognition][12. Dynamic Library][13. NeuralNet Filter][14. Segment Variation][15. TV Advertisements][16. Counting & Tracking][17. Image PreProcessing][18. Image Processing][19. Batch Job][20. Parameters][21. Input Option][22. Application Developers][23. Reference Manual][24. Support Services][25. Readme.txt]

Copyright (c) 1998 - 2006 Attrasoft, Inc. All rights reserved.

gina@attrasoft.com