Abstract
Automatic understanding of document images is a hard problem. Here we consider a sub-problem, automatically extracting content from filled form images. Without pre-selected templates or sophisticated structural/semantic analysis, we propose a novel approach based on clustering the component-block-projection-vectors. By combining spectral clustering and minimal spanning tree clustering, we generate highly accurate clusters, from which the adaptive templates are constructed to extract the filled-in content. Our experiments show this approach is effective for a set of 1040 US IRS tax form images belonging to 208 types.
| Original language | English |
|---|---|
| Pages (from-to) | 204-212 |
| Number of pages | 9 |
| Journal | Proceedings of SPIE - The International Society for Optical Engineering |
| Volume | 5296 |
| DOIs | |
| State | Published - 2004 |
| Externally published | Yes |
| Event | Document Recognition and Retrieval XI - San Jose, CA, United States Duration: 21 Jan 2004 → 22 Jan 2004 |
Keywords
- Clustering
- Document analysis
- Form processing
- Image classification
- Image understanding