David Doermann
Research Scientist Emeritus
Education:
Ph.D., University of Maryland (Computer Science)
Biography:
David Doermann is a research scientist emeritus in the University of Maryland Institute for Advanced Computer Studies.
He is a leading researcher in document image analysis and recognition, focused on improving how we process and understand visual information. Doermann’s work has a global impact, enhancing the use of documents in driving business processes in the digital age.
Publications
2007
2007. Hierarchical Part-Template Matching for Human Detection and Segmentation. Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on. :1-8.
2007. Object detection using a shape codebook. British Machine Vision Conference. 4
2007. Learning Higher-order Transition Models in Medium-scale Camera Networks. Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on. :1-8.
2007. Simultaneous appearance modeling and segmentation for matching people under occlusion. Proceedings of the 8th Asian conference on Computer vision - Volume Part II. :404-413.
2006
2006. Detecting Text Line in Handwritten Documents. ICPR'06. :1030-1033.
2006. Robust Point Matching for Nonrigid Shapes By Preserving Local Neighborhood Structures. IEEETransactions on Pattern Analysis and Machine Intelligence. 28(4):643-649.
2006. ARobust Stamp Detection Framework on Degraded Documents. International Conference on Document Recognition and Retrieval XIII. :1-9.
2006. Morphology Induction from Limited Noisy Data Using Approximate String Matching. Proceedings of the Eighth Meeting of the ACL Special Interest Group in Computational Phonology (SIGPHON 2006). :60-68.
2006. Mosaicing of Camera-captured Documents Without Pose Restriction. Computer Vision and Image Understanding.
2006. Script-Independent Text Line Segmentation in Freestyle Handwritten Documents. LAMP-TR-136, CS-TR-4836, UMIACS-TR-2006-51, CFAR-TR-1017
2006. DOCLIB: a Software Library for Document Processing. International Conference on Document Recognition and Retrieval XIII. :1-9.
2006. Video Retrieval of Near-Duplicates using k-Nearest Neighbor Retrieval of Spatio-Temporal Descriptors. Multimedia Tools and Applications (MTAP). 30
2006. Imaging as an Alternative Data Channel for Camera Phones. ACM International Conference Proceeding Series; Proceedings of the 5th International Conference on Mobile and Ubiquitous Multimedia. :No.5-No.5.
2006. ANew Algorithm for Detecting Text Line in Handwritten Documents. 10th International Workshop on Frontiers in Handwriting Recognition. :35-40.
2006. SOFTCBIR: Object Searching in Videos Combining Keypoint Matching and Graduated Assignment. LAMP-TR-132,CAR-TR-1013,CS-TR-4804,UMIACS-TR-2006-25
2006. Camera-Based Document Image Mosaicing. ICPR'06. :476-479.
2005
2005. Model of Object-Based Coding for Surveillance Video. Proceedings in the ICASSP'04 IEEE International Conference on Computer Vision. :693-696.
2005. Fast Camera Motion Estimation for Hand-Devices and Applications. 4th International Conference on Mobile and Ubiquitous Multimedia. :103-108.
2005. Identifying Script on Word-Level with Informational Confidence. 8th Int. Conf. on Document Analysis and Recognition. :416-420.
2005. Robust Point Matching for Two-Dimensional Nonrigid Shapes. Proceedings in the ICASSP'04 IEEEInternational Conference on Computer Vision. :1561-1566.
2005. DOCLIB: A document processing research tool. Symposium on Document Image Understanding Technology. :159-163.
2005. Flattening curved documents in images. Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on. 2:338-345vol.2-338-345vol.2.
2005. Font identification using the grating cell texture operator. SPIE Conference on Document Recogntion and Retreival XXII. :148-156.
2005. Adaptive OCR with Limited User Feedback. 8th Int. Conf. on Document Analysis and Recognition (ICDAR'05). :814-818.
2005. Document Ranking by Layout Relevance. ICDAR. :362-366.
2005. Handwriting matching and its application to handwriting synthesis. Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on. :861-865Vol.2-861-865Vol.2.
2005. AParallel-Line Detection Algorithm Based on HMMDecoding. IEEE Transactions on Pattern Analysis and Machine Intelligence. 27(5):777-792.
2005. Unwarping Images of Curved Documents Using Global Shape Optimization. Proc. Fist International Workshop on Camera-based Document Analysis and Recognition. :25-29.
2005. Camera-Based Analysis of Text and Documents: ASurvey. International Journal on Document Analysis and Recognition. 7(2+3):83-104.
2005. Using Computer Vision to Detect Web Browser Display Errors. 3rd Web Document Analysis Workshop (on ICDAR 2005). :5-9.
2004
2004. Mining tool for surveillance video. SPIE 16th International Symposium on Electronic Imaging, Storage and Retrieval Methods and Applications for Multimedia. 5307:259-270.
2004. Automatic recognition of spontaneous speech for access to multilingual oral history archives. IEEE Transactions on Speech and Audio Processing, Special Issue on Spontaneous Speech Processing. 12(4):420-435.
2004. Adaptive Hindi OCRUsing Generalized Hausdorff Image Comparison. ACMTransactions on Asian Language Information Processing. 26(2):198-213.
2004. Machine printed text and handwriting identification in noisy document images. Pattern Analysis and Machine Intelligence, IEEE Transactions on. 26(3):337-353.
2004. Product approximation by minimizing the upper bound of Bayes error rate for Bayesian combination of classifiers. Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on. 1:252-255Vol.1-252-255Vol.1.
2004. Shot boundary detection based on Image correlation features in video. TREC-VID: Text Retrieval and Evaluation Conference.
2004. An Appearance-based Approach for Consistent Labeling of Humans and Objects in Video. Pattern Analysis and Applications. :1433-7541.
2004. Word Level Script Identification on Scanned Document Images. SPIEConference on Document Recognition and Retrieval. :124-135.
2004. Mining Tools for Surveillance Video. Proceedings in SPIE 16th International Symposium on Electronic Imaging. :5307259-270-5307259-270.
2004. Representation and Recognition of Events in Surveillance Video Using Petri Nets. Second IEEEWorkshop on Event Mining 2004, CVPR2004. :112-112.
2004. Robust Point Matching for Non-Rigid Shapes: ARelaxation Labeling Based Approach. LAMP-TR-117,CAR-TR-1005,CS-TR-4633,UMIACS-TR-2004-75
2004. Building an information retrieval test collection for spontaneous conversational speech. Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval. :41-48.
2003
2003. Video Analysis for Pervasive Environments. ICME. 2:161-164.
2003. Parsing and Tagging of Bilingual Dictionaries. TALTraitement Automatique Des Langues. 44(2):125-150.
2003. Video Mining.
2003. Acquisition of Bilingual MTLexicons from OCRed Dictionaries. Proceedings of the Ninth Machine Translation Summit. :208-215.
2003. Adaptive Hindi OCRUsing Generalized Hausdorff Image Comparison. LAMP-TR-105,CFAR-TR-987,CS-TR-4519,UMIACS-TR-2003-87
2003. Tagging and Parsing of Bilingual Dictionary. LAMP-TR-106,CFAR-TR-991,CS-TR-4529,UMIACS-TR-2003-97
2003. Video Retrieval using Spatio-Temporal Descriptors. ACMMultimedia '03. :508-517.
2003. Gabor Filter Based Multi-class Classifier for Scanned Document Images. 7th International Conference on Document Analysis and Recognition (ICDAR). :968-972.
2003. Content Features for Logical Document Labeling. Proc. SPIE Conference on Document Recognition and Retrieval X. :189-196.
2003. Machine Printed Text and Handwriting Identification in Noisy Document Images. LAMP-TR-107,CFAR-TR-992,CS-TR-4531,UMIACS-TR-2003-99
2003. AModel-based Line Detection Algorithm in Documents. ICDAR. :44-48.
2003. An appearance based approach for human and object tracking. Image Processing, 2003. ICIP 2003. Proceedings. 2003 International Conference on. 2:II-85-8vol.3-II-85-8vol.3.
2003. Groundtruth Image Generation from Electronic Text (Demonstration). Symposium on Document Image Understanding Technology. :309-312.
2003. Measuring Structural Similarity of Document Pages for Searching Document Image Databases. 5th IASTEDInternational Conference on Signal and Image Processing. :320-325.
2003. Acquisition of bilingual MT lexicons from OCRed dictionaries. Proceedings of the 9th MT Summit. :208-215.
2003. AParallel Line Detection Algorithm Based on HMMDecoding. LAMP-TR-109,CAR-TR-994,CS-TR-4545,UMIACS-TR-2003-1113
2003. Desparately seeking cebuano. Third Conference on Human Language Technologies.
2003. Text Identification in Noisy Document Images Using Markov Random Field. ICDAR. :599-603.
2003. Sports Video Classification Using HMM. ICME. 2:345-348.
2003. Use of OCR for Rapid Construction of Bilingual Lexicons. LAMP-TR-104,CFAR-TR-986,CS-TR-4510,UMIACS-TR-2003-78
2002
2002. Logical Labeling of Document Images Using Layout Graph Matching with Adaptive Learning. IAPR Conference on Document Analysis System. :212-223.
2002. Video analysis applications for pervasive environments. 1st International Conference on Mobile and Ubiquitous Multimedia. :48-55.
2002. Performance evaluation of object detection algorithms. Pattern Recognition, 2002. Proceedings. 16th International Conference on. 3:965-969vol.3-965-969vol.3.
2002. Segmentation and Identification of Handwriting in Noisy Documents. IAPRConference on Document Analysis System. :95-105.
2002. Temporal Color Correlograms in Video Retrieval. International Conference on Pattern Recognition. :267-270.
2002. Beam Search for Feature Selection in Automatic SVMDefect Classification. International Conference on Pattern Recognition. :212-215.
2002. Binarization of low quality text using a Markov random field model. Pattern Recognition, 2002. Proceedings. 16th International Conference on. 3:160-163vol.3-160-163vol.3.
2002. Lexicon Acquisition from Bilingual Dictionaries. SPIEPhotonic West Electronic Imaging Conference. :37-48.