David Doermann

Research Scientist Emeritus

Education:

Ph.D., University of Maryland (Computer Science)

Biography:

David Doermann is a research scientist emeritus in the University of Maryland Institute for Advanced Computer Studies.

He is a leading researcher in document image analysis and recognition, focused on improving how we process and understand visual information. Doermann’s work has a global impact, enhancing the use of documents in driving business processes in the digital age.

Publications

2007

Lin Z, Davis LS, Doermann D, DeMenthon D. 2007. Hierarchical Part-Template Matching for Human Detection and Segmentation. Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on. :1-8.

Yu X, Yi L, Fermüller C, Doermann D. 2007. Object detection using a shape codebook. British Machine Vision Conference. 4

Farrell R, Doermann D, Davis LS. 2007. Learning Higher-order Transition Models in Medium-scale Camera Networks. Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on. :1-8.

Lin Z, Davis LS, Doermann D, DeMenthon D. 2007. Simultaneous appearance modeling and segmentation for matching people under occlusion. Proceedings of the 8th Asian conference on Computer vision - Volume Part II. :404-413.

2006

Li Y, Zheng Y, Doermann D. 2006. Detecting Text Line in Handwritten Documents. ICPR'06. :1030-1033.

Zheng Y, Doermann D. 2006. Robust Point Matching for Nonrigid Shapes By Preserving Local Neighborhood Structures. IEEETransactions on Pattern Analysis and Machine Intelligence. 28(4):643-649.

Zhu G, Jaeger S, Doermann D. 2006. ARobust Stamp Detection Framework on Degraded Documents. International Conference on Document Recognition and Retrieval XIII. :1-9.

Karagol-Ayan B, Doermann D, Weinberg A. 2006. Morphology Induction from Limited Noisy Data Using Approximate String Matching. Proceedings of the Eighth Meeting of the ACL Special Interest Group in Computational Phonology (SIGPHON 2006). :60-68.

Liang J, DeMenthon D, Doermann D. 2006. Mosaicing of Camera-captured Documents Without Pose Restriction. Computer Vision and Image Understanding.

Li Y, Zheng Y, Doermann D, Jaeger S. 2006. Script-Independent Text Line Segmentation in Freestyle Handwritten Documents. LAMP-TR-136, CS-TR-4836, UMIACS-TR-2006-51, CFAR-TR-1017

Jaeger S, Zhu G, Doermann D, Chen K, Sampat S. 2006. DOCLIB: a Software Library for Document Processing. International Conference on Document Recognition and Retrieval XIII. :1-9.

DeMenthon D, Doermann D. 2006. Video Retrieval of Near-Duplicates using k-Nearest Neighbor Retrieval of Spatio-Temporal Descriptors. Multimedia Tools and Applications (MTAP). 30

Liu X, Li H, Doermann D. 2006. Imaging as an Alternative Data Channel for Camera Phones. ACM International Conference Proceeding Series; Proceedings of the 5th International Conference on Mobile and Ubiquitous Multimedia. :No.5-No.5.

Li Y, Zheng Y, Doermann D, Jaeger S. 2006. ANew Algorithm for Detecting Text Line in Handwritten Documents. 10th International Workshop on Frontiers in Handwriting Recognition. :35-40.

Luo M, DeMenthon D, Yu X, Doermann D. 2006. SOFTCBIR: Object Searching in Videos Combining Keypoint Matching and Graduated Assignment. LAMP-TR-132,CAR-TR-1013,CS-TR-4804,UMIACS-TR-2006-25

Liang J, DeMenthon D, Doermann D. 2006. Camera-Based Document Image Mosaicing. ICPR'06. :476-479.

2005

Yu Y, Doermann D. 2005. Model of Object-Based Coding for Surveillance Video. Proceedings in the ICASSP'04 IEEE International Conference on Computer Vision. :693-696.

Liu X, Doermann D, Li H. 2005. Fast Camera Motion Estimation for Hand-Devices and Applications. 4th International Conference on Mobile and Ubiquitous Multimedia. :103-108.

Jaeger S, Ma H, Doermann D. 2005. Identifying Script on Word-Level with Informational Confidence. 8th Int. Conf. on Document Analysis and Recognition. :416-420.

Zheng Y, Doermann D. 2005. Robust Point Matching for Two-Dimensional Nonrigid Shapes. Proceedings in the ICASSP'04 IEEEInternational Conference on Computer Vision. :1561-1566.

Chen K, Jaeger S, Zhu G, Doermann D. 2005. DOCLIB: A document processing research tool. Symposium on Document Image Understanding Technology. :159-163.

Liang J, DeMenthon D, Doermann D. 2005. Flattening curved documents in images. Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on. 2:338-345vol.2-338-345vol.2.

Ma H, Doermann D. 2005. Font identification using the grating cell texture operator. SPIE Conference on Document Recogntion and Retreival XXII. :148-156.

Ma H, Doermann D. 2005. Adaptive OCR with Limited User Feedback. 8th Int. Conf. on Document Analysis and Recognition (ICDAR'05). :814-818.

Huang M, DeMenthon D, Doermann D, Golebiowski L. 2005. Document Ranking by Layout Relevance. ICDAR. :362-366.

Zheng Y, Doermann D. 2005. Handwriting matching and its application to handwriting synthesis. Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on. :861-865Vol.2-861-865Vol.2.

Zheng Y, Li H, Doermann D. 2005. AParallel-Line Detection Algorithm Based on HMMDecoding. IEEE Transactions on Pattern Analysis and Machine Intelligence. 27(5):777-792.

Liang J, DeMenthon D, Doermann D. 2005. Unwarping Images of Curved Documents Using Global Shape Optimization. Proc. Fist International Workshop on Camera-based Document Analysis and Recognition. :25-29.

Liang J, Doermann D, Li H. 2005. Camera-Based Analysis of Text and Documents: ASurvey. International Journal on Document Analysis and Recognition. 7(2+3):83-104.

Liu X, Doermann D. 2005. Using Computer Vision to Detect Web Browser Display Errors. 3rd Web Document Analysis Workshop (on ICDAR 2005). :5-9.

2004

Ghanem NM, Doermann D, Davis LS, DeMenthon D. 2004. Mining tool for surveillance video. SPIE 16th International Symposium on Electronic Imaging, Storage and Retrieval Methods and Applications for Multimedia. 5307:259-270.

Byrne W, Doermann D, Franz M, Gustman S, Hajic J, Oard D, Picheny M, Psutka J, Ramabhadran B. 2004. Automatic recognition of spontaneous speech for access to multilingual oral history archives. IEEE Transactions on Speech and Audio Processing, Special Issue on Spontaneous Speech Processing. 12(4):420-435.

Ma H, Doermann D. 2004. Adaptive Hindi OCRUsing Generalized Hausdorff Image Comparison. ACMTransactions on Asian Language Information Processing. 26(2):198-213.

Zheng Y, Li H, Doermann D. 2004. Machine printed text and handwriting identification in noisy document images. Pattern Analysis and Machine Intelligence, IEEE Transactions on. 26(3):337-353.

Kang H-J, Doermann D. 2004. Product approximation by minimizing the upper bound of Bayes error rate for Bayesian combination of classifiers. Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on. 1:252-255Vol.1-252-255Vol.1.

Luo M, DeMenthon D, Doermann D. 2004. Shot boundary detection based on Image correlation features in video. TREC-VID: Text Retrieval and Evaluation Conference.

Balcells-Capellades M, DeMenthon D, Doermann D. 2004. An Appearance-based Approach for Consistent Labeling of Humans and Objects in Video. Pattern Analysis and Applications. :1433-7541.

Ma H, Doermann D. 2004. Word Level Script Identification on Scanned Document Images. SPIEConference on Document Recognition and Retrieval. :124-135.

Ghanem N, Doermann D, Davis LS, DeMenthon D. 2004. Mining Tools for Surveillance Video. Proceedings in SPIE 16th International Symposium on Electronic Imaging. :5307259-270-5307259-270.

Ghanem N, DeMenthon D, Doermann D, Davis LS. 2004. Representation and Recognition of Events in Surveillance Video Using Petri Nets. Second IEEEWorkshop on Event Mining 2004, CVPR2004. :112-112.

Zheng Y, Doermann D. 2004. Robust Point Matching for Non-Rigid Shapes: ARelaxation Labeling Based Approach. LAMP-TR-117,CAR-TR-1005,CS-TR-4633,UMIACS-TR-2004-75

Oard D, Soergel D, Doermann D, Huang X, Murray CG, Wang J, Ramabhadran B, Franz M, Gustman S, Mayfield J et al.. 2004. Building an information retrieval test collection for spontaneous conversational speech. Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval. :41-48.

2003

Doermann D, Karunanidhi A. 2003. Video Analysis for Pervasive Environments. ICME. 2:161-164.

Ma H, Karagol-Ayan B, Doermann D, Oard D, Wang J. 2003. Parsing and Tagging of Bilingual Dictionaries. TALTraitement Automatique Des Langues. 44(2):125-150.

Rosenfeld A, Doermann D, DeMenthon D. 2003. Video Mining.

Karagol-Ayan B, Doermann D, Dorr BJ. 2003. Acquisition of Bilingual MTLexicons from OCRed Dictionaries. Proceedings of the Ninth Machine Translation Summit. :208-215.

Ma H, Doermann D. 2003. Adaptive Hindi OCRUsing Generalized Hausdorff Image Comparison. LAMP-TR-105,CFAR-TR-987,CS-TR-4519,UMIACS-TR-2003-87

Ma H, Karagol-Ayan B, Doermann D, Oard D, Wang J. 2003. Tagging and Parsing of Bilingual Dictionary. LAMP-TR-106,CFAR-TR-991,CS-TR-4529,UMIACS-TR-2003-97

DeMenthon D, Doermann D. 2003. Video Retrieval using Spatio-Temporal Descriptors. ACMMultimedia '03. :508-517.

Ma H, Doermann D. 2003. Gabor Filter Based Multi-class Classifier for Scanned Document Images. 7th International Conference on Document Analysis and Recognition (ICDAR). :968-972.

Liang J, Doermann D. 2003. Content Features for Logical Document Labeling. Proc. SPIE Conference on Document Recognition and Retrieval X. :189-196.

Zheng Y, Li H, Doermann D. 2003. Machine Printed Text and Handwriting Identification in Noisy Document Images. LAMP-TR-107,CFAR-TR-992,CS-TR-4531,UMIACS-TR-2003-99

Zheng Y, Li H, Doermann D. 2003. AModel-based Line Detection Algorithm in Documents. ICDAR. :44-48.

Capellades MB, Doermann D, DeMenthon D, Chellappa R. 2003. An appearance based approach for human and object tracking. Image Processing, 2003. ICIP 2003. Proceedings. 2003 International Conference on. 2:II-85-8vol.3-II-85-8vol.3.

Doermann D, Zi G. 2003. Groundtruth Image Generation from Electronic Text (Demonstration). Symposium on Document Image Understanding Technology. :309-312.

Shin C, Doermann D, Rosenfeld A. 2003. Measuring Structural Similarity of Document Pages for Searching Document Image Databases. 5th IASTEDInternational Conference on Signal and Image Processing. :320-325.

Karagol-Ayan B, Doermann D, Dorr BJ. 2003. Acquisition of bilingual MT lexicons from OCRed dictionaries. Proceedings of the 9th MT Summit. :208-215.

Zheng Y, Li H, Doermann D. 2003. AParallel Line Detection Algorithm Based on HMMDecoding. LAMP-TR-109,CAR-TR-994,CS-TR-4545,UMIACS-TR-2003-1113

Oard D, Doermann D, Dorr BJ, He D, Resnik P, Weinberg A, Byrne W, Khudanpur S, Yarowsky D, Leuski A et al.. 2003. Desparately seeking cebuano. Third Conference on Human Language Technologies.

Zheng Y, Li H, Doermann D. 2003. Text Identification in Noisy Document Images Using Markov Random Field. ICDAR. :599-603.

Gilbert X, Li H, Doermann D. 2003. Sports Video Classification Using HMM. ICME. 2:345-348.

Karagol-Ayan B, Doermann D, Dorr BJ. 2003. Use of OCR for Rapid Construction of Bilingual Lexicons. LAMP-TR-104,CFAR-TR-986,CS-TR-4510,UMIACS-TR-2003-78

2002

Liang J, Doermann D. 2002. Logical Labeling of Document Images Using Layout Graph Matching with Adaptive Learning. IAPR Conference on Document Analysis System. :212-223.

Karunanidhi A, Doermann D, Parekh N, Rautio V. 2002. Video analysis applications for pervasive environments. 1st International Conference on Mobile and Ubiquitous Multimedia. :48-55.

Mariano VY, Min J, Park J-H, Kasturi R, Mihalcik D, Li H, Doermann D, Drayer T. 2002. Performance evaluation of object detection algorithms. Pattern Recognition, 2002. Proceedings. 16th International Conference on. 3:965-969vol.3-965-969vol.3.

Zheng Y, Li H, Doermann D. 2002. Segmentation and Identification of Handwriting in Noisy Documents. IAPRConference on Document Analysis System. :95-105.

Rautiainen M, Doermann D. 2002. Temporal Color Correlograms in Video Retrieval. International Conference on Pattern Recognition. :267-270.

Gupta P, Doermann D, DeMenthon D. 2002. Beam Search for Feature Selection in Automatic SVMDefect Classification. International Conference on Pattern Recognition. :212-215.

Wolf C, Doermann D. 2002. Binarization of low quality text using a Markov random field model. Pattern Recognition, 2002. Proceedings. 16th International Conference on. 3:160-163vol.3-160-163vol.3.

Doermann D, Ma H, Karagol-Ayan B, Oard D. 2002. Lexicon Acquisition from Bilingual Dictionaries. SPIEPhotonic West Electronic Imaging Conference. :37-48.