free simple site templates

Document and Pattern Recognition Lab

We research systems that recognize and retrieve information in documents, images, and videos.

News (2019)

  • (Dec) We will have three papers at ECIR 2020: one full paper on accelerated formula search, a demo paper on our new MathDeck interface, and a short paper on the ARQMath lab at CLEF 2020. Congratulations to everyone on the project who contributed to these papers (details available through the Publications link).
  • (Dec) The lab welcomes Jessica Diehl, an MSc Computer Science studnet who will be working on formula indexing in Spring 2020. We are also happy to announce that Yancarlos Diaz will be rejoining us to work on the MathDeck interface for Spring 2020 as well.
  • (Dec) The dprl is looking for a new PhD student to start in Fall 2020, to work on formula extraction, recognition, and indexing in large document collections. Details may be found online here. Please contact Prof. Zanibbi if you are interested (
  • (Nov) The ARQMath lab has begun! Please visit the lab web page online, and join the ARQMath forum if you are interested in the lab discussions. 
  • (Oct) Congratulations to Abishai Dmello, who has successfully defended his thesis, Representing Mathematical Concepts Associated with Formulas using Math Entity Cards.
  • (Oct) A number of sotware projects from the MathSeer project are now available online (formula detector (ScanSSD), formula recognition (LPGA), Tangent-CFT formula retrieval model) along with demos for the MathSeer interface. Code is available through the dprl software page as well.
  • (Oct) Prof. Zanibbi gave a talk on MathSeer at the University of Waterloo in Canada.
  • (Sep) Prof. Zanibbi gave a talk about the MathSeer project at RMIT in Melbourne, Australia.
  • (Aug) Congratulations to Parag Mali, who has successfully defended his MSc thesis, Scanning Single Shot Detector for Math in Document Images.
  • (Aug) Puneeth Kukkadapu has accepted a new job at ACV Auctions, where he will be doing work on handwriting recognition.
  • (Aug) The lab welcomes Jennifer Liu. Jennifer is a BSc student who will be working as a full-time research programmer in Fall 2019.
  • (June) Behrooz Mansouri's paper on applying text embeddings to formula retrieval has been accepted for publication at ICTIR 2019. The techinque produces much stronger isolated formula search results for partially similar formulas than previous systems, and state-of-the-art results when combined with Wei Zhong's Approach0 path-based formula search engine.
  • (June)  Mahshad Mahdavi's ICDAR 2019 paper on recognition of math formulas in images has been accepted for Oral presentation at the conference in Sydney this September. Mahshad will also be presenting a paper at the GREC workshop on the new evaluation model used for formula recognition in CROHME 2019.

The 2019 CROHME + TFD Competition sucessfully concluded in June. A paper summarizing the competition outcome will appear in the proceedings of ICDAR 2019. Congratulations to Mahshad Mahdavi, who was the lead organizer for the competition.

  • (June) Behrooz Mansouri's JCDL paper on Characterizing Searches for Mathematical Concepts was nominated for a Best Paper award. 
  • (May) The lab welcomes Thomas Lazore, who will be working as an intern in the lab during Summer 2019.
  • (May) Congratulations to Mahshad Mahdavi and Kwon-Young Choi, who have both had papers accepted at ICDAR 2019! Mahshad's paper concerns formula recognition, while Kwon-Young's is concerned with detecting accidentals in scans of printed music.
  • (April) Congratulations to Wei Zhong, who received the Best Applications Paper Award at ECIR 2019! His paper. "Structural Similarity Search for Formulas using Leaf-Root Paths in Operator Subtrees" can be found here.
  • (April) Congratulations to Puneeth Kukkadapu, who has received the TSO Logic Master's Student Scholarship from the Department of Computer Science.
  • (April) The ECIR 2019 version of Tangent-v for visual formula search has been released. Results from Kenny and Ritvik's ECIR 2019 paper are included in the package.
  • (April) The SymbolScraper Apache PDFBox extension for obtaining precise symbol locations and identities has been released!
  • (Mar) The MathSeer project web pages are now live.
  • (Mar) Congratulations to Behrooz Mansouri, whose paper "Toward Math-Enabled Digital Libraries: Characterizing Searches for Mathematical Concepts" has been accepted for publication at JCDL 2019, which will be held at the University of Illinois Urbana-Champaign. The paper presents the first search log analysis of math search behaviors in a general-purpose search engine (Parsijoo, from Iran).
  • (Mar) Congratulations to Parag Mali, who has secured a full-time Software Engineer position with the Microsoft AI and Ink group in Seattle!
  • (Jan) Congratulations to Wei Zhong and Kenny Davila, who have both had full papers on math formula search engines accepted to ECIR 2019 in Cologne, Germany this April. Wei's paper is "Structural Similarity Search for Formulas using Leaf-Root Paths in Operator Subtrees," while Kenny's paper is "Tangent-V: Math Formula Image Search Using Line-of-Sight Graphs."
  • (Jan) Congratulations to Mahshad Mahdavi, who has secured a Machine Learning summer internship with Apple in Cupertino!
  • (Jan) The lab welcomes two new research assistants, Gavin Nishizawa and Yancarlos Diaz. They are working on developing a new math-aware search interface for the MathSeer project.

Room 70-3500, GCCIS
Dept. Computer Science
Rochester Inst. Technology
Rochester, NY, 14623-5608


Phone: +1 (585) 475-4536
Fax: +1 (585) 475-4935