Room 70-3500 (Golisano College)
Department of Computer Science
Rochester Institute of Technology
Phone: (585) 475-4536

[ Home ] [ News ] [ Members ] [ Projects ] [ Publications ] [ Software ] [ Support ]

CROHMELib and LgEval: Documentation

For CROHME 2013, we (R. Zanibbi and H. Mouchère) have created libraries for use by participants, and for others interested in using the CROHME data and tools for their research in the future. This year, CROHME participants may create symbol layout trees in a label graph file, and then have these automatically converted to CROHME .inkml and 'normalized' label graph formats (details are provided below). We are providing two libraries:


Contents


Installation

To install CROHMELib and LgEval on your system, follow the steps below.

  1. Make sure that you have python (recommmended: 2.6 or 2.7) and perl installed on your system, and install the TXL programming language. All three languages are available for a wide variety of platforms, and python and perl are normally installed by default on Unix-based systems (including Linux, and MacOS X). Under Windows, Cygwin provides similar command-line facilities.

  2. Check that GraphViz) is installed on your system.

  3. Download the .zip archives

    • CROHMELib (0.1.12 (updated May 3 - small correction in evalInkML script to handle COMMAs.)
    • LgEval (0.2.11 (updated June 5, 2013*)

    Uncompress these on your system.

    To be able to use the scripts on the command line in bash shell without providing paths, assuming that you are using bash shell, add the following to the .bashrc file in your home directory:

    export CROHMELibDir=[path_to_CROHMELib]
    export LgEvalDir=[path_to_LgEval]
    export PATH=$PATH:$CROHMELibDir/bin:$LgEvalDir/bin

CROHME InkML File Format

The CROHME .inkml file format is XML-based, and has the following sections:

  1. Header: identifies inkml format, and annotation (<annotation>, <traceFormat>) tags provide a unique identifier for each expression along with the ground truth (i.e. target recognition results) in LaTeX format. Additional optional annotations may be provided, e.g. to describe the format used for pen samples, demographic information on the writer, etc.

  2. MathML: this represents the structure of the math expression in Presentation MathML format (this represents symbol layout in a manner akin to LaTeX). MathML can be rendered directly by most web browsers (e.g. as used in the CROHME InkML Viewer). The MathML used for CROHME includes annotations that refer to symbols in the symbol list at the end of the file (given as xml:id attributes in symbol tags).

  3. Pen Strokes: tags provide an identifier for each stroke, along with a series of (x,y) pen coordinates. Note!! expressions have been collected from different labs, on different devices, and in different countries - a variety of coordinate representations are used, including negative, floating point and integer coordinates. CROHME participants will want to carefully consider how to represent pen strokes across these representations.

  4. Symbol List: At the end of a CROHME .inkml file is a list of segments (i.e. symbols), given by a symbol label and list of pen strokes (<traceView> elements). Each symbol has an identifier that is used in the MathML structure representation (given by tags). There is an outer <traceGroup> that contains a <traceGroup> for each symbol.

Label Graph File Format

A Label Graph is a labeled adjacency matrix representation for a graph (for more on the representation, please see our paper Evaluating structural pattern recognition for handwritten math via primitive label graphs from DRR 2013). For example, consider the expression '2+2,' written using one stroke for each 2, and two strokes for the '+' (one vertical stroke, and one horizontal stroke). To describe the layout of symbols in this expression from the set of strokes, we need to define:

  1. How strokes are grouped into symbols (i.e. stroke segmentation), and
  2. Which class each symbol has, and
  3. How symbols are spatially arranged.

In a label graph, we use adjacency matrices to represent relationships between all stroke pairs:

  1. Segmentation: an asterisk (*) label is defined between all pairs of strokes belonging to a symbol. This defines an undirected edge between all strokes in a symbol.
  2. Classification: a symbol's class is associated with each of its strokes, represented by a self-edge in the adjacency matrix (i.e. along the main diagonal). We represent symbol classes as node rather than edge labels (see example below).
  3. Structure (Symbol relationships): Are represented by a relationship from each stroke in a symbol to every other stroke in a symbol. Note that for math notation, spatial relationships are directed (i.e. hierarchical). In our example, all strokes of the '+' have an at-right (R) label associated with the stroke for the 2 on the right.

In LgEval, an 'undefined' label (e.g. no class, or no relationship between a pair of strokes) is represented by an underscore ('_'). Using stroke identifiers s1-s4, the label graph and associated adjacency matrix ('label matrix') for our '2+2' example looks like this:

\( \left[\begin{array}{cccc} 2 & R & R & R \\ \_ & + & * & R \\ \_ & * & + & R \\ \_ & \_ & \_ & 2\\ \end{array}\right] \)

Inherited Relationships: you may be wondering about the 'R' edge between the leftmost and righmost '2.' Symbol layout in math expressions is often represented in the form of a tree (e.g. in LaTeX or MathML). So 2 + 2 can be represented by '\(2 \rightarrow + \rightarrow 2\),' the left-to-right ordering of symbols along the baseline. Note that this graph represents the fact that both the '+' and righmost '2' are right of the leftmost 2, i.e. the relationship is inherited along the tree. By inheriting spatial relationship, this allows us, for example, to identify that if we mis-recognize \(2^{a_i}\) as \(2^{ai}\), that while the relationship between 'a' and 'i' is incorrect, 'i' is still correctly within the superscript region of the 2. You do not need to create inherited relationships in your .lg outputs for the CROHME competition (see below, under 'Tools to the rescue!').

For vertical structures such as nested fractions, the dominant operator of the sub-expression must possess the incoming and outgoing right-of edges to sub-expressions adjacent on the baseline; this is identical to the representation used for LaTeX or MathML. As an example, below is a fraction and its associated layout tree. Note: for simplicity, here the primitives (nodes) are connected components (one node per symbol). Relationships are labeled by (A)bove, (B)elow, and at (R)ight.

\(\Huge \frac{\frac{a}{2} + \frac{b}{3}}{c}\)

In the (final) label graph representation, each symbol in the symbol layout tree above will inherit all relationships from its ancestors; for example, all symbols above the widest fraction line inherit the Above ('A') relationship between the widest fraction line and the leftmost fraction line of the numerator, with an edge labeled 'A' between the widest line and all the strokes in symbols above it.

CSV Format (.lg files): Here is the CSV file, 2p2.lg for our '2+2' example. The file defines the labeled nodes and edges in the label graph shown above. Stroke/node labels are identified by N, and edges between strokes by E.

N, s1, 2
N, s2, +
N, s3, +
N, s4, 2

E, s2, s3, *
E, s3, s2, *

E, s1, s2, R
E, s1, s3, R
E, s1, s4, R
E, s2, s4, R
E, s3, s4, R

Labels not specified in the file are automatically interpreted as undefined by LgEval. By default LgEval will generate label weights for all nodes and edges of value '1.0,' but you do not need to provide label weights in your .lg files.

Tools to the rescue! Participants can produce label graphs defining a layout trees similar to that for the fraction example above (with additional nodes for multi-stroke symbols), and then use tools in CROHMELib (specifically, mergeLgCrohme and convertLgCrohme) that will convert an .lg file containing a layout tree into a CROHME .inkml file, and another .lg file where all spatial relationships have been inherited.


Tools

Below is a brief summary of the tools available from LgEval and CROHMELib. The tools were created to make it easier to produce recognition output and view recognition results in the CROHME InkML Viewer, allow new stroke-level as well as object level metrics to be computed from label graphs, and to provide a toolset for converting between CROHME InkML, label graph, and other (e.g. from MathBrush) file formats. Additional documentation may be found by calling the scripts shown without arguments, and in the README files provided with the two libraries.

Note: if you use label graphs and/or LgEval for projects other than the CROHME 2013 competition, we would appreciate it if you would cite our paper published at Document Recognition and Retrieval XX (2013), Evaluating structural pattern recognition for handwritten math via primitive label graphs.

LgEval

LgEval is python-based. The programs in the library are described in the README included in the LgEval download.

CROHMELib

Each of the programs below may be found in the bin/ directory after uncompressing the LgEval archive. Additional details are provided in a README.


Symbol Classes and Relationships for CROHME 2013

Spatial Relationships

CROHME 2013's training and test sets include single expressions that do not include matrices or other tabular/grid structures. The spatial relationships include the following (the labels to be used for label graphs/LgEval are shown in parentheses):

  1. (R)ight
  2. (A)bove
  3. (B)elow
  4. (I)nside (for square root)
  5. (Sup)erscript
  6. (Sub)script

Symbols

Participants should use the LaTeX names for symbols as given below. There are 102 symbol classes in total (Note: this has changed from the first release). Symbols were rendered from their LaTeX codes below using MathJax.

Note: fraction lines and subtraction signs are both represented by '-'; \times and X will also be treated as synonymns. For the function names given below, all strokes belonging to the function name should be grouped into a single 'symbol' labeled by the function name. COMMA is used instead of ',' to avoid conflicts with separators in CSV files.

Expression Grammar

A text file containing the list of symbols along with the grammar defining legal symbol layout trees for the CROHME competition is included in CROHMELib, and is also available through the CROHME 2013 web pages. All training and test expressions will be consistent with the grammar provided.


Contact

If you have questions about CROHMELib or LgEval (or find bugs!), please contact Richard Zanibbi (rlaz@cs.rit.edu, Rochester Institute of Technology, USA) or Harold Mouchère (harold.mouchere@univ-nantes.fr, University of Nantes, France).


Last Updated: March 26, 2013