pith. sign in

arxiv: 1407.0167 · v1 · pith:NHOHDPSUnew · submitted 2014-07-01 · 💻 cs.DL · cs.CL· cs.IR

Mathematical Language Processing Project

classification 💻 cs.DL cs.CLcs.IR
keywords approachidentifierslanguagemathematicalmeaningdefinitionsidentifier-definitionprocessing
0
0 comments X
read the original abstract

In natural language, words and phrases themselves imply the semantics. In contrast, the meaning of identifiers in mathematical formulae is undefined. Thus scientists must study the context to decode the meaning. The Mathematical Language Processing (MLP) project aims to support that process. In this paper, we compare two approaches to discover identifier-definition tuples. At first we use a simple pattern matching approach. Second, we present the MLP approach that uses part-of-speech tag based distances as well as sentence positions to calculate identifier-definition probabilities. The evaluation of our prototypical system, applied on the Wikipedia text corpus, shows that our approach augments the user experience substantially. While hovering the identifiers in the formula, tool-tips with the most probable definitions occur. Tests with random samples show that the displayed definitions provide a good match with the actual meaning of the identifiers.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.