pith. sign in

arxiv: cs/9905010 · v1 · submitted 1999-05-19 · 💻 cs.CL · cs.LG

Statistical Inference and Probabilistic Modelling for Constraint-Based NLP

classification 💻 cs.CL cs.LG
keywords dataincompleteprobabilisticestimationgrammarsparametersalgorithmconstraint-based
0
0 comments X
read the original abstract

We present a probabilistic model for constraint-based grammars and a method for estimating the parameters of such models from incomplete, i.e., unparsed data. Whereas methods exist to estimate the parameters of probabilistic context-free grammars from incomplete data (Baum 1970), so far for probabilistic grammars involving context-dependencies only parameter estimation techniques from complete, i.e., fully parsed data have been presented (Abney 1997). However, complete-data estimation requires labor-intensive, error-prone, and grammar-specific hand-annotating of large language corpora. We present a log-linear probability model for constraint logic programming, and a general algorithm to estimate the parameters of such models from incomplete data by extending the estimation algorithm of Della-Pietra, Della-Pietra, and Lafferty (1997) to incomplete data settings.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.