pith. machine review for the scientific record. sign in

arxiv: 1605.04359 · v1 · submitted 2016-05-14 · 💻 cs.CL

Recognition: unknown

Occurrence Statistics of Entities, Relations and Types on the Web

Authors on Pith no claims yet
classification 💻 cs.CL
keywords entitiesoccurrencereportstatisticsalongbuildcannotcase
0
0 comments X
read the original abstract

The problem of collecting reliable estimates of occurrence of entities on the open web forms the premise for this report. The models learned for tagging entities cannot be expected to perform well when deployed on the web. This is owing to the severe mismatch in the distributions of such entities on the web and in the relatively diminutive training data. In this report, we build up the case for maximum mean discrepancy for estimation of occurrence statistics of entities on the web, taking a review of named entity disambiguation techniques and related concepts along the way.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.