Exploring the importance of context and embeddings in neural NER models for task-oriented dialogue systems

Aman Srivastava; Chirag Jain; Pratik Jayarao

Exploring the importance of context and embeddings in neural NER models for task-oriented dialogue systems

Not yet reviewed by Pith; the record is open.

Re-run · record.json Download PDF Read on arXiv ↗

This paper has not been read by Pith yet. Machine review is queued; the pith claim, tier, and objections will appear here once it completes.

SPECIMEN: schema-true, not a live event

T0 review · schema-true

One-sentence machine reading of the paper's core claim.

pith:XXXXXXXX · record.json · timestamp

arxiv 1812.02370 v1 pith:L64G65UD submitted 2018-12-06 cs.CL

Exploring the importance of context and embeddings in neural NER models for task-oriented dialogue systems

Pratik Jayarao , Chirag Jain , Aman Srivastava This is my paper

classification cs.CL

keywords systemsneuraltask-orientedadditionalconversationaldatasetdialoguedifferent

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

0 comments

read the original abstract

Named Entity Recognition (NER), a classic sequence labelling task, is an essential component of natural language understanding (NLU) systems in task-oriented dialog systems for slot filling. For well over a decade, different methods from lookup using gazetteers and domain ontology, classifiers over handcrafted features to end-to-end systems involving neural network architectures have been evaluated mostly in language-independent non-conversational settings. In this paper, we evaluate a modified version of the recent state of the art neural architecture in a conversational setting where messages are often short and noisy. We perform an array of experiments with different combinations of including the previous utterance in the dialogue as a source of additional features and using word and character level embeddings trained on a larger external corpus. All methods are evaluated on a combined dataset formed from two public English task-oriented conversational datasets belonging to travel and restaurant domains respectively. For additional evaluation, we also repeat some of our experiments after adding automatically translated and transliterated (from translated) versions to the English only dataset.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Code Mixologist : A Practitioner's Guide to Building Code-Mixed LLMs
cs.CL 2026-01 unverdicted novelty 5.0

A survey that unifies prior code-switching research for LLMs into a taxonomy of data, modeling, and evaluation and distills it into actionable recommendations for practitioners.