pith. sign in

arxiv: 1508.01991 · v1 · pith:QX4NG7LVnew · submitted 2015-08-09 · 💻 cs.CL

Bidirectional LSTM-CRF Models for Sequence Tagging

classification 💻 cs.CL
keywords lstmbidirectionalbi-lstm-crflayermodelmodelssequencetagging
0
0 comments X
read the original abstract

In this paper, we propose a variety of Long Short-Term Memory (LSTM) based models for sequence tagging. These models include LSTM networks, bidirectional LSTM (BI-LSTM) networks, LSTM with a Conditional Random Field (CRF) layer (LSTM-CRF) and bidirectional LSTM with a CRF layer (BI-LSTM-CRF). Our work is the first to apply a bidirectional LSTM CRF (denoted as BI-LSTM-CRF) model to NLP benchmark sequence tagging data sets. We show that the BI-LSTM-CRF model can efficiently use both past and future input features thanks to a bidirectional LSTM component. It can also use sentence level tag information thanks to a CRF layer. The BI-LSTM-CRF model can produce state of the art (or close to) accuracy on POS, chunking and NER data sets. In addition, it is robust and has less dependence on word embedding as compared to previous observations.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 18 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. From Text to Voice: A Reproducible and Verifiable Framework for Evaluating Tool Calling LLM Agents

    cs.CL 2026-05 unverdicted novelty 7.0

    A dataset-agnostic framework converts text tool-calling benchmarks to paired audio versions via TTS and noise, showing model-dependent performance with small text-to-voice gaps of 1.8-4.8 points on Confetti and When2Call.

  2. A Convolutional Neural Network-Derived Catalog of Solar Flares from Soft X-Ray Observations

    astro-ph.SR 2026-04 unverdicted novelty 7.0

    The CNN-derived catalog detects over seven times more solar flares than the GOES catalog and extends the power-law distribution of flare peak fluxes to smaller sizes.

  3. Utility-Preserving De-Identification for Math Tutoring: Investigating Numeric Ambiguity in the MathEd-PII Benchmark Dataset

    cs.CL 2026-02 unverdicted novelty 7.0

    The MathEd-PII benchmark shows that math-aware and segment-aware LLM prompting raises PII detection F1 from 0.379 to 0.821 while cutting false redactions of instructional numbers.

  4. From Text to Voice: A Reproducible and Verifiable Framework for Evaluating Tool Calling LLM Agents

    cs.CL 2026-05 unverdicted novelty 6.0

    A dataset-agnostic framework converts text tool-calling benchmarks to paired audio evaluations via TTS, speaker variation and noise, then evaluates seven omni-modal models showing model- and task-dependent performance...

  5. Optimizing Chlorination in Water Distribution Systems via Surrogate-assisted Neuroevolution

    cs.NE 2026-02 unverdicted novelty 6.0

    Surrogate-assisted neuroevolution produces Pareto-optimal chlorine dosing policies for water distribution systems that outperform PPO on four practical objectives.

  6. A Survey on Vision-Language-Action Models for Embodied AI

    cs.RO 2024-05 unverdicted novelty 6.0

    This is the first survey on vision-language-action models, providing a taxonomy across three lines, plus summaries of datasets, simulators, benchmarks, challenges, and future directions in embodied AI.

  7. TabTransformer: Tabular Data Modeling Using Contextual Embeddings

    cs.LG 2020-12 unverdicted novelty 6.0

    TabTransformer uses Transformer self-attention to generate contextual embeddings from categorical features in tabular data, outperforming prior deep learning methods by at least 1% mean AUC and matching tree-based ens...

  8. Approximate Inference in Structured Instances with Noisy Categorical Observations

    cs.LG 2019-06 unverdicted novelty 6.0

    Approximate algorithm for categorical structured inference with noisy observations achieves Hamming error logarithmic in the number of categories, generalizing prior binary-label results.

  9. Automating Categorization of Scientific Texts with In-Context Learning and Prompt-Chaining in Large Language Models

    cs.IR 2026-04 unverdicted novelty 5.0

    Prompt chaining with off-the-shelf LLMs outperforms in-context learning and BERT for 1st- and 2nd-level classification on the ORKG taxonomy using the FORC dataset, but struggles at the 3rd level.

  10. A Multimodal Text- and Graph-Based Approach for Open-Domain Event Extraction from Documents

    cs.CL 2026-04 unverdicted novelty 5.0

    MODEE is a multimodal system that integrates graphs with LLM embeddings to outperform prior open-domain event extraction methods on large datasets.

  11. TabEmb: Joint Semantic-Structure Embedding for Table Annotation

    cs.LG 2026-04 unverdicted novelty 5.0

    TabEmb decouples LLM-based semantic column embeddings from graph-based structural modeling to produce joint representations that improve table annotation tasks.

  12. A Multi-head Attention Fusion Network for Industrial Prognostics under Discrete Operational Conditions

    cs.LG 2026-04 unverdicted novelty 5.0

    A multi-head attention fusion network integrates monotonic degradation trends, discrete operating state embeddings from clustering, and residual noise using BiLSTM and attention mechanisms to improve prognostic accura...

  13. Utilizing Pre-trained and Large Language Models for 10-K Items Segmentation

    q-fin.GN 2025-02 unverdicted novelty 5.0

    BERT4ItemSeg reaches macro-F1 of 0.9825 on core 10-K items across 3,737 annotated reports, outperforming GPT4ItemSeg (0.9567) and baselines.

  14. Eliciting Knowledge from Experts:Automatic Transcript Parsing for Cognitive Task Analysis

    cs.CL 2019-06 unverdicted novelty 5.0

    Introduces a weakly-supervised framework partitioning CTA transcript parsing into sequence labeling and text span-pair relation extraction using distant supervision from protocols and neighbor sentences for long-range...

  15. Beyond the Basics: Leveraging Large Language Model for Fine-Grained Medical Entity Recognition

    cs.AI 2026-04 conditional novelty 4.0

    Fine-tuned LLaMA3 with LoRA reaches 81.24% F1 on 18-category fine-grained medical entity recognition, beating zero-shot by 63.11% and few-shot by 35.63%.

  16. A Multi-modal Fusion Network for Star-Galaxy Classification from CSST Simulated Datasets

    astro-ph.IM 2026-04 unverdicted novelty 4.0

    A ResNet-50 and BiLSTM multi-modal fusion network achieves 99.81% galaxy recall and 99.66% star recall on a CSST simulated dataset of 125,896 objects.

  17. Short Text Conversation Based on Deep Neural Network and Analysis on Evaluation Measures

    cs.CL 2019-07 unverdicted novelty 4.0

    Hierarchical DNN models with BERT outperform prior models on DQ and ND subtasks using non-traditional metrics NMD, RSNOD, JSD, and RNSS, plus analysis of traditional metrics.

  18. Rare Disease Detection by Sequence Modeling with Generative Adversarial Networks

    cs.LG 2019-07 unverdicted novelty 4.0

    A GAN-boosted RNN model reaches 0.56 PR-AUC for rare EPI detection on 1.8 million patients and outperforms benchmarks.