LLM-Powered Grapheme-to- Phoneme Conversion: Benchmark and Case Study

Mahta Fetrat Qharabagh, Zahra Dehghanian, Hamid R · 2024 · arXiv 2409.08554

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

How Tokenization Limits Phonological Knowledge Representation in Language Models and How to Improve Them

cs.CL · 2026-04-18 · unverdicted · novelty 7.0

Subword tokenization impairs phonological knowledge encoding in LMs, but an IPA-based fine-tuning method restores it with minimal impact on other capabilities.

OLaPh: Optimal Language Phonemizer

cs.CL · 2025-09-24 · conditional · novelty 6.0

Hybrid OLaPh framework outperforms prior G2P baselines on WikiPron while enabling synthetic data for an LLM that generalizes well on out-of-vocabulary terms.

citing papers explorer

Showing 2 of 2 citing papers.

How Tokenization Limits Phonological Knowledge Representation in Language Models and How to Improve Them cs.CL · 2026-04-18 · unverdicted · none · ref 38
Subword tokenization impairs phonological knowledge encoding in LMs, but an IPA-based fine-tuning method restores it with minimal impact on other capabilities.
OLaPh: Optimal Language Phonemizer cs.CL · 2025-09-24 · conditional · none · ref 21
Hybrid OLaPh framework outperforms prior G2P baselines on WikiPron while enabling synthetic data for an LLM that generalizes well on out-of-vocabulary terms.

LLM-Powered Grapheme-to- Phoneme Conversion: Benchmark and Case Study

fields

years

verdicts

representative citing papers

citing papers explorer