Harnessing the Power of Large Language Models for Natural Language to First-Order Logic Translation

Yang, Yuan, Xiong, Siheng, Payani, Ali, Shareghi, Ehsan, Fekri, Faramarz · 2024 · DOI 10.18653/v1/2024.acl-long.375

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

LLMEval-Logic: A Solver-Verified Chinese Benchmark for Logical Reasoning of LLMs with Adversarial Hardening

cs.CL · 2026-05-19 · accept · novelty 7.0

LLMEval-Logic is a solver-verified Chinese logical reasoning benchmark with 246 base and 190 hard items that shows frontier LLMs reach only 37.5% hard-item accuracy and 60.16% joint formalization score.

GDPR Auto-Formalization with AI Agents and Human Verification

cs.AI · 2026-04-16 · unverdicted · novelty 6.0

Multi-agent LLMs with human verification can generate formal representations of GDPR provisions, but structured oversight is required to handle legal nuances effectively.

citing papers explorer

Showing 2 of 2 citing papers.

LLMEval-Logic: A Solver-Verified Chinese Benchmark for Logical Reasoning of LLMs with Adversarial Hardening cs.CL · 2026-05-19 · accept · none · ref 7
LLMEval-Logic is a solver-verified Chinese logical reasoning benchmark with 246 base and 190 hard items that shows frontier LLMs reach only 37.5% hard-item accuracy and 60.16% joint formalization score.
GDPR Auto-Formalization with AI Agents and Human Verification cs.AI · 2026-04-16 · unverdicted · none · ref 29
Multi-agent LLMs with human verification can generate formal representations of GDPR provisions, but structured oversight is required to handle legal nuances effectively.

Harnessing the Power of Large Language Models for Natural Language to First-Order Logic Translation

fields

years

verdicts

representative citing papers

citing papers explorer