Harnessing the Power of Large Language Models for Natural Language to First-Order Logic Translation

Yuan Yang, Siheng Xiong, Ali Payani, Ehsan Shareghi, Faramarz Fekri , editor = · 2024 · DOI 10.18653/v1/2024.acl-long.375

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

Fixing FOLIO and MALLS: Verified Annotations and an LLM-assisted Framework to Focus Human Relabeling

cs.CL · 2026-06-01 · unverdicted · novelty 7.0

Audit finds 36-39% incorrect FOL labels in FOLIO and MALLS; corrections raise LLM accuracy 9-22 points and an LLM-guided review framework achieves 90% dataset quality after checking fewer than 24% of examples.

LLMEval-Logic: A Solver-Verified Chinese Benchmark for Logical Reasoning of LLMs with Adversarial Hardening

cs.CL · 2026-05-19 · accept · novelty 7.0

LLMEval-Logic is a solver-verified Chinese logical reasoning benchmark with 246 base and 190 hard items that shows frontier LLMs reach only 37.5% hard-item accuracy and 60.16% joint formalization score.

GDPR Auto-Formalization with AI Agents and Human Verification

cs.AI · 2026-04-16 · unverdicted · novelty 6.0

Multi-agent LLMs with human verification can generate formal representations of GDPR provisions, but structured oversight is required to handle legal nuances effectively.

citing papers explorer

Showing 1 of 1 citing paper after filters.

GDPR Auto-Formalization with AI Agents and Human Verification cs.AI · 2026-04-16 · unverdicted · none · ref 29
Multi-agent LLMs with human verification can generate formal representations of GDPR provisions, but structured oversight is required to handle legal nuances effectively.

Harnessing the Power of Large Language Models for Natural Language to First-Order Logic Translation

fields

years

verdicts

representative citing papers

citing papers explorer