Title resolution pending

Wang, Wenxuan, Juluan, Shi, Ling, Zixuan, Chan, Yuk-Kit, Wang, Chaozheng, Lee, Cheryl · 2025 · DOI 10.18653/v1/2025.emnlp-main.1104

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

SMH-Bench: Benchmarking LLM Agents for Environment-Grounded Reasoning and Action in Smart Homes

cs.AI · 2026-06-01 · unverdicted · novelty 6.0

SMH-Bench supplies 1,100 stratified tasks in a verifiable smart-home simulator to measure LLM performance on explicit control, scheduling, ambiguity, and personalization as environment complexity grows.

Position: Anthropomorphic Misalignment Research Needs Stronger Evidence

cs.CY · 2026-05-29 · unverdicted · novelty 3.0

Position paper calling for stronger evidentiary standards and a diagnostic checklist in anthropomorphic misalignment research.

citing papers explorer

Showing 2 of 2 citing papers.

SMH-Bench: Benchmarking LLM Agents for Environment-Grounded Reasoning and Action in Smart Homes cs.AI · 2026-06-01 · unverdicted · none · ref 51
SMH-Bench supplies 1,100 stratified tasks in a verifiable smart-home simulator to measure LLM performance on explicit control, scheduling, ambiguity, and personalization as environment complexity grows.
Position: Anthropomorphic Misalignment Research Needs Stronger Evidence cs.CY · 2026-05-29 · unverdicted · none · ref 110
Position paper calling for stronger evidentiary standards and a diagnostic checklist in anthropomorphic misalignment research.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer