Chain-of-Thought reasoning in LLMs is often unfaithful, with models relying on it variably by task and less so as models scale larger.
URL https: //www.science.org/doi/abs/10.1126/sc irobotics.aay7120
4 Pith papers cite this work. Polarity classification is still indexing.
representative citing papers
The authors provide a detailed taxonomy of 21 risks associated with language models, covering discrimination, information leaks, misinformation, malicious applications, interaction harms, and societal impacts like job loss and environmental costs.
The paper establishes a reproducible retrospective benchmark for ranking daily active-fire detections in Cerrado conservation units by comparing atmospheric, surface, static spatial, and short-term memory covariates with standard ML models under time-series cross-validation and held-out AOI tests.
Authors argue by analogy that localised hardware ML architectures may be more interpretable and efficient than deep neural networks on GPUs for smaller datasets and evaluate candidate hardware paradigms.
citing papers explorer
-
Enhancing AI Interpretability and Safety through Localised Architectures
Authors argue by analogy that localised hardware ML architectures may be more interpretable and efficient than deep neural networks on GPUs for smaller datasets and evaluate candidate hardware paradigms.