pith. machine review for the scientific record. sign in

hub

Trustllm: Trustworthiness in large language models.arXiv preprint arXiv:2401.05561

16 Pith papers cite this work. Polarity classification is still indexing.

16 Pith papers citing it

hub tools

citation-role summary

background 2 dataset 1 other 1

citation-polarity summary

representative citing papers

A Multi-Dimensional Audit of Politically Aligned Large Language Models

cs.CL · 2026-04-27 · unverdicted · novelty 4.0

A multi-dimensional audit framework for politically aligned LLMs finds consistent trade-offs: larger models are more effective and truthful but less fair with higher bias, while fine-tuned models reduce bias but increase hallucinations and reasoning decline, and all tested models show deficiencies.

Large Language Models: A Survey

cs.CL · 2024-02-09 · accept · novelty 3.0

The paper surveys key large language models, their training methods, datasets, evaluation benchmarks, and future research directions in the field.

citing papers explorer

Showing 16 of 16 citing papers.