pith. sign in

hub Canonical reference

Decodingtrust: A comprehensive assessment of trustworthiness in gpt models

Canonical reference. 83% of citing Pith papers cite this work as background.

12 Pith papers citing it
Background 83% of classified citations

hub tools

citation-role summary

background 6

citation-polarity summary

roles

background 6

polarities

background 5 support 1

representative citing papers

BEAVER: An Efficient Deterministic LLM Verifier

cs.AI · 2025-12-05 · unverdicted · novelty 7.0

BEAVER is the first practical deterministic verifier that maintains sound probability bounds on LLM safety properties using token tries and frontier data structures, finding 2-3x more violations than sampling at 1/10 the compute.

TrustLLM: Trustworthiness in Large Language Models

cs.CL · 2024-01-10 · unverdicted · novelty 5.0

TrustLLM defines eight trustworthiness principles, creates a six-dimension benchmark, and evaluates 16 LLMs showing proprietary models generally lead but some open-source ones are close while over-calibration can hurt utility.

citing papers explorer

Showing 12 of 12 citing papers.