pith. sign in

hub Mixed citations

Datacomp: In search of the next generation of multimodal datasets

Mixed citation behavior. Most common role is background (40%).

19 Pith papers citing it
Background 40% of classified citations

hub tools

citation-role summary

background 3 baseline 1 other 1

citation-polarity summary

clear filters

representative citing papers

Objaverse-XL: A Universe of 10M+ 3D Objects

cs.CV · 2023-07-11 · accept · novelty 7.0

Objaverse-XL supplies over 10 million diverse 3D objects that, when used to render 100 million views, improve zero-shot novel-view synthesis in models such as Zero123.

Prior-Aligned Data Cleaning for Tabular Foundation Models

cs.LG · 2026-04-28 · unverdicted · novelty 6.0

L2C2 is a deep RL framework that learns to clean tabular data by aligning it to the synthetic prior of tabular foundation models, yielding higher accuracy on some benchmarks and cross-dataset policy transfer.

QuiLL: An LLM-Based Vulnerability Assessment Framework for the Wild

cs.CR · 2025-10-05 · unverdicted · novelty 6.0

QuiLL is a new evaluation pipeline that uses optimized LLM prompts, dynamic in-context learning from an NVD vector store, and a novel accuracy-plus-reasoning metric to benchmark vulnerability detection in real code.

TuringViT: Making SOTA Vision Transformers Accessible to All

cs.CV · 2026-06-23 · unverdicted · novelty 5.0

TuringViT claims a new ViT design with linear attention and curated data that matches SOTA performance using 10% of typical pretraining data while supporting dynamic resolutions and improving VLM integration.

Instrumented data for causal scientific machine learning

cs.LG · 2026-06-05 · unverdicted · novelty 5.0

Instrumented data augments observations with mechanistic models, uncertainty, and counterfactuals to enable causal interventions via Pearl's do-operator in scientific machine learning.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • Prior-Aligned Data Cleaning for Tabular Foundation Models cs.LG · 2026-04-28 · unverdicted · none · ref 11

    L2C2 is a deep RL framework that learns to clean tabular data by aligning it to the synthetic prior of tabular foundation models, yielding higher accuracy on some benchmarks and cross-dataset policy transfer.