James Zou

Identifiers

name variant James Zou 0.60 · backfill

Papers (81)

Benchmarking AI Agents for Addressing Scientific Challenges Across Scales cs.AI · 2026 · author #31
Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories cs.CV · 2026 · author #6
Harnessing the Collective Intelligence of AI Agents in the Wild for New Discoveries cs.CL · 2026 · author #4
On the Relationship Between Activation Outliers and Feature Death in Sparse Autoencoders cs.LG · 2026 · author #3
WorldMemArena: Evaluating Multimodal Agent Memory Through Action-World Interaction cs.CV · 2026 · author #13
ReasonOps: Operator Segmentation for LLM Reasoning Traces cs.AI · 2026 · author #3
Automated Benchmark Auditing for AI Agents and Large Language Models cs.CL · 2026 · author #7
Knowledge Graph Modulated Deep Learning for Limited-Sample Clinical Data Analysis cs.LG · 2026 · author #3
Evaluating Commercial AI Chatbots as News Intermediaries cs.CL · 2026 · author #8
Forecasting Scientific Progress with Artificial Intelligence cs.AI · 2026 · author #9
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration cs.AI · 2026 · author #32
Voice "Cloning" is Style Transfer cs.SD · 2026 · author #6
Mechanistic Interpretability of EEG Foundation Models via Sparse Autoencoders cs.LG · 2026 · author #12
TERMS-Bench: Diagnosing LLM Negotiation Agents Beyond Deal Rate cs.GT · 2026 · author #8
Unlocking LLM Creativity in Science through Analogical Reasoning cs.AI · 2026 · author #3
A Versatile AI Agent for Rare Disease Diagnosis and Risk Gene Prioritization cs.AI · 2026 · author #12
Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction cs.IR · 2026 · author #14
Recursive Multi-Agent Systems cs.AI · 2026 · author #12
Evaluation-driven Scaling for Scientific Discovery cs.LG · 2026 · author #24
Graph-of-Agents: A Graph-based Framework for Multi-Agent LLM Collaboration cs.AI · 2026 · author #6
Introspective Diffusion Language Models cs.AI · 2026 · author #13
Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution cs.AI · 2026 · author #18
Combee: Scaling Prompt Learning for Self-Improving Language Model Agents cs.AI · 2026 · author #11
The Price Reversal Phenomenon: When Cheaper Reasoning Models Cost More cs.CL · 2026 · author #6
Test-Time Optimization of Physical Query Plans with LLMs cs.DB · 2026 · author #6
Multi-Agent Teams Hold Experts Back cs.MA · 2026 · author #7
Sparse Reward Subsystem in Large Language Models cs.CL · 2026 · author #3
Textual Equilibrium Propagation for Deep Compound AI Systems cs.LG · 2026 · author #3
Learning to Discover at Test Time cs.LG · 2026 · author #9
Can Small Training Runs Reliably Guide Data Curation? Rethinking Proxy-Model Practice cs.LG · 2025 · author #4
Latent Collaboration in Multi-Agent Systems cs.CL · 2025 · author #11
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models cs.LG · 2025 · author #12
Impatient Users Confuse AI Agents: High-fidelity Simulations of Human Traits for Testing Agents cs.AI · 2025 · author #5
ACT: Agentic Classification Tree cs.LG · 2025 · author #5
Advancing AI Research Assistants with Expert-Involved Learning cs.AI · 2025 · author #29
OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning cs.LG · 2025 · author #6
D-Flow: Multi-modality Flow Matching for D-peptide Design cs.CE · 2024 · author #6
TextGrad: Automatic "Differentiation" via Text cs.CL · 2024 · author #7
Mixture-of-Agents Enhances Large Language Model Capabilities cs.CL · 2024 · author #5
TrustLLM: Trustworthiness in Large Language Models cs.CL · 2024 · author #32
FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance cs.LG · 2023 · author #3
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models cs.CL · 2022 · author #176
Gradio: Hassle-Free Sharing and Testing of ML Models in the Wild cs.LG · 2019 · author #6
Discovering Conditionally Salient Features with Statistical Guarantees stat.ML · 2019 · author #2
A Knowledge Graph-based Approach for Exploring the U.S. Opioid Epidemic cs.CY · 2019 · author #6
Data Shapley: Equitable Valuation of Data for Machine Learning stat.ML · 2019 · author #2
Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings cs.CL · 2019 · author #4
Contrastive Variational Autoencoder Enhances Salient Features cs.LG · 2019 · author #2
Adaptive Monte Carlo Multiple Testing via Multi-Armed Bandits stat.ME · 2019 · author #2
Concrete Autoencoders for Differentiable Feature Selection and Reconstruction cs.LG · 2019 · author #3
Large-scale Generative Modeling to Improve Automated Veterinary Disease Coding cs.CL · 2018 · author #3
Minimizing Close-k Aggregate Loss Improves Classification cs.LG · 2018 · author #2
Contrastive Multivariate Singular Spectrum Analysis stat.ML · 2018 · author #3
Improving the Stability of the Knockoff Procedure: Multiple Simultaneous Knockoffs and Entropy Maximization stat.ML · 2018 · author #2
Autowarp: Learning a Warping Distance from Unlabeled Time Series Using Sequence Autoencoders cs.LG · 2018 · author #2
Knockoffs for the mass: new feature importance statistics with false discovery guarantees stat.ML · 2018 · author #3
DeepTag: inferring all-cause diagnoses from clinical notes in under-resourced medical domain cs.CL · 2018 · author #7
Multiaccuracy: Black-Box Post-Processing for Fairness in Classification cs.LG · 2018 · author #3
Feedback GAN (FBGAN) for DNA: a Novel Feedback-Loop Architecture for Optimizing Protein Functions q-bio.GN · 2018 · author #2
Stochastic EM for Shuffled Linear Regression stat.ML · 2018 · author #2
CoVeR: Learning Covariate-Specific Vector Representations with Tensor Decompositions cs.CL · 2018 · author #3
Word Embeddings Quantify 100 Years of Gender and Ethnic Stereotypes cs.CL · 2017 · author #4
NeuralFDR: Learning Discovery Thresholds from Hypothesis Features stat.ME · 2017 · author #3
Interpretation of Neural Networks is Fragile stat.ML · 2017 · author #3
The Effects of Memory Replay in Reinforcement Learning cs.AI · 2017 · author #2
Contrastive Principal Component Analysis stat.ML · 2017 · author #4
Why Adaptively Collected Data Have Negative Bias and How to Correct for It stat.ML · 2017 · author #4
Estimating the unseen from multiple populations cs.LG · 2017 · author #3
Beyond Bilingual: Multi-sense Word Embeddings using Multilingual Context cs.CL · 2017 · author #5
Linear Regression with Shuffled Labels stat.ML · 2017 · author #3
Signal to noise in matching markets cs.GT · 2016 · author #2
Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings cs.CL · 2016 · author #3
Contingent Payment Mechanisms for Resource Utilization cs.GT · 2016 · author #4
Quantifying and Reducing Stereotypes in Word Embeddings cs.CL · 2016 · author #3
Clustering with a Reject Option: Interactive Clustering as Bayesian Prior Elicitation stat.ML · 2016 · author #2
Quantifying the accuracy of approximate diffusions and Markov chains math.ST · 2016 · author #2
Clustering with a Reject Option: Interactive Clustering as Bayesian Prior Elicitation stat.ML · 2016 · author #2
Rich Component Analysis cs.LG · 2015 · author #2
Incentive-Compatible Experimental Design stat.ME · 2015 · author #4
Intersecting Faces: Non-negative Matrix Factorization With New Guarantees cs.LG · 2015 · author #2
Mechanism Design for Time Critical and Cost Critical Task Execution via Crowdsourcing cs.GT · 2012 · author #5

Mentions

2606.12736 #31 · arxiv_oai · confidence 0.70 James Zou
2606.11176 #6 · arxiv_oai · confidence 0.70 James Zou
2606.10402 #4 · arxiv_oai · confidence 0.70 James Zou
2510.04491 #5 · arxiv_oai · confidence 0.70 James Zou
1507.03867 #2 · backfill · confidence 0.70 James Zou
1507.03063 #4 · backfill · confidence 0.70 James Zou
1507.02189 #2 · backfill · confidence 0.70 James Zou
2602.10387 #6 · arxiv_oai · confidence 0.70 James Zou
2511.20639 #11 · arxiv_oai · confidence 0.70 James Zou
2605.31518 #3 · arxiv_oai · confidence 0.70 James Zou
2602.01011 #7 · arxiv_oai · confidence 0.70 James Zou
2605.29341 #13 · arxiv_oai · confidence 0.70 James Zou
2605.29192 #3 · arxiv_oai · confidence 0.70 James Zou
2603.23971 #6 · arxiv_oai · confidence 0.70 James Zou
2605.26079 #7 · arxiv_oai · confidence 0.70 James Zou
2605.24162 #3 · arxiv_oai · confidence 0.70 James Zou
1208.1676 #5 · backfill · confidence 0.70 James Zou
2605.22785 #8 · arxiv_oai · confidence 0.70 James Zou
2605.22681 #9 · arxiv_oai · confidence 0.70 James Zou
2605.20025 #32 · arxiv_oai · confidence 0.70 James Zou
2605.16578 #6 · arxiv_oai · confidence 0.70 James Zou
2605.13930 #12 · arxiv_oai · confidence 0.70 James Zou
2401.05561 #32 · arxiv_oai · confidence 0.70 James Zou
2406.04692 #5 · arxiv_oai · confidence 0.70 James Zou
2601.16175 #9 · arxiv_oai · confidence 0.70 James Zou

Frequent Coauthors

Abubakar Abid 10 shared papers
Federico Bianchi 7 shared papers
Pan Lu 7 shared papers
Amirata Ghorbani 4 shared papers
Kai-Wei Chang 4 shared papers
Yejin Choi 4 shared papers
Adam Kalai 3 shared papers
Allen Nie 3 shared papers
Aneesh Pappu 3 shared papers
Ben Athiwaratkun 3 shared papers
Dan Jurafsky 3 shared papers
Hongyu Zhao 3 shared papers
Hua Xu 3 shared papers
Jaime Roquero Gimenez 3 shared papers
Martin J. Zhang 3 shared papers
Mert Yuksekgonul 3 shared papers
Rahul Thapa 3 shared papers
Sheng Liu 3 shared papers
Tianyu Liu 3 shared papers
Yongchan Kwon 3 shared papers