Yi: Open foundation models by 01.ai

author Brown, T · 2020 · arXiv 5724.349588

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

representative citing papers

Single-Sample Black-Box Membership Inference Attack against Vision-Language Models via Cross-modal Semantic Alignment

cs.CV · 2026-05-17 · unverdicted · novelty 7.0

A cross-modal alignment attack achieves AUC 0.821 for single-sample black-box membership inference on VLMs such as LLaVA-1.5 by quantifying image-generated caption similarity.

A Technical Typology of AI Systems in Public Administration

cs.CY · 2026-06-30 · unverdicted · novelty 6.0

The paper defines five AI system categories for public administration and reports that 55% of 91 recent papers leave the system type underspecified while 31% study one type but motivate with another.

A Tabular Schedule Abstraction for Communication-Aware Evaluation of Pipeline-Parallel LLM Training

cs.DC · 2026-05-19 · unverdicted · novelty 6.0

A new tabular abstraction for pipeline schedules shows communication can reverse rankings from bubble analysis alone, with GPipe and 1F1B runtime-equivalent but 1F1B lower in activation memory.

Task Vectors, Learned Not Extracted: Performance Gains and Mechanistic Insight

cs.CL · 2025-09-29 · unverdicted · novelty 6.0

Learned Task Vectors trained directly outperform extracted task vectors for in-context learning with added mechanistic insights into linear propagation and key attention circuits.

Localizing Task Recognition and Task Learning in In-Context Learning via Attention Head Analysis

cs.CL · 2025-09-29 · unverdicted · novelty 6.0

A new framework using Task Subspace Logit Attribution localizes attention heads specialized for task recognition and task learning in in-context learning, showing they align and rotate hidden states within a task subspace.

MedGemma vs GPT-4: Open-Source and Proprietary Zero-shot Medical Disease Classification from Images

cs.CV · 2025-12-29 · unverdicted · novelty 4.0

Fine-tuned MedGemma outperforms untuned GPT-4 in zero-shot medical image disease classification, achieving 80.37% versus 69.58% mean test accuracy with higher sensitivity for cancer and pneumonia.

Artificial Intelligence for Power-Converter-Rich Electrical Systems: A Review

eess.SY · 2026-06-14 · unverdicted · novelty 2.0

Review of AI applications in power-converter-rich systems across design, control, operations, and governance, highlighting deployment gaps.

citing papers explorer

Showing 7 of 7 citing papers.

Single-Sample Black-Box Membership Inference Attack against Vision-Language Models via Cross-modal Semantic Alignment cs.CV · 2026-05-17 · unverdicted · none · ref 5
A cross-modal alignment attack achieves AUC 0.821 for single-sample black-box membership inference on VLMs such as LLaVA-1.5 by quantifying image-generated caption similarity.
A Technical Typology of AI Systems in Public Administration cs.CY · 2026-06-30 · unverdicted · none · ref 23
The paper defines five AI system categories for public administration and reports that 55% of 91 recent papers leave the system type underspecified while 31% study one type but motivate with another.
A Tabular Schedule Abstraction for Communication-Aware Evaluation of Pipeline-Parallel LLM Training cs.DC · 2026-05-19 · unverdicted · none · ref 1
A new tabular abstraction for pipeline schedules shows communication can reverse rankings from bubble analysis alone, with GPipe and 1F1B runtime-equivalent but 1F1B lower in activation memory.
Task Vectors, Learned Not Extracted: Performance Gains and Mechanistic Insight cs.CL · 2025-09-29 · unverdicted · none · ref 1
Learned Task Vectors trained directly outperform extracted task vectors for in-context learning with added mechanistic insights into linear propagation and key attention circuits.
Localizing Task Recognition and Task Learning in In-Context Learning via Attention Head Analysis cs.CL · 2025-09-29 · unverdicted · none · ref 3
A new framework using Task Subspace Logit Attribution localizes attention heads specialized for task recognition and task learning in in-context learning, showing they align and rotate hidden states within a task subspace.
MedGemma vs GPT-4: Open-Source and Proprietary Zero-shot Medical Disease Classification from Images cs.CV · 2025-12-29 · unverdicted · none · ref 14
Fine-tuned MedGemma outperforms untuned GPT-4 in zero-shot medical image disease classification, achieving 80.37% versus 69.58% mean test accuracy with higher sensitivity for cancer and pneumonia.
Artificial Intelligence for Power-Converter-Rich Electrical Systems: A Review eess.SY · 2026-06-14 · unverdicted · none · ref 45
Review of AI applications in power-converter-rich systems across design, control, operations, and governance, highlighting deployment gaps.

Yi: Open foundation models by 01.ai

fields

years

verdicts

representative citing papers

citing papers explorer