AutoGen: Enabling Next-Gen LLM Applications via Multi- Agent Conversation Framework

Qingyun Wu et al · 2023 · DOI 10.48550/arxiv.2308

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

open at publisher browse 9 citing papers

citation-role summary

background 3 dataset 1

citation-polarity summary

background 3 use dataset 1

representative citing papers

SiblingRepair: Sibling-Based Multi-Hunk Repair with Large Language Models

cs.SE · 2026-05-07 · unverdicted · novelty 7.0

SiblingRepair uses LLMs with semantic sibling detection and simultaneous/iterative repair strategies to outperform prior multi-hunk APR tools like Hercules on Defects4J and GHRB benchmarks.

AttentionBender: Manipulating Cross-Attention in Video Diffusion Transformers as a Creative Probe

cs.MM · 2026-04-22 · unverdicted · novelty 7.0

AttentionBender applies 2D transforms to cross-attention maps in video diffusion transformers, producing distributed distortions and glitch aesthetics that reveal entangled attention mechanisms while serving as both an XAI probe and creative tool.

An Agentic Evaluation Architecture for Historical Bias Detection in Educational Textbooks

cs.AI · 2026-04-09 · unverdicted · novelty 7.0

An agentic architecture with multimodal screening, a five-agent jury, meta-synthesis, and source attribution protocol detects biases in Romanian history textbooks more accurately than zero-shot baselines, achieving 83.3% acceptable excerpts and human preference in 64.8% of blind comparisons.

Compass vs Railway Tracks: Unpacking User Mental Models for Communicating Long-Horizon Work to Humans vs. AI

cs.HC · 2026-01-17 · unverdicted · novelty 7.0

Users treat human delegation for long tasks as a flexible compass but AI delegation as rigid railway tracks due to perceived AI limitations in inference and judgment.

Code-Centric Detection of Vulnerability-Fixing Commits: A Unified Benchmark and Empirical Study

cs.SE · 2026-05-13 · accept · novelty 6.0

Code language models show no transferable security understanding from code diffs alone, rely on commit messages, miss over 93% of fixes at 0.5% false positive rate, and suffer large drops under group or temporal splits.

High resolution large working distance scanning helium microscopy

physics.optics · 2026-05-19 · accept · novelty 5.0

Sub-micron resolution (340 nm beamwidth) achieved in large-working-distance pinhole scanning helium microscopy through constrained optimization of atom optics, redesigned pinhole plate, smaller pinhole, increased source distance, and larger detector aperture.

A Demonstration of SQLyzr: A Platform for Fine-Grained Text-to-SQL Evaluation and Analysis

cs.DB · 2026-04-23 · unverdicted · novelty 5.0

SQLyzr is a new evaluation platform that adds diverse metrics, realistic settings, query classification, and analysis features to overcome the single-score limitations of existing text-to-SQL benchmarks.

AI-Enabled Serious Games: Integrating Intelligence and Adaptivity in Training Systems

cs.AI · 2026-05-21 · unverdicted · novelty 3.0

The chapter synthesizes the history of adaptive learning systems and examines how AI can provide instructional intelligence and real-time adaptivity in serious games while highlighting challenges such as explainability and limited long-term outcome data.

Escaping Mode Collapse in LLM Generation via Geometric Regulation

cs.CL · 2026-05-01

citing papers explorer

Showing 9 of 9 citing papers.

SiblingRepair: Sibling-Based Multi-Hunk Repair with Large Language Models cs.SE · 2026-05-07 · unverdicted · none · ref 53
SiblingRepair uses LLMs with semantic sibling detection and simultaneous/iterative repair strategies to outperform prior multi-hunk APR tools like Hercules on Defects4J and GHRB benchmarks.
AttentionBender: Manipulating Cross-Attention in Video Diffusion Transformers as a Creative Probe cs.MM · 2026-04-22 · unverdicted · none · ref 11
AttentionBender applies 2D transforms to cross-attention maps in video diffusion transformers, producing distributed distortions and glitch aesthetics that reveal entangled attention mechanisms while serving as both an XAI probe and creative tool.
An Agentic Evaluation Architecture for Historical Bias Detection in Educational Textbooks cs.AI · 2026-04-09 · unverdicted · none · ref 3
An agentic architecture with multimodal screening, a five-agent jury, meta-synthesis, and source attribution protocol detects biases in Romanian history textbooks more accurately than zero-shot baselines, achieving 83.3% acceptable excerpts and human preference in 64.8% of blind comparisons.
Compass vs Railway Tracks: Unpacking User Mental Models for Communicating Long-Horizon Work to Humans vs. AI cs.HC · 2026-01-17 · unverdicted · none · ref 23
Users treat human delegation for long tasks as a flexible compass but AI delegation as rigid railway tracks due to perceived AI limitations in inference and judgment.
Code-Centric Detection of Vulnerability-Fixing Commits: A Unified Benchmark and Empirical Study cs.SE · 2026-05-13 · accept · none · ref 47
Code language models show no transferable security understanding from code diffs alone, rely on commit messages, miss over 93% of fixes at 0.5% false positive rate, and suffer large drops under group or temporal splits.
High resolution large working distance scanning helium microscopy physics.optics · 2026-05-19 · accept · none · ref 42
Sub-micron resolution (340 nm beamwidth) achieved in large-working-distance pinhole scanning helium microscopy through constrained optimization of atom optics, redesigned pinhole plate, smaller pinhole, increased source distance, and larger detector aperture.
A Demonstration of SQLyzr: A Platform for Fine-Grained Text-to-SQL Evaluation and Analysis cs.DB · 2026-04-23 · unverdicted · none · ref 6
SQLyzr is a new evaluation platform that adds diverse metrics, realistic settings, query classification, and analysis features to overcome the single-score limitations of existing text-to-SQL benchmarks.
AI-Enabled Serious Games: Integrating Intelligence and Adaptivity in Training Systems cs.AI · 2026-05-21 · unverdicted · none · ref 101
The chapter synthesizes the history of adaptive learning systems and examines how AI can provide instructional intelligence and real-time adaptivity in serious games while highlighting challenges such as explainability and limited long-term outcome data.
Escaping Mode Collapse in LLM Generation via Geometric Regulation cs.CL · 2026-05-01 · unreviewed · ref 9

AutoGen: Enabling Next-Gen LLM Applications via Multi- Agent Conversation Framework

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer