hub

Language mod- els are few-shot learners

· 1901

27 Pith papers cite this work. Polarity classification is still indexing.

27 Pith papers citing it

browse 27 citing papers

hub tools

JSON dossier citing papers JSON

citation-role summary

background 4

citation-polarity summary

background 4

representative citing papers

QLAM: A Quantum Long-Attention Memory Approach to Long-Sequence Token Modeling

cs.LG · 2026-05-13 · unverdicted · novelty 7.0

QLAM extends state-space models with quantum superposition in the hidden state for linear-time long-sequence modeling and reports consistent gains over RNN and transformer baselines on sequential image tasks.

LoRM: Learning the Language of Rotating Machinery for Self-Supervised Condition Monitoring

cs.CL · 2026-04-07 · unverdicted · novelty 7.0

LoRM is a self-supervised framework that models multi-modal rotating machinery signals as token sequences for prediction with fine-tuned language models, using prediction errors to monitor machine health in real time.

Building Deep Graph Predictors with Graph Imitation Learning

cs.CV · 2026-01-21 · unverdicted · novelty 7.0

GRAIL trains graph predictors via imitation learning by modeling generation as sequential decisions on partial graph embeddings, matching or exceeding prior methods on 18 benchmarks.

Debate-Enhanced Pseudo Labeling and Frequency-Aware Progressive Debiasing for Weakly-Supervised Camouflaged Object Detection with Scribble Annotations

cs.CV · 2025-12-23 · unverdicted · novelty 7.0

D³ETOR combines debate-enhanced pseudo labeling from SAM with frequency-aware progressive debiasing in FADeNet to achieve state-of-the-art weakly-supervised camouflaged object detection using scribbles.

BELIEF: Structured Evidence Modeling and Uncertainty-Aware Fusion for Biomedical Question Answering

cs.CL · 2026-05-17 · unverdicted · novelty 6.0

BELIEF improves closed-set biomedical QA by converting documents to structured evidence objects and fusing D-S symbolic belief estimation with LLM inference through reliability-aware arbitration.

PrivScope: Task-scoped Disclosure Control for Hybrid Agentic Systems

cs.CR · 2026-05-15 · unverdicted · novelty 6.0

PrivScope enforces task-scoped disclosure at the local-cloud boundary in hybrid agents, eliminating profile leakage and halving re-identification risk on medical workflows while preserving task success.

FusionCIM: Accelerating LLM Inference with Fusion-Driven Computing-in-Memory Architecture

cs.AR · 2026-04-28 · unverdicted · novelty 6.0

FusionCIM is a fusion-driven CIM accelerator for LLM inference that maps QKT to IP-CIM and PV to OP-CIM, uses QO-stationary dataflow, and applies pattern-aware online softmax, delivering up to 3.86x energy savings and 1.98x speedup on LLaMA-3 at 29.4 TOPS/W.

Decoupled Travel Planning with Behavior Forest

cs.LG · 2026-04-23 · unverdicted · novelty 6.0

Behavior Forest decouples multi-constraint travel planning into parallel behavior trees with LLM nodes and global coordination, yielding 6.67% and 11.82% gains over prior methods on two benchmarks.

PARM: Pipeline-Adapted Reward Model

cs.AI · 2026-04-20 · unverdicted · novelty 6.0

PARM adapts reward models to multi-stage LLM pipelines via pipeline data and direct preference optimization, improving execution rate and solving accuracy on optimization benchmarks and showing transfer to GSM8K.

Training Time Prediction for Mixed Precision-based Distributed Training

cs.LG · 2026-04-17 · unverdicted · novelty 6.0

A precision-aware predictor for distributed training time achieves 9.8% MAPE across precision settings, compared to errors up to 147.85% when precision is ignored.

PlanGuard: Defending Agents against Indirect Prompt Injection via Planning-based Consistency Verification

cs.CR · 2026-04-11 · unverdicted · novelty 6.0

PlanGuard cuts indirect prompt injection attack success rate to 0% on the InjecAgent benchmark by verifying agent actions against a user-instruction-only plan while keeping false positives at 1.49%.

SLaB: Sparse-Lowrank-Binary Decomposition for Efficient Large Language Models

cs.LG · 2026-04-06 · unverdicted · novelty 6.0

SLaB compresses LLM weights via sparse-lowrank-binary decomposition guided by activation-aware scores, achieving up to 36% lower perplexity than prior methods at 50% compression on Llama models.

Hierarchical, Interpretable, Label-Free Concept Bottleneck Model

cs.CV · 2026-04-02 · unverdicted · novelty 6.0

HIL-CBM is a hierarchical label-free concept bottleneck model that improves classification accuracy and explanation quality over prior single-level CBMs using a visual consistency loss and dual heads.

One Prompt, Many Sounds: Modeling Listener Variability in LLM-Based Equalization

cs.SD · 2026-01-14 · unverdicted · novelty 6.0

LLMs using in-context learning and fine-tuning on listener experiment data generate equalization settings that align better with population preferences than random sampling or static presets.

Discriminative-Generative Target Speaker Extraction with Decoder-Only Language Models

eess.AS · 2026-01-09 · unverdicted · novelty 6.0

A hybrid two-stage framework pairs a discriminative front-end for interference suppression with a generative decoder-only LM back-end to improve perceptual quality and speaker consistency in target speaker extraction and speech enhancement.

The Command Line GUIde: Graphical Interfaces from Man Pages via AI

cs.HC · 2025-10-01 · unverdicted · novelty 6.0

GUIde uses AI to translate man pages into graphical interface specifications for command line tools, evaluated on a corpus of real commands.

DeRelayL: Sustainable Decentralized Relay Learning

cs.LG · 2026-04-30 · unverdicted · novelty 5.0

DeRelayL is a proposed sustainable decentralized learning paradigm where permissionless participants relay-train and share models via designed incentives, backed by theoretical analysis and simulations.

Budget-Constrained Online Retrieval-Augmented Generation: The Chunk-as-a-Service Model

cs.IR · 2026-04-28 · unverdicted · novelty 5.0

Chunk-as-a-Service with the UCOSA online algorithm enables budget-constrained selection of prompts for chunk enrichment in RAG, outperforming random selection by 52% on a combined performance metric and delivering higher performance-to-budget ratios than standard RaaS.

On the Generalization Properties of Selective State-Space Models for Filtering Tasks for Unknown Systems

eess.SY · 2026-04-26 · unverdicted · novelty 5.0

Selective state-space models achieve online filtering for unknown systems from the same class with generalization bounds derived under appropriate assumptions.

Train the Trainers -- An Agentic AI Framework for Peer-Based Mental Health Support in Battlefield Environments

cs.HC · 2026-03-31 · unverdicted · novelty 5.0

The paper introduces an agentic AI platform to train and support recovered soldiers as peer facilitators providing mental health triage and interventions in austere battlefield environments.

Chinese Short-Form Creative Content Generation via Explanation-Oriented Multi-Objective Optimization

cs.CL · 2025-11-19 · unverdicted · novelty 5.0

MAGIC-HMO is a multi-agent framework that treats Chinese short-form creative NLG as heterogeneous multi-objective optimization over personalized constraints plus explanation reliability and outperforms baselines on a baby-naming benchmark.

LLM as Attention-Informed NTM and Topic Modeling as long-input Generation: Interpretability and long-Context Capability

cs.CL · 2025-10-03 · unverdicted · novelty 5.0

LLMs recover interpretable topic structures via attention and achieve competitive topic modeling performance as long-context generators.

Plausible but Wrong: A case study on Agentic Failures in Astrophysical Workflows

cs.AI · 2026-04-28 · unverdicted · novelty 4.0

CMBAgent achieves high accuracy on well-specified astrophysical tasks with context but generates silent, plausible-yet-incorrect outputs on reasoning-challenging problems, with no self-diagnosis of inconsistencies.

AICCE: AI Driven Compliance Checker Engine

cs.CR · 2026-04-03 · unverdicted · novelty 4.0

AICCE combines RAG-based retrieval of protocol specs with dual LLM pipelines for debate-driven explanations or fast script execution, reporting up to 99% accuracy on IPv6 samples.

citing papers explorer

Showing 27 of 27 citing papers.

QLAM: A Quantum Long-Attention Memory Approach to Long-Sequence Token Modeling cs.LG · 2026-05-13 · unverdicted · none · ref 18
QLAM extends state-space models with quantum superposition in the hidden state for linear-time long-sequence modeling and reports consistent gains over RNN and transformer baselines on sequential image tasks.
LoRM: Learning the Language of Rotating Machinery for Self-Supervised Condition Monitoring cs.CL · 2026-04-07 · unverdicted · none · ref 14
LoRM is a self-supervised framework that models multi-modal rotating machinery signals as token sequences for prediction with fine-tuned language models, using prediction errors to monitor machine health in real time.
Building Deep Graph Predictors with Graph Imitation Learning cs.CV · 2026-01-21 · unverdicted · none · ref 27
GRAIL trains graph predictors via imitation learning by modeling generation as sequential decisions on partial graph embeddings, matching or exceeding prior methods on 18 benchmarks.
Debate-Enhanced Pseudo Labeling and Frequency-Aware Progressive Debiasing for Weakly-Supervised Camouflaged Object Detection with Scribble Annotations cs.CV · 2025-12-23 · unverdicted · none · ref 28
D³ETOR combines debate-enhanced pseudo labeling from SAM with frequency-aware progressive debiasing in FADeNet to achieve state-of-the-art weakly-supervised camouflaged object detection using scribbles.
BELIEF: Structured Evidence Modeling and Uncertainty-Aware Fusion for Biomedical Question Answering cs.CL · 2026-05-17 · unverdicted · none · ref 1
BELIEF improves closed-set biomedical QA by converting documents to structured evidence objects and fusing D-S symbolic belief estimation with LLM inference through reliability-aware arbitration.
PrivScope: Task-scoped Disclosure Control for Hybrid Agentic Systems cs.CR · 2026-05-15 · unverdicted · none · ref 1
PrivScope enforces task-scoped disclosure at the local-cloud boundary in hybrid agents, eliminating profile leakage and halving re-identification risk on medical workflows while preserving task success.
FusionCIM: Accelerating LLM Inference with Fusion-Driven Computing-in-Memory Architecture cs.AR · 2026-04-28 · unverdicted · none · ref 1
FusionCIM is a fusion-driven CIM accelerator for LLM inference that maps QKT to IP-CIM and PV to OP-CIM, uses QO-stationary dataflow, and applies pattern-aware online softmax, delivering up to 3.86x energy savings and 1.98x speedup on LLaMA-3 at 29.4 TOPS/W.
Decoupled Travel Planning with Behavior Forest cs.LG · 2026-04-23 · unverdicted · none · ref 30
Behavior Forest decouples multi-constraint travel planning into parallel behavior trees with LLM nodes and global coordination, yielding 6.67% and 11.82% gains over prior methods on two benchmarks.
PARM: Pipeline-Adapted Reward Model cs.AI · 2026-04-20 · unverdicted · none · ref 28
PARM adapts reward models to multi-stage LLM pipelines via pipeline data and direct preference optimization, improving execution rate and solving accuracy on optimization benchmarks and showing transfer to GSM8K.
Training Time Prediction for Mixed Precision-based Distributed Training cs.LG · 2026-04-17 · unverdicted · none · ref 14
A precision-aware predictor for distributed training time achieves 9.8% MAPE across precision settings, compared to errors up to 147.85% when precision is ignored.
PlanGuard: Defending Agents against Indirect Prompt Injection via Planning-based Consistency Verification cs.CR · 2026-04-11 · unverdicted · none · ref 1
PlanGuard cuts indirect prompt injection attack success rate to 0% on the InjecAgent benchmark by verifying agent actions against a user-instruction-only plan while keeping false positives at 1.49%.
SLaB: Sparse-Lowrank-Binary Decomposition for Efficient Large Language Models cs.LG · 2026-04-06 · unverdicted · none · ref 2
SLaB compresses LLM weights via sparse-lowrank-binary decomposition guided by activation-aware scores, achieving up to 36% lower perplexity than prior methods at 50% compression on Llama models.
Hierarchical, Interpretable, Label-Free Concept Bottleneck Model cs.CV · 2026-04-02 · unverdicted · none · ref 13
HIL-CBM is a hierarchical label-free concept bottleneck model that improves classification accuracy and explanation quality over prior single-level CBMs using a visual consistency loss and dual heads.
One Prompt, Many Sounds: Modeling Listener Variability in LLM-Based Equalization cs.SD · 2026-01-14 · unverdicted · none · ref 24
LLMs using in-context learning and fine-tuning on listener experiment data generate equalization settings that align better with population preferences than random sampling or static presets.
Discriminative-Generative Target Speaker Extraction with Decoder-Only Language Models eess.AS · 2026-01-09 · unverdicted · none · ref 74
A hybrid two-stage framework pairs a discriminative front-end for interference suppression with a generative decoder-only LM back-end to improve perceptual quality and speaker consistency in target speaker extraction and speech enhancement.
The Command Line GUIde: Graphical Interfaces from Man Pages via AI cs.HC · 2025-10-01 · unverdicted · none · ref 23
GUIde uses AI to translate man pages into graphical interface specifications for command line tools, evaluated on a corpus of real commands.
DeRelayL: Sustainable Decentralized Relay Learning cs.LG · 2026-04-30 · unverdicted · none · ref 54
DeRelayL is a proposed sustainable decentralized learning paradigm where permissionless participants relay-train and share models via designed incentives, backed by theoretical analysis and simulations.
Budget-Constrained Online Retrieval-Augmented Generation: The Chunk-as-a-Service Model cs.IR · 2026-04-28 · unverdicted · none · ref 1
Chunk-as-a-Service with the UCOSA online algorithm enables budget-constrained selection of prompts for chunk enrichment in RAG, outperforming random selection by 52% on a combined performance metric and delivering higher performance-to-budget ratios than standard RaaS.
On the Generalization Properties of Selective State-Space Models for Filtering Tasks for Unknown Systems eess.SY · 2026-04-26 · unverdicted · none · ref 7
Selective state-space models achieve online filtering for unknown systems from the same class with generalization bounds derived under appropriate assumptions.
Train the Trainers -- An Agentic AI Framework for Peer-Based Mental Health Support in Battlefield Environments cs.HC · 2026-03-31 · unverdicted · none · ref 16
The paper introduces an agentic AI platform to train and support recovered soldiers as peer facilitators providing mental health triage and interventions in austere battlefield environments.
Chinese Short-Form Creative Content Generation via Explanation-Oriented Multi-Objective Optimization cs.CL · 2025-11-19 · unverdicted · none · ref 78
MAGIC-HMO is a multi-agent framework that treats Chinese short-form creative NLG as heterogeneous multi-objective optimization over personalized constraints plus explanation reliability and outperforms baselines on a baby-naming benchmark.
LLM as Attention-Informed NTM and Topic Modeling as long-input Generation: Interpretability and long-Context Capability cs.CL · 2025-10-03 · unverdicted · none · ref 13
LLMs recover interpretable topic structures via attention and achieve competitive topic modeling performance as long-context generators.
Plausible but Wrong: A case study on Agentic Failures in Astrophysical Workflows cs.AI · 2026-04-28 · unverdicted · none · ref 1
CMBAgent achieves high accuracy on well-specified astrophysical tasks with context but generates silent, plausible-yet-incorrect outputs on reasoning-challenging problems, with no self-diagnosis of inconsistencies.
AICCE: AI Driven Compliance Checker Engine cs.CR · 2026-04-03 · unverdicted · none · ref 12
AICCE combines RAG-based retrieval of protocol specs with dual LLM pipelines for debate-driven explanations or fast script execution, reporting up to 99% accuracy on IPv6 samples.
VeriInteresting: An Empirical Study of Model Prompt Interactions in Verilog Code Generation cs.AR · 2026-02-04 · unverdicted · none · ref 3
Empirical study identifies patterns in how model classes respond to structured prompts, optimization, and other techniques across two Verilog benchmarks.
Threat Modelling using Domain-Adapted Language Models: Empirical Evaluation and Insights cs.CR · 2026-05-11 · unverdicted · none · ref 35
Domain-adapted LLMs and SLMs do not consistently outperform general models on STRIDE threat classification for 5G, with decoding strategies and model scale affecting validity but gains remaining insufficient for reliable use.
Redefining End-of-Life: Intelligent Automation for Electronics Remanufacturing Systems eess.SY · 2026-04-03 · unverdicted · none · ref 152
A literature review of intelligent automation approaches using robotics, AI, and control for disassembly, inspection, sorting, and reprocessing of end-of-life electronics.

Language mod- els are few-shot learners

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer