Title resolution pending

Devlin, Jacob, Chang, Ming-Wei, Lee, Kenton, Toutanova, Kristina , booktitle=

19 Pith papers cite this work. Polarity classification is still indexing.

19 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?

cs.CL · 2022-02-25 · accept · novelty 8.0

Randomly replacing labels in in-context demonstrations barely hurts performance, showing that label space, input distribution, and sequence format drive in-context learning more than ground-truth labels.

Beyond Square Roots: Explicit Memory-Efficient Factorization for Multi-Epoch Private Learning

cs.LG · 2026-05-18 · unverdicted · novelty 7.0

γ-BIFR unifies DP-λCGD and BISR into a single banded inverse factorization that improves RMSE and theoretical guarantees in the low-bandwidth regime for multi-epoch private learning.

NeuralBench: A Unifying Framework to Benchmark NeuroAI Models

cs.LG · 2026-05-08 · conditional · novelty 7.0

NeuralBench is a new benchmarking framework for neuroAI models on EEG data that finds foundation models only marginally outperform task-specific ones while many tasks like cognitive decoding stay highly challenging.

Rethinking the Rank Threshold for LoRA Fine-Tuning

cs.LG · 2026-05-05 · unverdicted · novelty 7.0

For binary classification in the NTK regime, LoRA rank r=1 suffices and is often optimal under cross-entropy loss, reducing the prior sufficient condition from r>=12.

StyleShield: Exposing the Fragility of AIGC Detectors through Continuous Controllable Style Transfer

cs.LG · 2026-04-30 · unverdicted · novelty 7.0

StyleShield uses flow matching in continuous token embeddings with a DiT backbone to achieve 94.6% evasion on trained detectors and over 99% on unseen ones in Chinese benchmarks, with 0.928 semantic similarity, plus a RateAudit method to arbitrarily control detection rates.

STELLAR: Scaling 3D Perception Large Models for Autonomous Driving

cs.CV · 2026-05-19 · unverdicted · novelty 6.0

STELLAR trains up to 500M-parameter multi-modal models on 50M driving scenes and reports empirical scaling trends plus new state-of-the-art results on the Waymo Open Dataset.

Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design

cs.AI · 2026-05-15 · unverdicted · novelty 6.0

Multi-agent LLM systems discover new Transformer and hybrid architectures that outperform Llama 3.2 at 1B scale and approach human SOTA on long-range benchmarks.

Is Data Shapley Not Better than Random in Data Selection? Ask NASH

cs.LG · 2026-05-11 · unverdicted · novelty 6.0 · 2 refs

NASH decomposes the validation utility into Shapley-informative component functions and aggregates them non-linearly to make Data Shapley-based data selection consistently effective.

Power Distribution Bridges Sampling, Self-Reward RL, and Self-Distillation

cs.LG · 2026-05-06 · unverdicted · novelty 6.0

The power distribution is the target of power sampling, the closed-form solution to self-reward KL-regularized RL, and the basis for power self-distillation that matches sampling performance at lower cost.

Structure-guided molecular design with contrastive 3D protein-ligand learning

cs.LG · 2026-04-21 · unverdicted · novelty 6.0

An SE(3)-equivariant transformer encodes 3D protein-ligand interactions via contrastive learning for zero-shot virtual screening, and these embeddings condition a multimodal chemical language model to autoregressively generate target-specific molecules with favorable predicted binding properties.

DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models

cs.LG · 2023-09-25 · accept · novelty 6.0

DeepSpeed-Ulysses keeps communication volume constant for sequence-parallel attention when sequence length and device count scale together, delivering 2.5x faster training on 4x longer sequences than prior SOTA.

Monetary Policy in the Media Spotlight: Sentiments, Signals, and Economic Impact

econ.EM · 2026-05-14 · unverdicted · novelty 5.0

Media sentiment indicators from Canadian news, when added to a New Keynesian model with endogenous central-bank response, improve out-of-sample forecasts and account for part of monetary-policy propagation to output and prices.

MaskTab: Scalable Masked Tabular Pretraining with Scaling Laws and Distillation for Industrial Classification

cs.LG · 2026-05-12 · unverdicted · novelty 5.0

MaskTab is a masked pretraining method for industrial tabular data that delivers measurable gains in classification AUC and KS metrics while enabling effective distillation to smaller models.

Emergent Semantic Role Understanding in Language Models

cs.AI · 2026-05-09 · unverdicted · novelty 5.0

Semantic role understanding partially emerges during language model pre-training, with linear probes on frozen representations achieving substantial performance that improves with scale but does not match fine-tuned models, and representations shifting toward more distributed forms at larger scales.

Benchmarking Wireless Representations: High-Dimensional vs. Compressed Embeddings for Efficiency and Robustness

eess.SP · 2026-05-03 · unverdicted · novelty 5.0

High-dimensional embeddings excel in few-shot regimes for some wireless tasks but carry high latency and parameter costs, whereas compressed autoencoder representations provide better noise robustness, stability, and efficiency.

Accurate, Efficient, and Explainable Deep Learning Approaches for Environmental Science Problems

cs.LG · 2026-05-19 · unverdicted · novelty 4.0

The work introduces WaLeF/FIDLAr for flood forecasting, CoDiCast for probabilistic weather, and Hypercube-RAG for explainable environmental QA, claiming superior accuracy, efficiency, and interpretability over baselines.

Read, Extract, Classify: A Tool for Smarter Requirements Engineering

cs.SE · 2026-05-11 · unverdicted · novelty 3.0

ReXCL automates extraction of requirements into a schema and their classification via adaptive fine-tuning of encoder models to improve efficiency and accuracy in software development.

Multilingual and Multimodal LLMs in the Wild: Building for Low-Resource Languages

cs.CL · 2026-05-16 · unverdicted · novelty 2.0

A tutorial synthesizing foundations, recent models such as PALO and Maya, and low-cost methods for tri-modal multilingual AI in resource-constrained settings.

PSK@EEUCA 2026: Fine-Tuning Large Language Models with Synthetic Data Augmentation for Multi-Class Toxicity Detection in Gaming Chat

cs.CL · 2026-05-08 · unverdicted · novelty 2.0

Llama 3.1 8B fine-tuned with calibrated 5% synthetic data augmentation reaches 0.6234 F1-macro on multi-class toxicity detection in gaming chat and places fourth among 35 teams.

citing papers explorer

Showing 19 of 19 citing papers.

Rethinking the Role of Demonstrations: What Makes In-Context Learning Work? cs.CL · 2022-02-25 · accept · none · ref 171
Randomly replacing labels in in-context demonstrations barely hurts performance, showing that label space, input distribution, and sequence format drive in-context learning more than ground-truth labels.
Beyond Square Roots: Explicit Memory-Efficient Factorization for Multi-Epoch Private Learning cs.LG · 2026-05-18 · unverdicted · none · ref 78
γ-BIFR unifies DP-λCGD and BISR into a single banded inverse factorization that improves RMSE and theoretical guarantees in the low-bandwidth regime for multi-epoch private learning.
NeuralBench: A Unifying Framework to Benchmark NeuroAI Models cs.LG · 2026-05-08 · conditional · none · ref 201
NeuralBench is a new benchmarking framework for neuroAI models on EEG data that finds foundation models only marginally outperform task-specific ones while many tasks like cognitive decoding stay highly challenging.
Rethinking the Rank Threshold for LoRA Fine-Tuning cs.LG · 2026-05-05 · unverdicted · none · ref 23
For binary classification in the NTK regime, LoRA rank r=1 suffices and is often optimal under cross-entropy loss, reducing the prior sufficient condition from r>=12.
StyleShield: Exposing the Fragility of AIGC Detectors through Continuous Controllable Style Transfer cs.LG · 2026-04-30 · unverdicted · none · ref 13
StyleShield uses flow matching in continuous token embeddings with a DiT backbone to achieve 94.6% evasion on trained detectors and over 99% on unseen ones in Chinese benchmarks, with 0.928 semantic similarity, plus a RateAudit method to arbitrarily control detection rates.
STELLAR: Scaling 3D Perception Large Models for Autonomous Driving cs.CV · 2026-05-19 · unverdicted · none · ref 25
STELLAR trains up to 500M-parameter multi-modal models on 50M driving scenes and reports empirical scaling trends plus new state-of-the-art results on the Waymo Open Dataset.
Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design cs.AI · 2026-05-15 · unverdicted · none · ref 45
Multi-agent LLM systems discover new Transformer and hybrid architectures that outperform Llama 3.2 at 1B scale and approach human SOTA on long-range benchmarks.
Is Data Shapley Not Better than Random in Data Selection? Ask NASH cs.LG · 2026-05-11 · unverdicted · none · ref 16 · 2 links
NASH decomposes the validation utility into Shapley-informative component functions and aggregates them non-linearly to make Data Shapley-based data selection consistently effective.
Power Distribution Bridges Sampling, Self-Reward RL, and Self-Distillation cs.LG · 2026-05-06 · unverdicted · none · ref 101
The power distribution is the target of power sampling, the closed-form solution to self-reward KL-regularized RL, and the basis for power self-distillation that matches sampling performance at lower cost.
Structure-guided molecular design with contrastive 3D protein-ligand learning cs.LG · 2026-04-21 · unverdicted · none · ref 40
An SE(3)-equivariant transformer encodes 3D protein-ligand interactions via contrastive learning for zero-shot virtual screening, and these embeddings condition a multimodal chemical language model to autoregressively generate target-specific molecules with favorable predicted binding properties.
DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models cs.LG · 2023-09-25 · accept · none · ref 138
DeepSpeed-Ulysses keeps communication volume constant for sequence-parallel attention when sequence length and device count scale together, delivering 2.5x faster training on 4x longer sequences than prior SOTA.
Monetary Policy in the Media Spotlight: Sentiments, Signals, and Economic Impact econ.EM · 2026-05-14 · unverdicted · none · ref 72
Media sentiment indicators from Canadian news, when added to a New Keynesian model with endogenous central-bank response, improve out-of-sample forecasts and account for part of monetary-policy propagation to output and prices.
MaskTab: Scalable Masked Tabular Pretraining with Scaling Laws and Distillation for Industrial Classification cs.LG · 2026-05-12 · unverdicted · none · ref 29
MaskTab is a masked pretraining method for industrial tabular data that delivers measurable gains in classification AUC and KS metrics while enabling effective distillation to smaller models.
Emergent Semantic Role Understanding in Language Models cs.AI · 2026-05-09 · unverdicted · none · ref 26
Semantic role understanding partially emerges during language model pre-training, with linear probes on frozen representations achieving substantial performance that improves with scale but does not match fine-tuned models, and representations shifting toward more distributed forms at larger scales.
Benchmarking Wireless Representations: High-Dimensional vs. Compressed Embeddings for Efficiency and Robustness eess.SP · 2026-05-03 · unverdicted · none · ref 1
High-dimensional embeddings excel in few-shot regimes for some wireless tasks but carry high latency and parameter costs, whereas compressed autoencoder representations provide better noise robustness, stability, and efficiency.
Accurate, Efficient, and Explainable Deep Learning Approaches for Environmental Science Problems cs.LG · 2026-05-19 · unverdicted · none · ref 237
The work introduces WaLeF/FIDLAr for flood forecasting, CoDiCast for probabilistic weather, and Hypercube-RAG for explainable environmental QA, claiming superior accuracy, efficiency, and interpretability over baselines.
Read, Extract, Classify: A Tool for Smarter Requirements Engineering cs.SE · 2026-05-11 · unverdicted · none · ref 4
ReXCL automates extraction of requirements into a schema and their classification via adaptive fine-tuning of encoder models to improve efficiency and accuracy in software development.
Multilingual and Multimodal LLMs in the Wild: Building for Low-Resource Languages cs.CL · 2026-05-16 · unverdicted · none · ref 151
A tutorial synthesizing foundations, recent models such as PALO and Maya, and low-cost methods for tri-modal multilingual AI in resource-constrained settings.
PSK@EEUCA 2026: Fine-Tuning Large Language Models with Synthetic Data Augmentation for Multi-Class Toxicity Detection in Gaming Chat cs.CL · 2026-05-08 · unverdicted · none · ref 7
Llama 3.1 8B fine-tuned with calibrated 5% synthetic data augmentation reaches 0.6234 F1-macro on multi-class toxicity detection in gaming chat and places fourth among 35 teams.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer