hub

UniXcoder: Unified cross-modal pre-training for code representation

Daya Guo, Shuai Lu, Nan Duan, Yanlin Wang, Ming Zhou, Jian Yin · 2022 · Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) · DOI 10.18653/v1/2022.acl-long.499

18 Pith papers cite this work, alongside 459 external citations. Polarity classification is still indexing.

18 Pith papers citing it

459 external citations · Crossref

open at publisher browse 18 citing papers

hub tools

JSON dossier citing papers JSON publisher DOI

citation-role summary

background 3 baseline 1

citation-polarity summary

background 2 baseline 1 support 1

representative citing papers

Edit, But Verify: An Empirical Audit of Instructed Code-Editing Benchmarks

cs.SE · 2026-04-06 · conditional · novelty 8.0

The two main benchmarks for LLM instructed code editing over-represent Python, miss common real-world domains and edit types, and have test coverage issues that limit what they measure.

BioDefect: The First Dataset for Defect Detection in Bioinformatics Software

cs.SE · 2026-05-20 · unverdicted · novelty 7.0

BioDefect is a new dataset for defect detection in bioinformatics software that improves average F1-scores by 29.61% to 38.04% over existing datasets when evaluated on nine language models.

Deep Graph-Language Fusion for Structure-Aware Code Generation

cs.SE · 2026-05-05 · unverdicted · novelty 7.0

CGFuse enables deep token-level fusion of graph-derived structural features into language models, yielding 10-16% BLEU and 6-11% CodeBLEU gains on code generation tasks.

Parallel-SFT: Improving Zero-Shot Cross-Programming-Language Transfer for Code RL

cs.CL · 2026-04-22 · unverdicted · novelty 7.0

Parallel-SFT mixes parallel programs across languages during SFT to produce more transferable RL initializations, yielding better zero-shot generalization to unseen programming languages.

RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems

cs.CL · 2023-06-05 · unverdicted · novelty 7.0

RepoBench is a new benchmark with retrieval, completion, and pipeline tasks to evaluate code auto-completion systems on entire repositories instead of single files.

Do not copy and paste! Rewriting strategies for code retrieval

cs.SE · 2026-05-08 · conditional · novelty 6.0

Full natural-language rewriting of code and queries boosts retrieval on code benchmarks while corpus-only rewriting often hurts, with token entropy difference serving as a cheap predictor of gains.

SAGE: Signal-Amplified Guided Embeddings for LLM-based Vulnerability Detection

cs.CR · 2026-04-21 · unverdicted · novelty 6.0

SAGE uses sparse autoencoders to boost vulnerability signals in LLMs, raising internal SNR 12.7x and delivering up to 318% MCC gains on vulnerability detection benchmarks.

UntrustVul: An Automated Approach for Identifying Untrustworthy Alerts in Vulnerability Detection Models

cs.SE · 2025-03-19 · unverdicted · novelty 6.0

UntrustVul identifies untrustworthy vulnerability predictions by marking lines that neither match historical vulnerability patterns nor influence vulnerable lines through dependencies, reporting AUC 70-88% and F1 82-94% on 115K predictions.

Bash-Commenter: Leveraging Syntax-Aware Preference Optimization to Reinforce Large Language Model for Bash Code Comment Generation

cs.SE · 2026-06-29 · unverdicted · novelty 5.0

Bash-Commenter applies CPT, SFT, and Syntax-Aware Preference Optimization (SAPO) via AST atomic operations to LLaMA-3.1-8B, reporting higher BLEU-4/METEOR/ROUGE-L scores than baselines on single-line and multi-line Bash comment generation tasks.

UNICS: Multilingual Code Search via Unified Pseudocode and Contrastive Transfer Learning

cs.SE · 2026-06-26 · unverdicted · novelty 5.0

UNICS pre-trains on a pseudocode dataset for cross-lingual logic then applies multi-task transfer learning with hard-positive mining and dynamic hard-negative sampling to reach claimed SOTA on multilingual code-search benchmarks.

Token Optimization Strategies for LLM-Based Oracle-to-PostgreSQL Migration

cs.LO · 2026-05-27 · unverdicted · novelty 5.0

Evaluation of twelve token optimization strategies for LLM-based Oracle-to-PostgreSQL migration finds that adaptive routing reduces tokens with minimal semantic loss while aggressive schema distillation harms quality.

MARGIN: Margin-Aware Regularized Geometry for Imbalanced Vulnerability Detection

cs.SE · 2026-05-11 · unverdicted · novelty 5.0 · 2 refs

MARGIN uses von Mises-Fisher concentration to dynamically adjust geometric regularization, aligning embedding distributions with Voronoi cells for more stable decision boundaries in imbalanced vulnerability detection.

Standing on the Shoulders of Giants: Stabilized Knowledge Distillation for Cross--Language Code Clone Detection

cs.AI · 2026-05-04 · unverdicted · novelty 5.0

Reasoning-oriented knowledge distillation from DeepSeek-R1 plus response stabilization improves reliability and often performance of compact models for cross-language code clone detection on pairs like Python-Java and Rust-Java.

How Code Representation Shapes False-Positive Dynamics in Cross-Language LLM Vulnerability Detection

cs.CR · 2026-04-30 · unverdicted · novelty 5.0

Text fine-tuning of 8B LLMs on C/C++ vulnerability data inflates cross-language false-positive rates through surface-cue memorization, which an AST inference probe can partially reverse while direct AST fine-tuning cannot.

Towards General Text Embeddings with Multi-stage Contrastive Learning

cs.CL · 2023-08-07 · unverdicted · novelty 5.0

GTE_base is a compact text embedding model using multi-stage contrastive learning on diverse data that outperforms OpenAI's API and 10x larger models on massive benchmarks and works for code as text.

Carbon-Taxed Transformers: A Green Compression Pipeline for Overgrown Language Models

cs.SE · 2026-04-28 · unverdicted · novelty 4.0

CTT is a compression pipeline for LLMs that achieves up to 49x memory reduction, 10x faster inference, 81% lower CO2 emissions, and retains 68-98% accuracy on code clone detection, summarization, and generation tasks.

LoRA-MME: Multi-Model Ensemble of LoRA-Tuned Encoders for Code Comment Classification

cs.SE · 2026-03-04 · conditional · novelty 4.0

LoRA-MME ensembles LoRA-adapted UniXcoder, CodeBERT, GraphCodeBERT, and CodeBERTa with learned weights to reach 0.7906 weighted F1 and 0.6867 macro F1 on code comment classification.

MultiMend: Multilingual Program Repair with Context Augmentation and Multi-Hunk Patch Generation

cs.SE · 2025-01-27 · unverdicted · novelty 4.0

MultiMend augments buggy function context via retrieval and generates multi-hunk patches, fixing 2,227 of 5,501 bugs across six benchmarks in four languages.

citing papers explorer

Showing 18 of 18 citing papers.

Edit, But Verify: An Empirical Audit of Instructed Code-Editing Benchmarks cs.SE · 2026-04-06 · conditional · none · ref 9
The two main benchmarks for LLM instructed code editing over-represent Python, miss common real-world domains and edit types, and have test coverage issues that limit what they measure.
BioDefect: The First Dataset for Defect Detection in Bioinformatics Software cs.SE · 2026-05-20 · unverdicted · none · ref 53
BioDefect is a new dataset for defect detection in bioinformatics software that improves average F1-scores by 29.61% to 38.04% over existing datasets when evaluated on nine language models.
Deep Graph-Language Fusion for Structure-Aware Code Generation cs.SE · 2026-05-05 · unverdicted · none · ref 10
CGFuse enables deep token-level fusion of graph-derived structural features into language models, yielding 10-16% BLEU and 6-11% CodeBLEU gains on code generation tasks.
Parallel-SFT: Improving Zero-Shot Cross-Programming-Language Transfer for Code RL cs.CL · 2026-04-22 · unverdicted · none · ref 37
Parallel-SFT mixes parallel programs across languages during SFT to produce more transferable RL initializations, yielding better zero-shot generalization to unseen programming languages.
RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems cs.CL · 2023-06-05 · unverdicted · none · ref 20
RepoBench is a new benchmark with retrieval, completion, and pipeline tasks to evaluate code auto-completion systems on entire repositories instead of single files.
Do not copy and paste! Rewriting strategies for code retrieval cs.SE · 2026-05-08 · conditional · none · ref 17
Full natural-language rewriting of code and queries boosts retrieval on code benchmarks while corpus-only rewriting often hurts, with token entropy difference serving as a cheap predictor of gains.
SAGE: Signal-Amplified Guided Embeddings for LLM-based Vulnerability Detection cs.CR · 2026-04-21 · unverdicted · none · ref 21
SAGE uses sparse autoencoders to boost vulnerability signals in LLMs, raising internal SNR 12.7x and delivering up to 318% MCC gains on vulnerability detection benchmarks.
UntrustVul: An Automated Approach for Identifying Untrustworthy Alerts in Vulnerability Detection Models cs.SE · 2025-03-19 · unverdicted · none · ref 18
UntrustVul identifies untrustworthy vulnerability predictions by marking lines that neither match historical vulnerability patterns nor influence vulnerable lines through dependencies, reporting AUC 70-88% and F1 82-94% on 115K predictions.
Bash-Commenter: Leveraging Syntax-Aware Preference Optimization to Reinforce Large Language Model for Bash Code Comment Generation cs.SE · 2026-06-29 · unverdicted · none · ref 14
Bash-Commenter applies CPT, SFT, and Syntax-Aware Preference Optimization (SAPO) via AST atomic operations to LLaMA-3.1-8B, reporting higher BLEU-4/METEOR/ROUGE-L scores than baselines on single-line and multi-line Bash comment generation tasks.
UNICS: Multilingual Code Search via Unified Pseudocode and Contrastive Transfer Learning cs.SE · 2026-06-26 · unverdicted · none · ref 20
UNICS pre-trains on a pseudocode dataset for cross-lingual logic then applies multi-task transfer learning with hard-positive mining and dynamic hard-negative sampling to reach claimed SOTA on multilingual code-search benchmarks.
Token Optimization Strategies for LLM-Based Oracle-to-PostgreSQL Migration cs.LO · 2026-05-27 · unverdicted · none · ref 9
Evaluation of twelve token optimization strategies for LLM-based Oracle-to-PostgreSQL migration finds that adaptive routing reduces tokens with minimal semantic loss while aggressive schema distillation harms quality.
MARGIN: Margin-Aware Regularized Geometry for Imbalanced Vulnerability Detection cs.SE · 2026-05-11 · unverdicted · none · ref 31 · 2 links
MARGIN uses von Mises-Fisher concentration to dynamically adjust geometric regularization, aligning embedding distributions with Voronoi cells for more stable decision boundaries in imbalanced vulnerability detection.
Standing on the Shoulders of Giants: Stabilized Knowledge Distillation for Cross--Language Code Clone Detection cs.AI · 2026-05-04 · unverdicted · none · ref 12
Reasoning-oriented knowledge distillation from DeepSeek-R1 plus response stabilization improves reliability and often performance of compact models for cross-language code clone detection on pairs like Python-Java and Rust-Java.
How Code Representation Shapes False-Positive Dynamics in Cross-Language LLM Vulnerability Detection cs.CR · 2026-04-30 · unverdicted · none · ref 10
Text fine-tuning of 8B LLMs on C/C++ vulnerability data inflates cross-language false-positive rates through surface-cue memorization, which an AST inference probe can partially reverse while direct AST fine-tuning cannot.
Towards General Text Embeddings with Multi-stage Contrastive Learning cs.CL · 2023-08-07 · unverdicted · none · ref 124
GTE_base is a compact text embedding model using multi-stage contrastive learning on diverse data that outperforms OpenAI's API and 10x larger models on massive benchmarks and works for code as text.
Carbon-Taxed Transformers: A Green Compression Pipeline for Overgrown Language Models cs.SE · 2026-04-28 · unverdicted · none · ref 17
CTT is a compression pipeline for LLMs that achieves up to 49x memory reduction, 10x faster inference, 81% lower CO2 emissions, and retains 68-98% accuracy on code clone detection, summarization, and generation tasks.
LoRA-MME: Multi-Model Ensemble of LoRA-Tuned Encoders for Code Comment Classification cs.SE · 2026-03-04 · conditional · none · ref 3
LoRA-MME ensembles LoRA-adapted UniXcoder, CodeBERT, GraphCodeBERT, and CodeBERTa with learned weights to reach 0.7906 weighted F1 and 0.6867 macro F1 on code comment classification.
MultiMend: Multilingual Program Repair with Context Augmentation and Multi-Hunk Patch Generation cs.SE · 2025-01-27 · unverdicted · none · ref 4
MultiMend augments buggy function context via retrieval and generates multi-hunk patches, fixing 2,227 of 5,501 bugs across six benchmarks in four languages.

UniXcoder: Unified cross-modal pre-training for code representation

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer