Black-box Prompt Learning for Pre-trained Language Models

Ruijia Xu; Shizhe Diao; Tong Zhang; Xiao Zhou; Xuechun Li; Yong Lin; Zhichao Huang

arxiv: 2201.08531 · v3 · pith:MZ2JECIUnew · submitted 2022-01-21 · 💻 cs.CL

Black-box Prompt Learning for Pre-trained Language Models

Shizhe Diao , Zhichao Huang , Ruijia Xu , Xuechun Li , Yong Lin , Xiao Zhou , Tong Zhang This is my paper

classification 💻 cs.CL

keywords promptblack-boxclouddiscretelearningmodelsparametersplms

0 comments

read the original abstract

The increasing scale of general-purpose Pre-trained Language Models (PLMs) necessitates the study of more efficient adaptation across different downstream tasks. In this paper, we establish a Black-box Discrete Prompt Learning (BDPL) to resonate with pragmatic interactions between the cloud infrastructure and edge devices. Particularly, instead of fine-tuning the model in the cloud, we adapt PLMs by prompt learning, which efficiently optimizes only a few parameters of the discrete prompts. Moreover, we consider the scenario that we do not have access to the parameters and gradients of the pre-trained models, except for its outputs given inputs. This black-box setting secures the cloud infrastructure from potential attack and misuse to cause a single-point failure, which is preferable to the white-box counterpart by current infrastructures. Under this black-box constraint, we apply a variance-reduced policy gradient algorithm to estimate the gradients of parameters in the categorical distribution of each discrete prompt. In light of our method, the user devices can efficiently tune their tasks by querying the PLMs bounded by a range of API calls. Our experiments on RoBERTa and GPT-3 demonstrate that the proposed algorithm achieves significant improvement on eight benchmarks in a cloud-device collaboration manner. Finally, we conduct in-depth case studies to comprehensively analyze our method in terms of various data sizes, prompt lengths, training budgets, optimization objectives, prompt transferability, and explanations of the learned prompts. Our code will be available at https://github.com/shizhediao/Black-Box-Prompt-Learning.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Towards Spec Learning: Inference-Time Alignment from Preference Pairs
cs.CL 2026-06 unverdicted novelty 6.0

Proposes compiling preference pairs into readable natural-language specifications for inference-time LLM alignment, claiming outperformance over DPO on dense-preference domains.
Black-Box Continual Learning for Vision-Language Models
cs.CV 2026-06 unverdicted novelty 6.0

Introduces Black-CL black-box benchmark and BETA textual-prototype method that matches or exceeds white-box continual learning performance on ten datasets using 0.05M parameters.
Enhancing Chat Language Models by Scaling High-quality Instructional Conversations
cs.CL 2023-05 conditional novelty 6.0

UltraChat supplies 1.5 million high-quality multi-turn dialogues that, when used to fine-tune LLaMA, produce UltraLLaMA, which outperforms prior open-source chat models including Vicuna.
Towards Spec Learning: Inference-Time Alignment from Preference Pairs
cs.CL 2026-06 unverdicted novelty 5.0

Spec learning compiles brief instructions and preference pairs into readable natural-language specifications that condition LLMs at inference time and can outperform DPO on domains with dense preference signals.