Contexting as Recommendation: Evolutionary Collaborative Filtering for Context Engineering
Pith reviewed 2026-05-20 19:01 UTC · model grok-4.3
The pith
Context engineering as recommendation enables matching each input with its optimal context.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We propose a paradigm shift by formulating context engineering as a recommendation problem. We introduce Neural Collaborative Context Engineering (NCCE), a framework that transitions optimization from a static global search to dynamic, instance-wise routing. NCCE first bootstraps a diverse catalog of anchor contexts and then employs a novel Context-CF Co-Evolution mechanism. This stage establishes a synergistic feedback loop: a lightweight Neural Collaborative Filtering (NCF) model learns instance-context preferences to guide the generation of specialized context variants, while the newly evaluated contexts continuously refine the NCF model's understanding of latent preferences. At inference
What carries the argument
The Context-CF Co-Evolution mechanism that creates a feedback loop between a Neural Collaborative Filtering model learning preferences and the generation of context variants for instance-specific routing.
If this is right
- Instance-wise context routing captures performance gains missed by global optimization.
- The NCF model enables efficient dynamic assignment at inference without repeated searches.
- Personalization in context engineering is shown to be critical for LLM task accuracy.
- The co-evolution process refines both the preference model and the context catalog over iterations.
Where Pith is reading between the lines
- This framing could be applied to selecting few-shot examples or other prompt components on a per-input basis.
- Testing on a wider range of tasks would reveal how much the gains depend on the diversity of the initial anchor catalog.
Load-bearing premise
A diverse catalog of anchor contexts can be bootstrapped such that the subsequent Context-CF Co-Evolution loop produces genuinely instance-specific improvements rather than simply rediscovering a few strong global contexts.
What would settle it
An experiment showing no accuracy improvement when using the NCF-routed contexts compared to the best single global context or the initial anchor set on new inputs would indicate the claim does not hold.
Figures
read the original abstract
Large Language Models (LLMs) are highly sensitive to their input contexts, motivating the development of automated context engineering. However, existing methods predominantly treat this as a global search problem, seeking a single context strategy that maximizes average performance across a dataset. This restrictive assumption overlooks the fact that different inputs often require distinct guidance, leaving substantial instance-level performance gains untapped. In this paper, we propose a paradigm shift by formulating context engineering as a recommendation problem. We introduce \textbf{Neural Collaborative Context Engineering (NCCE)}, a framework that transitions optimization from a static global search to dynamic, instance-wise routing. NCCE first bootstraps a diverse catalog of anchor contexts and then employs a novel \textbf{Context-CF Co-Evolution} mechanism. This stage establishes a synergistic feedback loop: a lightweight Neural Collaborative Filtering (NCF) model learns instance-context preferences to guide the generation of specialized context variants, while the newly evaluated contexts continuously refine the NCF model's understanding of latent preferences. At inference time, the trained NCF model acts as a context router, dynamically assigning the most suitable context strategy to each unseen instance. Theoretical Proofs and comprehensive experiments demonstrate that by matching individual inputs with their optimal contexts, NCCE significantly improves task accuracy, highlighting the critical importance of personalization in LLM context engineering.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes Neural Collaborative Context Engineering (NCCE) to reframe LLM context engineering as an instance-wise recommendation task rather than global search. It bootstraps a catalog of anchor contexts and introduces a Context-CF Co-Evolution loop in which a Neural Collaborative Filtering (NCF) model learns preferences to guide generation of context variants; at inference the NCF routes each input to its preferred context. The abstract states that theoretical proofs and comprehensive experiments show significant accuracy gains from this personalization.
Significance. If the results hold and the method demonstrably routes inputs to distinct, instance-specific contexts rather than rediscovering a few strong global ones, the work would be significant for shifting context engineering from dataset-level optimization to per-instance routing, with potential gains on heterogeneous tasks.
major comments (2)
- Abstract: the claim that 'Theoretical Proofs and comprehensive experiments demonstrate' significant improvements is unsupported in the provided manuscript, which contains no quantitative results, error bars, baseline comparisons, or description of controls against post-hoc context selection; this is load-bearing for the central accuracy claim.
- Context-CF Co-Evolution mechanism (described in the abstract and introduction): the feedback loop between NCF preference learning and variant generation lacks any stated mechanism or metric (e.g., entropy of context assignments, per-instance context diversity, or ablation showing routing variation) to ensure convergence produces genuinely instance-specific contexts rather than a small set of dominant global winners; without such evidence the personalization paradigm cannot be distinguished from improved global search.
minor comments (2)
- Notation: 'NCF' and 'NCCE' should be expanded on first use; the distinction between 'anchor contexts' and 'specialized context variants' is not made explicit.
- The title 'Contexting as Recommendation' would benefit from a brief clarification of how the evolutionary loop differs from standard collaborative filtering pipelines.
Simulated Author's Rebuttal
We thank the referee for the detailed and constructive report. The comments highlight important areas where the current manuscript requires strengthening to support its central claims. We address each major comment below and outline the revisions we will make.
read point-by-point responses
-
Referee: Abstract: the claim that 'Theoretical Proofs and comprehensive experiments demonstrate' significant improvements is unsupported in the provided manuscript, which contains no quantitative results, error bars, baseline comparisons, or description of controls against post-hoc context selection; this is load-bearing for the central accuracy claim.
Authors: We agree that the abstract claim is currently unsupported, as the submitted manuscript does not yet include the quantitative results, error bars, baseline comparisons, or explicit controls against post-hoc selection. This phrasing was carried over from an earlier outline and does not reflect the present state of the document. In the revised version we will remove the unsupported claim from the abstract and, if the experiments are completed in time, replace it with a more qualified statement that points to the specific results and controls that will be added to the experimental section. revision: yes
-
Referee: Context-CF Co-Evolution mechanism (described in the abstract and introduction): the feedback loop between NCF preference learning and variant generation lacks any stated mechanism or metric (e.g., entropy of context assignments, per-instance context diversity, or ablation showing routing variation) to ensure convergence produces genuinely instance-specific contexts rather than a small set of dominant global winners; without such evidence the personalization paradigm cannot be distinguished from improved global search.
Authors: We acknowledge that the current description of the Context-CF Co-Evolution loop does not supply the requested metrics or ablations. While the manuscript outlines the iterative feedback between the NCF router and context variant generation, it does not report assignment entropy, per-instance diversity statistics, or controlled ablations that would demonstrate routing variation. We will add these analyses in the revision, including entropy of context assignments across the test set and an ablation that compares instance-specific routing against a global-search baseline, to provide the necessary evidence that the method produces genuinely personalized contexts rather than converging on a few dominant ones. revision: yes
Circularity Check
No significant circularity; standard supervised training-inference separation
full rationale
The NCCE framework bootstraps an initial catalog of anchor contexts, evaluates them to create training data for the NCF model, then uses the trained NCF to route contexts for new instances. This follows a conventional supervised learning loop where parameters are fit on observed instance-context preference data and applied to unseen inputs. No equation or step equates a claimed prediction to its own inputs by construction, no uniqueness theorem is imported via self-citation, and the co-evolution is described as iterative refinement rather than a closed definitional loop. The central claim of instance-specific routing rests on empirical generalization rather than tautology.
Axiom & Free-Parameter Ledger
free parameters (1)
- number of anchor contexts
axioms (1)
- domain assumption Different inputs require distinct guidance that can be captured by a low-rank preference matrix
invented entities (1)
-
Context-CF Co-Evolution mechanism
no independent evidence
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We propose a paradigm shift by formulating context engineering as a recommendation problem... Neural Collaborative Filtering (NCF) model learns instance-context preferences
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, et al . 2023. Gpt-4 technical report.arXiv preprint arXiv:2303.08774(2023)
work page internal anchor Pith review Pith/arXiv arXiv 2023
- [2]
-
[3]
GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
Lakshya A Agrawal, Shangyin Tan, Dilara Soylu, Noah Ziems, Rishi Khare, Krista Opsahl- Ong, Arnav Singhvi, Herumb Shandilya, Michael J Ryan, Meng Jiang, Christopher Potts, Koushik Sen, Alexandros G. Dimakis, Ion Stoica, Dan Klein, Matei Zaharia, and Omar Khat- tab. 2026. GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning. arXiv:2507.1...
work page internal anchor Pith review Pith/arXiv arXiv 2026
-
[4]
Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al . 2020. Language models are few-shot learners.Advances in neural information processing systems33 (2020), 1877–1901
work page 2020
-
[5]
Heng-Tze Cheng, Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra, Hrishi Aradhye, Glen Anderson, Greg Corrado, Wei Chai, Mustafa Ispir, et al. 2016. Wide & deep learning for recommender systems. InProceedings of the 1st workshop on deep learning for recommender systems. 7–10
work page 2016
-
[6]
Stéphan Clémençon, Gábor Lugosi, and Nicolas Vayatis. 2008. Ranking and empirical mini- mization of U-statistics. (2008)
work page 2008
-
[7]
Paul Covington, Jay Adams, and Emre Sargin. 2016. Deep neural networks for youtube recommendations. InProceedings of the 10th ACM conference on recommender systems. 191– 198
work page 2016
-
[8]
Chrisantha Fernando, Dylan Banarse, Henryk Michalewski, Simon Osindero, and Tim Rock- täschel. 2023. Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution. arXiv:2309.16797 [cs.CL]https://arxiv.org/abs/2309.16797
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[9]
Qingyan Guo, Rui Wang, Junliang Guo, Bei Li, Kaitao Song, Xu Tan, Guoqing Liu, Jiang Bian, and Yujiu Yang. 2025. EvoPrompt: Connecting LLMs with Evolutionary Algorithms Yields Powerful Prompt Optimizers. arXiv:2309.08532 [cs.CL] https://arxiv.org/abs/2309. 08532 10
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[10]
Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural collaborative filtering. InProceedings of the 26th international conference on world wide web. 173–182
work page 2017
-
[11]
Yichen Jiang, Shikha Bordia, Zheng Zhong, Charles Dognin, Maneesh Singh, and Mohit Bansal
-
[12]
InFindings of the Association for Computational Linguistics: EMNLP 2020
HoVer: A dataset for many-hop fact extraction and claim verification. InFindings of the Association for Computational Linguistics: EMNLP 2020. 3441–3460
work page 2020
-
[13]
Omar Khattab, Arnav Singhvi, Paridhi Maheshwari, Zhiyuan Zhang, Keshav Santhanam, Saiful Haq, Ashutosh Sharma, Thomas T Joshi, Hanna Moazam, Heather Miller, et al. 2023. DSPy: compiling declarative language model calls into state-of-the-art pipelines. InThe Twelfth International Conference on Learning Representations
work page 2023
-
[14]
Yehuda Koren, Robert Bell, and Chris V olinsky. 2009. Matrix factorization techniques for recommender systems.Computer42, 8 (2009), 30–37
work page 2009
-
[15]
Greg Linden, Brent Smith, and Jeremy York. 2003. Amazon. com recommendations: Item-to- item collaborative filtering.IEEE Internet computing7, 1 (2003), 76–80
work page 2003
-
[16]
Reginald Long, Panupong Pasupat, and Percy Liang. 2016. Simpler context-dependent logical forms via model projections. InProceedings of the 54th Annual Meeting of the Association for Computational Linguistics (V olume 1: Long Papers). 1456–1465
work page 2016
- [17]
-
[18]
OpenAI, :, Aaron Hurst, Adam Lerer, Adam P. Goucher, and etc. 2024. GPT-4o System Card. arXiv:2410.21276 [cs.CL]https://arxiv.org/abs/2410.21276
work page internal anchor Pith review Pith/arXiv arXiv 2024
- [19]
-
[20]
Jiarui Qin, Jiachen Zhu, Bo Chen, Zhirong Liu, Weiwen Liu, Ruiming Tang, Rui Zhang, Yong Yu, and Weinan Zhang. 2022. Rankflow: Joint optimization of multi-stage cascade ranking systems as flows. InProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 814–824
work page 2022
-
[21]
Jiarui Qin, Jiachen Zhu, Yankai Liu, Junchao Gao, Jianjie Ying, Chaoxiong Liu, Ding Wang, Junlan Feng, Chao Deng, Xiaozheng Wang, et al . 2023. Learning to distinguish multi-user coupling behaviors for TV recommendation. InProceedings of the sixteenth ACM international conference on web search and data mining. 204–212
work page 2023
- [22]
-
[23]
Steffen Rendle. 2010. Factorization machines. In2010 IEEE International conference on data mining. IEEE, 995–1000
work page 2010
-
[24]
Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2012. BPR: Bayesian personalized ranking from implicit feedback.arXiv preprint arXiv:1205.2618(2012)
work page internal anchor Pith review Pith/arXiv arXiv 2012
-
[25]
Raparthy, Andrei Lupu, Eric Hambro, Aram H
Mikayel Samvelyan, Sharath Chandra Raparthy, Andrei Lupu, Eric Hambro, Aram H. Markosyan, Manish Bhatt, Yuning Mao, Minqi Jiang, Jack Parker-Holder, Jakob Foerster, Tim Rocktäschel, and Roberta Raileanu. 2024. Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts. arXiv:2402.16822 [cs.CL] https://arxiv.org/abs/2402. 16822
-
[26]
Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl. 2001. Item-based collaborative filtering recommendation algorithms. InProceedings of the 10th international conference on World Wide Web. 285–295. 11
work page 2001
-
[27]
Suvash Sedhain, Aditya Krishna Menon, Scott Sanner, and Lexing Xie. 2015. Autorec: Au- toencoders meet collaborative filtering. InProceedings of the 24th international conference on World Wide Web. 111–112
work page 2015
-
[28]
Rong Shan, Jiachen Zhu, Jianghao Lin, Chenxu Zhu, Bo Chen, Ruiming Tang, Yong Yu, and Weinan Zhang. 2025. Full-Stack Optimized Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation.ACM Transactions on Recommender Systems4, 2 (2025), 1–33
work page 2025
-
[29]
2025.OpenEvolve: an open-source evolutionary coding agent
Asankhaya Sharma. 2025.OpenEvolve: an open-source evolutionary coding agent. https: //github.com/algorithmicsuperintelligence/openevolve
work page 2025
-
[30]
Noah Shinn, Federico Cassano, Edward Berman, Ashwin Gopinath, Karthik Narasimhan, and Shunyu Yao. 2023. Reflexion: Language Agents with Verbal Reinforcement Learning. arXiv:2303.11366 [cs.AI]https://arxiv.org/abs/2303.11366
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[31]
Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, et al . 2023. Llama 2: open foundation and fine-tuned chat models. arXiv.arXiv preprint arXiv:2307.0928810 (2023)
work page internal anchor Pith review Pith/arXiv arXiv 2023
- [32]
-
[33]
Wenhui Wang, Furu Wei, Li Dong, Hangbo Bao, Nan Yang, and Ming Zhou. 2020. Minilm: Deep self-attention distillation for task-agnostic compression of pre-trained transformers.Ad- vances in neural information processing systems33 (2020), 5776–5788
work page 2020
-
[34]
Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc Le, and Denny Zhou. 2023. Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. arXiv:2201.11903 [cs.CL]https://arxiv.org/abs/2201.11903
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[35]
Large Language Models as Optimizers
Chengrun Yang, Xuezhi Wang, Yifeng Lu, Hanxiao Liu, Quoc V . Le, Denny Zhou, and Xinyun Chen. 2024. Large Language Models as Optimizers. arXiv:2309.03409 [cs.LG] https://arxiv.org/abs/2309.03409
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[36]
Zhilin Yang, Peng Qi, Saizheng Zhang, Yoshua Bengio, William Cohen, Ruslan Salakhutdinov, and Christopher D Manning. 2018. HotpotQA: A dataset for diverse, explainable multi-hop question answering. InProceedings of the 2018 conference on empirical methods in natural language processing. 2369–2380
work page 2018
-
[37]
TextGrad: Automatic "Differentiation" via Text
Mert Yuksekgonul, Federico Bianchi, Joseph Boen, Sheng Liu, Zhi Huang, Carlos Guestrin, and James Zou. 2024. TextGrad: Automatic "Differentiation" via Text. arXiv:2406.07496 [cs.CL] https://arxiv.org/abs/2406.07496
work page internal anchor Pith review Pith/arXiv arXiv 2024
- [38]
-
[39]
Wayne Xin Zhao, Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, et al . 2023. A survey of large language models. arXiv preprint arXiv:2303.182231, 2 (2023), 1–124
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[40]
Zihao Zhao, Eric Wallace, Shi Feng, Dan Klein, and Sameer Singh. 2021. Calibrate before use: Improving few-shot performance of language models. InInternational conference on machine learning. Pmlr, 12697–12706
work page 2021
-
[41]
Yongchao Zhou, Andrei Ioan Muresanu, Ziwen Han, Keiran Paster, Silviu Pitis, Harris Chan, and Jimmy Ba. 2023. Large Language Models Are Human-Level Prompt Engineers. arXiv:2211.01910 [cs.LG]https://arxiv.org/abs/2211.01910
work page internal anchor Pith review arXiv 2023
- [42]
-
[43]
Jiachen Zhu, Yichao Wang, Jianghao Lin, Jiarui Qin, Ruiming Tang, Weinan Zhang, and Yong Yu. 2024. M-scan: A multi-scenario causal-driven adaptive network for recommendation. In Proceedings of the ACM Web Conference 2024. 3844–3853. 13 A Overall Algorithm Algorithm 1Neural Collaborative Context Engineering (NCCE) Require: Training instances X, warm-up opt...
work page 2024
-
[48]
He is younger than Stephen Cummings ( born in 1954) . However , without the specific age or i d e n t i f i c a t i o n of a former Wonder Girls member , we cannot d e f i n i t i v e l y conclude the claim based on the passages provided ." , " summary ": " The passages provide b i r t h d a t e s for several i n d i v i d u a l s named Stephen , but none...
work page 1954
-
[49]
Prior to that , it had peaked at number 1 on the Cl as si ca l Digital Songs and number 10 on the Dance / E l e c t r o n i c Digital Songs charts , as well as charting in Germany at number 59." , " Ha le st or m | H ale st or m is an American hard rock band from Red Lion , Pennsylvania , c o n s i s t i n g of lead vocalist and gu it ar is t Lzzy Hale , ...
work page 2009
-
[50]
is an American composer of concert music , film , and video game scores . His work is pr im ar il y o r c h e s t r a l and choral , often with a world music inf lu en ce . He has won two Grammy Awards for his cl as si ca l c ro ss ov er album \" Calling All Dawns \"." , " Reaching for the Moon ( album ) | Reaching for the Moon is the third album by jazz ...
work page 1991
-
[51]
Stephen Pearcy was born on July 3 , 1956
work page 1956
-
[52]
Stephen Duffy was born on May 30 , 1960
work page 1960
-
[53]
Stephen Cummings was born on S ep te mbe r 13 , 1954
work page 1954
-
[54]
Stephen Gately was born on March 17 , 1976. None of the above i n d i v i d u a l s were a s s o c i a t e d with Wonder Girls , a South Korean girl group formed in 2007 by JYP E n t e r t a i n m e n t . Therefore , we do not have i n f o r m a t i o n from the passages that e x p l i c i t l y i d e n t i f i e s a former Wonder Girls member to compare ...
work page 1976
-
[55]
He is younger than Stephen Cummings ( born in 1954) . However , without the specific age or i d e n t i f i c a t i o n of a former Wonder Girls member , we cannot d e f i n i t i v e l y conclude the claim based on the passages provided ." , 30 " summary ": " The passages provide b i r t h d a t e s for several i n d i v i d u a l s named Stephen , but n...
work page 1954
-
[56]
Prior to that , it had peaked at number 1 on the Cl as si ca l Digital Songs and number 10 on the Dance / E l e c t r o n i c Digital Songs charts , as well as charting in Germany at number 59." , " Ha le st or m | H ale st or m is an American hard rock band from Red Lion , Pennsylvania , c o n s i s t i n g of lead vocalist and gu it ar is t Lzzy Hale , ...
work page 2009
-
[57]
is an American composer of concert music , film , and video game scores . His work is pr im ar il y o r c h e s t r a l and choral , often with a world music inf lu en ce . He has won two Grammy Awards for his cl as si ca l c ro ss ov er album \" Calling All Dawns \"." , " Reaching for the Moon ( album ) | Reaching for the Moon is the third album by jazz ...
work page 1991
-
[58]
C o n t r a d i c t i o n s Collapse \
is a Romanian - American s ci ent is t who is the current Pr of es so r of Ecology in the D e p a r t m e n t of Land Re so ur ce s and E n v i r o n m e n t a l Sciences at Montana State U n i v e r s i t y . He is a pr in ci pa l i n v e s t i g a t o r in the McMurdo Dry Valleys Long Term E c o l o g i c a l Research ( LTER ) project ." , " None ( Mes ...
work page 1994
-
[59]
The Voice of the Civil Rights Movement \
, known as Odetta , was an American singer , actress , guitarist , songwriter , and a civil and human rights activist , often referred to as \" The Voice of the Civil Rights Movement \". Her musical r e p e r t o i r e co ns is te d largely of American folk music , blues , jazz , and s p i r i t u a l s . An im po rt an t figure in the American folk music...
work page 1950
-
[60]
C o n t r a d i c t i o n s Collapse \
is a Romanian - American s ci ent is t who is the current Pr of es so r of Ecology in the D e p a r t m e n t of Land Re so ur ce s and E n v i r o n m e n t a l Sciences at Montana State U n i v e r s i t y . He is a pr in ci pa l i n v e s t i g a t o r in the McMurdo Dry Valleys Long Term E c o l o g i c a l Research ( LTER ) project ." , " None ( Mes ...
work page 1994
-
[61]
** Extract Missing or Am bi gu ou s I n f o r m a t i o n **: Focus on i d e n t i f y i n g gaps or a m b i g u i t i e s in the ‘ context ‘ that prevent a ns we rin g the question . The ‘ search_query ‘ should target r e t r i e v i n g the missing i n f o r m a t i o n rather than r e i t e r a t i n g what is already in the ‘ context ‘
-
[62]
** Preserve Key Entities and R e l a t i o n s h i p s **: Ensure all entities ( e . g . , names , dates , titles ) and their r e l a t i o n s h i p s from the question are a c c u r a t e l y i n c o r p o r a t e d into the ‘ search_query ‘. Avoid altering or omitting critical details
-
[63]
** Avoid Re as oni ng or A s s u m p t i o n s **: Do not include reasoning , explanations , or inferred c o n c l u s i o n s in the ‘ search_query ‘. The query should remain neutral and factual , aimed solely at finding the missing pieces of i n f o r m a t i o n
-
[64]
** Adapt to S p e c i f i c i t y **: When the question contains highly specific details ( e . g . , dates , names , or unique i d e n t i f i e r s ) , ensure these are included verbatim in the ‘ search_query ‘. Avoid g e n e r a l i z i n g or b r o a d e n i n g the scope u n n e c e s s a r i l y
-
[65]
** Avoid R e d u n d a n c i e s **: Do not include i n f o r m a t i o n already fully resolved in the ‘ context ‘. The ‘ search_query ‘ should focus e x c l u s i v e l y on u n r e s o l v e d aspects of the question
-
[66]
** Examples C l a r i f i c a t i o n **: For cases where the question e x p l i c i t l y r e f e r e n c e s an entity or detail absent in the ‘ context ‘ ( e . g . , \" Mel Groomes ’ alma mater \") , p r i o r i t i z e c o n s t r u c t i n g a query that captures the specific missing entity and its r e l a t i o n s h i p to the question ( e . g . , ...
work page 1972
-
[67]
The Voice of the Civil Rights Movement \
, known as Odetta , was an American singer , actress , guitarist , songwriter , and a civil and human rights activist , often referred to as \" The Voice of the Civil Rights Movement \". Her musical r e p e r t o i r e co ns is te d largely of American folk music , blues , jazz , and s p i r i t u a l s . An im po rt an t figure in the American folk music...
work page 1950
-
[68]
** P rec is io n in T e r m i n o l o g y and Data E x t r a c t i o n **: C ar efu ll y extract and use precise and complete details directly from the context . Pay p a r t i c u l a r a tt en ti on to numeric data , dates , proper nouns , entity names , and other key details . Do not rely on a s s u m p t i o n s or external kn ow le dg e unless e x p l...
-
[69]
** C o n t e x t u a l C o m p l e t e n e s s **: R i g o r o u s l y validate that all elements of the re as on in g and the final answer are fully s up por te d by the context . If the context does not directly provide the n ece ss ar y information , e x p l i c i t l y state what is missing and provide an a p p r o p r i a t e fallback response ( e . ...
-
[70]
** Logical Step - by - Step Re as on in g **: C on str uc t the r ea so nin g in a clear , explicit , and l og ica ll y c o n s i s t e n t manner . Clearly outline how each piece of i n f o r m a t i o n from the context c o n t r i b u t e s to deriving the answer . Avoid skipping i n t e r m e d i a t e steps or making vague c o n n e c t i o n s betwe...
-
[71]
** Query - Specific I n t e r p r e t a t i o n and Nuance Handling **: T h o r o u g h l y analyze the phrasing and implied c o n d i t i o n s in the question . Pay close at te nt io n to details such as specific dates , numeric constraints , entity relationships , and other query - specific nuances . Ensure the re as on in g and answer directly and ful...
-
[72]
** Error I d e n t i f i c a t i o n and R e s o l u t i o n **: P r o a c t i v e l y validate ex tr ac te d i n f o r m a t i o n against the context to avoid errors . For example : - For date - related queries , cross - check all dates in the context to ensure accuracy . - For numeric or quantity - related queries , verify c a l c u l a t i o n s or e ...
-
[73]
Avoid guessing or i n t r o d u c i n g u n s u p p o r t e d i n f o r m a t i o n
** Fallback Res po ns es for Am bi gui ty or Missing Context **: If the context does not support a d e f i n i t i v e answer , clearly c o m m u n i c a t e this in the re as on in g and provide a suitable fallback response . Avoid guessing or i n t r o d u c i n g u n s u p p o r t e d i n f o r m a t i o n
-
[74]
For yes / no questions , use l ow er ca se ( ’ yes ’ , ’no ’)
** Answer F o r m a t t i n g and C o n s i s t e n c y **: Adhere strictly to the expected answer format based on the question or provided feedback . For yes / no questions , use l ow er ca se ( ’ yes ’ , ’no ’) . For other types of queries , ensure the answer matches the exact phrasing or c o n v e n t i o n s present in the context
-
[75]
** Feedback - Informed R e f i n e m e n t **: Where prior e x e c u t i o n s have failed due to i n a c c u r a c i e s or mis in te rp re tat io ns , pay special at te nt io n to similar patterns in future queries . Use lessons from such failures to refine re as on in g and avoid re pe ati ng errors . Failure to adhere to these p r i n c i p l e s will...
-
[76]
Institutional review board (IRB) approvals or equivalent for research with human subjects Question: Does the paper describe potential risks incurred by study participants, whether such risks were disclosed to the subjects, and whether Institutional Review Board (IRB) approvals (or an equivalent approval/review based on the requirements of your country or ...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.