Human-LLM Compound System for Scientific Ideation through Facet Recombination and Novelty Evaluation
Pith reviewed 2026-05-23 20:55 UTC · model grok-4.3
The pith
Scideator improves scientific ideation by letting users recombine extracted purposes, mechanisms, and evaluations from papers through a human-LLM system.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Scideator extracts key facets (purposes, mechanisms, and evaluations) from papers, enables human-in-the-loop recombination to synthesize ideas by finding analogies, surfaces papers at varying distances via controlled retrieval, and verifies novelty with a facet-grounded retrieve-then-rerank method, resulting in greater creativity support than a plain LLM baseline in a user study with computer science researchers.
What carries the argument
Facet recombination, in which users select purposes, mechanisms, and evaluations extracted by the LLM from papers and the system generates ideas by identifying analogies across those facets.
If this is right
- Users gain a spectrum of ideation directions from same-topic to distant papers through distance-controlled retrieval.
- Facet-based retrieve-then-rerank surfaces more relevant papers for novelty checking than standard retrieval methods.
- A facet-grounded novelty classifier outperforms classifiers that reason over unstructured text of ideas and papers.
- The system particularly boosts idea exploration and expressiveness compared to using the backbone LLM without facet modules.
Where Pith is reading between the lines
- This facet approach could extend to domains outside computer science if the extraction accuracy holds across fields.
- Researchers might spend less time manually scanning literature if facet extraction becomes reliable enough for direct use in recombination.
- Future tests could check whether the creativity gains persist when users start from fewer seed papers or when the system is applied to hypothesis generation rather than full idea synthesis.
Load-bearing premise
LLM-extracted facets from papers are accurate and complete enough that recombining them produces scientifically coherent and novel ideas that users can productively evaluate.
What would settle it
An experiment in which independent experts rate ideas generated via Scideator as no more novel or coherent than those from the plain-LLM baseline, or in which many recombined ideas are later found to already exist in the literature.
Figures
read the original abstract
The scientific ideation process often involves blending facets of existing papers to create new ideas. We contribute Scideator, the first human-LLM system for facet-based scientific ideation. Starting from user-provided papers, Scideator extracts key facets -- purposes, mechanisms, and evaluations -- from these and related papers, allowing users to interactively recombine facets to synthesize ideas. Scideator is driven by three design choices: (1) human-in-the-loop facet recombination, in which users select facets from retrieved papers and the system generates ideas by finding analogies across them via the Faceted Idea Generator module; (2) distance-controlled retrieval via the Analogous Paper Facet Finder module, which surfaces papers ranging from the same topic to entirely different areas to provide a spectrum of directions; and (3) facet-based novelty verification via the Idea Novelty Checker module, a retrieve-then-rerank pipeline that helps users to evaluate idea originality using facets. In a user study with computer science researchers, Scideator provided significantly more creativity support than a baseline using the same backbone LLM without our facet-based modules, particularly in idea exploration and expressiveness. Ablations further show that the facets benefit the novelty checker: facet-based retrieve-then-rerank surfaces more relevant papers than standard retrieval and re-ranking, and a facet-grounded novelty classifier outperforms classifiers that reason over unstructured ideas and papers.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces Scideator, a human-LLM compound system for scientific ideation. It extracts facets (purposes, mechanisms, evaluations) from user-provided and retrieved papers, supports interactive facet recombination via analogy in the Faceted Idea Generator, uses distance-controlled retrieval in the Analogous Paper Facet Finder to surface papers from similar to distant domains, and includes a retrieve-then-rerank Idea Novelty Checker grounded in facets. A user study with computer science researchers reports that Scideator provides significantly more creativity support than a baseline LLM without the facet modules, especially for idea exploration and expressiveness; ablations indicate that facet-based retrieval and classification improve novelty checking over unstructured baselines.
Significance. If the user-study results hold under fuller reporting, the work offers a structured, facet-driven alternative to unstructured LLM ideation that integrates human control with retrieval and verification modules. The explicit ablations on the novelty checker (facet-based retrieve-then-rerank vs. standard methods, facet-grounded classifier vs. unstructured) and the human-in-the-loop design are concrete strengths that allow partial isolation of the contribution. The distance-controlled retrieval spectrum is a useful design choice for controlling exploration breadth.
major comments (2)
- [User Study] User Study section (and abstract): the central claim of 'significantly more creativity support' is reported without participant count (n), statistical tests performed, blinding procedures, exact system prompts, or raw score distributions. These omissions make it impossible to assess effect size, power, or reproducibility of the reported difference, which is load-bearing for the empirical contribution.
- [Idea Novelty Checker] § on Idea Novelty Checker: while ablations are presented, the paper does not report inter-annotator agreement or error analysis on the LLM-extracted facets themselves; if facet extraction accuracy is low, the downstream recombination and novelty signals rest on an unquantified foundation.
minor comments (2)
- [System Overview] Notation for facets (purposes, mechanisms, evaluations) is introduced without an explicit formal definition or example table showing extraction output for a sample paper.
- [Figures] Figure captions for the system pipeline and retrieval spectrum could more explicitly label the three modules (Faceted Idea Generator, Analogous Paper Facet Finder, Idea Novelty Checker) to aid navigation.
Simulated Author's Rebuttal
We thank the referee for their constructive review and for recognizing the value of the human-in-the-loop design, distance-controlled retrieval, and the ablations on the novelty checker. We address each major comment below.
read point-by-point responses
-
Referee: [User Study] User Study section (and abstract): the central claim of 'significantly more creativity support' is reported without participant count (n), statistical tests performed, blinding procedures, exact system prompts, or raw score distributions. These omissions make it impossible to assess effect size, power, or reproducibility of the reported difference, which is load-bearing for the empirical contribution.
Authors: We agree that these details are required to evaluate the strength and reproducibility of the user-study results. The current manuscript does not report the participant count, the statistical tests performed, blinding procedures, exact system prompts, or raw score distributions. In the revised manuscript we will expand the User Study section (and update the abstract) to include the number of participants, the specific statistical tests and results, blinding procedures, the exact prompts, and raw score distributions or summary visualizations. revision: yes
-
Referee: [Idea Novelty Checker] § on Idea Novelty Checker: while ablations are presented, the paper does not report inter-annotator agreement or error analysis on the LLM-extracted facets themselves; if facet extraction accuracy is low, the downstream recombination and novelty signals rest on an unquantified foundation.
Authors: We agree that an explicit quantification of facet-extraction reliability would strengthen the claims about the downstream modules. The manuscript does not currently report inter-annotator agreement or an error analysis on the LLM-extracted facets. In the revision we will add a targeted analysis of facet-extraction quality, for example by reporting agreement or error rates on a sampled set of extractions, to provide a quantified basis for the recombination and novelty-checking components. revision: yes
Circularity Check
No significant circularity
full rationale
The paper describes an empirical human-LLM system (Scideator) whose central claims rest on a comparative user study with computer science researchers and ablations of its modules. No equations, fitted parameters, or derivation chain appear in the provided material. The evaluation compares the full facet-based system to a baseline using the same backbone LLM, making the reported superiority an external, falsifiable outcome rather than a quantity that reduces to the system's own inputs by construction. No self-citation load-bearing steps, self-definitional relations, or renamed known results are present.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Large language models can reliably extract purposes, mechanisms, and evaluations from scientific papers and perform analogy-based recombination across them.
Forward citations
Cited by 12 Pith papers
-
Evolving Idea Graphs with Learnable Edits-and-Commits for Multi-Agent Scientific Ideation
EIG represents research ideas as evolving graphs with nodes for claims and edges for relations, using a learned controller for edits and commits to produce higher-quality scientific proposals than text-only multi-agen...
-
ResearchCube: Multi-Dimensional Trade-off Exploration for Research Ideation
ResearchCube provides a 3D spatial interface with bipolar trade-off dimensions and direct-manipulation interactions to support multi-dimensional research ideation, shown helpful in a study with 11 researchers for exte...
-
LitPivot: Developing Well-Situated Research Ideas Through Dynamic Contextualization and Critique within the Literature Landscape
LitPivot introduces literature-initiated pivots where engagement with dynamically retrieved papers prompts revisions to a developing research idea.
-
The Alien Space of Science: Sampling Coherent but Cognitively Unavailable Research Directions
A framework decomposes LLM papers into idea atoms, trains coherence and availability models over the resulting vocabulary, and samples atom combinations that are coherent yet unlikely under existing author communities.
-
IDRBench: Understanding the Capability of Large Language Models on Interdisciplinary Research
IDRBench is presented as the first benchmark framework consisting of datasets and three evaluation tasks to measure LLMs' ability to perform interdisciplinary research.
-
CHIMERA: A Knowledge Base of Scientific Idea Recombinations for Research Analysis and Ideation
CHIMERA is the first large-scale mined KB of concept recombinations from scientific literature, created via a new IE task and LLM extraction, with demonstrated uses in pattern analysis and hypothesis generation.
-
When AI reviews science: Can we trust the referee?
AI peer review systems are vulnerable to prompt injections, prestige biases, assertion strength effects, and contextual poisoning, as demonstrated by a new attack taxonomy and causal experiments on real conference sub...
-
Attribution Gradients: Incrementally Unfolding Citations for Critical Examination of Attributed AI Answers
Attribution gradients consolidate citation evidence and enable incremental unfolding of secondary sources, leading to deeper engagement in a lab study of critical reading tasks for AI answers.
-
"Like Taking the Path of Least Resistance": Exploring the Impact of LLM Interaction on the Creative Process of Programming
LLM assistance shortens idea-generation periods and reduces creative moments during programming tasks while yielding solutions with comparable idea counts and greater functional correctness.
-
AutoResearch AI: Towards AI-Powered Research Automation for Scientific Discovery
A survey organizing AI-powered research automation into five workflow stages, defining AutoResearch and Vibe Research, and proposing five evaluation dimensions while noting domain-conditioned limits on autonomy.
-
Omakase: proactive assistance with actionable suggestions for evolving scientific research projects
Omakase monitors project documents to infer timely queries and distills research reports into actionable suggestions that users rated significantly more useful than raw reports.
-
Evolving Roles of LLMs in Scientific Innovation: Assistant, Collaborator, Scientist, and Evaluator
The paper proposes a four-role framework for LLMs in scientific innovation and reviews methods, benchmarks, and limitations across Assistant, Collaborator, Scientist, and Evaluator roles.
Reference graph
Works this paper leans on
-
[1]
Abdelrahman Abdallah, Bhawna Piryani, Jamshid Mozafari, Mohammed Ali, and Adam Jatowt. 2025. Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation. https://api.semanticscholar. org/CorpusID:276107364
work page 2025
- [2]
-
[3]
Shm Garanganao Almeda, JD Zamfirescu-Pereira, Kyu Won Kim, Pradeep Mani Rathnam, and Bjoern Hartmann. 2024. Prompting for discovery: Flex- ible sense-making for ai art-making with dreamsheets. In Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems . 1–17
work page 2024
- [4]
- [5]
-
[6]
Lutz Bornmann and Rüdiger Mutz. 2015. Growth rates of modern science: A bibliometric analysis based on the number of publications and cited references. Journal of the association for information science and technology 66, 11 (2015), 2215–2222
work page 2015
-
[7]
Virginia Braun and Victoria Clarke. 2006. Using thematic analysis in psychology. Qualitative research in psychology 3, 2 (2006), 77–101
work page 2006
-
[8]
Joel Chan, Joseph Chee Chang, Tom Hope, Dafna Shahaf, and Aniket Kittur. 2018. Solvent: A mixed initiative system for finding analogies between research papers. Proceedings of the ACM on Human-Computer Interaction 2, CSCW (2018), 1–21
work page 2018
-
[9]
Liuqing Chen, Yuan Zhang, Ji Han, Lingyun Sun, Peter Childs, and Boheng Wang. 2024. A foundation model enhanced approach for generative design in combinational creativity. Journal of Engineering Design 35, 11 (2024), 1394–1420
work page 2024
-
[10]
Alan Y Cheng, Meng Guo, Melissa Ran, Arpit Ranasaria, Arjun Sharma, Anthony Xie, Khuyen N Le, Bala Vinaithirthan, Shihe Luan, David Thomas Henry Wright, et al. 2024. Scientific and fantastical: Creating immersive, culturally relevant learning experiences with augmented reality and large language models. In Proceedings of the 2024 CHI Conference on Human F...
work page 2024
-
[11]
Erin Cherry and Celine Latulipe. 2014. Quantifying the creativity support of digital tools through the creativity support index.ACM Transactions on Computer- Human Interaction (TOCHI) 21, 4 (2014), 1–25
work page 2014
-
[12]
DaEun Choi, Sumin Hong, Jeongeon Park, John Joon Young Chung, and Juho Kim. 2024. CreativeConnect: Supporting Reference Recombination for Graphic Design Ideation with Generative AI. In Proceedings of the CHI Conference on Human Factors in Computing Systems . 1–25
work page 2024
-
[13]
Seulgi Choi, Hyewon Lee, Yoonjoo Lee, and Juho Kim. 2024. VIVID: Human-AI Collaborative Authoring of Vicarious Dialogues from Lecture Videos. In Pro- ceedings of the 2024 CHI Conference on Human Factors in Computing Systems . 1–26
work page 2024
-
[14]
Arman Cohan, Sergey Feldman, Iz Beltagy, Doug Downey, and Daniel S. Weld
-
[15]
SPECTER: Document-level Representation Learning using Citation- informed Transformers. ArXiv abs/2004.07180 (2020). https://api.semanticscholar. org/CorpusID:215768677
-
[16]
Arthur Cropley. 2006. In praise of convergent thinking. Creativity research journal 18, 3 (2006), 391–404
work page 2006
- [17]
-
[18]
Douglas L Dean, Jill Hender, Tom Rodgers, and Eric Santanen. 2006. Identifying good ideas: constructs and scales for idea evaluation. Journal of Association for Information Systems 7, 10 (2006), 646–699
work page 2006
-
[19]
Karl Duncker and Lynne S Lees. 1945. On problem-solving. Psychological mono- graphs 58, 5 (1945), i
work page 1945
- [20]
-
[21]
Jingtong Gao, Bo Chen, Xiangyu Zhao, Weiwen Liu, Xiangyang Li, Yichao Wang, Zijian Zhang, Wanyu Wang, Yuyang Ye, Shanru Lin, Huifeng Guo, and Ruim- ing Tang. 2024. LLM-enhanced Reranking in Recommender Systems. ArXiv abs/2406.12433 (2024). https://api.semanticscholar.org/CorpusID:270562015
-
[22]
Juraj Gottweis, Wei-Hung Weng, Alexander Daryin, Tao Tu, Anil Palepu, Petar Sirkovic, Artiom Myaskovsky, Felix Weissenberger, Keran Rong, Ryutaro Tanno, et al. 2025. Towards an AI co-scientist. arXiv preprint arXiv:2502.18864 (2025)
work page internal anchor Pith review Pith/arXiv arXiv 2025
- [23]
- [24]
-
[25]
Hua Guo and David H Laidlaw. 2018. Topic-based exploration and embedded visualizations for research idea generation. IEEE transactions on visualization and computer graphics 26, 3 (2018), 1592–1607
work page 2018
- [26]
-
[27]
1996.Mental leaps: Analogy in creative thought
Keith J Holyoak and Paul Thagard. 1996.Mental leaps: Analogy in creative thought. MIT press
work page 1996
-
[28]
Tom Hope, Joel Chan, Aniket Kittur, and Dafna Shahaf. 2017. Accelerating innovation through analogy mining. In Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining . 235–243
work page 2017
-
[29]
Tom Hope, Doug Downey, Daniel S Weld, Oren Etzioni, and Eric Horvitz. 2023. A computational inflection for scientific discovery. Commun. ACM 66, 8 (2023), 62–73
work page 2023
-
[30]
Tom Hope, Ronen Tamari, Daniel Hershcovich, Hyeonsu B Kang, Joel Chan, Aniket Kittur, and Dafna Shahaf. 2022. Scaling creative inspiration with fine- grained functional aspects of ideas. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems . 1–15. Scideator: Human-LLM Scientific Idea Generation and Novelty Evaluation Grounded in...
work page 2022
-
[31]
Peter Jansen, Oyvind Tafjord, Marissa Radensky, Pao Siangliulue, Tom Hope, Bhavana Dalvi Mishra, Bodhisattwa Prasad Majumder, Daniel S Weld, and Peter Clark. 2025. CodeScientist: End-to-End Semi-Automated Scientific Discovery with Code-based Experimentation. arXiv preprint arXiv:2503.22708 (2025)
-
[32]
Arif E Jinha. 2010. Article 50 million: an estimate of the number of scholarly articles in existence. Learned publishing 23, 3 (2010), 258–263
work page 2010
-
[33]
Hyeonsu B Kang, David Chuan-En Lin, Nikolas Martelaro, Aniket Kittur, Yan- Ying Chen, and Matthew K Hong. 2024. BioSpark: An End-to-End Generative System for Biological-Analogical Inspirations and Ideation. In Extended Abstracts of the CHI Conference on Human Factors in Computing Systems . 1–13
work page 2024
-
[34]
Hyeonsu B Kang, Xin Qian, Tom Hope, Dafna Shahaf, Joel Chan, and Aniket Kittur. 2022. Augmenting scientific creativity with an analogical search engine. ACM Transactions on Computer-Human Interaction 29, 6 (2022), 1–36
work page 2022
-
[35]
James C Kaufman and Robert J Sternberg. 2010. The Cambridge handbook of creativity. Cambridge University Press
work page 2010
-
[36]
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines
Omar Khattab, Arnav Singhvi, Paridhi Maheshwari, Zhiyuan Zhang, Keshav Santhanam, Sri Vardhamanan, Saiful Haq, Ashutosh Sharma, Thomas T. Joshi, Hanna Moazam, Heather Miller, Matei Zaharia, and Christopher Potts. 2023. DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines. arXiv preprint arXiv:2310.03714 (2023)
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[37]
Rodney Michael Kinney, Chloe Anastasiades, Russell Authur, Iz Beltagy, Jonathan Bragg, Alexandra Buraczynski, Isabel Cachola, Stefan Candra, Yoganand Chan- drasekhar, Arman Cohan, Miles Crawford, Doug Downey, Jason Dunkelberger, Oren Etzioni, Rob Evans, Sergey Feldman, Joseph Gorney, David W. Graham, F.Q. Hu, Regan Huff, Daniel King, Sebastian Kohlmeier, ...
-
[38]
Dan Lahav, Jon Saad Falcon, Bailey Kuehl, Sophie Johnson, Sravanthi Parasa, Noam Shomron, Duen Horng Chau, Diyi Yang, Eric Horvitz, Daniel S Weld, et al
-
[39]
In Proceedings of the AAAI Conference on Artificial Intelligence, Vol
A search engine for discovery of scientific challenges and directions. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 11982–11990
-
[40]
Pier Luca Lanzi and Daniele Loiacono. 2023. Chatgpt and other large language models as evolutionary engines for online interactive collaborative game design. In Proceedings of the Genetic and Evolutionary Computation Conference . 1383– 1390
work page 2023
-
[41]
Joanne Leong, Pat Pataranutaporn, Valdemar Danry, Florian Perteneder, Yaoli Mao, and Pattie Maes. 2024. Putting things into context: Generative AI-enabled context personalization for vocabulary learning improves learning motivation. In Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems . 1–15
work page 2024
-
[42]
Weixin Liang, Yuhui Zhang, Hancheng Cao, Binglu Wang, Daisy Yi Ding, Xinyu Yang, Kailas Vodrahalli, Siyu He, Daniel Scott Smith, Yian Yin, et al. 2024. Can large language models provide useful feedback on research papers? A large-scale empirical analysis. NEJM AI 1, 8 (2024), AIoa2400196
work page 2024
-
[43]
Yiren Liu, Si Chen, Haocong Cheng, Mengxia Yu, Xiao Ran, Andrew Mo, Yiliu Tang, and Yun Huang. 2024. How ai processing delays foster creativity: Exploring research question co-creation with an llm-based agent. In Proceedings of the CHI Conference on Human Factors in Computing Systems . 1–25
work page 2024
-
[44]
Yiren Liu, Pranav Sharma, Mehul Jitendra Oswal, Haijun Xia, and Yun Huang
-
[45]
arXiv preprint arXiv:2409.12538 (2024)
Personaflow: Boosting research ideation with llm-simulated expert per- sonas. arXiv preprint arXiv:2409.12538 (2024)
-
[46]
Yiren Liu, Mengxia Yu, Meng Jiang, and Yun Huang. 2023. Creative Research Question Generation for Human-Computer Interaction Research.. In IUI Work- shops. 58–66
work page 2023
-
[47]
Chris Lu, Cong Lu, Robert Tjarko Lange, Jakob Foerster, Jeff Clune, and David Ha
-
[49]
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
Chris Lu, Cong Lu, Robert Tjarko Lange, Jakob N. Foerster, Jeff Clune, and David Ha. 2024. The AI Scientist: Towards Fully Automated Open-Ended Scientific Dis- covery. ArXiv abs/2408.06292 (2024). https://api.semanticscholar.org/CorpusID: 271854887
work page internal anchor Pith review Pith/arXiv arXiv 2024
- [50]
-
[51]
Louie Meyer, Johanne Engel Aaen, Anitamalina Regitse Tranberg, Peter Kun, Matthias Freiberger, Sebastian Risi, and Anders Sundnes Løvlie. 2024. Algorithmic ways of seeing: Using object detection to facilitate art exploration. In Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems . 1–18
work page 2024
-
[52]
Sheshera Mysore, Arman Cohan, and Tom Hope. 2022. Multi-Vector Models with Textual Guidance for Fine-Grained Scientific Document Similarity. InProceed- ings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 4453–4470
work page 2022
-
[53]
Sheshera Mysore, Tim O’Gorman, Andrew McCallum, and Hamed Zamani. [n. d.]. CSFCube–A Test Collection of Computer Science Research Articles for Faceted Query by Example. ([n. d.])
- [54]
-
[55]
Harshit Nigam, Manasi Patwardhan, Lovekesh Vig, and Gautam Shroff. 2024. An Interactive Co-Pilot for Accelerated Research Ideation. In Proceedings of the Third Workshop on Bridging Human–Computer Interaction and Natural Language Processing. 60–73
work page 2024
- [56]
-
[57]
Jeongseok Oh, Seungju Kim, and Seungjun Kim. 2024. LumiMood: A Creativity Support Tool for Designing the Mood of a 3D Scene. In Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems . 1–21
work page 2024
-
[58]
OpenReview. [n. d.]. OpenReview. https://openreview.net/
-
[59]
Jason Portenoy, Marissa Radensky, Jevin D West, Eric Horvitz, Daniel S Weld, and Tom Hope. 2022. Bursting scientific filter bubbles: Boosting innovation via novel author discovery. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. 1–13
work page 2022
-
[60]
Kevin Pu, KJ Feng, Tovi Grossman, Tom Hope, Bhavana Dalvi Mishra, Matt Latzke, Jonathan Bragg, Joseph Chee Chang, and Pao Siangliulue. 2024. IdeaSynth: Itera- tive Research Idea Development Through Evolving and Composing Idea Facets with Literature-Grounded Feedback. arXiv preprint arXiv:2410.04025 (2024)
-
[61]
A Terry Purcell and John S Gero. 1996. Design and other types of fixation.Design studies 17, 4 (1996), 363–383
work page 1996
- [62]
-
[63]
Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In Conference on Empirical Methods in Natural Language Processing. https://api.semanticscholar.org/CorpusID:201646309
work page 2019
-
[64]
Mark A Runco et al . 2010. Divergent thinking, creativity, and ideation. The Cambridge handbook of creativity 413 (2010), 446
work page 2010
- [66]
-
[67]
Dean Keith Simonton. 2021. Scientific Creativity: Discovery and Invention as Combinatorial. Frontiers in Psychology 12 (2021). https://api.semanticscholar. org/CorpusID:237262181
work page 2021
-
[68]
Arvind Srinivasan and Joel Chan. 2024. Improving Selection of Analogical Inspirations through Chunking and Recombination. In Proceedings of the 16th Conference on Creativity & Cognition . 374–397
work page 2024
-
[69]
Sangho Suh, Meng Chen, Bryan Min, Toby Jia-Jun Li, and Haijun Xia. 2024. Luminate: Structured Generation and Exploration of Design Space with Large Language Models for Human-AI Co-Creation. InProceedings of the CHI Conference on Human Factors in Computing Systems . 1–26
work page 2024
-
[70]
Lu Sun, Aaron Chan, Yun Seo Chang, and Steven P Dow. 2024. ReviewFlow: Intelligent Scaffolding to Support Academic Peer Reviewing. In Proceedings of the 29th International Conference on Intelligent User Interfaces . 120–137
work page 2024
- [71]
-
[72]
Weiwei Sun, Lingyong Yan, Xinyu Ma, Shuaiqiang Wang, Pengjie Ren, Zhumin Chen, Dawei Yin, and Zhaochun Ren. 2023. Is ChatGPT Good at Search? Investi- gating Large Language Models as Re-Ranking Agents. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing . 14918–14937
work page 2023
-
[73]
P Thagard. 2012. The cognitive science of science: Explanation, discovery, and conceptual change. The MIT Press
work page 2012
-
[74]
Jianyou Wang, Kaicheng Wang, Xiaoyue Wang, Prudhviraj Naidu, Leon Bergen, and Ramamohan Paturi. 2023. DORIS-MAE: scientific document retrieval using multi-level aspect-based queries. In Proceedings of the 37th International Confer- ence on Neural Information Processing Systems . 38404–38419
work page 2023
- [75]
- [76]
-
[77]
Hongji Yang, Delin Jing, and Lu Zhang. 2016. Creative Computing: an approach to knowledge combination for creativity?. In 2016 IEEE Symposium on Service- Oriented System Engineering (SOSE) . IEEE, 407–414. , , Radensky et al
work page 2016
-
[78]
TextGrad: Automatic "Differentiation" via Text
Mert Yuksekgonul, Federico Bianchi, Joseph Boen, Sheng Liu, Zhi Huang, Carlos Guestrin, and James Zou. 2024. TextGrad: Automatic" Differentiation" via Text. arXiv preprint arXiv:2406.07496 (2024)
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[79]
Xiaoying Zhang, Baolin Peng, Ye Tian, Jingyan Zhou, Lifeng Jin, Linfeng Song, Haitao Mi, and Helen Meng. 2024. Self-alignment for factuality: Mitigating hallucinations in llms via self-evaluation. arXiv preprint arXiv:2402.09267 (2024). A PROMPTS FOR ANALOGOUS PAPER FACET FINDER A.1 Prompt to extract facets from a paper title/abstract. def promptTextToPur...
-
[86]
No referencing the purpose in the evaluation facet. Examples of bad vs good purposes: - bad (too specific): to generate creative writing activities for third-grade English lessons --> good: to support elementary creative writing↩→ - bad (too broad): to support healthcare --> good: to provide clinical decision support↩→ - bad (more than one purpose that ar...
-
[91]
Do NOT reuse the words already in the facet. Examples of bad vs good definitions: - facet: longitudinal study. bad: a study that evaluates the tool Toolio over the course of a year --> good: a study that takes place over a long period of time extending at least multiple days ↩→ ↩→ - facet: Toolio for creative writing. bad: Toolio implements SLM for genera...
-
[92]
Specific enough to be helpful in coming up with research ideas
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.