pith. sign in

arxiv: 2606.28070 · v2 · pith:ARTSCHSJnew · submitted 2026-06-26 · 💻 cs.AI

JD Oxygen AI Item Center (Oxygen AIIC) V1: An Industrial-Scale LLM/VLM-Centric Solution for Item Understanding, Management, and Applications

Pith reviewed 2026-06-30 09:41 UTC · model grok-4.3

classification 💻 cs.AI
keywords item understandingLLMVLMe-commerceontology engineeringknowledge productionsemantic searchindustrial AI system
0
0 comments X

The pith

Oxygen AIIC uses self-evolving LLMs and VLMs to produce item knowledge for tens of billions of SKUs at 94.2% precision and 82.8% recall.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents Oxygen AIIC as an industrial platform that applies LLMs and VLMs to the production and management of structured item knowledge inside a catalog of tens of billions of SKUs. It confronts three scale-related problems: fast-emerging concepts, high-volume knowledge creation, and varied downstream uses, by organizing the work around four pillars of ontology engineering with human-AI loops, a Semantic Search then Discrimination identification method, self-evolving models, and a unified data-service tunnel. The deployed system now covers tens of thousands of categories and processes hundreds of millions of item updates each day. Reported outcomes include 94.2% precision and 82.8% recall in knowledge production together with 80.4% search-traffic coverage and a 37% drop in item-information quality issues. These results are offered as evidence that controllable self-evolution of the models can sustain high-quality output under continuous large-scale operation.

Core claim

The central claim is that self-evolving item-understanding LLMs and VLMs, when paired with human-AI ontology engineering and the S2D knowledge identification architecture, enable scalable production of high-quality structured item knowledge for tens of billions of SKUs, delivering 94.2% precision, 82.8% recall, and measurable operational gains while running at hundreds of millions of daily updates.

What carries the argument

Self-evolving item-understanding LLMs/VLMs together with the Semantic Search then Discrimination (S2D) architecture, which together drive stable model improvement and high-throughput knowledge identification across the item catalog.

If this is right

  • Search-traffic coverage reaches 80.4% in core business scenarios.
  • Item-information quality issues drop by 37%.
  • Automated fill rate of core attributes during item listing exceeds 80%.
  • The platform accumulates hundreds of billions of item-knowledge assets while operating on Huawei Ascend NPUs.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The self-evolving loop may reduce the fraction of human effort required per new ontology entry as the catalog grows.
  • The S2D architecture could be adapted to other large-scale structured-data domains that combine text and image signals.
  • A unified item tunnel may lower the engineering cost of connecting the knowledge base to additional downstream services such as dynamic pricing or inventory forecasting.

Load-bearing premise

The reported precision, recall, and business-impact numbers measured on current production data will remain stable as the ontology expands to millions of entries and daily update volume stays at hundreds of millions.

What would settle it

A new evaluation set drawn from categories added after the reported measurements shows precision or recall falling materially below 94.2% and 82.8%.

Figures

Figures reproduced from arXiv: 2606.28070 by Chan Long, Chaofan Chen, Chaohui Dong, Chao Liu, Chunyuan Guo, Danping Liu, Debin Liu, Deping Xiang, Fulai Xu, Guangyue Liu, Hao Li, Huichun Hu, Jianan Wang, Jianbo Zhao, Jian Yang, Jiaoyang Li, Jiaxing Wang, Jinglong Li, Jinjin Guo, Jun Fang, Jun Liu, Kai Zhou, Lili Gao, Li Wang, Liying Chen, Luning Yang, Mengdi Zhou, Oxygen AIIC, Pengzhang Liu, Qianyun Wang, Qi Lv, Qixia Jiang, Ruyue Li, Shimu Liang, Shuxing Wang, Sijie Zhang, Siqi Li, Tianhao Gao, Wang Ke, Weihu Huang, Wencan Lai, Wenjie Zhang, Xiaohui Zhang, Xiaojing Dong, Ya Liu, Yifeng Zhang, Yixiang Wang, Yongtai Zhang, Yongyi Liao, Zhaoru Chen, Zhen Chen, Zhiyong Ma, Zhiyuan Liu, Zhongwei Liu, Ziyan Xing.

Figure 1
Figure 1. Figure 1: Typical failure cases in traditional item knowledge systems across the demand, supply, and operations sides. efficiency, and experience” its core strategic priorities. As e-commerce has grown rapidly, traditional item knowledge systems can no longer support this strategy effectively, giving rise to three industrial￾scale bottlenecks across the demand, supply, and operations sides, as illustrated in [PITH_… view at source ↗
Figure 2
Figure 2. Figure 2: Overview of Oxygen AIIC across the item lifecycle. Ontology, and AI Item Library jointly support category planning, merchant workflows, user understanding, search, recommendation, and platform operations. These efforts confirm the feasibility of large models for intelligent item understanding. However, deploying them at JD, a platform that spans virtually every retail category and manages tens of billions … view at source ↗
Figure 3
Figure 3. Figure 3: Overall architecture of JD Oxygen AI Item Center. Oxygen AIIC integrates ontology engi￾neering, AI Item Library, the item understanding LLMs/VLMs, the item tunnel, and the application matrix into a closed-loop industrial system. Item-Understanding LLMs/VLMs The item-understanding LLMs/VLMs support both ontology construction and AI Item Library production, serving as the foundation for continuous improvemen… view at source ↗
Figure 4
Figure 4. Figure 4: Human–AI collaborative ontology engineering. Human experts establish the fundamental ontology backbone, while an automated pipeline dynamically discovers, fuses, and validates emerg￾ing concepts from multi-source heterogeneous data. 3.2.2 Algorithm-driven ontology growth (bottom-up) Building upon the expert-defined ontology backbone and continuously incorporating signals from user behavior and industry tre… view at source ↗
Figure 5
Figure 5. Figure 5: Production architecture of the AI Item Library. Taking item data and a dynamically evolv￾ing ontology as input, the pipeline first mitigates computational redundancy across the SKU and at￾tribute dimensions, and then performs precise item-to-ontology recognition through a two-stage “Se￾mantic Search then Discrimination” (S 2D) engine, powered by the item understanding LLMs/VLMs. 4.2 Item Knowledge Recognit… view at source ↗
Figure 6
Figure 6. Figure 6: Overview of the Oxygen AIIC framework for the item understanding LLMs/VLMs. Con￾structed upon a unified multi-task item understanding foundation model, the framework supports incremental capability expansion, incorporates instruction-following knowledge representation, and implements a closed-loop model self-evolution mechanism to continuously enhance model perfor￾mance and data quality. its continuous evo… view at source ↗
Figure 7
Figure 7. Figure 7: Incremental adaptation based on LoRAM experts and adaptive expert composition. A frozen SFT backbone is combined with multiple lightweight expert updates, and GRPO optimizes expert composition via task feedback. under limited business data, even when the learning rate is increased. Consequently, we introduce LoRAM initialization based on the Magnitude Principle (Zhang et al., 2026b). By directly construct￾… view at source ↗
Figure 8
Figure 8. Figure 8: Instruction-following knowledge representation training. The framework transfers reason￾ing capability through latent chain-of-thought (Latent CoT) distillation and enhances representational robustness via adaptive feature-space perturbation. In e-commerce, representation models must extract knowledge signals from comprehensive item information. Traditional embeddings are susceptible to interference from l… view at source ↗
Figure 9
Figure 9. Figure 9: provides a system-level view of the self-evolution loop; the following paragraphs delineate its four modules. Module 1: Data Evaluation Hardcase Set Consistency Confidence Stability Badcase Set Module 2: Data Analysis Item Understanding LLMs/VLMs Boundary Confusion Hallucination Expression Deviation Module 3: Data Synthesis Factual Constraints Boundary Calibration Fact Alignment Module 4: Data Selection Sy… view at source ↗
read the original abstract

JD$.$com, one of the world's largest e-commerce platforms, serves over 700 million active users and millions of merchants, with a catalog of tens of billions of SKUs. At this scale, high-quality, structured item knowledge underpins a better consumer experience, lower management costs, and higher operational efficiency-yet producing and serving it poses three industrial-scale challenges: fast-emerging concepts, high-quality knowledge production for massive SKUs, and diverse downstream requirements. To address these challenges, we present the JD Oxygen AI Item Center (Oxygen AIIC), an industrial-scale platform built on LLMs/VLMs for item-knowledge production and service. Oxygen AIIC is built around four core pillars: (i) ontology engineering driven by efficient human-AI collaboration, which supports the dynamic evolution and agile expansion of an ontology with millions of entries; (ii) a "Semantic Search then Discrimination"(S2D) knowledge identification architecture that, combined with throughput improvement strategies, enables scalable, extensible, and high-throughput AI Item Library production for tens of billions of SKUs; (iii) self-evolving item-understanding LLMs/VLMs that improve in a stable and controllable manner, enabling knowledge production with 94.2% precision and 82.8% recall; and (iv) a unified item tunnel that serves as the data and service hub. Oxygen AIIC now covers tens of thousands of JD categories and processes hundreds of millions of item updates per day on Huawei Ascend NPUs. It has accumulated hundreds of billions of item-knowledge assets. Deployed across core business scenarios-including search, recommendation, operations, category planning-Oxygen AIIC has delivered measurable gains at scale. Search-traffic coverage reaches 80.4%, item-information quality issues drop by 37%, the automated fill rate of core attributes during item listing exceeds 80%.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 0 minor

Summary. The manuscript describes the JD Oxygen AI Item Center (Oxygen AIIC), an industrial-scale platform at JD.com for item-knowledge production and service using LLMs/VLMs. It outlines four pillars: (i) ontology engineering via human-AI collaboration supporting millions of entries, (ii) a Semantic Search then Discrimination (S2D) architecture for high-throughput processing of tens of billions of SKUs, (iii) self-evolving item-understanding models claimed to achieve 94.2% precision and 82.8% recall, and (iv) a unified item tunnel. The system reportedly processes hundreds of millions of daily updates, has accumulated hundreds of billions of knowledge assets, and delivers business gains including 80.4% search-traffic coverage and a 37% reduction in item-information quality issues across search, recommendation, and operations scenarios.

Significance. If the performance and impact claims are substantiated with rigorous evaluation, the work would represent a significant case study of LLM/VLM deployment at extreme industrial scale, demonstrating practical solutions for ontology evolution, scalable knowledge extraction, and integration with e-commerce workflows. The emphasis on self-evolving models and throughput strategies could inform similar large-scale systems, provided the metrics prove stable and generalizable.

major comments (1)
  1. [Abstract] Abstract (and any results/evaluation sections): The central claims of production-grade performance rest on the stated 94.2% precision and 82.8% recall for self-evolving LLMs/VLMs, plus downstream metrics (80.4% search coverage, 37% quality-issue reduction). No evaluation protocol, test-set sampling method from the tens-of-billions SKU catalog, ground-truth construction or validation process, ontology-subset representativeness, or non-self-evolving baseline comparisons are provided. This absence makes it impossible to assess whether the numbers support the claims of stable, controllable improvement under the stated daily update volume.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the careful reading and the constructive comment on evaluation rigor. We address the point below and will revise the manuscript to strengthen the presentation of results.

read point-by-point responses
  1. Referee: [Abstract] Abstract (and any results/evaluation sections): The central claims of production-grade performance rest on the stated 94.2% precision and 82.8% recall for self-evolving LLMs/VLMs, plus downstream metrics (80.4% search coverage, 37% quality-issue reduction). No evaluation protocol, test-set sampling method from the tens-of-billions SKU catalog, ground-truth construction or validation process, ontology-subset representativeness, or non-self-evolving baseline comparisons are provided. This absence makes it impossible to assess whether the numbers support the claims of stable, controllable improvement under the stated daily update volume.

    Authors: We agree that the manuscript would benefit from an explicit evaluation section. In the revision we will add a dedicated subsection that describes: (1) the stratified sampling procedure used to construct the test sets from the full SKU catalog while preserving category and ontology coverage; (2) the multi-stage ground-truth construction process (human annotation guidelines, inter-annotator agreement, and adjudication); (3) how ontology subsets were chosen for representativeness; and (4) the non-self-evolving baseline models against which the reported gains were measured. We will also clarify the temporal split used to simulate the daily-update regime. Because the underlying data are proprietary, we will present the protocol at a level that allows reproducibility of the evaluation design without releasing raw item records. revision: yes

Circularity Check

0 steps flagged

No circularity: descriptive system paper with no derivations or load-bearing self-citations

full rationale

The manuscript is a high-level engineering description of an industrial platform (four pillars: ontology engineering, S2D architecture, self-evolving LLMs/VLMs, unified tunnel) rather than a theoretical argument containing equations, fitted parameters, or first-principles derivations. Performance figures (94.2% precision, 82.8% recall) are presented as observed outcomes of the deployed system without any reduction to prior inputs by construction, and no self-citations, uniqueness theorems, or ansatzes are invoked to justify core claims. The derivation chain is therefore self-contained as a factual report of architecture and metrics.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no equations, fitted constants, or postulated entities. No free parameters, axioms, or invented entities are identifiable from the provided text.

pith-pipeline@v0.9.1-grok · 6095 in / 1278 out tokens · 36289 ms · 2026-06-30T09:41:04.550753+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

117 extracted references · 27 canonical work pages · 7 internal anchors

  1. [1]

    Bert: Pre-training of deep bidirectional transformers for language understanding , author=. Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers) , pages=

  2. [2]

    Proceedings of the 31st International Conference on Computational Linguistics , pages=

    LLaMA-E: Empowering E-commerce authoring with object-interleaved instruction following , author=. Proceedings of the 31st International Conference on Computational Linguistics , pages=

  3. [3]

    Bo Peng and Xinyi Ling and Ziru Chen and Huan Sun and Xia Ning , booktitle=. eCe. 2024 , url=

  4. [4]

    arXiv preprint arXiv:2406.12023 , year=

    LiLiuM: eBay's Large Language Models for e-commerce , author=. arXiv preprint arXiv:2406.12023 , year=

  5. [5]

    Proceedings of the web conference 2020 , pages=

    TaxoExpan: Self-supervised taxonomy expansion with position-enhanced graph neural network , author=. Proceedings of the web conference 2020 , pages=

  6. [6]

    Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining , pages=

    Octet: Online catalog taxonomy enrichment with self-supervision , author=. Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining , pages=

  7. [7]

    Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval , pages=

    KATIE: a system for key attributes identification in product knowledge graph construction , author=. Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval , pages=

  8. [8]

    Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining , pages=

    Opentag: Open attribute value extraction from product profiles , author=. Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining , pages=

  9. [9]

    Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining , pages=

    Learning to extract attribute value from product via question answering: A multi-task approach , author=. Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining , pages=

  10. [10]

    CatalogRAG: Retrieval-guided LLM prediction for multilingual e-commerce product attributes , author=

  11. [11]

    Proceedings of the 2020 ACM SIGMOD international conference on management of data , pages=

    Alicoco: Alibaba e-commerce cognitive concept net , author=. Proceedings of the 2020 ACM SIGMOD international conference on management of data , pages=

  12. [12]

    Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining , pages=

    Alicoco2: Commonsense knowledge extraction, representation and application in e-commerce , author=. Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining , pages=

  13. [13]

    Proceedings of the 27th ACM SIGKDD Conference on knowledge discovery & data mining , pages=

    All you need to know to build a product knowledge graph , author=. Proceedings of the 27th ACM SIGKDD Conference on knowledge discovery & data mining , pages=

  14. [14]

    Proceedings of the 13th International Conference on Web Search and Data Mining , pages=

    Product Knowledge Graph Embedding for E-commerce , author=. Proceedings of the 13th International Conference on Web Search and Data Mining , pages=. 2020 , doi=

  15. [15]

    arXiv preprint arXiv:2412.01837 , year=

    Enabling Explainable Recommendation in E-commerce with LLM-powered Product Knowledge Graph , author=. arXiv preprint arXiv:2412.01837 , year=

  16. [16]

    Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , pages=

    TaxoGen: Unsupervised Topic Taxonomy Construction by Adaptive Term Embedding and Clustering , author=. Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , pages=. 2018 , doi=

  17. [17]

    Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , pages=

    HiExpan: Task-Guided Taxonomy Construction by Hierarchical Tree Expansion , author=. Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , pages=. 2018 , doi=

  18. [18]

    Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , pages=

    STEAM: Self-Supervised Taxonomy Expansion with Mini-Paths , author=. Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , pages=. 2020 , doi=

  19. [19]

    Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics , pages=

    Scaling up Open Tagging from Tens to Thousands: Comprehension Empowered Attribute Value Extraction from Product Title , author=. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics , pages=. 2019 , doi=

  20. [20]

    Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining , pages=

    MAVE: A Product Dataset for Multi-source Attribute Value Extraction , author=. Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining , pages=. 2022 , doi=

  21. [21]

    Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) , pages=

    Multimodal Joint Attribute Prediction and Value Extraction for E-commerce Product , author=. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) , pages=. 2020 , doi=

  22. [22]

    Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining , pages=

    PAM: Understanding Product Images in Cross Product Category Attribute Extraction , author=. Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining , pages=. 2021 , doi=

  23. [23]

    Findings of the Association for Computational Linguistics: EMNLP 2022 , pages=

    SMARTAVE: Structured Multimodal Transformer for Product Attribute Value Extraction , author=. Findings of the Association for Computational Linguistics: EMNLP 2022 , pages=. 2022 , doi=

  24. [24]

    Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track) , pages=

    Large Scale Generative Multimodal Attribute Extraction for E-commerce Attributes , author=. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track) , pages=. 2023 , doi=

  25. [25]

    Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , pages=

    Automatic Validation of Textual Attribute Values in E-commerce Catalog by Learning with Limited Labeled Data , author=. Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , pages=. 2020 , doi=

  26. [26]

    Advances in Databases and Information Systems , series=

    Using LLMs for the Extraction and Normalization of Product Attribute Values , author=. Advances in Databases and Information Systems , series=. 2024 , publisher=

  27. [27]

    Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining , pages=

    Using Brand Knowledge Bases and LLM Agents to Enhance E-commerce Retailers' Catalog Quality , author=. Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining , pages=. 2026 , doi=

  28. [28]

    Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=

    Open-World Attribute Mining for E-Commerce Products with Multimodal Self-Correction Instruction Tuning , author=. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=. 2025 , doi=

  29. [29]

    Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track) , pages=

    Domain Adaptation of Foundation LLMs for e-Commerce , author=. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track) , pages=. 2025 , doi=

  30. [30]

    arXiv preprint arXiv:2509.09121 , year=

    Compass-v3: Scaling Domain-Specific LLMs for Multilingual E-Commerce in Southeast Asia , author=. arXiv preprint arXiv:2509.09121 , year=

  31. [31]

    Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining , pages=

    MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understanding , author=. Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining , pages=. 2026 , doi=

  32. [32]

    arXiv preprint arXiv:2511.12449 , year=

    MOON2.0: Dynamic Modality-balanced Multimodal Representation Learning for E-commerce Product Understanding , author=. arXiv preprint arXiv:2511.12449 , year=

  33. [33]

    Psychology of learning and motivation , volume=

    Catastrophic interference in connectionist networks: The sequential learning problem , author=. Psychology of learning and motivation , volume=. 1989 , publisher=

  34. [34]

    Proceedings of the national academy of sciences , volume=

    Overcoming catastrophic forgetting in neural networks , author=. Proceedings of the national academy of sciences , volume=. 2017 , publisher=

  35. [35]

    International Conference on Learning Representations , year=

    Finetuned Language Models are Zero-Shot Learners , author=. International Conference on Learning Representations , year=

  36. [36]

    Journal of Machine Learning Research , volume=

    Scaling instruction-finetuned language models , author=. Journal of Machine Learning Research , volume=

  37. [37]

    Edward J Hu and yelong shen and Phillip Wallis and Zeyuan Allen-Zhu and Yuanzhi Li and Shean Wang and Lu Wang and Weizhu Chen , booktitle=. Lo. 2022 , url=

  38. [38]

    Advances in Neural Information Processing Systems , volume=

    The primacy of magnitude in low-rank adaptation , author=. Advances in Neural Information Processing Systems , volume=

  39. [39]

    Mixture of Lo

    Xun Wu and Shaohan Huang and Furu Wei , booktitle=. Mixture of Lo. 2024 , url=

  40. [40]

    arXiv preprint arXiv:2405.00361 , year=

    Adamole: Fine-tuning large language models with adaptive mixture of low-rank adaptation experts , author=. arXiv preprint arXiv:2405.00361 , year=

  41. [41]

    DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

    Deepseekmath: Pushing the limits of mathematical reasoning in open language models , author=. arXiv preprint arXiv:2402.03300 , year=

  42. [42]

    arXiv preprint arXiv:2311.01460 , year =

    Implicit chain of thought reasoning via knowledge distillation , author=. arXiv preprint arXiv:2311.01460 , year=

  43. [43]

    Proceedings of the AAAI Conference on Artificial Intelligence , volume=

    FANoise: Singular Value-Adaptive Noise Modulation for Robust Multimodal Representation Learning , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

  44. [44]

    Advances in neural information processing systems , volume=

    Judging llm-as-a-judge with mt-bench and chatbot arena , author=. Advances in neural information processing systems , volume=

  45. [45]

    Advances in Neural Information Processing Systems , volume=

    Doremi: Optimizing data mixtures speeds up language model pretraining , author=. Advances in Neural Information Processing Systems , volume=

  46. [46]

    Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , pages=

    Semantic product search , author=. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , pages=

  47. [47]

    Language Models are Few-Shot Learners , volume =

    Brown, Tom and Mann, Benjamin and Ryder, Nick and Subbiah, Melanie and Kaplan, Jared D and Dhariwal, Prafulla and Neelakantan, Arvind and Shyam, Pranav and Sastry, Girish and Askell, Amanda and Agarwal, Sandhini and Herbert-Voss, Ariel and Krueger, Gretchen and Henighan, Tom and Child, Rewon and Ramesh, Aditya and Ziegler, Daniel and Wu, Jeffrey and Winte...

  48. [48]

    Advances in neural information processing systems , volume=

    Training language models to follow instructions with human feedback , author=. Advances in neural information processing systems , volume=

  49. [49]

    International conference on machine learning , pages=

    Learning transferable visual models from natural language supervision , author=. International conference on machine learning , pages=. 2021 , organization=

  50. [50]

    Advances in neural information processing systems , volume=

    Visual instruction tuning , author=. Advances in neural information processing systems , volume=

  51. [51]

    Advances in neural information processing systems , volume=

    Flashattention: Fast and memory-efficient exact attention with io-awareness , author=. Advances in neural information processing systems , volume=

  52. [52]

    Proceedings of the 29th symposium on operating systems principles , pages=

    Efficient memory management for large language model serving with pagedattention , author=. Proceedings of the 29th symposium on operating systems principles , pages=

  53. [53]

    Proceedings of the 58th annual meeting of the association for computational linguistics , pages=

    Don’t stop pretraining: Adapt language models to domains and tasks , author=. Proceedings of the 58th annual meeting of the association for computational linguistics , pages=

  54. [54]

    Sentence-bert: Sentence embeddings using siamese bert-networks , author=. Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP) , pages=

  55. [55]

    Proceedings of the 2021 conference on empirical methods in natural language processing , pages=

    Simcse: Simple contrastive learning of sentence embeddings , author=. Proceedings of the 2021 conference on empirical methods in natural language processing , pages=

  56. [56]

    An Overview of Multi-Task Learning in Deep Neural Networks

    An overview of multi-task learning in deep neural networks , author=. arXiv preprint arXiv:1706.05098 , year=

  57. [57]

    Journal of machine learning research , volume=

    Exploring the limits of transfer learning with a unified text-to-text transformer , author=. Journal of machine learning research , volume=

  58. [58]

    International conference on machine learning , pages=

    Parameter-efficient transfer learning for NLP , author=. International conference on machine learning , pages=. 2019 , organization=

  59. [59]

    Proceedings of the 28th international conference on computational linguistics , pages=

    Continual lifelong learning in natural language processing: A survey , author=. Proceedings of the 28th international conference on computational linguistics , pages=

  60. [60]

    Advances in neural information processing systems , volume=

    Chain-of-thought prompting elicits reasoning in large language models , author=. Advances in neural information processing systems , volume=

  61. [61]

    Distilling the Knowledge in a Neural Network

    Distilling the knowledge in a neural network , author=. arXiv preprint arXiv:1503.02531 , year=

  62. [62]

    Advances in Neural Information Processing Systems , volume=

    Star: Bootstrapping reasoning with reasoning , author=. Advances in Neural Information Processing Systems , volume=

  63. [63]

    Advances in neural information processing systems , volume=

    Self-refine: Iterative refinement with self-feedback , author=. Advances in neural information processing systems , volume=

  64. [64]

    Proceedings of the 61st annual meeting of the association for computational linguistics (volume 1: long papers) , pages=

    Self-instruct: Aligning language models with self-generated instructions , author=. Proceedings of the 61st annual meeting of the association for computational linguistics (volume 1: long papers) , pages=

  65. [65]

    Companion of the 2024 International Conference on Management of Data , pages =

    Yu, Changlong and Liu, Xin and Maia, Jefferson and Li, Yang and Cao, Tianyu and Gao, Yifan and Song, Yangqiu and Goutam, Rahul and Zhang, Haiyang and Yin, Bing and Li, Zheng , title =. Companion of the 2024 International Conference on Management of Data , pages =

  66. [66]

    Findings of the Association for Computational Linguistics: ACL 2023 , pages =

    Yu, Changlong and Wang, Weiqi and Liu, Xin and Bai, Jiaxin and Song, Yangqiu and Li, Zheng and Gao, Yifan and Cao, Tianyu and Yin, Bing , title =. Findings of the Association for Computational Linguistics: ACL 2023 , pages =

  67. [67]

    Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , pages=

    Autoknow: Self-driving knowledge collection for products of thousands of types , author=. Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , pages=

  68. [68]

    Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track , pages=

    AttributeForge: An Agentic LLM Framework for Automated Product Schema Modeling , author=. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track , pages=

  69. [69]

    European Semantic Web Conference , pages=

    Ontogenia: Ontology generation with metacognitive prompting in large language models , author=. European Semantic Web Conference , pages=

  70. [70]

    European semantic web conference , pages=

    Navigating ontology development with large language models , author=. European semantic web conference , pages=

  71. [71]

    European Semantic Web Conference , pages=

    Ontology generation using large language models , author=. European Semantic Web Conference , pages=

  72. [72]

    International semantic web conference , pages=

    LLMs4OL: Large language models for ontology learning , author=. International semantic web conference , pages=

  73. [73]

    2023 25th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC) , pages=

    Ontology engineering with large language models , author=. 2023 25th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC) , pages=

  74. [74]

    European Semantic Web Conference , pages=

    Neon-gpt: a large language model-powered pipeline for ontology learning , author=. European Semantic Web Conference , pages=

  75. [75]

    arXiv preprint arXiv:2412.02035 , year=

    Llms4life: Large language models for ontology learning in life sciences , author=. arXiv preprint arXiv:2412.02035 , year=

  76. [76]

    arXiv preprint arXiv:2505.24163 , year=

    LKD-KGC: Domain-specific KG construction via LLM-driven knowledge dependency parsing , author=. arXiv preprint arXiv:2505.24163 , year=

  77. [77]

    From Local to Global: A Graph RAG Approach to Query-Focused Summarization

    From local to global: A graph rag approach to query-focused summarization , author=. arXiv preprint arXiv:2404.16130 , year=

  78. [78]

    arXiv preprint arXiv:2506.00664 , year=

    OntoRAG: Enhancing question-answering through automated ontology derivation from unstructured knowledge bases , author=. arXiv preprint arXiv:2506.00664 , year=

  79. [79]

    Proceedings of the 2024 conference on empirical methods in natural language processing , pages=

    Extract, define, canonicalize: An llm-based framework for knowledge graph construction , author=. Proceedings of the 2024 conference on empirical methods in natural language processing , pages=

  80. [80]

    Findings of the Association for Computational Linguistics: EMNLP 2023 , pages=

    Schema-adaptable knowledge graph construction , author=. Findings of the Association for Computational Linguistics: EMNLP 2023 , pages=

Showing first 80 references.