ECLASS-Augmented Semantic Product Search for Electronic Components

Jan Henze; Markus Lange-Hegermann; Nico Baumgart

arxiv: 2604.19664 · v1 · submitted 2026-04-21 · 💻 cs.IR

ECLASS-Augmented Semantic Product Search for Electronic Components

Nico Baumgart , Markus Lange-Hegermann , Jan Henze This is my paper

Pith reviewed 2026-05-10 01:21 UTC · model grok-4.3

classification 💻 cs.IR

keywords semantic product searchdense retrievalECLASSelectronic componentsindustrial catalogsLLM retrievalre-ranking

0 comments

The pith

Integrating ECLASS hierarchical semantics into dense retrieval bridges natural-language queries to electronic component catalogs effectively.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests LLM-assisted dense retrieval for locating industrial electronic parts when users describe needs in plain language rather than attribute lists. It finds that dense retrieval followed by re-ranking recovers far more relevant items than lexical baselines like BM25 or general web foundation models. Adding structured hierarchical information from the ECLASS standard to the product embeddings produces steady accuracy lifts across different setups by linking user intent to sparse catalog entries. This matters for factory automation and LLM agent systems that must locate suitable components quickly from large, structured inventories.

Core claim

Dense retrieval combined with re-ranking substantially outperforms classical lexical methods and foundation model baselines for semantic product search on industrial electronic components. Augmenting product representations with ECLASS semantics yields consistent performance gains across configurations by supplying a semantic bridge between user intent and sparse product descriptions.

What carries the argument

ECLASS-augmented dense retrieval, which embeds product data and adds hierarchical standard metadata to align natural-language queries with attribute-centric catalog entries.

If this is right

Dense retrieval with re-ranking recovers relevant components at much higher rates than BM25 on expert queries.
ECLASS augmentation improves results consistently across different retrieval configurations.
The method runs more effectively and efficiently than foundation model baselines for this industrial task.
Better component identification supports factory automation and LLM-based agent workflows.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same augmentation pattern could apply to other standardized industrial taxonomies outside electronics.
More accurate retrieval might support agent-driven tasks such as automated bill-of-materials generation or supplier matching.
Performance on casual non-expert queries would test whether the semantic bridge holds for broader user populations.

Load-bearing premise

The expert queries and product dataset represent typical real industrial searches, and ECLASS metadata acts as a reliable general bridge without heavy domain-specific adjustments.

What would settle it

Running the same retrieval methods on a fresh collection of electronic components paired with queries whose wording diverges from ECLASS category structures.

Figures

Figures reproduced from arXiv: 2604.19664 by Jan Henze, Markus Lange-Hegermann, Nico Baumgart.

**Figure 1.** Figure 1: Overview of the proposed LLM-assisted semantic product search pipeline for industrial component retrieval, following the current state of the art in [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗

**Figure 2.** Figure 2: Visualization of the ECLASS hierarchy and the corresponding product [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 4.** Figure 4: Search performance with and without re-ranking over different values of top k and the corresponding mean latency on the combined dataset with manual assessment of the top candidates (see Section III-D). For these experiments, CL 1, basic product data level, embeddings size 4096, and no query rewriting. B. Baseline comparison and practical trade-offs To contextualize the main findings, we compare our best d… view at source ↗

**Figure 3.** Figure 3: Search performance over different CLs and embedding sizes with and [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

read the original abstract

Efficient semantic access to industrial product data is a key enabler for factory automation and emerging LLM-based agent workflows, where both human engineers and autonomous agents must identify suitable components from highly structured catalogs. However, the vocabulary mismatch between natural-language queries and attribute-centric product descriptions limits the effectiveness of traditional retrieval approaches, e.g., BM25. In this work, we present a systematic evaluation of LLM-assisted dense retrieval for semantic product search on industrial electronic components, and investigate the integration of hierarchical semantics from the ECLASS standard into embedding-based retrieval. Our results show that dense retrieval combined with re-ranking substantially outperforms classical lexical methods and foundation model web-search baselines. In particular, the proposed approach achieves a Hit_Rate@5 of 94.3 %, compared to 31.4 % for BM25 on expert queries, while also exceeding foundation model baselines in both effectiveness and efficiency. Furthermore, augmenting product representations with ECLASS semantics yields consistent performance gains across configurations, demonstrating that standardized hierarchical metadata provides a crucial semantic bridge between user intent and sparse product descriptions.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper shows solid gains from dense retrieval and ECLASS on component search, but missing dataset details make the headline numbers hard to judge.

read the letter

The main takeaway is that dense retrieval with re-ranking and ECLASS augmentation reaches 94.3% Hit@5 on expert queries for electronic components, compared to 31.4% for BM25. That gap, plus the consistent lift from the ECLASS metadata, is the concrete result worth noting. They also report beating some foundation model baselines on both accuracy and speed. The work applies established dense retrieval techniques to a structured industrial catalog and shows how hierarchical standard metadata helps close the vocabulary gap between queries and attribute-based descriptions. That part is useful for anyone dealing with product search in manufacturing or automation settings. The evaluation setup is the soft spot. The abstract gives no numbers on catalog size, number of queries, or how the expert queries were constructed. Without those, it is difficult to tell whether the high hit rate reflects a genuinely hard task or a smaller, more aligned test set. The stress-test note is correct on this point. If the catalog is modest or the queries closely match product attributes, the reported advantage could shrink in messier real-world use. The paper is aimed at researchers and practitioners in industrial information retrieval and semantic search for catalogs. A reader working on LLM agents for factory parts or similar retrieval problems would find the numbers and the ECLASS integration worth seeing. It deserves peer review because the domain is specific, the gains are large enough to check, and the approach is straightforward. Referees can request the missing dataset statistics and error analysis. I would send it to review rather than desk reject.

Referee Report

3 major / 1 minor

Summary. The paper evaluates LLM-assisted dense retrieval augmented with ECLASS hierarchical semantics for semantic search over industrial electronic component catalogs. It claims that the proposed approach achieves a Hit_Rate@5 of 94.3% on expert queries (versus 31.4% for BM25) while also outperforming foundation-model baselines in both effectiveness and efficiency, with consistent gains from ECLASS augmentation across configurations.

Significance. If the evaluation proves robust and representative, the work would be significant for industrial information retrieval: it provides concrete evidence that standardized hierarchical metadata (ECLASS) can serve as an effective semantic bridge for vocabulary mismatch in attribute-centric product data, with potential applicability to factory automation and LLM-agent workflows.

major comments (3)

[Abstract / Evaluation] Abstract and experimental evaluation: the manuscript reports a Hit_Rate@5 of 94.3% versus 31.4% for BM25 but provides no statistics on catalog size, number of products, or number of expert queries. Without these figures it is impossible to determine whether the performance delta reflects a genuine advance on realistically large and challenging industrial data or is inflated by a small or easy test set.
[Abstract / Evaluation] Abstract and experimental evaluation: details on how the expert queries were constructed or sampled are absent. This information is load-bearing for the claim that the results generalize to real industrial use cases and that ECLASS supplies a 'general semantic bridge' without extensive domain tuning.
[Abstract] Abstract: the claim that the approach 'exceeds foundation model baselines in both effectiveness and efficiency' is stated without naming the specific models, their retrieval configurations, or the concrete efficiency metrics (latency, throughput, or index size) used in the comparison.

minor comments (1)

[Abstract] The abstract would benefit from a brief parenthetical note on the scale of the evaluation (e.g., 'on a catalog of N products with M expert queries') to allow readers to immediately contextualize the reported percentages.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive feedback. We have revised the manuscript to address all major comments by adding the requested details on dataset statistics, query construction, and baseline specifications to the abstract and evaluation sections.

read point-by-point responses

Referee: [Abstract / Evaluation] Abstract and experimental evaluation: the manuscript reports a Hit_Rate@5 of 94.3% versus 31.4% for BM25 but provides no statistics on catalog size, number of products, or number of expert queries. Without these figures it is impossible to determine whether the performance delta reflects a genuine advance on realistically large and challenging industrial data or is inflated by a small or easy test set.

Authors: We agree that these statistics are necessary to properly contextualize the results. In the revised manuscript we have added the catalog size, total number of products, and number of expert queries to both the abstract and the experimental evaluation section. revision: yes
Referee: [Abstract / Evaluation] Abstract and experimental evaluation: details on how the expert queries were constructed or sampled are absent. This information is load-bearing for the claim that the results generalize to real industrial use cases and that ECLASS supplies a 'general semantic bridge' without extensive domain tuning.

Authors: We acknowledge the importance of this information for assessing generalizability. The revised manuscript now includes a detailed account of how the expert queries were constructed and sampled, including the role of domain experts and the sampling approach used to reflect industrial scenarios. revision: yes
Referee: [Abstract] Abstract: the claim that the approach 'exceeds foundation model baselines in both effectiveness and efficiency' is stated without naming the specific models, their retrieval configurations, or the concrete efficiency metrics (latency, throughput, or index size) used in the comparison.

Authors: We agree that the abstract should be more specific. We have updated the abstract to name the foundation-model baselines, describe their retrieval configurations, and report the concrete efficiency metrics (latency, throughput, and index size) employed in the comparisons. revision: yes

Circularity Check

0 steps flagged

No circularity: purely empirical evaluation with external metrics

full rationale

The paper reports experimental results on dense retrieval for product search, comparing Hit_Rate@5 and other metrics against BM25 and foundation-model baselines, with gains from ECLASS augmentation. No equations, derivations, fitted parameters renamed as predictions, or self-citation chains appear in the provided abstract or described content. All claims rest on direct measurement against independent baselines and standard IR metrics, with no reduction of outputs to inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The work rests on two standard domain assumptions in information retrieval and no free parameters or invented entities.

axioms (2)

domain assumption Dense retrieval embeddings capture semantic similarity between natural-language queries and attribute-centric product descriptions.
Invoked when claiming superiority of LLM-assisted dense retrieval over lexical methods.
domain assumption ECLASS hierarchical metadata supplies a semantic bridge that reduces vocabulary mismatch.
Central premise for the augmentation experiments and performance gains.

pith-pipeline@v0.9.0 · 5479 in / 1205 out tokens · 32399 ms · 2026-05-10T01:21:38.236826+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

28 extracted references · 28 canonical work pages · 2 internal anchors

[1]

Ten years of asset administration shell: Developments, research opportunities, and adoption challenges,

L. Sakurada, F. de la Prieta, and P. Leitao, “Ten years of asset administration shell: Developments, research opportunities, and adoption challenges,”IEEE Access, vol. 13, pp. 127 721–127 741, 2025

work page 2025
[2]

(2026) References & cooperations

ECLASS. (2026) References & cooperations. [Online]. Available: https://eclass.eu/en/about-us/references-cooperations

work page 2026
[3]

(2026) Technical specification

——. (2026) Technical specification. [Online]. Available: https: //eclass.eu/support/technical-specification

work page 2026
[4]

AI agents and agentic AI–navigating a plethora of concepts for future manufacturing,

Y . Ren, Y . Liu, T. Ji, and X. Xu, “AI agents and agentic AI–navigating a plethora of concepts for future manufacturing,”Journal of Manufac- turing Systems, vol. 83, pp. 126–133, 2025

work page 2025
[5]

STaRK: Benchmarking LLM retrieval on textual and relational knowledge bases,

S. Wu, S. Zhao, M. Yasunaga, K. Huang, K. Cao, Q. Huang, V . N. Ioannidis, K. Subbian, J. Zou, and J. Leskovec, “STaRK: Benchmarking LLM retrieval on textual and relational knowledge bases,”Advances in Neural Information Processing Systems, vol. 37, pp. 127 129–127 153, Dec. 2024

work page 2024
[6]

Okapi at TREC,

S. E. Robertson, S. Walker, S. Jones, M. M. Hancock-Beaulieu, M. Gat- fordet al., “Okapi at TREC,” 1994

work page 1994
[8]

ACM Trans

Y . Zhu, H. Yuan, S. Wang, J. Liu, W. Liu, C. Deng, H. Chen, Z. Liu, Z. Dou, and J.-R. Wen, “Large language models for information retrieval: A survey,”ACM Transactions on Information Systems, vol. 44, no. 1, pp. 12:1–12:54, 2025. [Online]. Available: https://dl.acm.org/doi/10.1145/3748304

work page doi:10.1145/3748304 2025
[9]

Automated extraction of conditional causal rules from control narratives using logic programming and large language models,

F. C. Kunze, G. Manca, and A. Fay, “Automated extraction of conditional causal rules from control narratives using logic programming and large language models,” in2025 IEEE 30th International Conference on Emerging Technologies and Factory Automation (ETF A), 2025, pp. 1–4

work page 2025
[10]

Leveraging LLMs towards assistant-based support for industrial threat models,

E. Fregnan, C. G ¨ottel, B. Maag, A. Dawoud, and G. Nakas, “Leveraging LLMs towards assistant-based support for industrial threat models,” in 2025 IEEE 30th International Conference on Emerging Technologies and Factory Automation (ETF A), 2025, pp. 1–8

work page 2025
[11]

Leveraging large language models for robust maintenance rule extrac- tion in industrial settings,

N. Tamascelli, N. Bhattacharya, C. Song, R. Borrison, and R. Gitzel, “Leveraging large language models for robust maintenance rule extrac- tion in industrial settings,” in2025 IEEE 30th International Conference on Emerging Technologies and Factory Automation (ETF A), 2025, pp. 1–7

work page 2025
[12]

Why asset administration shells: A survey on uses and challenges,

A. Alexopoulos, G. Kalogeras, K. Koutras, and A. Kalogeras, “Why asset administration shells: A survey on uses and challenges,”IEEE Access, vol. 13, pp. 126 582–126 609, 2025

work page 2025
[13]

Dual data mapping with fine-tuned large language models and asset administration shells to- ward interoperable knowledge representation,

D. Shi, O. Meyer, M. Oberle, and T. Bauernhansl, “Dual data mapping with fine-tuned large language models and asset administration shells to- ward interoperable knowledge representation,”Robotics and Computer- Integrated Manufacturing, vol. 91, p. 102837, 2025

work page 2025
[14]

Generation of asset administration shell with large language model agents: Toward semantic interoperability in digital twins in the context of industry 4.0,

Y . Xia, Z. Xiao, N. Jazdi, and M. Weyrich, “Generation of asset administration shell with large language model agents: Toward semantic interoperability in digital twins in the context of industry 4.0,”IEEE Access, vol. 12, pp. 84 863–84 877, 2024

work page 2024
[15]

Interoperable information modelling leveraging asset administration shell and large language model for quality control toward zero defect manufacturing,

D. Shi, P. Liedl, and T. Bauernhansl, “Interoperable information modelling leveraging asset administration shell and large language model for quality control toward zero defect manufacturing,”Journal of Manufacturing Systems, vol. 77, pp. 678–696, 2024. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S0278612524002395

work page 2024
[16]

Generalized embedding models for industry 4.0 applications,

C. Constantinides, S. Lin, and D. C. Patel, “Generalized embedding models for industry 4.0 applications,” inProceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track, S. Potdar, L. Rojas-Barahona, and S. Montella, Eds. Suzhou (China): Association for Computational Linguistics, Nov. 2025, pp. 2234–2251. [Online]...

work page 2025
[17]

LLMs as sparse retrievers: A framework for first-stage product search,

H. Song, Y . an Liu, R. Zhang, J. Guo, M. de Rijke, S. Li, W. Peng, F. Lv, and X. Cheng, “LLMs as sparse retrievers: A framework for first-stage product search,” 2025. [Online]. Available: https://arxiv.org/abs/2510.18527

work page arXiv 2025
[18]

Hierarchical multi-field representations for two-stage e-commerce retrieval,

N. Freymuth, D. Liu, T. Ricatte, and S. Mansour, “Hierarchical multi-field representations for two-stage e-commerce retrieval,” 2025. [Online]. Available: https://arxiv.org/abs/2501.18707

work page arXiv 2025
[19]

Mod- elling the semantics of data of an asset administration shell with elements of eclass,

A. Belyaev, C. Block, B. Boss, C. Diedrich, P. Juhel, W. Hartmann, O. Hillermeier, N. Ondracek, S. Pfeifer, F. Scherenschlichet al., “Mod- elling the semantics of data of an asset administration shell with elements of eclass,”Joint white paper Plattform Industrie 4.0 and ECLASS, 2021

work page 2021
[20]

(2026) clipx ENGINEER software

Phoenix Contact. (2026) clipx ENGINEER software. [On- line]. Available: https://www.phoenixcontact.com/en-pc/products/ software-clipx-engineer-1272241

work page 2026
[21]

Someone Hid It!

W. X. Zhao, J. Liu, R. Ren, and J.-R. Wen, “Dense text retrieval based on pretrained language models: A survey,”ACM Transactions on Information Systems, vol. 42, no. 4, pp. 89:1–89:60, 2024. [Online]. Available: https://dl.acm.org/doi/10.1145/3637870

work page doi:10.1145/3637870 2024
[22]

Qwen3 technical report,

A. Yang, A. Li, B. Yang, B. Zhang, B. Hui, B. Zheng, B. Yu, C. Gao, C. Huang, C. Lv, C. Zheng, D. Liu, F. Zhou, F. Huang, F. Hu, H. Ge, H. Wei, H. Lin, J. Tang, J. Yang, J. Tu, J. Zhang, J. Yang, J. Yang, J. Zhou, J. Zhou, J. Lin, K. Dang, K. Bao, K. Yang, L. Yu, L. Deng, M. Li, M. Xue, M. Li, P. Zhang, P. Wang, Q. Zhu, R. Men, R. Gao, S. Liu, S. Luo, T. ...

work page
[23]

Qwen3 Technical Report

[Online]. Available: https://arxiv.org/abs/2505.09388

work page internal anchor Pith review Pith/arXiv arXiv
[24]

Qwen3 embedding: Advancing text embedding and reranking through foundation models,

Y . Zhang, M. Li, D. Long, X. Zhang, H. Lin, B. Yang, P. Xie, A. Yang, D. Liu, J. Lin, F. Huang, and J. Zhou, “Qwen3 embedding: Advancing text embedding and reranking through foundation models,”

work page
[25]

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

[Online]. Available: http://arxiv.org/abs/2506.05176

work page internal anchor Pith review arXiv
[26]

(2026) Neo4j graph database

Neo4j, Inc. (2026) Neo4j graph database. [Online]. Available: https://neo4j.com/product/neo4j-graph-database/

work page 2026
[27]

(2026) System card: Claude sonnet 4.6

Anthropic. (2026) System card: Claude sonnet 4.6. [Online]. Available: https://www.anthropic.com/claude-sonnet-4-6-system-card

work page 2026
[28]

(2025) Introducing gpt-4.1 in the api

OpenAI. (2025) Introducing gpt-4.1 in the api. [Online]. Available: https://openai.com/index/gpt-4-1/

work page 2025
[29]

(2025) Update to gpt-5 system card: Gpt-5.2

——. (2025) Update to gpt-5 system card: Gpt-5.2. [Online]. Available: https://deploymentsafety.openai.com/gpt-5-2

work page 2025

[1] [1]

Ten years of asset administration shell: Developments, research opportunities, and adoption challenges,

L. Sakurada, F. de la Prieta, and P. Leitao, “Ten years of asset administration shell: Developments, research opportunities, and adoption challenges,”IEEE Access, vol. 13, pp. 127 721–127 741, 2025

work page 2025

[2] [2]

(2026) References & cooperations

ECLASS. (2026) References & cooperations. [Online]. Available: https://eclass.eu/en/about-us/references-cooperations

work page 2026

[3] [3]

(2026) Technical specification

——. (2026) Technical specification. [Online]. Available: https: //eclass.eu/support/technical-specification

work page 2026

[4] [4]

AI agents and agentic AI–navigating a plethora of concepts for future manufacturing,

Y . Ren, Y . Liu, T. Ji, and X. Xu, “AI agents and agentic AI–navigating a plethora of concepts for future manufacturing,”Journal of Manufac- turing Systems, vol. 83, pp. 126–133, 2025

work page 2025

[5] [5]

STaRK: Benchmarking LLM retrieval on textual and relational knowledge bases,

S. Wu, S. Zhao, M. Yasunaga, K. Huang, K. Cao, Q. Huang, V . N. Ioannidis, K. Subbian, J. Zou, and J. Leskovec, “STaRK: Benchmarking LLM retrieval on textual and relational knowledge bases,”Advances in Neural Information Processing Systems, vol. 37, pp. 127 129–127 153, Dec. 2024

work page 2024

[6] [6]

Okapi at TREC,

S. E. Robertson, S. Walker, S. Jones, M. M. Hancock-Beaulieu, M. Gat- fordet al., “Okapi at TREC,” 1994

work page 1994

[7] [8]

ACM Trans

Y . Zhu, H. Yuan, S. Wang, J. Liu, W. Liu, C. Deng, H. Chen, Z. Liu, Z. Dou, and J.-R. Wen, “Large language models for information retrieval: A survey,”ACM Transactions on Information Systems, vol. 44, no. 1, pp. 12:1–12:54, 2025. [Online]. Available: https://dl.acm.org/doi/10.1145/3748304

work page doi:10.1145/3748304 2025

[8] [9]

Automated extraction of conditional causal rules from control narratives using logic programming and large language models,

F. C. Kunze, G. Manca, and A. Fay, “Automated extraction of conditional causal rules from control narratives using logic programming and large language models,” in2025 IEEE 30th International Conference on Emerging Technologies and Factory Automation (ETF A), 2025, pp. 1–4

work page 2025

[9] [10]

Leveraging LLMs towards assistant-based support for industrial threat models,

E. Fregnan, C. G ¨ottel, B. Maag, A. Dawoud, and G. Nakas, “Leveraging LLMs towards assistant-based support for industrial threat models,” in 2025 IEEE 30th International Conference on Emerging Technologies and Factory Automation (ETF A), 2025, pp. 1–8

work page 2025

[10] [11]

Leveraging large language models for robust maintenance rule extrac- tion in industrial settings,

N. Tamascelli, N. Bhattacharya, C. Song, R. Borrison, and R. Gitzel, “Leveraging large language models for robust maintenance rule extrac- tion in industrial settings,” in2025 IEEE 30th International Conference on Emerging Technologies and Factory Automation (ETF A), 2025, pp. 1–7

work page 2025

[11] [12]

Why asset administration shells: A survey on uses and challenges,

A. Alexopoulos, G. Kalogeras, K. Koutras, and A. Kalogeras, “Why asset administration shells: A survey on uses and challenges,”IEEE Access, vol. 13, pp. 126 582–126 609, 2025

work page 2025

[12] [13]

Dual data mapping with fine-tuned large language models and asset administration shells to- ward interoperable knowledge representation,

D. Shi, O. Meyer, M. Oberle, and T. Bauernhansl, “Dual data mapping with fine-tuned large language models and asset administration shells to- ward interoperable knowledge representation,”Robotics and Computer- Integrated Manufacturing, vol. 91, p. 102837, 2025

work page 2025

[13] [14]

Generation of asset administration shell with large language model agents: Toward semantic interoperability in digital twins in the context of industry 4.0,

Y . Xia, Z. Xiao, N. Jazdi, and M. Weyrich, “Generation of asset administration shell with large language model agents: Toward semantic interoperability in digital twins in the context of industry 4.0,”IEEE Access, vol. 12, pp. 84 863–84 877, 2024

work page 2024

[14] [15]

Interoperable information modelling leveraging asset administration shell and large language model for quality control toward zero defect manufacturing,

D. Shi, P. Liedl, and T. Bauernhansl, “Interoperable information modelling leveraging asset administration shell and large language model for quality control toward zero defect manufacturing,”Journal of Manufacturing Systems, vol. 77, pp. 678–696, 2024. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S0278612524002395

work page 2024

[15] [16]

Generalized embedding models for industry 4.0 applications,

C. Constantinides, S. Lin, and D. C. Patel, “Generalized embedding models for industry 4.0 applications,” inProceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track, S. Potdar, L. Rojas-Barahona, and S. Montella, Eds. Suzhou (China): Association for Computational Linguistics, Nov. 2025, pp. 2234–2251. [Online]...

work page 2025

[16] [17]

LLMs as sparse retrievers: A framework for first-stage product search,

H. Song, Y . an Liu, R. Zhang, J. Guo, M. de Rijke, S. Li, W. Peng, F. Lv, and X. Cheng, “LLMs as sparse retrievers: A framework for first-stage product search,” 2025. [Online]. Available: https://arxiv.org/abs/2510.18527

work page arXiv 2025

[17] [18]

Hierarchical multi-field representations for two-stage e-commerce retrieval,

N. Freymuth, D. Liu, T. Ricatte, and S. Mansour, “Hierarchical multi-field representations for two-stage e-commerce retrieval,” 2025. [Online]. Available: https://arxiv.org/abs/2501.18707

work page arXiv 2025

[18] [19]

Mod- elling the semantics of data of an asset administration shell with elements of eclass,

A. Belyaev, C. Block, B. Boss, C. Diedrich, P. Juhel, W. Hartmann, O. Hillermeier, N. Ondracek, S. Pfeifer, F. Scherenschlichet al., “Mod- elling the semantics of data of an asset administration shell with elements of eclass,”Joint white paper Plattform Industrie 4.0 and ECLASS, 2021

work page 2021

[19] [20]

(2026) clipx ENGINEER software

Phoenix Contact. (2026) clipx ENGINEER software. [On- line]. Available: https://www.phoenixcontact.com/en-pc/products/ software-clipx-engineer-1272241

work page 2026

[20] [21]

Someone Hid It!

W. X. Zhao, J. Liu, R. Ren, and J.-R. Wen, “Dense text retrieval based on pretrained language models: A survey,”ACM Transactions on Information Systems, vol. 42, no. 4, pp. 89:1–89:60, 2024. [Online]. Available: https://dl.acm.org/doi/10.1145/3637870

work page doi:10.1145/3637870 2024

[21] [22]

Qwen3 technical report,

A. Yang, A. Li, B. Yang, B. Zhang, B. Hui, B. Zheng, B. Yu, C. Gao, C. Huang, C. Lv, C. Zheng, D. Liu, F. Zhou, F. Huang, F. Hu, H. Ge, H. Wei, H. Lin, J. Tang, J. Yang, J. Tu, J. Zhang, J. Yang, J. Yang, J. Zhou, J. Zhou, J. Lin, K. Dang, K. Bao, K. Yang, L. Yu, L. Deng, M. Li, M. Xue, M. Li, P. Zhang, P. Wang, Q. Zhu, R. Men, R. Gao, S. Liu, S. Luo, T. ...

work page

[22] [23]

Qwen3 Technical Report

[Online]. Available: https://arxiv.org/abs/2505.09388

work page internal anchor Pith review Pith/arXiv arXiv

[23] [24]

Qwen3 embedding: Advancing text embedding and reranking through foundation models,

Y . Zhang, M. Li, D. Long, X. Zhang, H. Lin, B. Yang, P. Xie, A. Yang, D. Liu, J. Lin, F. Huang, and J. Zhou, “Qwen3 embedding: Advancing text embedding and reranking through foundation models,”

work page

[24] [25]

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

[Online]. Available: http://arxiv.org/abs/2506.05176

work page internal anchor Pith review arXiv

[25] [26]

(2026) Neo4j graph database

Neo4j, Inc. (2026) Neo4j graph database. [Online]. Available: https://neo4j.com/product/neo4j-graph-database/

work page 2026

[26] [27]

(2026) System card: Claude sonnet 4.6

Anthropic. (2026) System card: Claude sonnet 4.6. [Online]. Available: https://www.anthropic.com/claude-sonnet-4-6-system-card

work page 2026

[27] [28]

(2025) Introducing gpt-4.1 in the api

OpenAI. (2025) Introducing gpt-4.1 in the api. [Online]. Available: https://openai.com/index/gpt-4-1/

work page 2025

[28] [29]

(2025) Update to gpt-5 system card: Gpt-5.2

——. (2025) Update to gpt-5 system card: Gpt-5.2. [Online]. Available: https://deploymentsafety.openai.com/gpt-5-2

work page 2025