Granite Embedding Models

Aashka Trivedi; Abraham Daniels; Arafat Sultan; Avirup Sil; Bhavani Iyer; David Cox; Gabe Goodhart; Jaydeep Sen; Kate Soule; Luis Lastras

arxiv: 2502.20204 · v1 · pith:NBMNLDYJnew · submitted 2025-02-27 · 💻 cs.IR · cs.CL

Granite Embedding Models

Parul Awasthy , Aashka Trivedi , Yulong Li , Mihaela Bornea , David Cox , Abraham Daniels , Martin Franz , Gabe Goodhart

show 14 more authors

Bhavani Iyer Vishwajeet Kumar Luis Lastras Scott McCarley Rudra Murthy Vignesh P Sara Rosenthal Salim Roukos Jaydeep Sen Sukriti Sharma Avirup Sil Kate Soule Arafat Sultan Radu Florian

This is my paper

classification 💻 cs.IR cs.CL

keywords modelsembeddingretrievalgranitelayerpubliclytasksallowing

0 comments

read the original abstract

We introduce the Granite Embedding models, a family of encoder-based embedding models designed for retrieval tasks, spanning dense-retrieval and sparse retrieval architectures, with both English and Multilingual capabilities. This report provides the technical details of training these highly effective 12 layer embedding models, along with their efficient 6 layer distilled counterparts. Extensive evaluations show that the models, developed with techniques like retrieval oriented pretraining, contrastive finetuning, knowledge distillation, and model merging significantly outperform publicly available models of similar sizes on both internal IBM retrieval and search tasks, and have equivalent performance on widely used information retrieval benchmarks, while being trained on high-quality data suitable for enterprise use. We publicly release all our Granite Embedding models under the Apache 2.0 license, allowing both research and commercial use at https://huggingface.co/collections/ibm-granite.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Structure Retention in Embedding Spaces as a Predictor of Benchmark Performance
cs.CL 2026-05 unverdicted novelty 6.0

Embedding model performance on MTEB tasks correlates strongly with nearest-neighbor overlap and ICA magnitude differences in their embedding spaces.
Search-R3: Unifying Reasoning and Embedding in Large Language Models
cs.CL 2025-10 unverdicted novelty 5.0

Search-R3 trains LLMs to output search embeddings as a direct product of step-by-step reasoning via supervised pre-training and a specialized RL environment that avoids full corpus re-encoding.