MALM: Mask Augmentation based Lo- cal Matching for Food-Recipe Retrieval

Bhanu Prakash V outharoja, Peng Wang, Lei Wang, Vivienne Guan · 2023 · arXiv 2305.11327

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

SIMMER: Cross-Modal Food Image--Recipe Retrieval via MLLM-Based Embedding

cs.CV · 2026-04-17 · unverdicted · novelty 6.0

SIMMER uses a single multimodal LLM (VLM2Vec) with custom prompts and partial-recipe augmentation to embed food images and recipes, achieving new state-of-the-art retrieval accuracy on Recipe1M.

citing papers explorer

Showing 1 of 1 citing paper.

SIMMER: Cross-Modal Food Image--Recipe Retrieval via MLLM-Based Embedding cs.CV · 2026-04-17 · unverdicted · none · ref 46
SIMMER uses a single multimodal LLM (VLM2Vec) with custom prompts and partial-recipe augmentation to embed food images and recipes, achieving new state-of-the-art retrieval accuracy on Recipe1M.

MALM: Mask Augmentation based Lo- cal Matching for Food-Recipe Retrieval

fields

years

verdicts

representative citing papers

citing papers explorer