GuideCAD generates 3D CAD models from text-image pairs via prefix embeddings in a pretrained LLM using a mapping network, achieving comparable quality with roughly 4x fewer parameters and 2x training efficiency than fine-tuning.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management , pages =
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
RGCD-Rep distills cross-domain reasoning from a frozen MLLM teacher and learns decomposed transferable item representations via two-stage training, yielding gains in offline experiments and production A/B tests on a live streaming platform.
citing papers explorer
-
Bridging Short Videos and Live Streams: Reasoning-Guided Multimodal LLMs for Cross-Domain Representation Learning
RGCD-Rep distills cross-domain reasoning from a frozen MLLM teacher and learns decomposed transferable item representations via two-stage training, yielding gains in offline experiments and production A/B tests on a live streaming platform.