DeepFM: A Factorization-Machine based Neural Network for CTR Prediction

Huifeng Guo; Ruiming Tang; Xiuqiang He; Yunming Ye; Zhenguo Li

arxiv: 1703.04247 · v1 · pith:IVP24ROLnew · submitted 2017-03-13 · 💻 cs.IR · cs.CL

DeepFM: A Factorization-Machine based Neural Network for CTR Prediction

Huifeng Guo , Ruiming Tang , Yunming Ye , Zhenguo Li , Xiuqiang He This is my paper

classification 💻 cs.IR cs.CL

keywords featuredeepfmlearningdeepinteractionsmodeldataengineering

0 comments

read the original abstract

Learning sophisticated feature interactions behind user behaviors is critical in maximizing CTR for recommender systems. Despite great progress, existing methods seem to have a strong bias towards low- or high-order interactions, or require expertise feature engineering. In this paper, we show that it is possible to derive an end-to-end learning model that emphasizes both low- and high-order feature interactions. The proposed model, DeepFM, combines the power of factorization machines for recommendation and deep learning for feature learning in a new neural network architecture. Compared to the latest Wide \& Deep model from Google, DeepFM has a shared input to its "wide" and "deep" parts, with no need of feature engineering besides raw features. Comprehensive experiments are conducted to demonstrate the effectiveness and efficiency of DeepFM over the existing models for CTR prediction, on both benchmark data and commercial data.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 23 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Next-Scale Generative Reranking: A Tree-based Generative Rerank Method at Meituan
cs.IR 2026-04 unverdicted novelty 7.0

NSGR is a tree-structured generative reranker that progressively generates optimal lists via next-scale expansion and multi-scale neighbor loss to balance perspectives and align training signals.
Generative Long-term User Interest Modeling for Click-Through Rate Prediction
cs.IR 2026-05 unverdicted novelty 6.0

GenLI generates diverse target-independent interest distributions via an IGM, retrieves behaviors with O(1) lookup in BRM, and fuses via IFM gating to balance accuracy and efficiency in CTR prediction.
Understanding DNNs in Feature Interaction Models: A Dimensional Collapse Perspective
cs.LG 2026-04 unverdicted novelty 6.0

DNNs mitigate dimensional collapse of embeddings in feature interaction models, shown via parallel and stacked experiments plus gradient analysis.
Modular Representation Compression: Adapting LLMs for Efficient and Effective Recommendations
cs.IR 2026-04 unverdicted novelty 6.0

LLMs exhibit mid-layer representation advantage for recommendations; MARC compresses representations modularly to reduce costs while improving performance, as shown in a large-scale online advertising deployment.
MBGR: Multi-Business Prediction for Generative Recommendation at Meituan
cs.IR 2026-04 unverdicted novelty 6.0

MBGR is a new generative recommendation framework using business-aware semantic IDs, multi-business prediction, and label dynamic routing to handle multiple businesses without seesaw effects or representation confusio...
Mixture of Sequence: Theme-Aware Mixture-of-Experts for Long-Sequence Recommendation
cs.IR 2026-03 unverdicted novelty 6.0

MoS applies theme-aware routing to extract multi-scale theme-specific subsequences from noisy long user sequences, achieving state-of-the-art recommendation performance with fewer FLOPs than comparable MoE models.
Performance-Driven QUBO for Recommender Systems on Quantum Annealers
cs.IR 2024-10 unverdicted novelty 6.0

PDQUBO is a new performance-driven QUBO method for feature selection in recommender systems that incorporates counterfactual performance impacts of features and pairs, is model-agnostic, and outperforms prior quantum ...
PACEvolve++: Improving Test-time Learning for Evolutionary Search Agents
cs.LG 2026-05 unverdicted novelty 5.0

PACEvolve++ uses a phase-adaptive reinforcement learning advisor to decouple hypothesis selection from execution in LLM-driven evolutionary search, delivering faster convergence than prior frameworks on load balancing...
Light-FMP: Lightweight Feature and Model Pruning for Enhanced Deep Recommender Systems
cs.IR 2026-05 unverdicted novelty 5.0

Light-FMP prunes features and model parameters in deep recommender systems by pretraining a hard-concrete masking layer on data subsets, then retraining the reduced model to improve both efficiency and accuracy over p...
Beyond Dense Connectivity: Explicit Sparsity for Scalable Recommendation
cs.IR 2026-04 unverdicted novelty 5.0

SSR uses static random filters and iterative competitive sparse mechanisms to explicitly enforce sparsity in recommendation models, outperforming dense baselines on public and billion-scale industrial datasets.
Revisiting Content-Based Music Recommendation: Efficient Feature Aggregation from Large-Scale Music Models
cs.IR 2026-02 unverdicted novelty 5.0

TASTE dataset and MuQ-token aggregation enable effective use of audio features from large music models to improve content-based music recommendations over collaborative filtering alone.
Decoupled Multimodal Fusion for User Interest Modeling in Click-Through Rate Prediction
cs.IR 2025-10 conditional novelty 5.0

DMF adds target-aware bridging features and an inference-optimized decoupled attention layer to combine modality-centric and modality-enriched user interest modeling for CTR prediction.
OneRec: Unifying Retrieve and Rank with Generative Recommender and Iterative Preference Alignment
cs.IR 2025-02 unverdicted novelty 5.0

OneRec unifies retrieval and ranking in a generative recommender using session-wise decoding and iterative DPO-based preference alignment, achieving real-world gains on Kuaishou.
Retrieval-Augmented Generation with Graphs (GraphRAG)
cs.IR 2024-12 unverdicted novelty 5.0

A survey proposing a holistic GraphRAG framework with components including query processor, retriever, organizer, generator, and data source, plus domain-tailored reviews, challenges, and future directions.
CCNETS: A Modular Causal Learning Framework for Pattern Recognition in Imbalanced Datasets
cs.LG 2024-01 unverdicted novelty 5.0

CCNETS is a new modular causal framework using three cooperative modules and a Zoint mechanism to align synthetic data generation with classifier needs on imbalanced pattern recognition tasks.
Cloudless-Training: A Framework to Improve Efficiency of Geo-Distributed ML Training
cs.DC 2023-03 unverdicted novelty 5.0

Cloudless-Training proposes a two-layer serverless framework with elastic scheduling and two new synchronization strategies (ASGD-GA and inter-PS model averaging) that reports 9.2-24% cost reduction and up to 1.7x spe...
Building a privacy-preserving Federated Recommender system for mobile devices
cs.LG 2026-05 unverdicted novelty 4.0

Presents a two-stage federated recommendation pipeline that runs collaborative filtering on non-sensitive data in the cloud and re-ranks candidates on-device using sensitive mobile signals.
RecGPT-Mobile: On-Device Large Language Models for User Intent Understanding in Taobao Feed Recommendation
cs.IR 2026-05 unverdicted novelty 4.0

RecGPT-Mobile runs a compact LLM on phones to understand evolving user intent from behaviors and improve mobile e-commerce recommendations.
Deep Situation-Aware Interaction Network for Click-Through Rate Prediction
cs.IR 2026-04 unverdicted novelty 4.0

DSAIN introduces situational features and tri-directional fusion to enhance behavior sequence modeling for CTR prediction, delivering 2.7% CTR, 2.62% CPM, and 2.16% GMV lifts in online A/B tests on the Meituan platform.
Joint Model Parameter Scaling and Universal-Domain Data Integration for E-commerce Search Ranking
cs.IR 2026-03 unverdicted novelty 4.0

UniScale couples entire-space data construction with a hierarchical fusion transformer to improve scaling behavior and deliver 1.70% purchase and 2.04% GMV lifts in large-scale e-commerce search A/B tests.
Exploring Vision Neural Network Pruning via Screening Methodology
cs.LG 2025-02 unverdicted novelty 4.0

A unified F-statistic screening and weighted evaluation method prunes both unstructured and structured parameters in FNNs and CNNs, claiming order-of-magnitude size reduction with competitive accuracy on vision datasets.
A Unified Framework for Modeling Heterogeneous Financial Data via Dual-Granularity Prompting
cs.CE 2024-04 unverdicted novelty 4.0

FinLangNet applies dual-granularity prompting in a sequential model to heterogeneous financial data, reporting 6.3 pp KS improvement and 9.9% bad debt reduction in real-world deployment.
Infer Implicit Contexts in Real-time Online-to-Offline Recommendation
cs.IR 2019-07 unverdicted novelty 4.0

MACDAE infers implicit contexts via a constrained autoencoder and integrates them into an end-to-end O2O recommender, reporting gains on Yelp/Dianping/Koubei and 2.9%/5.6% lifts in online CTR/conversion.