FuxiCTR
FuxiCTR copied to clipboard
A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io
data:image/s3,"s3://crabby-images/90dbf/90dbfd9d1cbfe35b040c953d55b36b78acb8760f" alt="Logo"
Click-through rate (CTR) prediction is a critical task for many industrial applications such as online advertising, recommender systems, and sponsored search. FuxiCTR provides an open-source library for CTR prediction, with key features in configurability, tunability, and reproducibility. We hope this project could benefit both researchers and practitioners with the goal of open benchmarking for CTR prediction.
This repo is the community dev version of the original release at huawei-noah/benchmark/FuxiCTR.
:bell: If you find our code or benchmarks helpful in your research, please kindly cite the following papers.
Jieming Zhu, Jinyang Liu, Shuai Yang, Qi Zhang, Xiuqiang He. Open Benchmarking for Click-Through Rate Prediction. The 30th ACM International Conference on Information and Knowledge Management (CIKM), 2021. [Bibtex]
Jieming Zhu, Quanyu Dai, Liangcai Su, Rong Ma, Jinyang Liu, Guohao Cai, Xi Xiao, Rui Zhang. BARS: Towards Open Benchmarking for Recommender Systems. The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2022. [Bibtex]
Key Features
-
Configurable: Both data preprocessing and models are modularized and configurable.
-
Tunable: Models can be automatically tuned with easy configuration.
-
Reproducible: All the benchmarks can be easily reproduced.
Model List
- :point_right: Check reusable dataset splits for CTR prediction.
- :point_right: Check benchmarking configurations and steps.
- :point_right: Check BARS benchmark website.
Installation
Please follow the guide for installation. In particular, FuxiCTR has the following dependent requirements.
- python 3.6
- pytorch v1.0/v1.1
- pyyaml >=5.1
- scikit-learn
- pandas
- numpy
- h5py
- tqdm
Tutorials | 中文教程
-
Run the demo to understand the overall workflow
-
How to use dataset and model config files
-
How to preprocess raw csv data to h5 data
-
How to use h5 data as input
-
How to make configurations?
-
How to tune the model hyper-parameters via grid search
-
How to use sequence features
-
How to load pretrained embeddings as features
API Documentation
Check an overview of code structure for details on API design.
Discussion
Welcome to join our WeChat group for any question and discussion.
Join Us
We have open positions for internships and full-time jobs. If you are interested in research and practice in recommender systems, please send your CV to [email protected].