Git Product home page Git Product logo

sdm's Introduction

SDM: Sequential Deep Matching Model for Online Large-scale Recommender System

New Released Code!!!

Thanks to the DeepMatch Group members for providing doc and code.

Demo Code

Code (Python2.7, TF1.4) of the sequential deep matching (SDM) model for recommender system at Taobao. Current version only contains the core code of our model. The processes of data processing and evaluation are executed on our internal cloud platform ODPS.

Paper

Here is the arxiv link (accepted by CIKM 2019)

Citation:

@inproceedings{lv2019sdm,
  title={SDM: Sequential deep matching model for online large-scale recommender system},
  author={Lv, Fuyu and Jin, Taiwei and Yu, Changlong and Sun, Fei and Lin, Quan and Yang, Keping and Ng, Wilfred},
  booktitle={Proceedings of the 28th ACM International Conference on Information and Knowledge Management},
  pages={2635--2643},
  year={2019},
  organization={ACM}
}

Dataset

JD Dataset: raw data, train and test data in the paper (tfrecord). The schema of raw data is shown in data/sample_data/.

Disclaimer

This is an implementation on experiment of offline JD dataset rather than the online official version. There may be differences between results reported in the paper and the released one, because the former one is achieved in distribution tensorflow on our internal deep learning platform PAI.

sdm's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

sdm's Issues

请问验证时的训练、测试的数据怎么定义?能否给出完整示例?

\SDM-master\code\train\run.py

def run_validating(hparams, model):
acc_keys = ["user_id", "ds", "user_embedding_output"]

#  user defined function
#  you should write your own code here for reading and writing data
**train_file = get_your_train_files()
test_file = get_your_test_files()
writer = open_your_test_result_file()**

不太明白需要什么样的数据格式,是否需要转换,恳请指教。

label generation logic of JD offline datasets

Hi,fuyu:
Great job for this match framework. However,I'm a little confused about the label generation logic of JD offline datasets as provided in your codes.
If convenient, would you please upload the preprocessing script of the offline dataset. Thx a lot.

数据集问题

请问你们京东数据集中各个子段的含义是什么,比如说action.csv中的type的每个取值是什么意思

prefer feature choose and tfrecord file generate code

hi, thanks you for open source code. there is two problems

  1. is the same prefer feature filtered, such as item_id, brand_id, but not for short feature?
  2. Is it convenient to provide the code to generate tfrecord from the original csv file?

Astonishing work! How could you come up with it?

I'm a new one who comes first into the recommender system. I happen to find this work and believe it is excellent and valuable. I want to learn from you and could you impart me some experience?

Datasets

Please upload the datasets as well.

num_labels含义

您好,最近在研究您的sdm源码,有一点疑惑:样例数据中num_labels=5,是把每个session最后的5个item作为目标吗?
image
通过sampled_softmax_loss负采样,inputs=rnn_outputs_split,是用户长短兴趣结合后各个时刻embedding组成的矩阵,那么这边的loss是session中各个时刻都会针对目标num_labels=5进行负采样,然后将所有loss相加?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.