This repository includes the codes of winning solution of KDD-cup 2020 Reinforcement track.
In this compeitition, we try the framework of KM algorithm and temporal difference learning. We finally win the 5th place of task1 and 4th place of task2.
More details will be presented if somebody is interest.