Git Product home page Git Product logo

booking's Introduction

Booking.com provides a unique dataset based on millions of real anonymized bookings to encourage the research on sequential recommendation problems. Many travelers go on trips which include more than one destination. Our mission at Booking.com is to make it easier for everyone to experience the world, and we can help to do that by providing real-time recommendations for what their next in-trip destination will be. By making accurate predictions, we help deliver a frictionless trip planning experience. Teams are encouraged to compete and submit their trip predictions before January 28th 2021 to qualify for WSDM WebTour challenge.

The main challenge is to use a dataset based on millions of real anonymized accommodation reservations to come up with a strategy for making the best recommendation for their next destination in real-time.

Dataset


The training dataset consists of over a million (1,166,835) of anonymized hotel reservations, based on real data, with the following features:

  • user_id - User ID
  • checkin - Reservation check-in date
  • checkout - Reservation check-out date- created_date - Date when the reservation was made
  • affiliate_id - An anonymized ID of affiliate channels where the booker came from (e.g. direct, some third party referrals, paid search engine, etc.)
  • device_class - desktop/mobile
  • booker_country - Country from which the reservation was made (anonymized)
  • hotel_country - Country of the hotel (anonymized)
  • city_id - city_id of the hotel's city (anonymized)
  • utrip_id - Unique identification of user's trip (a group of multi-destinations bookings within the same trip).

Each reservation is a part of a customer's trip (identified by utrip_id) which includes consecutive reservations. The evaluation dataset is constructed similarly (378,667 reservations), however the city_id (and the country) of the final reservation of each trip is concealed and requires a prediction.

Evaluation Criteria


The main challenge is to predict (and recommend) the final city (city_id) of each trip (utrip_id). The quality of the predictions is evaluated based on the top four recommended cities for each trip by using Accuracy@Top 4 metric (4 representing the four suggestion slots at Booking.com website). When the true city is one of the top 4 suggestions (regardless of the order), it is considered correct.

Attachments


  • train_set.csv - Training dataset
  • test_set.csv - Validation test set data (with a concealed last destination) as used in Booking.com WSDM WebTour challenge
  • ground_truth.csv - The true values of the test set
  • submission.csv - an example submission for the test data
  • evaluation_demo.ipynb - a Jupyter notebook exampling train set loading, submission generation for test set and evaluation function

Terms and Conditions


  • The dataset is a property of Booking.com and may not be reused for commercial purposes.
  • The dataset may not be used in a manner that is harmful or competitive in nature with Booking.com or Booking Holdings.
  • The dataset may not be used in any manner or for any purpose that may violate any law or regulation.

booking

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.