Git Product home page Git Product logo

ltr-tools's Introduction

Learning To Rank Tools Project

Description

This project is containing both the ltr-tools cli and the solr configuration to provide backend search functionalities. The main scope of this project will be to provide usable tools to help with Learning To Rank integrations and developments.

The ltr-tools is a command line interface to run a set of utility tasks to visualize, debug and understand better Learning To Rank models and training sets. The ltr-tools project is meant to work with a Solr backend, related configuration are in the configuration project ( and they are used in the tests).

Tools available so far :

  1. Model Indexer

This tool allow to index the splitting point of your lambdaMART model. Each splitting branch point is indexed. This allow to visualize the most occurring features and thresholds in the branches. Can be really useful to :

  • find anomalies in the model and in the training set( e.g. thresholds that don't make sense)
  • find important features and thresholds
  1. Training Set Indexer

This tool allow to index the training set. Each document is a training sample with its feature vector. It is really useful to explore and possibly visualize the training set. It could be important also to validate specific leaf paths from the model

  1. Top Scoring Leaves Viewer

This tool visits the lambdaMART model and prints the top scoring leaves. It is really useful to find out the most promising paths in the trees of the ensemble. Can be used to discuss with the business people if the most promising paths make sense in the domain .

BUILD ( includes Unit Tests )

gradle build

UNIT TESTS

gradle test

Reports

/ltr-tools/build/reports/test/index.html

Usage

  1. First of all you need your Solr running, install a Solr version ( 6.0 would be fine but it is not a requirement) and copy the configuration from the configuration project. This will create 2 collections : models and trainingSet. models will be used to index the LambdaMART models . trainingSet will be used to index your training set.

  2. Help

    java -jar ltr-tools-1.0.jar -help

  3. Model Indexer

    java -jar ltr-tools-1.0.jar -tool modelIndexer -model /models/lambdaMARTModel1.json -solrURL http://localhost:8983/solr/models

  4. Training set Indexer

    java -jar ltr-tools-1.0.jar -tool trainingSetIndexer -trainingSet /trainingSets/training1.txt -features /featureMappings/feature-mapping1.json -categoricalFeatures /feature/categoricalFeatures1.txt -solrURL http://localhost:8983/solr/trainingSet

  5. Top Scoring Leaves Viewer

    java -jar ltr-tools-1.0.jar -tool topScoringLeavesViewer -model /models/lambdaMARTModel1.json -topK 10

More details are available here : https://sease.io/2016/08/solr-is-learning-to-rank-better-part-3-ltr-tools.html

ltr-tools's People

Contributors

agazzarini avatar alessandrobenedetti avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.