Git Product home page Git Product logo

irene's Introduction

Sr. Software Engineer, Fortanix

Previously:

  • Assistant Professor of Computer Science at Middlebury College (2020-2022)
  • Visiting Assistant Professor at Smith College (2018-2020)
  • Research Assistant, PhD Student at UMass Amherst (2013-2019)

irene's People

Contributors

dependabot[bot] avatar jjfiv avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

irene's Issues

TREC CAR 2018

Creating an issue so I can group together commits that work on this task.

TREC News 2018

Creating an issue so I can group together commits that work on this task.

Galago-compatible query parsing

Want people like @daltonj who have existing libraries to generate string queries to be able to ingest them.

Conflicts a little with #3, but ultimately this belongs in the irene-galago subproject.

Galago-style HTML front-end

With #4 we ought to be able to support a front-end like that of Galago, which will render detailed queries and present results, snippets, etc.

  • results
  • Jetty versions with Galago
  • parsing of Galago queries #4
  • How to render Lucene's more free-form documents?
  • snippets

Minimize dependency on Galago

Frequency of import/typealias in src/main.

  18 import org.lemurproject.galago.utility.Parameters
   4 import org.lemurproject.galago.utility.StreamCreator
   4 import org.lemurproject.galago.core.eval.QueryResults
   3 import org.lemurproject.galago.core.eval.SimpleEvalDoc
   2 import org.lemurproject.galago.utility.MathUtils
   2 import org.lemurproject.galago.core.retrieval.LocalRetrieval
   2 import org.lemurproject.galago.core.index.stats.FieldStatistics
   2 import org.lemurproject.galago.core.eval.metric.QueryEvaluatorFactory
   2 import org.lemurproject.galago.core.eval.QueryJudgments
   1 typealias GResults = org.lemurproject.galago.core.retrieval.Results
   1 typealias GExpr = org.lemurproject.galago.core.retrieval.query.Node
   1 typealias GDocArgs = org.lemurproject.galago.core.parse.Document.DocumentComponents
   1 typealias GDoc = org.lemurproject.galago.core.parse.Document
   1 import org.lemurproject.galago.utility.lists.Ranked
   1 import org.lemurproject.galago.utility.StringPooler
   1 import org.lemurproject.galago.core.util.WordLists
   1 import org.lemurproject.galago.core.retrieval.query.NodeParameters
   1 import org.lemurproject.galago.core.retrieval.query.AnnotatedNode
   1 import org.lemurproject.galago.core.retrieval.processing.ScoringContext
   1 import org.lemurproject.galago.core.retrieval.iterator.TransformIterator
   1 import org.lemurproject.galago.core.retrieval.iterator.ScoreIterator
   1 import org.lemurproject.galago.core.retrieval.iterator.CountIterator
   1 import org.lemurproject.galago.core.retrieval.Retrieval
   1 import org.lemurproject.galago.core.parse.TagTokenizer
   1 import org.lemurproject.galago.core.index.stats.NodeStatistics
   1 import org.lemurproject.galago.core.index.corpus.CorpusReader
   1 import org.lemurproject.galago.core.eval.metric.QueryEvaluator
   1 import org.lemurproject.galago.core.eval.QuerySetJudgments

LuceneExpr

  • Crashes when a segment does not have any results -- this returns a null Scorer, I think.

Move to multi-module maven project

This blocks everything when it is done.

I'm envisioning the following sub-projects.

  • irene-lang No dependencies, just our CountStats and QExpr classes and our query compilation passes.
  • irene-ltr depends irene-lang, our LTRDoc and LTRDocField and re-rank context scoring. Regressions against Galago can be done in test scope.
  • irene-galago depends on both galago-core and irene-lang. Galago's Nodes <-> our QExpr will be supported here.
  • irene-lucene depends on both irene-lang and lucene.
  • irene-full depends on all projects, has graphical front-end, etc.

JS, Native-code backends

Once we split out the language to different models (#8), the language will be pure Kotlin and compilable with both Kotlin-js and Kotlin-native.

This means (potentially) we can run re-ranking queries in the browser for the user and pursue an efficient Indri (et al?) backend that shares code without paying the JNI price to use indri-java.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.