Git Product home page Git Product logo

autoanki's Introduction

AutoAnki

This project aims to automate the process of creating cloze notes using a combination of existing research and new findings in the field of Natural Language Processing. It will at first aims to work alongside Anki as an addon.

The target language currently is English.

Pipeline

In this section we shall discuss about the different modules that makes up the process. To put it shortly the pipeline is designed as such:

  • Parsing from a Source & Preprocessing
  • Learn key information from the text and generate clozes
  • Postprocessing
  • Export to Anki

Parsing from a Source & Preprocessing

The scope of this project limits itself to text based content. For example, we will not attempt to automatically generate clozes for :

  • Schematics
  • Mathemathical Equations
  • Anatomy
  • etc.

At first, we will get content from verified Wikipedia articles. The parsing should be relatively straightforward, we simply want to extract the text, clean it if possible: remove hyperlinks, superscripts, etc. anything that brings no value to learning key information from a text or that is too hard or exotic for models to process.

Some preprocessing can also be done. A model could actually be used here as well. The idea is to remove ambiguity, like replacing pronouns with the actual nouns they represent, or shortening long sentences into a collection of shorter ones.

autoanki's People

Contributors

aleksacrveni avatar lthiet avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

aleksacrveni

autoanki's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.