Git Product home page Git Product logo

gina's Introduction

Gina

isi ka naam hai

Gina is a rule-based program to build translated pairs (English to Hindi), given parallel corpora. Given corresponding English-Hindi data, she derives correspondece pairs using two things: 1) her initial lexicon, updated with every learnt pair, that 'overlaps' in newly encountered data. 2) An understanding of basic syntactical ordering in Hindi and English.

Both of these premises place severe constraints on Gina's capacity for learning as well as her accuracy; however, this is only a rudimentary implementation of a possible approach towards lexicon building, that engages with problems such as, for example, translating from a genderless to a gendered language, or from a non-ergative to a partially ergative language. Given the simplistic setup, Gina recognizes and learns only four syntactical categories: nouns, pre/postpositions, verbs and adjectives. The approach taken, however, can be extended to certain (sub)categories (like verb auxillaries and plural nouns), but not others (like adverbs, in most cases.)

Setup

Authentication

Gina is still under development. You would need to setup credentials from an NLP API enabled account on GCP to get Gina up and running on your device.

Install Dependencies

  1. Install pip and virtualenv

  2. Create a virtualenv and activate it

$ virtualenv --python python3 env
$ source env/bin/activate

If you're on Windows, you may have to specify the full path to your python installation directory

$ virtualenv --python "c:\python36\python.exe" env
$ .\env\Scripts\activate
  1. Install the dependencies required by Gina
$ pip install -r requirements.txt

Run

Take a look at Gina's initial lexicon. Then give her 1) an English sentence 2) its literal Hindi translation. Try to include new words that you want her to learn, as well as old ones she can use as reference. You can check how much she has learnt from your sentece; you can also test her knowledge, and delete inaccurate deductions on her part. Teach her well!

  $ python interface.py

To Do (Early 2019)

  • Build a web interface for Gina
  • Move to a database

License

CC BY-NC 4.0 International

gina's People

Contributors

abhinavmasalia avatar deeprajpandey avatar niyatibafna avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.