Git Product home page Git Product logo

lexical_db_bangla's Introduction

Lexical Database Bangla

Automatic construction of lexical database for Bangla inspired from Wordnet using a bilingual dictionary and Wordnet.

Usage

  1. Download the package.

     python setup.py install
    
  2. In your python code

     import lexical_db_bangla
     syns_set=lexical_db_bangla.syns(word)
     print syns_set
    

where word is any bangla word.

Approach

For each Bangla word in the billingual (bangla to english dictionary), we need to look up all possible English words. Then we out find out the synsets for those English words from Princeton WordNet, extract the whole network of those synsets and copy that to our target wordnet for Bangla. Then, we try to translate the structure where ever possible, like name of the features attached with each word/synset, the features of these words and of course the actual words into Bangla.

Note

The bangla to bangla dictionary have already been generated which is then parsed and used to find the synonymns. You can also generate the same using read_dict.py file. Also, remember the file has a really high memory footprint. Dumps in the folder english_bangla_datasets have been downloaded from http://www.bengalinux.org/english-to-bengali-dictionary/dumps/. The license for the same can be found in the folder in the file Copying.txt

Dependencies:


After you have installed the NLTK Library, do the following to download the NLTK Corpora:

  1. Go to your python shell. Type:

     nltk.download()
    
  2. Download

    • Wordnet Corpora

lexical_db_bangla's People

Contributors

abhishekgupta92 avatar

Watchers

James Cloos avatar Imran Hossain avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.