Git Product home page Git Product logo

srix / pytamil Goto Github PK

View Code? Open in Web Editor NEW
50.0 9.0 9.0 4.41 MB

பைந்தமிழ் (pytamil) library is intended to be used in analysis of tamil literary work. A wealth of knowledge is hidden in old literature. They are time machines to past. Ever wondered what is the popular color or food in tamil speaking world in 500AD. The answer is hidden in literature. With right computer tools it becomes possible for us to dig in to this wealth of knowledge.

License: MIT License

Python 94.93% ANTLR 4.88% Shell 0.19%
tamil compiler language-processing

pytamil's Introduction

பைந்தமிழ் (pytamil)

A Library that can do the following

தமிழ் எழுத்து மற்றும் புணர்ச்சி

எழுத்து.மெல்லினம்

['ங்', 'ஞ்', 'ண்', 'ந்', 'ம்', 'ன்']

எழுத்து.குறில்

['அ', 'இ', 'உ', 'எ', 'ஒ']

புணர்ச்சி.தனிமொழி_ஆக்கு('விருந்தோம்பல்')

['விருந்து', 'ஓம்பல்']

புணர்ச்சி.தொடர்மொழி_ஆக்கு('விருந்து', 'ஓம்பல்' )

விருந்தோம்பல்

மாத்திரை.மாத்திரை_கொடு('பைந்தமிழ்')

[2, 0.5, 1, 1, 0.5]

தற்போதைய எழுத்துக்களை பண்டைய எழுத்துக்களாக மாற்றுதல்

தமிழ்.பிரம்மி('வணக்கம்')

𑀯𑀡𑀓𑀓𑀫

தமிழ்.பண்டைய_வாக்கியம்_ஆக்கு(வாக்கியம் = 'வணக்கம்', வருடம் = 300 )

யாப்பு ஆராய்தல்

திருக்குறள் : பொருட்பால் : குறள் 467

எண்ணித் துணிகக் கருமம் துணிந்தபின்
எண்ணுவம் என்பது இழுக்கு

kural_parse_tree

புகழேந்திப் புலவர் இயற்றிய நளவெண்பா : 1

ஆதித் தனிக்கோல மானா னடியவற்காச்
சோதித் திருத்தூணிற் றோன்றினான் வேதத்தின்
முன்னின்றான் வேழம் முதலே யெனவழைப்ப
என்னென்றா னெங்கட் கிறை

nerisai_parse_tree

Why Pytamil

பைந்தமிழ் (pytamil) library is intended to be used in analysis of tamil literary work. A wealth of knowledge is hidden in old literature. They are time machines to past. Ever wondered what is the popular color or food in tamil speaking world in 500AD. The answer is hidden in literature. With right computer tools it becomes possible for us to dig in to this wealth of knowledge.

Core philosophy of பைந்தமிழ் (pytamil) library is to clearly separarte tamil language conepts from the programming language. For example, Tamil புணர்ச்சி rules are captured in human readable text file புணர்ச்சிவிதிகள்.yaml in YAML format. This approach has two major benefits

  1. This allows people with no prior knowledge in computer programming to contribute to the project and have more meaningful and natural discussion on the language concepts.
  2. Similar approach can be used to implement libraries for other human languages like Sanskrit, Telugu, Kannada etc.
  3. Developers can use the core tamil language files to port this library to other computer languages like Javascript, c# etc.

List of Core tamil language files

TODO

If you have a feature in mind, Please add a feature request here with label as enhancement.

  • return original words when a combined word is presented by predictive deomposition using புணர்ச்சி விதிகள்
  • built pip module
  • and many more

For Developers

Getting started

pytamil's People

Contributors

srix avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

pytamil's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.