xprogramer Goto Github PK
Type: User
Company: Université 8 Mai 1945 Guelma
Bio: NLP/IR/Computional Linguistics/Robotics/IoT
Location: Algeria
Blog: http://abainia.info
Type: User
Company: Université 8 Mai 1945 Guelma
Bio: NLP/IR/Computional Linguistics/Robotics/IoT
Location: Algeria
Blog: http://abainia.info
Library to build AndroidAuto headunit
Unofficial Android Auto SDK
ANTSIX is a small Arabic corpus dedicated to Automatic Topic Identification of written texts. The corpus contains noisy texts collected over different Arabic discussion forums related to 6 topics, where the texts may be corrupted with the following noises: URLs, Citations in other language, Tags, Abbreviations, Misspelling errors, Typing errors, Html tags and objects, Insignificant characters, SMS writing style and Letters Mistakenness. Moreover, the corpus is unbalanced in terms of text sizes, where the text length ranges between 32 and 318 words, and further, each topic contains 50 texts encoded with UTF-8 encoding. Therefore, in overall the ANTSIX corpus contains 300 short noisy Arabic texts related to 6 different topics.
A repository of different Arabic stemmers
ARASTEM is a new corpus dedicated to the Arabic stemming field, where it contains several documents containing grouped words which are semantically and morphologically related. Hence, the corpus was constructed manually by the full intervention of native Arabic speakers after collecting several texts from different Arabic discussion forums. Furthermore, it contains words belonging to the Standard Arabic, Dialectical Arabic and Modern Pseudo Arabic languages.
Repo for customized CNC 2D/3D parts
DLI32 and DLI32-2 are two small corpora dedicated to Automatic Language identification of written texts. They are collected over different discussion forums, and they contain noisy texts encoded with UTF-8 encoding. The texts may contain any kind of the following noises: URLs, Citations in other language, Tags, Abbreviations, Unaccented characters, Misspelling errors, Typing errors, Html tags and objects, Insignificant characters and SMS writing style. The DLI32 corpus contains 320 texts corresponding to 10 texts per language, in which the text length ranges between 93 and 146 words. The DLI32-2 corpus is a subdivision of the DLI32, where it contains 640 texts (20 texts per languages) and the text length ranges between 43 and 67 words.
The corpus for offensive language detection in under-resourced Algerian dialectal Arabic
Facebook comments crawler
Homemade intelligent vehicule system to prevent the conductors from the near objects. The project is based on Arduino Uno, buzzer, nokia 5110 LCD and Ultrasonic sensor. The source code can be downloaded from the following link:
The program of the new homemade 8 DOFs biped robot. It is based on Arduino Uno and 8 servos, as well as is controlled via the nRF24L01+ radio module.
The source codes of 5 statistical algorithms (i.e. CBA, WBA, SCA, HA1 and HA2) that were conceived for language identification. Hence, the algorithms are featured only with 32 languages belonging to DLI32 corpus, and they figured out quite interesting accuracies comparing to Google LID API and Microsoft Office LID (refer to publications for more details).
NLTK Source
A low cost non-contact (infrared) Thermometer mainly based on Arduino and MLX90614 sensor
Corpus of offensive Bambara language
Offensive Tamazight corpus
This is the project of a homemade RC controller based on pic 16F84, PS2 joystick, nRF24L01+ radio transceiver and Nokia 5110 LCD. The source code is written in assembly language.
Scrapy, a fast high-level web crawling & scraping framework for Python.
Labs and demos for courses for GCP Training (http://cloud.google.com/training).
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.