Git Product home page Git Product logo

xprogramer's Projects

aasdk icon aasdk

Library to build AndroidAuto headunit

antsix-corpus icon antsix-corpus

ANTSIX is a small Arabic corpus dedicated to Automatic Topic Identification of written texts. The corpus contains noisy texts collected over different Arabic discussion forums related to 6 topics, where the texts may be corrupted with the following noises: URLs, Citations in other language, Tags, Abbreviations, Misspelling errors, Typing errors, Html tags and objects, Insignificant characters, SMS writing style and Letters Mistakenness. Moreover, the corpus is unbalanced in terms of text sizes, where the text length ranges between 32 and 318 words, and further, each topic contains 50 texts encoded with UTF-8 encoding. Therefore, in overall the ANTSIX corpus contains 300 short noisy Arabic texts related to 6 different topics.

arastem-corpus icon arastem-corpus

ARASTEM is a new corpus dedicated to the Arabic stemming field, where it contains several documents containing grouped words which are semantically and morphologically related. Hence, the corpus was constructed manually by the full intervention of native Arabic speakers after collecting several texts from different Arabic discussion forums. Furthermore, it contains words belonging to the Standard Arabic, Dialectical Arabic and Modern Pseudo Arabic languages.

dli32-corpus icon dli32-corpus

DLI32 and DLI32-2 are two small corpora dedicated to Automatic Language identification of written texts. They are collected over different discussion forums, and they contain noisy texts encoded with UTF-8 encoding. The texts may contain any kind of the following noises: URLs, Citations in other language, Tags, Abbreviations, Unaccented characters, Misspelling errors, Typing errors, Html tags and objects, Insignificant characters and SMS writing style. The DLI32 corpus contains 320 texts corresponding to 10 texts per language, in which the text length ranges between 93 and 146 words. The DLI32-2 corpus is a subdivision of the DLI32, where it contains 640 texts (20 texts per languages) and the text length ranges between 43 and 67 words.

dziriofn icon dziriofn

The corpus for offensive language detection in under-resourced Algerian dialectal Arabic

intelligent-vehicule-system-v1.0 icon intelligent-vehicule-system-v1.0

Homemade intelligent vehicule system to prevent the conductors from the near objects. The project is based on Arduino Uno, buzzer, nokia 5110 LCD and Ultrasonic sensor. The source code can be downloaded from the following link:

kheirobot_v2 icon kheirobot_v2

The program of the new homemade 8 DOFs biped robot. It is based on Arduino Uno and 8 servos, as well as is controlled via the nRF24L01+ radio module.

language-identification icon language-identification

The source codes of 5 statistical algorithms (i.e. CBA, WBA, SCA, HA1 and HA2) that were conceived for language identification. Hence, the algorithms are featured only with 32 languages belonging to DLI32 corpus, and they figured out quite interesting accuracies comparing to Google LID API and Microsoft Office LID (refer to publications for more details).

obam icon obam

Corpus of offensive Bambara language

otam icon otam

Offensive Tamazight corpus

rc-controller icon rc-controller

This is the project of a homemade RC controller based on pic 16F84, PS2 joystick, nRF24L01+ radio transceiver and Nokia 5110 LCD. The source code is written in assembly language.

scrapy icon scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.