Git Product home page Git Product logo

aspose.ocr-for-.net's Introduction

Nuget Nuget GitHub

Optical Character Recognition (OCR) .NET API

It is a standalone OCR API that enhances your .NET apps to perform OCR on JPEG, PNG, GIF, BMP & TIFF images for extraction of English, French, Spanish & Portuguese content.

Aspose.OCR for .NET not only provides the Optical Character Recognition engine but more. You can also apply Blur, Gaussian Blur, and Median filter to reduce noise before document recognition and can set the OcrEngine to ignore non-textual blocks, maintain correct text order during document text recognition & automatically correct spellings of the document text.

Directory Description
Applications_Aspose_Ocr_Net_6 A collection of .NET projects that can be easily built with a script.
Examples A collection of .NET examples that help you learn the product features.

Image OCR API Features

  • Programmatically detect, identify and read characters from images.
  • Currently, it supports 26 European languages and Chinese.
  • Detect and read popular font faces such as Arial, Times New Roman, Courier New, Tahoma, Calibri & Verdana.
  • Supports regular, bold and italic font styles.
  • Scan whole image or only a specific portion of the image.
  • Scan rotated images.
  • Application of various noise removal filters to assist image recognition.
  • Calculate the bounding boxes of lines, paragraphs, words.
  • Get possible choices for each recognized character.
  • Pass URI and recognize the image from it.
  • Recognize multiple images in a folder, zip archive or in the list.
  • Get result in JSON or XML format.
  • Save results in text, DOCX, PDF, XLSX, XML, JSON formats

Aspose.OCR can recognize a large number of languages and all popular writing scripts, including texts with mixed languages.

Extended Latin alphabet

  • Croatian
  • Czech
  • Danish
  • Dutch
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Italian
  • Latvian
  • Lithuanian
  • Norwegian
  • Polish
  • Portuguese
  • Romanian
  • Slovak
  • Slovenian
  • Spanish
  • Swedish

Supported characters

! " # $ % & ' ( )
* + , - . / 0 1 2 3
4 5 6 7 8 9 : ; < =
> ? @ [ \ ] _ ` { |
} ~ A B C D E F G H
I J K L M N O P Q R
S T U V W X Y Z a b
c d e f g h i j k l
m n o p q r s t u v
w x y z À Á Â Ã Ä Å
Æ Ā Ă Ą Ç Ć Ĉ Ċ Č Ð
Ď Đ È É Ê Ë Ē Ė Ę Ě
Ə à á â ã ä å æ ā ă
ą ç ć ĉ ċ č ð ď đ è
é ê ë ē ė ę ě ə Ĝ Ğ
Ġ Ģ Ĥ Ħ Ì Í Î Ï Ī Į
İ IJ Ĵ Ķ Ļ Ł Ñ Ń Ņ Ň
ĝ ğ ġ ģ ĥ ħ ì í î ï
ī į ı ij ĵ ķ ļ ł ñ ń
ņ ň Ò Ó Ô Õ Ö Ø Ő Œ
Ơ Ŕ Ř Ś Ŝ Ş Ș Š Þ
Ţ Ť Ù Ú Û Ü Ū Ŭ Ů Ű
Ų Ư Ŵ Ý Ŷ Ÿ Ź Ż Ž ò
ó ô õ ö ø ő œ ơ ŕ ř
ß ś ŝ ş ș š þ ţ ť ù
ú û ü ū ŭ ů ű ų ư ŵ
ý ŷ ÿ ź ż ž c

Cyrillic alphabet

  • Belorussian
  • Bulgarian
  • Kazakh
  • Russian
  • Serbian
  • Ukrainian

Supported characters

! \ " # $ % & ' (
) * + , - . / 0 1 2
3 4 5 6 7 8 9 : ; <
= > ? @ [ ] _ { | }
~ А Б В Г Д Е Ё Ж З
И Й К Л М Н О П Р С
Т У Ф Х Ц Ч Ш Щ Ъ Ы
Ь Э Ю Я а б в г д е
ё ж з и й к л м н о
п р с т у ф х ц ч ш
щ ъ ы ь э ю я Ў ў І
і Ѕ Ј Љ Њ Ќ Џ ѕ ј љ
њ ќ џ Ђ ђ Ћ ћ Ә Ғ Қ
Ң Ө Ұ Һ ә ғ қ ң ө ұ
һ Ї ї Є є

Asian languages

  • Chinese (more than 6,000 characters)

Load Images for OCR

Raster Formats: JPEG, PNG, GIF, BMP, TIFF

Platform Independence

You can use Aspose.OCR for .NET to develop applications in any development environment that targets the .NET Framework 4.6.1 and higher, .NET Core 2.0 and higher. Supports x64, x86, AnyCPU This includes support for Mono, Xamarin and Client Profiles.

Get Started with Aspose.OCR for .NET

Are you ready to give Aspose.OCR for .NET a try? Simply execute Install-Package Aspose.OCR from Package Manager Console in Visual Studio to fetch the NuGet package. If you already have Aspose.OCR for .NET and want to upgrade the version, please execute Update-Package Aspose.OCR to get the latest version.

Perform OCR on PNG Image via C# Code

// initialize an instance of AsposeOcr
AsposeOcr api = new AsposeOcr();

// recognize image
string result = api.RecognizeImage(dataDir + "sample.png");

// display the recognized text
Console.WriteLine(result);

Home | Product Page | Docs | Demos | API Reference | Examples | Blog | Search | Free Support | Temporary License

aspose.ocr-for-.net's People

Contributors

adam-skelton avatar ahmad-chishti avatar annushka34 avatar aspose-ocr-gists avatar asposeomrdeveloper avatar babar-raza avatar farooqsheikhpk avatar fatimaahmed avatar ikram-haq avatar jawadaspose avatar kashifiqb avatar m-ikramulhaq avatar muhammad-ijaz avatar naeem244 avatar rizwanniazigroupdocs avatar saqib-razzaq-aspose avatar saqibmasood avatar saqibrazzaq avatar saudaspose avatar shahzad-latif avatar valeriy-ten-aspose avatar valeriyten avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

aspose.ocr-for-.net's Issues

compiliing samples fails ASP Core 6 , BS 5 update

Hello I am trying to find a sample that can extract text from images and PDF and give me an success rating 90% confidence etc.

Its not compiling in VS 2022 with asp core 6
the nugets are also outdated
how to get the confidence rating on the OCR after processing.
how to display text side by side with original document / scrolling web view, so user can compare both

AsposeOcr().RecognizeImage produces additional images

Hi,
I cannot find that info in the documentation for .NET.

Looks like that new AsposeOcr().RecognizeImage(dataDir + "Sampleocr.bmp"); produces additional images "Row_exampleX.png" and stores it in the project folder.

Is it possible to manage it? Now, I need to remember to delete it after performing the image recognition.

OCR not reading tiff file effectively

The included tiff file (SampleTiff.tiff) does not seem to be well processed by the OCR. I tried targetting it with PerformOCROnImage.Run() and tried just uncommenting OCROnMultipageTIFF.Run(), both with lackluster results. Both output only one line (similar in both cases), and its unclear where the line is coming from in the tiff file.

Build Errors in Example due to OMR references

I believe OMR has been removed from OCR in one of the recent updates to ASPOSE OCR, however the examples still contain numerous references to OMR and therefore fail to build successfully.

Aspose.OCR reads only 10 pages

Hi,
I am working on a POC using a temporary AsposeOCR license. When I try to read the sample file provided in this repo (multi_page_1.pdf), it reads only 10 Pages.

Is it because of the temporary license?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.