IronOcr 4.3.0.1

The ID prefix of this package has been reserved for one of the owners of this package by NuGet.org. Prefix Reserved
There is a newer version of this package available.
See the version list below for details.
dotnet add package IronOcr --version 4.3.0.1
NuGet\Install-Package IronOcr -Version 4.3.0.1
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="IronOcr" Version="4.3.0.1" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add IronOcr --version 4.3.0.1
#r "nuget: IronOcr, 4.3.0.1"
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
// Install IronOcr as a Cake Addin
#addin nuget:?package=IronOcr&version=4.3.0.1

// Install IronOcr as a Cake Tool
#tool nuget:?package=IronOcr&version=4.3.0.1

Code Examples

C# Code Example (Getting Started):

/*****************************/
var Ocr = new IronOcr.AutoOcr();
var Result = Ocr.Read(@"C:\path\to\image.png");
Console.WriteLine(Result.Text);

C# Code Example (Kitchen Sink):

/*****************************/
var Ocr = new IronOcr.AdvancedOcr()
{
    CleanBackgroundNoise = true,
    EnhanceContrast = true,
    EnhanceResolution = true,
    Language =  IronOcr.Languages.English.OcrLanguagePack,
    Strategy = IronOcr.AdvancedOcr.OcrStrategy.Advanced,
    ColorSpace = AdvancedOcr.OcrColorSpace.Color,
    DetectWhiteTextOnDarkBackgrounds = true,
    InputImageType = AdvancedOcr.InputTypes.AutoDetect,
    RotateAndStraighten = true,
    ReadBarCodes = true,
    ColorDepth =4
};

var testDocument = @"C:\path\to\scan.pdf";
var Results = Ocr.Read(testDocument);
Console.WriteLine(Results.Text);
Console.WriteLine("Barcodes:" + String.Join(",", Results.Barcodes.Select(b => b.Value)));

Learn More about Iron OCR The IronOCR library reads text from images, scans and pdfs. It also detects and decodes barcodes and QR codes. 'Image to text' and 'PDF to Text' functionality is added to Desktop, Console and Web applications in about 5 minutes. It supports over 20 international languages.

Product Compatible and additional computed target framework versions.
.NET Framework net40 is compatible.  net403 was computed.  net45 was computed.  net451 was computed.  net452 was computed.  net46 was computed.  net461 was computed.  net462 was computed.  net463 was computed.  net47 was computed.  net471 was computed.  net472 was computed.  net48 was computed.  net481 was computed. 
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.

This package has no dependencies.

NuGet packages (132)

Showing the top 5 NuGet packages that depend on IronOcr:

Package Downloads
IronOcr.Languages.German The ID prefix of this package has been reserved for one of the owners of this package by NuGet.org.

The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * German * GermanBest * GermanFast * GermanFraktur ==================================== Deutschsprachige OCR in C# & .NET. Optimierte C# Tesseract 5 OCR in einer eigenständigen .NET OCR-API. Konvertiert Scannerdokumente, Bilder und PDF in Text. C# & VB Beispiele: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also German support including: * German (also known as Deutsch) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in German * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/

IronOcr.Languages.Japanese The ID prefix of this package has been reserved for one of the owners of this package by NuGet.org.

Japanese Language pack for the IronOCR C# and VB.Net OCR library. Reads Japanese language text from images and PDFs in .NET. Ocr Dictionaries in this package: * JapaneseAlphabet * JapaneseAlphabetBest * JapaneseAlphabetFast * JapaneseVerticalAlphabet * JapaneseVerticalAlphabetBest * JapaneseVerticalAlphabetFast * Japanese * JapaneseBest * JapaneseFast * JapaneseVertical * JapaneseVerticalBest * JapaneseVerticalFast This package installs IronOCR and also Japanese support including: * Japanese (also known as 日本語 (にほんご)) OCR for screenshots, cameras, images files, tiffs and PDFs. * Custom OCR that significantly outperforms Tesseract on real world documents. * Can read scans with distortion, skewing, low resolution & contrast, and digital noise. * Also supports Tesseract 3, 4 and 5 in Japanese. * Support for 122 other languages also available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework(4.5 +) * .NET CORE(2.0 +) * .NET Standard(2.0 +) Works on: *Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: * Images * TIFFS * PDFs * Screenshots * Camera Input * Scans * Barcodes * QR codes This package also installs: https://www.nuget.org/packages/IronOcr/ For product and licensing support please email us at developers@ironsoftware.com ====== C# と .NET での日本語 OCR スタンドアロン .NET OCR API で最適化された C# Tesseract 5 OCR。 スキャナーのドキュメント、画像、PDF をテキストに変換します。 C# と VB の例: https://ironsoftware.com/csharp/ocr/languages/Japanese/

IronOcr.Languages.Spanish The ID prefix of this package has been reserved for one of the owners of this package by NuGet.org.

The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * Spanish * SpanishBest * SpanishFast * SpanishOld * SpanishOldBest * SpanishOldFast ==================================== OCR en español en C# y .NET. C# Tesseract 5 OCR optimizado en una API de .NET OCR independiente. Convierte documentos, imágenes y PDF del escáner en texto. Ejemplos de C# y VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also Spanish support including: * Spanish (also known as Español, Castellano) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in Spanish * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/

IronOcr.Languages.Arabic The ID prefix of this package has been reserved for one of the owners of this package by NuGet.org.

The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * Arabic * ArabicBest * ArabicFast * ArabicAlphabet * ArabicAlphabetBest * ArabicAlphabetFast ==================================== OCR للغة العربية في C# & .NET. محسن C# Tesseract 5 OCR في .NET OCR API مستقل. يحول مستندات الماسح الضوئي والصور و PDF إلى نص. أمثلة على C# و VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also Arabic support including: * Arabic (also known as العربية) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in Arabic * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/

IronOcr.Languages.French The ID prefix of this package has been reserved for one of the owners of this package by NuGet.org.

The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * French * FrenchBest * FrenchFast ==================================== OCR de langue française en C# & .NET. OCR C# Tesseract 5 optimisé dans une API OCR .NET autonome. Convertit les documents du scanner, les images et les PDF en texte. Exemples C# et VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also French support including: * French (also known as Français, Langue Française) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in French * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/

GitHub repositories

This package is not used by any popular GitHub repositories.

Version Downloads Last updated
2024.6.3 1,089 6/4/2024
2024.5.25 8,057 4/29/2024
2024.4.6 9,740 4/5/2024
2024.3.4 13,613 3/8/2024
2024.2.41 17,298 1/29/2024
2024.1.17 11,235 12/29/2023
2023.12.34 64,146 11/27/2023
2023.11.35 10,854 10/27/2023
2023.10.9 12,229 9/26/2023
2023.9.4 6,768 9/8/2023
2023.8.34 36,569 8/1/2023
2023.7.28 50,434 7/3/2023
2023.6.6 40,470 5/29/2023
2023.5.35 16,525 5/1/2023
2023.4.13 19,014 4/6/2023
2023.3.2 32,047 3/1/2023
2023.1.11644 22,713 1/18/2023
2022.12.10830 31,360 12/5/2022
2022.11.10109 59,516 10/26/2022
2022.10.9390 14,863 9/27/2022
2022.8.8198 41,567 8/18/2022
2022.8.7804 73,915 7/26/2022
2022.3.0 168,313 3/10/2022
2022.1.0 47,217 1/17/2022
2021.12.0 21,142 12/21/2021
2021.11.0 311,681 10/29/2021
2021.9.0 26,212 8/24/2021
2021.6.0 31,047 6/24/2021
2021.2.1 39,475 2/24/2021
2020.12.2 27,945 12/14/2020
2020.11.2 120,655 11/13/2020
4.4.0 297,474 6/21/2018
4.3.0.1 36,724 4/9/2018
4.2.2.51 5,562 1/22/2018
4.2.2.1 4,347 12/1/2017
4.2.1.5 5,586 9/9/2017
4.1.1 9,402 8/4/2017
4.0.10 3,069 1/12/2017
4.0.9 2,672 12/20/2016

- Overall speed improvement though better OCR image processing core code.
- PDF OCR Reading improved for speed and stability
- Memory management improved; memory load reduced by 75%
- Certified 'Very' Thread-Safe.  Safe to use in BackgroundWorkers, Tasks, Async, and other multithreaded applications.
- Multi-threading automatically applied to any multi-page OCR job without any changes to the developer API.  .Net 4.0 compatible multithreading.
- Character list read and detected by OCR can now be set via AdvancedOCR.AcceptedOcrCharacters for improved speed and accuracy.
- Added explicit support for OCR of Multi-Frame (multi-page) TIFF and GIF files.
- Core image management rebuilt to avoid any use of windows use of GDI+. This paves a route for .Net Core / Standard Support and linux compatibility.

Added support for 4 new languages:
- Czech
- Polish
- Greek
- Thai