Easily add OCR functionality to Python applications

Create Python applications that can extract text from images, screenshots, photos, and scanned PDFs by calling Aspose.OCR Cloud with this open source SDK.

Get Started

Aspose.OCR Cloud SDK for Python

Overview

Overview
- Features
- Resources
- Live Demos
- Pricing

GitHub Learn Buy

Aspose.OCR Cloud provides a REST API for optical character recognition. With it, you can add OCR functionality to your applications without worrying about CPU usage, RAM, and overall system performance - all resource-intensive tasks are running on high-performance cloud maintained by Aspose. Our API supports 26 languages based on Latin and Cyrillic scripts as well as Chinese and can recognize images, PDF files, photos, and screenshots, returning results in the most popular document and data exchange formats, including JSON.

This SDK greatly simplifies calls to Aspose.OCR Cloud services from Python code, allowing you to focus on business needs rather than the technical details. It handles all the routine operations such as establishing connection, sending API requests, and parsing responses, wrapping all these tasks into a few lines of code that are very easy to read and maintain even for inexperienced developers.

The Python SDK and demo notebooks are open source under the MIT license. You can use them for any purpose and change any part of the code to suit your needs.

At a Glance

Core Features

Extract text from photos
Create searchable PDFs
Automatic image corrections
Support multiple typefaces
Preserve text formatting
Detect text fragments
Multi-page processing
Spell checking

Supported Languages

English
Chinese
German
French
Ukrainian
Spanish
Czech
Polish
Arabic
Hindi
Russian
and many more...

Aspose.OCR

Supported File Formats

Source files

PDF
JPEG
PNG
TIFF
GIF
BMP
EMF
EPS
SVG

Recognition results

Plain text
Searchable PDF
Microsoft Excel
CSV
hOCR

Aspose.OCR

Platform Independence

Features and capabilities of Aspose.OCR Cloud

Extracts text from scanned images and PDFs

Supports raster and vector images

Reads languages based on Latin, Cyrillic, Hindi, Arabic, and other alphabets

Recognizes more than 6,000 Chinese characters

Processes tables and receipts

Processes the whole image or specific areas only

Automatically corrects rotated, skewed and noisy images

Finds and automatically corrects misspelled words

Requires minimal resources on the end user devices

45 Recognition Languages

Our cloud API can recognize a large number of languages written in different scripts.

Extended Latin alphabet: Azerbaijani, Albanian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Indonesian, Italian, Javanese, Latin, Latvian, Lithuanian, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish, Turkish, Uzbek, and Vietnamese.
Cyrillic alphabet: Bulgarian, Russian, Serbian, Ukrainian.
Middle Eastern scripts: Arabic, Hebrew, Persian (Farsi), Urdu.
Indic scripts: Bengali, Hindi.
Far East scripts: Chinese, Japanese, Korean, Thai, Tibetan.
Other European alphabets: Georgian, Greek.

Read photos and low-quality scans

Our API has powerful built-in image pre-processing filters that can correct rotated and skewed images, and automatically remove dirt, spots, scratches, glare, unwanted gradients, and other image defects. In combination with support for all image formats, it allows for reliable recognition of even smartphone photos. Most of the pre-processing and image correction is done automatically, so you will only have to intervene in difficult cases.

Recognize and convert

The API can read literally any image you can get from a scanner, camera or smartphone: PDF documents, JPEG, PNG, TIFF, GIF, and BMP images. Multi-page PDF documents and TIFF files are fully supported.

Recognition results are returned in the most popular document and data exchange formats: plain text, PDF, Microsoft Excel, CSV, and hOCR.

Minimal System Requirements

Aspose.OCR Cloud is an on-demand optical character recognition service. As such, it has no special hardware or operating system requirements - you can use it even on entry-level systems and mobile devices without loss of accuracy and performance.

We use highly reliable and high performance GPU-based Amazon servers to host our OCR engine, ensuring the fastest possible speed regardless of the number of requests.

Spell Check

While the OCR produces reliable results, dust and print defects might cause some symbols to be recognized incorrectly. Cloud OCR API has a built-in spell checker that automatically replaces misspelled words and frees you from having to manually correct the recognition results.

Create searchable PDFs

Convert a scanned PDF file to a searchable PDF document, which can be easily navigated and indexed. Text in searchable PDF documents can be selected, copied, and marked up.

Recognize images from the Internet

There is no need to upload images and PDF documents to the cloud storage for recognition. Just send the image web link to Cloud OCR and get the text.

Unlimited possibilities with Aspose Cloud solutions

An account in Aspose Cloud grants you access to the full range of our cloud APIs. You can combine image recognition with OMR, easily modify and convert recognized documents to almost any format, analyze and combine data from multiple sources. All tasks are performed in the same way, which significantly speeds up development, reduces learning and maintenance costs, even for the most advanced business solution.