Aspose.OCR  Cloud SDK for Python

Easily add OCR functionality to Python applications

Create Python applications that can extract text from images, screenshots, photos, and scanned PDFs by calling Aspose.OCR Cloud with this open source SDK.

Get Started
  
 

Aspose.OCR Cloud provides a REST API for optical character recognition. With it, you can add OCR functionality to your applications without worrying about CPU usage, RAM, and overall system performance - all resource-intensive tasks are running on high-performance cloud maintained by Aspose. Our API supports 26 languages based on Latin and Cyrillic scripts as well as Chinese and can recognize images, PDF files, photos, and screenshots, returning results in the most popular document and data exchange formats, including JSON.

This SDK greatly simplifies calls to Aspose.OCR Cloud services from Python code, allowing you to focus on business needs rather than the technical details. It handles all the routine operations such as establishing connection, sending API requests, and parsing responses, wrapping all these tasks into a few lines of code that are very easy to read and maintain even for inexperienced developers.

The Python SDK and demo notebooks are open source under the MIT license. You can use them for any purpose and change any part of the code to suit your needs.

Features and capabilities of Aspose.OCR Cloud

Extracts text from scanned images and PDFs

Supports raster and vector images

Reads languages based on Latin, Cyrillic, Hindi, Arabic, and other alphabets

Recognizes more than 6,000 Chinese characters

Processes tables and receipts

Processes the whole image or specific areas only

Automatically corrects rotated, skewed and noisy images

Finds and automatically corrects misspelled words

Requires minimal resources on the end user devices

45 Recognition Languages

Our cloud API can recognize a large number of languages written in different scripts.

  • Extended Latin alphabet: Azerbaijani, Albanian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Indonesian, Italian, Javanese, Latin, Latvian, Lithuanian, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish, Turkish, Uzbek, and Vietnamese.
  • Cyrillic alphabet: Bulgarian, Russian, Serbian, Ukrainian.
  • Middle Eastern scripts: Arabic, Hebrew, Persian (Farsi), Urdu.
  • Indic scripts: Bengali, Hindi.
  • Far East scripts: Chinese, Japanese, Korean, Thai, Tibetan.
  • Other European alphabets: Georgian, Greek.

Read photos and low-quality scans

Our API has powerful built-in image pre-processing filters that can correct rotated and skewed images, and automatically remove dirt, spots, scratches, glare, unwanted gradients, and other image defects. In combination with support for all image formats, it allows for reliable recognition of even smartphone photos. Most of the pre-processing and image correction is done automatically, so you will only have to intervene in difficult cases.

Recognize and convert

The API can read literally any image you can get from a scanner, camera or smartphone: PDF documents, JPEG, PNG, TIFF, GIF, and BMP images. Multi-page PDF documents and TIFF files are fully supported.

Recognition results are returned in the most popular document and data exchange formats: plain text, PDF, Microsoft Excel, CSV, and hOCR.

Minimal System Requirements

Aspose.OCR Cloud is an on-demand optical character recognition service. As such, it has no special hardware or operating system requirements - you can use it even on entry-level systems and mobile devices without loss of accuracy and performance.

We use highly reliable and high performance GPU-based Amazon servers to host our OCR engine, ensuring the fastest possible speed regardless of the number of requests.

Spell Check

While the OCR produces reliable results, dust and print defects might cause some symbols to be recognized incorrectly. Cloud OCR API has a built-in spell checker that automatically replaces misspelled words and frees you from having to manually correct the recognition results.

Create searchable PDFs

Convert a scanned PDF file to a searchable PDF document, which can be easily navigated and indexed. Text in searchable PDF documents can be selected, copied, and marked up.

Recognize images from the Internet

There is no need to upload images and PDF documents to the cloud storage for recognition. Just send the image web link to Cloud OCR and get the text.

Unlimited possibilities with Aspose Cloud solutions

An account in Aspose Cloud grants you access to the full range of our cloud APIs. You can combine image recognition with OMR, easily modify and convert recognized documents to almost any format, analyze and combine data from multiple sources. All tasks are performed in the same way, which significantly speeds up development, reduces learning and maintenance costs, even for the most advanced business solution.

  

Support and Learning Resources