What Is OCR?

Optical Character Recognition (OCR) allows you to detect and recognize printed characters in images and convert the characters into editable text in JSON format.

OCR provides services through open application programming interfaces (APIs). You can use programming languages such as Python and Java to call OCR APIs to recognize images as text, helping you automatically collect key data and build an intelligent service system to improve service efficiency. For details about how to obtain APIs, see the API Reference.

OCR also provides software development kits (SDKs) for multiple programming languages. For details about how to use SDKs, see the Optical Character Recognition SDK Reference.

Before You Start

You must have programming capabilities and be familiar with the Java, Python, iOS, Android, and Node.js programming languages.

OCR provides APIs for you to convert characters in images or scanned copies into editable text and returns the recognition result in JSON format. You can encode the recognition result and save it to a service system or save it in TXT or Excel format.

For details about OCR statements, see Service Statement.

OCR Capabilities

  • General OCR

    Automatically identifies text on images of any format (including network images, images with tables, images in documents, and more) and adaptively analyzes various layouts and tables.

  • Card OCR

    Automatically identifies valid information on images of certificates such as passports , ID cards, driving licenses, and vehicle licenses and converts the information into text.

  • Receipt OCR

    Automatically identifies valid information on forms and invoices, such as VAT invoices, motor vehicle sales invoices, and medical invoices, and converts the information into text.

  • Domain OCR

    Extracts structured information from images of logistics waybills and medical forms, facilitating industry automation.

  • Custom OCR

    Allows you to tailor character recognition to suit your specific needs. You can customize templates to specify key fields to be recognized in images.

Using OCR for the First Time

If you are a first-time user, the following information will help you get familiar with OCR: