Help Center > > Service Overview> What Is OCR?

What Is OCR?

Updated at: Feb 24, 2021 GMT+08:00

Optical Character Recognition (OCR) allows you to detect and recognize printed characters in images and convert the characters into editable text in JSON format.

OCR provides services through open application programming interfaces (APIs). You can use programming languages such as Python and Java to call OCR APIs to recognize images as text, helping you automatically collect key data and build an intelligent service system to improve service efficiency. For details about how to obtain APIs, see the API Reference.

OCR also provides software development kits (SDKs) for multiple programming languages. For details about how to use SDKs, see the Optical Character Recognition SDK Reference.

Before You Start

You must have programming capabilities and be familiar with the Java, Python, iOS, Android, and Node.js programming languages.

OCR provides APIs for you to convert characters in images or scanned copies into editable text and returns the recognition result in JSON format. You can encode the recognition result and save it to a service system or save it in TXT or Excel format.

OCR Capabilities

  • General OCR

    Automatically identifies text on images of any format (including network images and more) .

  • Card OCR

    Automatically identifies valid information on images of certificates such as passports , ID cards, driving licenses, and converts the information into text.

Using OCR for the First Time

If you are a first-time user, the following information will help you get familiar with OCR:

Did you find this page helpful?

Submit successfully!

Thank you for your feedback. Your feedback helps make our documentation better.

Failed to submit the feedback. Please try again later.

Which of the following issues have you encountered?







Please complete at least one feedback item.

Content most length 200 character

Content is empty.

OK Cancel