Updated on 2022-12-21 GMT+08:00

Introduction to OCR

Optical Character Recognition (OCR) detects and extracts text from images, scanned copies, PDFs, and OFD files, and converts the recognition results into editable text.

OCR provides open APIs, so you can use programming languages such as Python and Java to call OCR APIs to extract text from images. OCR allows you to automate the collection of key data. It helps you build an intelligent service system to improve efficiency. For details about how to obtain APIs, see Optical Character Recognition API Reference.

OCR also provides software development kits (SDKs) for multiple programming languages. For details about how to use SDKs, see Optical Character Recognition SDK Reference.

You can use either of the following methods to call OCR APIs after subscription:

  • A software tool such as curl or Postman

    These are good options if you are more comfortable writing code, HTTP requests, and API calls. For details about how to use the tool, see Using Postman to Call the Passport OCR API.

  • Software development kit (SDK)

    SDKs for Java, Python, and Node.js are available for quick integration. For details about how to use the tool, see Calling the Passport OCR SDK.