Help Center> Optical Character Recognition> Service Overview> What Is OCR?

What Is OCR?

Optical Character Recognition (OCR) allows you to detect and recognize printed characters in images and convert the characters into editable text in JSON format.

OCR provides services through open application programming interfaces (APIs). You can use programming languages such as Python and Java to call OCR APIs to recognize images as text, helping you automatically collect key data and build an intelligent service system to improve service efficiency. For details about how to obtain APIs, see the API Reference.

OCR also provides software development kits (SDKs) for multiple programming languages. For details about how to use SDKs, see the Optical Character Recognition SDK Reference.

Before You Start

You must have programming capabilities and be familiar with the Java, Python, iOS, Android, and Node.js programming languages.

OCR provides APIs for you to convert characters in images or scanned copies into editable text and returns the recognition result in JSON format. You can encode the recognition result and save it to a service system or save it in TXT or Excel format.

For details about OCR statements, see Service Statement.

OCR Capabilities

General OCR
Automatically identifies text on images of any format (including network images, images with tables, images in documents, and more) and adaptively analyzes various layouts and tables.
Card OCR
Automatically identifies valid information on images of certificates such as passports , ID cards, driving licenses, and vehicle licenses and converts the information into text.
Receipt OCR
Automatically identifies valid information on forms and invoices, such as VAT invoices, motor vehicle sales invoices, and medical invoices, and converts the information into text.
Domain OCR
Extracts structured information from images of logistics waybills and medical forms, facilitating industry automation.
Custom OCR
Allows you to tailor character recognition to suit your specific needs. You can customize templates to specify key fields to be recognized in images.

Using OCR for the First Time

If you are a first-time user, the following information will help you get familiar with OCR:

Function Description
This section helps you learn about OCR functions of different types, including Receipt OCR, General OCR and Card OCR.
Getting Started
OCR provides services through open APIs. You can learn how to use OCR by referring to the Optical Character Recognition Getting Started.
Using OCR
If you are a development engineer familiar with code compilation and want to directly call OCR APIs or SDKs, see the API Reference or SDK Reference.
From Beginners to Experts
You can learn how to use OCR by referring to Introduction to OCR.

Next Article: Function Description

Did this article solve your problem?

Thank you for your score！Your feedback would help us improve the website.

Products

Compute

Application

Dedicated Cloud

Storage

Management & Deployment

Migration

Network

Enterprise Intelligence

Video

Database

Edge Cloud Services

DevCloud

Security

Cloud Communications

Internet of Things

Solutions

Industry-Specific Solutions

General-Purpose Solutions

Security

DevOps

Enterprise Intelligence

Essential Platform

Big Data

Visual Cognition

Speech and Semantics

Support

Help Center

Customer Services

Developers

Console

语言 - Language

中国站 - 简体中文

中国站 - English

International - 简体中文

International - English

Help Center

What Is OCR?

Before You Start

OCR Capabilities

Using OCR for the First Time