What Is OCR?
Optical Character Recognition (OCR) allows you to detect and recognize printed characters in images and convert the characters into editable text in JSON format.
OCR provides services through open application programming interfaces (APIs). You can use programming languages such as Python and Java to call OCR APIs to recognize images as text, helping you automatically collect key data and build an intelligent service system to improve service efficiency. For details about how to obtain APIs, see the API Reference.
OCR also provides software development kits (SDKs) for multiple programming languages. For details about how to use SDKs, see the Optical Character Recognition SDK Reference.
Before You Start
You must have programming capabilities and be familiar with the Java, Python, iOS, Android, and Node.js programming languages.
OCR provides APIs for you to convert characters in images or scanned copies into editable text and returns the recognition result in JSON format. You can encode the recognition result and save it to a service system or save it in TXT or Excel format.
For details about OCR statements, see Service Statement.
OCR Capabilities
- General OCR
Automatically identifies text on images of any format (including network images, images with tables, images in documents, and more) and adaptively analyzes various layouts and tables.
- Card OCR
Automatically identifies valid information on images of certificates such as passports , ID cards, driving licenses, and vehicle licenses and converts the information into text.
- Receipt OCR
Automatically identifies valid information on forms and invoices, such as VAT invoices, motor vehicle sales invoices, and medical invoices, and converts the information into text.
- Domain OCR
Extracts structured information from images of logistics waybills and medical forms, facilitating industry automation.
- Custom OCR
Allows you to tailor character recognition to suit your specific needs. You can customize templates to specify key fields to be recognized in images.
Using OCR for the First Time
If you are a first-time user, the following information will help you get familiar with OCR:
- Function Description
This section helps you learn about OCR functions of different types, including Receipt OCR, General OCR and Card OCR.
- Getting Started
OCR provides services through open APIs. You can learn how to use OCR by referring to the Optical Character Recognition Getting Started.
- Using OCR
If you are a development engineer familiar with code compilation and want to directly call OCR APIs or SDKs, see the API Reference or SDK Reference.
- From Beginners to Experts
You can learn how to use OCR by referring to Introduction to OCR.
Next Article: Function Description
Did this article solve your problem?
Thank you for your score!Your feedback would help us improve the website.