Document Extraction

Sub-module for scanning and extracting (OCR) the text present on the KYC documents of the consumer, partner or business.

Scan & Extract

Important Security Update

Please Note: The URL provided should have one of the following extensions:

  • .jpeg
  • .png
  • .jpg
  • .pdf

Apart from having the correct extension, we check the MIME Type as well as the content if it corresponds to the permissible types.

Businesses can integrate with Decentro to extract information from a customer’s standard KYC document images as well as validate the same using the Validate APIs. The major benefit of this offering is that customers do not have to enter any information and business can just pre-fill the information from the document.

Decentro powers 2 flows here.
* Scan: This flow allows businesses to extract information from standard ID documents
* Scan & Verify: This flow allows businesses to extract information from standard ID documents as well as verify the same from the associated public repository

Documents Supported for OCR

Decentro supports the following documents for OCR:

  • PAN
  • Aadhaar
  • Driving License
  • Voter ID


Document Type Enumeration Standardized

In the request body, "document_type" parameter will only consider capital letters.

The special character's are not allowed -!@#$%^&*()~{}|<>,.:;"'?/+=

Accepted Enumeration Examples -

  1. PAN

Not Accepted Enumeration Examples -

  1. pan
  2. Driving_License

Except for Aadhaar, all the documents mentioned above are supported in the Scan and Verify flow as well.

The platform using this API can configure the flows at an API level by setting kyc_validate = 1 in the API's request body.

Currently, we support the following states for Driving License OCR:

  • Delhi
  • Karnataka
  • Madhya Pradesh
  • Rajasthan
  • Punjab
  • Bihar

Document Formats Supported for OCR

  • JPEG
  • PNG
  • JPG
  • PDF (1 Pager)