Document Data Extractor UI Components | Android Document Scanner

Introduction

The Scanbot SDK provides the ability to detect various types of documents in an image, crop them and recognize data fields via the Document Data Extractor.

Currently, the Document Data Extractor supports the following types of documents:

German ID Card
German Passport
German Driver's license
German Residence permit
European Health Insurance Card (EHIC)
German Driver Qualification Card

The Document Data Extractor is available both as an RTU UI and a classic component (types of components are explained here).

Integration

Take a look at our Example Apps to see how to integrate the Document Data Extractor.

Ready-To-Use UI: ready-to-use-ui-demo
Classic UI Components: Document Data Extractor example For Live Detection or Document Data Extractor Example For Auto Snapping

Add Feature as a Dependency

DocumentDataExtractor is available with SDK Package 3 (Data Capture Modules). You have to add the following dependencies for it:

implementation("io.scanbot:sdk-package-3:$latestSdkVersion")
implementation("io.scanbot:sdk-documentdata-assets:$latestSdkVersion")

caution

Please do not use multiple scanners at the same time. For example, do not combine generic document scanner, health insurance scanner, text data scanner, etc. at the same time! Each scanner instance requires a lot of memory, GPU, and processor resources. Using multiple scanners will lead to performance issues for the entire application.

Initialize the SDK

The Scanbot SDK must be initialized before use. Add the following code snippet to your Application class:

Initialize SDK
loading...

See full example on GitHub

caution

Unfortunately, we have noticed that all devices using a Cortex A53 processor DO NOT SUPPORT GPU acceleration. If you encounter any problems, please disable GPU acceleration for these devices.

ScanbotSDKInitializer()
        .allowGpuAcceleration(false)

Ready-To-Use UI Component

Ready-To-Use UI Component (activity) that is responsible for scanning documents supported by the DocumentDataExtractor is DocumentDataExtractorActivity.

alt text

Have a look at our end-to-end working example of the RTU components usage here.

Starting and configuring RTU Document Data Extractor

First of all, you have to add the SDK package and feature dependencies as described here.

Initialize the SDK as described here. More information about the SDK license initialization can be found here.

To use any of the RTU UI components you need to include the corresponding dependency in your build.gradle file:

implementation("io.scanbot:sdk-package-ui:$scanbotSdkVersion")

Get the latest $scanbotSdkVersion from the Changelog.

To start the RTU Document Data Extractor you only have to start a new activity and be ready to process its result later.

info

Starting from version 1.90.0, the SDK RTU components contain predefined AndroidX Result API contracts. They handle part of the boilerplate for starting the RTU activity component and mapping the result once it finishes.

If your code is bundled with Android's deprecated startActivityForResult API - check the other approach we offer for this case.

AndroidX Result API
old 'startActivityForResult' approach

Start RTU Document Data Extractor and Process Result
loading...

See full example on GitHub

(DEPRECATED) Start RTU Document Data Extractor
loading...

See full example on GitHub

(DEPRECATED) Process Data Extractor Result
loading...

See full example on GitHub

info

We offer some syntactic sugar for handling the result from RTU components via AndroidX Result API:

every RTU component's activity contains a Result class which, in turn, along with the resultCode value exposes a Boolean resultOk property. This will be true if resultCode equals Activity.RESULT_OK;
when you only expect Activity.RESULT_OK result code - you can use the AppCompatActivity.registerForActivityResultOk extension method instead of registerForActivityResult - it will be triggered only when there is a non-nullable result entity present.

caution

Always use the corresponding activity's static newIntent method to create intent when starting the RTU UI activity using deprecated startActivityForResult approach. Creating android.content.Intent object using its constructor (passing the activity's class as a parameter) will lead to the RTU UI component malfunctioning.

An instance of DocumentDataExtractorConfiguration is required for starting the RTU UI activity. It allows configuration changes through methods it exposes:

Configure Document Data Extractor RTU UI activity
loading...

See full example on GitHub

Excluding fields from scanning in RTU UI

It is also possible to exclude certain fields from the scanning process altogether. When implemented, these excluded fields will not even be attempted to be recognized. This is useful for security and/or privacy reasons. All other fields will be scanned as usual. Fields should be set ONLY as normalized field names.

Exclude fields from being recognized in the configuration
loading...

See full example on GitHub

info

All parameters in DocumentDataExtractorConfiguration are optional.

Full API references for these methods can be found on this page.

Handling the result

Below is a simple example of handling the result: List<DocumentDataExtractionResult>.

For simplicity we will take only the first document

Get the first document from the result list
loading...

See full example on GitHub

Now you can simply show the detected document fields in a Toast notification:

Show the detected document fields in a Toast notification
loading...

See full example on GitHub

It is also possible to use the GenericDocumentWrapper successors to use strongly typed objects and conveniently get access to the fields of the corresponding document.

To receive an instance of the scanned document wrapper as follows:

Manual data parsing snippet
loading...

See full example on GitHub

Full API references for the result wrapper class are available here and for GenericDocument class here.

Classic component

To integrate the classic component of the Document Data Extractor you can take a look at our DocumentDataExtractor Example For Live Detection, DocumentDataExtractor Example For Auto Snapping or check the following step-by-step integration instructions.

DocumentDataExtractor can be used both in conjunction with ScanbotCameraXView (e.g. live detection for preview) and by itself for detection on a Bitmap or JPEG byte array. Let's have a look at an example with ScanbotCameraXView.

Add feature dependencies and initialize the SDK

First of all, you have to add the SDK package and feature dependencies as described here.

Initialize the SDK as described here. More information about the SDK license initialization can be found here.

Add `ScanbotCameraXView` to layout

<io.scanbot.sdk.ui.camera.ScanbotCameraXView
    android:id="@+id/camera_view"
    android:layout_width="match_parent"
    android:layout_height="match_parent" />

Get `DocumentDataExtractor` instance from `ScanbotSDK`, set the required document types and blurriness acceptance score, then attach it to `ScanbotCameraXView`

Get DocumentDataExtractor instance and attach it to ScanbotCameraXView
loading...

See full example on GitHub

Excluding fields from scanning for Document Data Extractor

Exclude fields from being recognized
loading...

See full example on GitHub

Add a result handler for `DocumentDataExtractorFrameHandler`

Add a frame handler which, for example, observes consecutive successful recognition statuses and shows a toast notification whenever two or more such statuses are received.

Add a frame handler for DocumentDataExtractor
loading...

See full example on GitHub

Method handle(result: FrameHandlerResult<DocumentDataExtractionResult, SdkLicenseError>) will be triggered every time DocumentDataExtractor detects a document in the camera preview frame or if a license error has occurred.

If the result of the scanning was successful, the user gets the DocumentDataExtractionResult object which contains a cropped document image and a GenericDocument object. Each field is represented by the Field class, holding the field's type, cropped visual source, recognized text and confidence level value.

You can now run your app and should see a simple camera preview that can scan your documents.

Pass snapped picture to `DocumentDataExtractor`, process results

First, decode the image ByteArray obtained from the camera's callback, taking into account the image orientation. Our ImageProcessor component can be used for this:

Rotate image
loading...

See full example on GitHub

Next, we perform a recognition:

Extract data from the image
loading...

See full example on GitHub

As an example of further application, set the obtained extraction results to a TextView:

Set the obtained extraction results to a TextView
loading...

See full example on GitHub

It is also possible to use the GenericDocumentWrapper successors to use strongly typed objects and conveniently get access to the fields of the corresponding document.

To receive an instance of the scanned document wrapper, check follow example:

Document Data Extractor example For Live Detection or Document Data Extractor Example For Auto Snapping

Add a Finder Overlay

In addition, it is recommended to add a "Finder Overlay". This feature allows you to predefine a document over the ScanbotCameraXView screen. By using this overlay the Document Data Extractor can skip the time-consuming step "Search for the document area" and perform the recognition directly in the specified "Finder Overlay" area. By using this approach the Document Data Extractor finds and extracts the document content much faster.

Details about applying finder view logic in the layout and in the code can be found here.

Want to scan longer than one minute?

Generate a free trial license to test the Scanbot SDK thoroughly.

Get your free Trial License

Document Data Extractor UI Components | Android Document Scanner

Introduction

Integration

Add Feature as a Dependency

Initialize the SDK

Ready-To-Use UI Component

Starting and configuring RTU Document Data Extractor

Excluding fields from scanning in RTU UI

Handling the result

Classic component

Add feature dependencies and initialize the SDK

Add `ScanbotCameraXView` to layout

Get `DocumentDataExtractor` instance from `ScanbotSDK`, set the required document types and blurriness acceptance score, then attach it to `ScanbotCameraXView`

Excluding fields from scanning for Document Data Extractor

Add a result handler for `DocumentDataExtractorFrameHandler`

Pass snapped picture to `DocumentDataExtractor`, process results

Add a Finder Overlay

Want to scan longer than one minute?

What do you think of this documentation?

On this page

Document Data Extractor UI Components | Android Document Scanner

Introduction​

Integration​

Add Feature as a Dependency​

Initialize the SDK​

Ready-To-Use UI Component​

Starting and configuring RTU Document Data Extractor​

Excluding fields from scanning in RTU UI​

Handling the result​

Classic component​

Add feature dependencies and initialize the SDK​

Add ScanbotCameraXView to layout​

Get DocumentDataExtractor instance from ScanbotSDK, set the required document types and blurriness acceptance score, then attach it to ScanbotCameraXView​

Excluding fields from scanning for Document Data Extractor​

Add a result handler for DocumentDataExtractorFrameHandler​

Pass snapped picture to DocumentDataExtractor, process results​

Add a Finder Overlay​

Want to scan longer than one minute?

What do you think of this documentation?

On this page

Introduction

Integration

Add Feature as a Dependency

Initialize the SDK

Ready-To-Use UI Component

Starting and configuring RTU Document Data Extractor

Excluding fields from scanning in RTU UI

Handling the result

Classic component

Add feature dependencies and initialize the SDK

Add `ScanbotCameraXView` to layout

Get `DocumentDataExtractor` instance from `ScanbotSDK`, set the required document types and blurriness acceptance score, then attach it to `ScanbotCameraXView`

Excluding fields from scanning for Document Data Extractor

Add a result handler for `DocumentDataExtractorFrameHandler`

Pass snapped picture to `DocumentDataExtractor`, process results

Add a Finder Overlay