Classic UI for the Android Document Data Extractor Module
Overview
To integrate the Document Data Extractor using the SDK's Classic UI Components, you can take a look at our examples for live detection and auto-snapping or check the following step-by-step integration instructions.
DocumentDataExtractor
can be used both in conjunction with ScanbotCameraXView
(e.g., live detection for preview) and by itself for detection on a Bitmap
or JPEG byte array. Let's have a look at an example with ScanbotCameraXView
.
Integration
Adding the feature as a dependency
DocumentDataExtractor
is included in Scanbot SDK package 3. Therefore, add the dependency io.scanbot:sdk-package-3
or higher in your build.gradle
:
implementation("io.scanbot:sdk-package-3:$latestSdkVersion")
implementation("io.scanbot:sdk-documentdata-assets:$latestSdkVersion")
Do not use multiple scanners (e.g., MRZ Scanner and Credit Card Scanner) at the same time.
Each scanner instance requires a lot of memory, GPU, and processor resources. Using multiple scanners will lead to performance issues for the entire application.
Initializing the SDK
The Scanbot SDK must be initialized before use. Add the following code snippet to your Application
class:
loading...
Unfortunately, we have noticed that all devices using a Cortex A53 processor DO NOT SUPPORT GPU acceleration. If you encounter any problems, please disable GPU acceleration for these devices.
ScanbotSDKInitializer()
.allowGpuAcceleration(false)
Adding ScanbotCameraXView
to the layout
<io.scanbot.sdk.ui.camera.ScanbotCameraXView
android:id="@+id/camera_view"
android:layout_width="match_parent"
android:layout_height="match_parent" />
Getting the DocumentDataExtractor
instance from ScanbotSDK
, setting the required document types and blurriness acceptance score, and attaching it to ScanbotCameraXView
loading...
Excluding fields from scanning
It is also possible to exclude certain fields from the scanning process altogether. When implemented, these excluded fields will not even be attempted to be recognized.
This can be useful for security and privacy reasons. All other fields will be scanned as usual. Fields should be set ONLY as normalized field names.
loading...
Adding a result handler for DocumentDataExtractorFrameHandler
Add a frame handler which, for example, observes consecutive successful recognition statuses and shows a toast notification whenever two or more such statuses are received.
loading...
Method handle(result: FrameHandlerResult<DocumentDataExtractionResult, SdkLicenseError>)
will be triggered every time DocumentDataExtractor
detects a document in the camera preview frame or if a license error has occurred.
If the result of the scanning was successful, the user gets the DocumentDataExtractionResult
object, which contains a cropped document image and a GenericDocument
object.
Each field is represented by the Field
class, holding the field's type, cropped visual source, recognized text and confidence level value.
You can now run your app and should see a simple camera preview that can scan your documents.
Passing a snapped picture to DocumentDataExtractor
and processing the results
First, decode the image ByteArray
obtained from the camera's callback, taking into account the image orientation. Our ImageProcessor
component can be used for this:
loading...
Next, we perform a recognition:
loading...
As an example of further application, set the obtained extraction results to a TextView
:
loading...
It is also possible to use the GenericDocumentWrapper
successors to use strongly typed objects and conveniently get access to the fields of the corresponding document.
As for how to receive an instance of the scanned document wrapper, take a look at our examples for live detection and auto-snapping.
Adding a finder overlay
In addition, it is recommended to add a finder overlay. This feature allows you to predefine a document area over the ScanbotCameraXView
screen. By using this overlay, the Document Data Extractor skips the time-consuming step of searching for the document area and performs the recognition directly in the specified area. By using this approach, the Document Data Extractor finds and extracts the document content much faster.
Details about applying finder view logic in the layout and in the code can be found here.
Want to scan longer than one minute?
Generate a free trial license to test the Scanbot SDK thoroughly.
Get free trial license