Skip to main content

Scanning a generic document from an image | Android Document Scanner

The Scanbot SDK Generic Document Scanner is not only capable of live detection of Passports, ID Cards, etc., but it can also extract content from still images imported from the gallery or any other source.

Integration

The Example Apps are available to see the Generic Document scanner in action and to gain a comprehensive understanding of how to integrate it into your project:

Feature Dependency

The GenericDocumentRecognizer feature is available with SDK Package 3 (Date Capture Modules). The following dependencies need to be added:

implementation("io.scanbot:sdk-package-3:$latestSdkVersion")
implementation("io.scanbot:sdk-genericdocument-assets:$latestSdkVersion")

SDK Initialization

Before use, the Scanbot SDK needs to be initialized. The following code snippet should be added to your Application class:

import io.scanbot.sdk.ScanbotSDKInitializer

class ExampleApplication : Application() {

override fun onCreate() {
super.onCreate()

// The Scanbot Scanner SDK initialization:
ScanbotSDKInitializer()
...
.initialize(this)
}
}

Image Processing

To select an image from the photo library and run detection on it, a class for an image import contract is created using the modern Android result API.

class ImportImageContract(private val context: Context) : ActivityResultContract<Unit, Bitmap?>() {
override fun createIntent(context: Context, input: Unit): Intent {
// An image is selected from the photo library and document detection is run on it:
val imageIntent = Intent()
imageIntent.type = "image/*"
imageIntent.action = Intent.ACTION_GET_CONTENT
imageIntent.putExtra(Intent.EXTRA_LOCAL_ONLY, false)
imageIntent.putExtra(Intent.EXTRA_ALLOW_MULTIPLE, false)

return Intent.createChooser(imageIntent, "Select a picture")
}

private fun processGalleryResult(data: Intent): Bitmap? {
val imageUri = data.data
return MediaStore.Images.Media.getBitmap(context.contentResolver, imageUri)
}

override fun parseResult(resultCode: Int, intent: Intent?): Bitmap? {
return if (resultCode == Activity.RESULT_OK && intent != null) {
return processGalleryResult(intent)
} else {
null
}
}
}

To run the gallery call and get a Bitmap using ImportImageContract, the following code is used:

 val galleryImageLauncher =
registerForActivityResult(ImportImageContract(this)) { resultEntity ->
lifecycleScope.launch(Dispatchers.Default) {
val activity = this@MainActivity
val sdk = ScanbotSDK(activity)
if (!sdk.licenseInfo.isValid) {
withContext(Dispatchers.Main) {
Toast.makeText(
activity,
"License has expired!",
Toast.LENGTH_LONG
).show()
}
} else {
resultEntity?.let { bitmap ->
// Image processing is carried out
// processImage()
}
}
}
}
findViewById<View>(R.id.import_image).setOnClickListener {
galleryImageLauncher.launch(Unit)
}

Scanner Creation

  val scanbotSDK = ScanbotSDK(this)
val documentRecognizer = sdk.createGenericDocumentRecognizer()
caution

Each call to createGenericDocumentRecognizer() will create a new GenericDocumentRecognizer and initialize memory for it. If more than one import operation is expected, do not create new scanners for each operation.

Detection

The Scanbot SDK is used to detect the desired element on the imported bitmap.

    private fun processImage(
documentRecognizer: GenericDocumentRecognizer,
bitmap: Bitmap
) {
val result = documentRecognizer.scanBitmap(bitmap, true, 0)
// Document recognition results are processed
// processResult(result)
}

Want to scan longer than one minute?

Generate a free trial license to test the Scanbot SDK thoroughly.

Get your free Trial License

What do you think of this documentation?