I had an issue regarding checkboxes.let me explain problem. i had check boxes with

duplicates in checkboxes with same value about amazon-textract-textractor HOT 2 CLOSED

aws-samples commented on May 29, 2024

duplicates in checkboxes with same value

from amazon-textract-textractor.

Comments (2)

schadem commented on May 29, 2024

Assuming that in your example Amazon Textract identified the following selection elements as key/value pairs:

male - Not Selected
female - Not Selected
male - Not Selected
female - Not Selected

You can go by geometry information from the keys and values to get the context.
Example how to get the geometry from the Textract Response Parser (https://github.com/aws-samples/amazon-textract-response-parser):

for page in doc.pages: for field in page.form.fields: t = f"key: {field.key.text}({field.key.geometry.boundingBox}): {geo: field.value.text} (geo: {field.value.geometry.boundingBox})" print(f"{t}")

The geometry information allows you to identify which male|female belongs to which context by looking at the position relative to each other.

Hope this helps to unblock you.

from amazon-textract-textractor.

schadem commented on May 29, 2024

Check out the geofinder, which helps with identifying context.

from amazon-textract-textractor.

duplicates in checkboxes with same value about amazon-textract-textractor HOT 2 CLOSED

Comments (2)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent