JPEG conversion in `analyze_document` significantly impacts table predictions

When obtaining predictions through `analyze_document`, the image is converted to JPEG https://github.com/aws-samples/amazon-textract-textractor/blob/master/textractor/textractor.py#L845. The compression is enough to degrade the table predictions.

We should check and keep the format, assuming that it is supported by Textract to avoid discrepancies between calling Textract with Textractor and calling Textract with `boto3`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

JPEG conversion in `analyze_document` significantly impacts table predictions #341

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

JPEG conversion in analyze_document significantly impacts table predictions #341

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

JPEG conversion in `analyze_document` significantly impacts table predictions #341