-
Notifications
You must be signed in to change notification settings - Fork 162
Closed
Description
amazon-textract-caller==0.2.2
amazon-textract-response-parser==1.0.2
amazon-textract-textractor==1.7.4
Why is start_document_analysis() using so much memory? This is for pdfs of below 1000 pages.
Even with memory size of 32GB my container gets killed due to overloaded memory. Is there a leak?
Happens both locally and on ECS
I use the boto3 library with no issues on same pdfs.
Usage:
document = extractor.start_document_analysis(
file_source=TRIAL_DOCUMENT_S3_URI,
features=[TextractFeatures.LAYOUT,TextractFeatures.TABLES],
s3_output_path=<some_s3_path>
)
Metadata
Metadata
Assignees
Labels
No labels