Code

Discussion on AWS Amazon Textract - Extract Text Forms Tables from Images and PDFs with ML

Discussion on AWS Amazon Textract - Extract Text Forms Tables from Images and PDFs with ML

By
Cart 100 sales

Berkine supports this item

Supported

This author's response time can be up to 1 business day.

49 comments found.

Hi, when do you plan to release new version?

Hi,

Somewhere towards the end of the month a new SaaS version will be released. We will update this script afterwards as well.

Ok, very excited to wait. Surely there is a promotion for the new Saas version?

I will notify you ones it will be released

will handwritten documents work

Textract officially supports handwritten documents in english starting recently, but we haven’t tested it on our script… we will update this script towards the end of the year with different framework.

We like to have to customize the the textract for comparison document, wouLd you please email us for details thanks

Send us an email via our profile with further details on what are you planning to achieve…

Is possible extrac text and images from pdf ??

You can extract only text, both from images and pdf files, and also current version of this script only supports English language.

i need extract images from PDF, do you know if is possible? i send a email for private custom..

It won’t be possible with Amazon Textract. I will reply over an email as well.

Hi Berkine, nice job! I’m about to buy your script. Can you please explain in detail this : “Additionally, you can create smart search indexes, build automated approval workflows, and better maintain compliance with document archival rules by flagging data that may require redaction.” How to do that with your script? Thanks.

Hi,

That part was copied from the main Textract page, and not implemented in our script. To achieve this, additional AWS services and components needs to be implemented. Thanks for letting us know.

Hello, I am looking for a product like yours that can do the following: read PDF/JPG text. mainly invoices. Text need to be extractable and it would be ideal to save th data as an xml or any data format that can be imported in my website during a cron run. Can your software do that?

Hi,

Yes it does support PDF/JPG extensions, and tables are saved in comma separated csv format. You will need AWS account for it, and there you can test with Textract your files, if you are satisfied with the results, our script will give you the same results since we use exact same Textract APIs.

Hi Are you open for customize this solution with GOOGLE CLOUD VISION API?

Hi,

Can you tell me what feature are you looking for? Is it just to grab the text out of it or tables and key pairs as well?

Google Vision if I recall correctly has to be compared to Amazon Rekognition. I don’t know how good Google Vision is for full size texts, but it does not grab tables or separate key pairs, at least didn’t hear anything new on this features from them.

Due to limited languages support on Amazon. Cloud Vision has something that we need. Are you open to do customization? we are happy with the extra cost.

hi there I want image to text service for new language which doesn’t support Amazon How can we rain that?

Hi,

You can’t. Only Amazon can train a new language and make it publicly available.

Textract supports: English, and starting recently: Spanish, Italian, French, Portuguese and German – these languages will be added in our next update.

I want to buy a couple of your products. Do you provide installation service? If yes what will be the cost? Is there any image annotation tool available?

Hi,

Yes we provide complete manual setup and installation from our side for additional cost, we can discuss the details at berkinedesign@gmail.com

Is there any image annotation tool available? – I didn’t get this part, can you leverage a bit more on this?

Hi The script seems impressive. I have created the user and bucket , still nothing happens. The documentation is missing some images or instructions. Please help.

Hi,

Thank you for your purchase.

AWS setup requires a bit more than IAM user and a bucket for this script, can you send us a support request at berkinedesign@gmail.com and we will help you with setup.

Hello, my need is the following:

a) In my custom App, I will take a photo of an ID Card (containing text) and upload the resulting image to a server.

b) On the server, receive the image, pass it to an OCR (api like yours or include it in a PHP routine) and extract the characters from the image. Then form a TXT file. In this step there should be no interface with a human or human intervention (that is, I do not need a sophisticated interface like the one in your demo).

c) Take the TXT file for further automatic patterns processing.

Can you tell me if your product can help me to do the tasks in step b) (only that step, not a) or c), that will be in my charge)

Thank you so much.

Hi,

In a nutshell it requires custom work based on this script, since it solve task of processing Textract results.

a) ok b) requires customization of code obviously, better to setup a Lambda function to output the result to S3 bucket as txt format, and pull it from there for further processing.

I’m Interested in Buying, before that i like to know ; 1) Can you add a login screen? 2)save the extracted data in a database?

We will be have admin panels for all of our current products, coming up 1 by 1.

1 – are you looking for login screen before the frontend? 2 – yes, processed file results will be stored in a db

Great, I will definitely purchase if both these features are available.Are these features already their or need further customisation.

can you provide your contact info. Or contact me at rahular1512@gmail.com

I have sent you an email.

can I add custom language?

custom language? you mean alphabet?

Hello, I checked the system. I’m interested to purchased it.But I have one question before that, please reply me ASAP. Let’s say in a invoice PDF file, I have 9 columns in a table and I need 6 of them, I need the capability to select that 6 specific columns. Then after analyzing that 6 columns will show in the .CSV file. Is this possible to add in your application?

Hi,

To achieve this, you can either customize client side js file to grab first 6 columns or edit it afterwards the csv file as needed. For editing js file a bit of understanding javascript might be required.

do you have any plans to make a system using aws ses (Amazon Simple Email Service), something like that would be very good

yes we do, most of aws services are in the roadmap

Hi, may I know if you are available for customization.. I’m planning to have a document analysis…. then by upload PDF..I’m able to set/create some sort of pattern …and able to identify the frequency of keywords

Hi,

let me know more details over an email, especially regarding pattern that you want to achieve to see what are you trying to achieve.

You can send an email to berkinedesign (at) gmail.com

Buttons for upload pdf and images are not clickable …

Yep, that’s why you have a tooltip notification there…

so, english is not 1 language in thee world – i want to test for by in my language – but its impossible, sure i am not 1 who think so …

I agree with your comment about english not being the only language in the world… but for the rest I would ask you to read the description of this app..

Amazon Textract currently supports only Latin-Script Characters(English Alphabet), and we don’t know when they will support cyrillic or any other character types… otherwise we would include demo documents in these languages as well

Hi, in Maximum Textract Lambda function invokes error during runtime:

{ "errorType": "TypeError", "errorMessage": "Cannot read property 'Type' of undefined", "stack": [ "TypeError: Cannot read property 'Type' of undefined", " at get_table_text (/var/task/index.js:311:25)", " at get_rows_columns_map (/var/task/index.js:274:53)", " at generate_table_csv (/var/task/index.js:208:18)", " at Runtime.exports.handler (/var/task/index.js:161:26)", " at processTicksAndRejections (internal/process/task_queues.js:97:5)" ] } Please advice accordingly. Thanks.

Hi,

Thanks for your purchase,

I’ve replied to your support request over an email.

Hi, I have some customization request. can you share your email?

Hi, you can send your request to: berkinedesign (at) gmail.com

Hi, Thanks for your awesome web app. Would you please tell me where should I set google secret key?

I mean recaptcha v3 secret key

Hi,

Thank you for your purchase!

It is explained with screenshots on Stage 6, in Maximum Textract Setup section.

Send us a support request if you will still face difficulties

by
by
by
by
by
by

Tell us what you think!

We'd like to ask you a few questions to help improve CodeCanyon.

Sure, take me to the survey