WORKLOAD AUTOMATION COMMUNITY
  • Home
  • Blogs
  • Forum
  • Resources
  • Events
  • About
  • Contact
  • What's new

Amazon Textract with Workload Automation

7/7/2025

0 Comments

 
Let us begin with understanding Amazon Textract, what it is all about along with our Amazon Textract Plug-in, and how it benefits our HCL Automation Orchestration users.
 
The Amazon Textract plugin can be downloaded from Automation Hub to enhance your Workload Automation setup.
Picture
​In today’s fast-paced digital world, businesses are constantly looking for ways to streamline operations and improve efficiency. Document processing, a critical yet time-consuming task, often becomes a bottleneck. Enter Amazon Textract, a powerful machine learning service by AWS that extracts text, handwriting, and data from scanned documents. With the Amazon Textract Plug-in for Workload Automation, you can now seamlessly integrate this capability into your workflows, automating document processing and enabling smarter business operations.
Key Features of the Amazon Textract Plug-in:
●      Amazon Textract Integration: Seamlessly connect Workload Automation with the Amazon Textract service to submit documents for analysis and retrieve extracted data.
●      Document Analysis Orchestration: Automate the process of sending documents (images or PDFs) to Amazon Textract for text detection, form data extraction, and table extraction.
●      Asynchronous Processing Management: Handle the asynchronous nature of Amazon Textract processing by monitoring job status and retrieving results upon completion.
●      Data Retrieval and Integration: Automatically retrieve the extracted text and structured data from Amazon Textract and integrate it into downstream workflows or applications.
●      Error Handling and Retry Mechanisms: Implement robust error handling to manage potential issues during document processing and configure automated retry attempts.
Use Cases:
The Amazon Textract Plug-in opens a world of possibilities for automating document-centric workflows. Here are some common use cases:
●      Invoice Processing: Automate the extraction of key information from invoices, such as vendor details, invoice number, line items, and totals.
●      Forms Processing: Automatically extract data from various types of forms, such as applications, surveys, and medical questionnaires.
●      Document Archiving and Search: Extract text content from scanned documents to create searchable archives.
●      Compliance and Regulatory Reporting: Automate the extraction of data required for compliance reports from scanned documents.
●      Data Entry Automation: Reduce manual data entry by automatically extracting information from documents and populating databases or applications.
Example Workflow: Policy Document Processing
Insurance Company Utilizes Workload Automation to Orchestrate Policy Document Processing with Amazon Textract
●      Orchestration: Workload Automation monitors a designated storage location for new scanned policy documents (e.g., PDFs of applications or claims).
●      Document Submission: Upon detection of a new document, Workload Automation triggers the Amazon Textract plugin to submit the document for analysis to extract text and form data.
●      Asynchronous Monitoring: The plugin monitors the status of the Textract job.
●      Data Retrieval: Once the Textract job is complete, the plugin retrieves the extracted data, including policyholder information, coverage details, and claim information.
●      Data Integration: Workload Automation then integrates this extracted data into the insurance company's policy management system or claims processing system for further action.
Getting Started with the Amazon Textract Plug-in.
 
To begin using the Amazon Textract Plug-in, follow these steps:
 
1.      Connect to Amazon Textract
 
Log in to the Dynamic Workload/UnO Console and open the Workload Designer/Designer. Create a new job definition/task template and select the Amazon Textract.
 
Access Key and Secret Key: Provide the AWS access key ID and secret key.
Region: Specify the AWS region where Amazon Textract is hosted (e.g., us-east-1).
Role ARN: Provide the Amazon Resource Name (ARN) of the role to assume for accessing Textract.
Test Connection: Verify the connection to ensure the credentials and region are correctly configured.
Picture
Picture
Picture
Picture
 ​2. Define the Action
In the Action tab, specify the details of the document to be processed:
 
Adapter Id: Select the adapter id.
Adapter Version: Select adapter version.
Document Path: Provide the path to the document or the S3 object key.
Bucket Name: Specify the S3 bucket where the document is stored.
 
3. Add/Save and Submit/Run your Job
Once the job is defined, add/save it and submit/run it to the current plan. Add the job to a job stream to automate your business process flow. You can monitor the job execution in the Monitoring View.
 
4. Monitor Job Execution
Track the job’s progress in the Monitor Page. If the job completes successfully, the status will update to “Successful”.
Picture
Picture
Picture
Picture
Picture
Picture
Why Choose the Amazon Textract Plug-in?
 
The Amazon Textract Plug-in empowers businesses to automate document processing workflows, saving time and reducing errors. By integrating with Workload Automation, you can centralize and streamline operations, ensuring scalability and efficiency. Whether you’re processing invoices, forms, or compliance documents, this plug-in is your gateway to smarter, automated workflows.
 
 
Job Log Details:
Picture
Picture
In Conclusion:
The Amazon Textract plugin is a powerful tool that empowers you to integrate intelligent document processing capabilities into your Workload Automation workflows. By automating the extraction of text and structured data from documents, you can significantly reduce manual effort, improve data accuracy, accelerate business processes, and unlock valuable insights from your document repositories.

Picture
​Nilesh Kumar Mishra - Senior Software Engineer at HCL Software
Works as a Plug-in Developer and L3 support in the Workload Automation Plug-in Factory team. Proficient in Java, Git, Maven, Docker, Kafka, Spring MVC, Spring Boot and SQL.
In his leisure time, he enjoys listening to a variety of music and admires the wonders of nature finding inspiration in its wonders.

Picture
​​Ernesto Carrabba, Product Manager, HCL Clara, HCL HERO and HCL Workload Automation 
Ernesto Carrabba is the Product Manager for HCL Clara, HCL HERO and HCL Workload Automation. Ernesto is a very dynamic product manager with experience in building and launching IoT products, combined with a master's degree in mechanical engineering and study researches on Augmented and Virtual Reality

0 Comments

Your comment will be posted after it is approved.


Leave a Reply.

    Archives

    October 2025
    July 2025
    June 2025
    May 2025
    March 2025
    February 2025
    January 2025
    December 2024
    November 2024
    October 2024
    September 2024
    August 2024
    July 2024
    June 2024
    May 2024
    April 2024
    March 2024
    February 2024
    January 2024
    October 2023
    August 2023
    July 2023
    June 2023
    May 2023
    April 2023
    March 2023
    February 2023
    January 2023
    December 2022
    September 2022
    August 2022
    July 2022
    June 2022
    May 2022
    April 2022
    March 2022
    February 2022
    January 2022
    December 2021
    October 2021
    September 2021
    August 2021
    July 2021
    June 2021
    May 2021
    April 2021
    March 2021
    February 2021
    January 2021
    December 2020
    November 2020
    October 2020
    September 2020
    August 2020
    July 2020
    June 2020
    May 2020
    April 2020
    March 2020
    January 2020
    December 2019
    November 2019
    October 2019
    August 2019
    July 2019
    June 2019
    May 2019
    April 2019
    March 2019
    February 2019
    January 2019
    December 2018
    November 2018
    October 2018
    September 2018
    August 2018
    July 2018
    June 2018
    May 2018
    April 2018
    March 2018
    February 2018
    January 2018
    December 2017
    November 2017
    October 2017
    September 2017
    August 2017
    July 2017
    June 2017
    May 2017

    Categories

    All
    Analytics
    Azure
    Business Applications
    Cloud
    Data Storage
    DevOps
    Monitoring & Reporting

    RSS Feed

www.hcltechsw.com
About HCL Software 
HCL Software is a division of HCL Technologies (HCL) that operates its primary software business. It develops, markets, sells, and supports over 20 product families in the areas of DevSecOps, Automation, Digital Solutions, Data Management, Marketing and Commerce, and Mainframes. HCL Software has offices and labs around the world to serve thousands of customers. Its mission is to drive ultimate customer success with their IT investments through relentless innovation of its products. For more information, To know more  please visit www.hcltechsw.com.  Copyright © 2024 HCL Technologies Limited
  • Home
  • Blogs
  • Forum
  • Resources
  • Events
  • About
  • Contact
  • What's new