Feb 4, 2026

How to Extract Data From PDFs in Power Automate

Stig Zerener

This guide shows how to automate extracting structured information from PDFs inside a Power Automate flow using a Cradl AI model. You’ll end up with JSON output you can send on to tools like Excel, Dataverse, SQL, or SharePoint — with optional human review for uncertain predictions.

Before we begin

You'll need:

A Cradl AI account
Access to Microsoft Power Automate where you can build and run flows.

Setting up your data extraction agent

Login to Cradl AI and create a new agent. We'll use the pre-configured order confirmations agent as our starting point, but you can choose any of the others you like or you can choose the Custom option.

Before jumping into Power Automate, you need a Cradl AI model that understands your document type.

Create a new project: Log in to Cradl AI and start a new project. You can use a pre-defined template for popular document types like invoices, order confirmations, or ID cards.
Configure the fields: Define which fields to extract — e.g., invoice number, due date, total amount.
Enable HITL: In your Cradl AI workflow, Human-in-the-Loop validation is enabled by default. This ensures that when the model is unsure, the document is routed to a human reviewer before the final output is sent. Yourself is added as a Validator by default, but you can also invite your colleagues to review AI predictions.

Connecting your agent to Power Automate

Now let’s connect your Cradl AI workflow to Power Automate.

In Cradl AI:

Open your workflow in the visual builder.
Add an Input Trigger and select Power Automate.
This will automatically generate the Client ID and Client Secret. We will use these later.

In Power Automate:‍

Create a new flow and choose your preferred trigger (e.g., when a file is uploaded to OneDrive, received via email, or added to SharePoint).
Add a new action: search for Cradl AI in the list of connectors and select Cradl AI → Create Document.
Configure the connection:
- Give it a name
- Paste your Client ID and Client Secret‍
In the action settings:
- Select your Cradl AI workflow
- Pass in the file content from your trigger (e.g., file content from SharePoint or an email attachment

This will send the document to Cradl AI for parsing and validation.

Step 3: Create a Webhook in Power Automate to receive the response

To receive the structured (and validated) data back in Power Automate, we’ll set up a webhook.

In Power Automate:

Add a new flow with the trigger: “When an HTTP request is received”.
This will generate a webhook URL — copy it.

Back in Cradl AI:

In your workflow, add an Output Destination and select Power Automate.
Paste the webhook URL from Power Automate into the Webhook URL field.
Cradl AI will also generate a JSON schema based on your model’s output — copy this and paste it into the Request Body JSON Schema section in Power Automate.

Now Cradl AI knows where to send the results — and Power Automate knows what to expect.

Wrapping up

With this setup, you’ve added an end-to-end document automation workflow to your Power Automate flow that is scalable, accurate and reliable.

Need help setting up a workflow like this? Book a free call with an expert

‍

‹ How to Convert PDFs to JSON using AI

How to set up an AI agent to extract PDF tables to Excel ›

Start automating today

Try for free

14 day free trial. No credit card required.