Unify and manage your data

Agentflow Unstructured workflow

Learn about the workflow AgentFlow Unstructured uses to extract document data, validate the results, and process approved output in Reltio.

AgentFlow Unstructured helps you get started quickly with document processing in Reltio.It helps you turn unstructured documents into usable Reltio data through a simple workflow through these simple steps:

  • Create a template

  • Configure a source
  • Run a pipeline to process documents.

The extracted data then becomes available in Reltio for review, matching, and downstream use.

The following diagram shows the workflow.

The following table explains each stage of the workflow:

Table 1. AgentFlow Unstructured workflow
StageTaskDescription
1Template creationThe workflow begins with template creation. A template defines what structured data AgentFlow Unstructured must extract and how the extracted output maps to Reltio entities and attributes in your data model. You can create templates for specific document types and reuse them across processing runs. For example, create a template for contracts and use it to process a set of contracts that share a similar structure, which supports more consistent and efficient extraction.
2Extraction and validationAs part of template creation, AgentFlow Unstructured processes a sample document and generates extracted output for review. You then review the document, examine the extracted JSON, and verify how the extracted fields map to Reltio entities and attributes. This validation stage ensures that the extraction output is complete, accurate, and suitable for repeated use.
3Publish templateAfter reviewing the extraction results, publish the template. Publishing makes the template available for use in pipelines, enabling consistent and reliable processing across future document runs of similar type and structure.
4Source configurationSources define where AgentFlow Unstructured retrieves files for processing. You can configure sources for supported cloud storage services, such as AWS S3 and Google Cloud Storage. AgentFlow Unstructured document pipelines use these source definitions to access files for processing.
5Pipeline configurationA pipeline is configured to use a selected source and a published template to process documents. It allows you to define the following settings:
  • Crosswalk source, which defaults to the cloud source
  • Whether to create a data change request before publishing extracted data to Reltio, if needed

  • The schedule for pipeline runs

6Pipeline executionWhen a pipeline is executed, AgentFlow Unstructured begins processing documents from the selected source. The system creates a new execution record per run. Each execution captures run-level details, documents processed, including start time, end time, status, and processing outcomes.
7Execution reviewAfter a pipeline runs, you can review execution results at both the pipeline level and the document level. Execution views show overall run status, success and failure counts, and the extracted entities associated with processed documents. This helps you confirm outcomes and identify runs that need follow-up.
8Publishing structured output to ReltioAs part of this workflow, AgentFlow Unstructured processes extracted and approved data for use in Reltio. Depending on the configured mappings and processing outcome, the results will contribute to entity creation, linkage, and downstream master data workflows.