Accelerate the Value of Data

Data Loader

Learn about preparing and uploading your data using the Reltio Data Loader.

Important: Some Data Loader functionalities are cloud platform specific. We'll indicate which ones are specific throughout the documentation.

Ready to get going?

Are you a data steward looking to upload several sources of your data? Want to consolidate, clean, and unify them into a perfect golden version?

The Data Loader, Reltio’s intuitive application, enables you to:

  • Load your data from the most common locations and in the most common formats. For more information, see topic Supported data sources and file types.
  • Visualize your data, define mappings, and do the basic transformation before loading the data.
  • Define data load options, like using life-cycle assessment (LCA), or full or partial updates.

What's next?

Learn about the Data Loader interface and how to get around in topic Get started with Data Loader.

There you'll also learn about other things you need to know before you begin:

Threshold errors

During pre-processing of a data load job, the Data Loader can sometimes automatically fail or cancel a job. This action prevents long-running jobs from being interrupted, adding no contribution, or generating large error files. Sometimes, a data load job may timeout if it has more than 10,000 rows or takes longer than 6 hours to complete.

The software uses the defined rate of permitted errors; that is, a threshold rate or error threshold. For example, if your job encounters bad data at or above the defined rate, the software cancels or fails the job. When such cancellation or failure occurs, the Job Status page prompts you to review the error file. Jobs that reached the error threshold display a status of COMPLETED_WITH_ERRORS.

When would I set the ErrorThreshold?

You apply the ErrorThreshold when you use the periodic task and when there are more than 10K records. You define the error threshold on the Job Definition level. Set the value as appropriate or leave it at the default, which is 15%.

Here’s an example where the error threshold is set for 25:


    {
      "additionalAttributes": {
      "alwaysCreateDCR": false,
      "acceptFileType": ".csv",
      "errorThreshold": 25
    }

Jobs still complete with the status COMPLETED_WITH_ERRORS when they reach or exceed the error threshold.