Unify and manage your data

Reindex Relations Task

Reindexes relations in a tenant.

This task reindexes relations in a tenant.

Stop and Pause are supported.

Requests

Tenant admin role is required:

POST {ApplicationURL}/reindexRelations

Tenant admin role is required:

POST {ApplicationURL}/api/{tenantId}/reindexRelations
Table 1. Parameters
ParameterRequiredDescription
tenantId YesID of the tenant to reindex entities.
updateRelations NoIf set to true (default), in addition to refreshing the relations in the tenant, also updates history, match tables, and the analytics layer (RI). This may interfere with normal event streaming.

If set to false, the task updates ElasticSearch relations data only but does not perform rematching or update history or analytics.

See Reindexing and Streaming for details.

relationType NoIf provided, restricts reindex scope to relations of a specific type.
skipRelationsCount NoNumber of relations which are skipped during reindex. Default value: 0.
relationsLimit NoNumber of relations which are processed during reindex. Default value: infinity.
updatedSince NoTimestamp in Unix format. If this parameter is specified, then only relations with greater or equal timestamp are reindexed.
checkCrosswalksConsistency NoSpecify true to start a task which checks whether all crosswalks are unique before reindexing.
distributed No
If set to true, the task runs in distributed mode. Default value is false. For more information, see Distributed mode.
taskPartsCount No
Specifies the maximum number of sub-tasks for distributed execution. The platform determines the optimal number based on performance limits. Default value is 2.
Note: This parameter is only applicable when distributed=true. Otherwise, it s ignored.
forceIgnoreInStreamingNoIf set to true, events generated by the task are excluded from streaming to any external queue.

Default: false

enableSeparateIndexingNoIf set to true, the Reindex Relations task builds a new index. After building the new index, the task replaces the old index with the new one. With this parameter, the Reindex Relations task runs in the background and you can continue your activities in Reltio Platform without waiting for the task to finish.

Default value : false

Body (optional):

JSON Array of relations URIs to reindex. Only these relations are reindexed in this case.

Reindexing and Streaming

The Reindex relations task can do the following:

  • Reindex tenant's relations
  • Update the history table
  • Update the match tables to reflect each relation change/creation/deletion caused by reindexing
  • Repopulate the analytics layer (RI)

Each of these activities can generate a large volume of events, which may overwhelm the tenant's AMQ/SQS queues.

There are two ways to prevent this problem:

  • Run reindexRelations with updateRelations set to false, as in the following example:

    POST /reltio/{tenantId}/reindexRelations?updateRelations=false

    When run with this option, reindexRelations only reindexes the ElasticSearch index; the History, Match, and Analytics processors are not triggered.

  • Run reindexRelations with forceIgnoreInStreaming set to true, as in the following example:

    POST /reltio/{tenantId}/reindexRelations?forceIgnoreInStreaming=true

    This option forces reindexRelations to ignore streaming when running a reindex job.