Unify and manage your data

Remove Attribute Duplicates Task

Use this API to remove duplicate attribute values.

This task removes duplicate values for attributes (that is, when value is the same, but URI is different), which may appear during simultaneous updates of the same object. The root cause was fixed earlier, but corrupted data may still be found in old tenants.
Note: The Stop and Pause tasks are supported.

Requests

Use the following request to remove duplicate attribute values in all tenants. You must have the Admin role assigned.

POST {ApplicationURL}/removeAttributeDuplicates
	 ["entities"]    

Use the following request to remove duplicate attribute values for an entity in a single tenant. You must have the Tenant admin role assigned.

POST {ApplicationURL}/api/{tenantId}/removeAttributeDuplicates
            ["entities/<entity id>"]
Use the following request to remove all duplicate attribute values for all entities in a single tenant. You must have the Tenant admin role assigned. Ensure that you keep the request body empty.
POST {ApplicationURL}/api/{tenantId}/removeAttributeDuplicates
Note: The RemoveAttributeDuplicates task does not process data when the internal queue size is high and more than 15K events are there in the queue. So, it is recommended to run the task when there are no activities on the tenant. For example, dataload or other activities.
Table 1. Parameters
Parameter Required Description
tenantIdYesID of the tenant to remove duplicated attributes.
objectTypeNoType of objects to process. Possible values: entities and relations.Default value: entities.
skipObjectsCountNoNumber of objects which will be skipped during processing. Default value: 0.
maxResultsToStoreNo The task stores object IDs for which inconsistency has been found in its status. This parameter is needed to prevent consuming huge volumes of memory when numerous objects with inconsistencies are found. Default value: 100.
bufferSizeNoSize of the buffer which is used to fetch objects from Cassandra. Default value: 1500.