Accelerate the Value of Data

Matching on Data in Multiple Languages

Reltio Match Engine supports matching data in multiple languages.following languages: Chinese, Korean, Japanese and Arabic.

Before Reltio can match data from other languages, it must transliterate characters from some source languages into characters it recognizes. Transliteration involves mapping characters between written character sets, not translating the meaning or words or matching verbal pronunciation. Using a standard mapping system provides a consistent representation of data from the source language into target language. For example, if source data is in a non-Latin character set such as Mandarin, Reltio transliterates those characters into the Latin character set, which the Match engine uses. Configure the transliteration system to use as an attribute in an entity match group by the transliterated name.

How does matching on multiple languages work?

The following example illustrates how the Match Engine can be configured. The configuration settings can be configured in your L3 layer. To do this, contact support.

{
                    "uri": "configuration/entityTypes/HCP/matchGroups/HCPbyTransLiterator",
                    "label": "HCP by transliterated Name",
                    "type": "suspect",
                    "rule": {
                        "and": {
                            "exact": [
                                "configuration/entityTypes/HCP/attributes/NonLatin_Name"
                            ],
                            "cleanse": [
                                {
                                    "cleanseAdapter": "com.reltio.cleanse.impl.TransliterateCleanser"
                                    ,
                                    "attributes": [
                                        "configuration/entityTypes/HCP/attributes/NonLatin_Name"
                                    ]
                                }
                            ]
                        },
                        "matchTokenClass": "com.reltio.match.token.ExactMatchToken"
                    },
                    "matchServiceClass": "com.reltio.businesslogic.match.providers.internal.InternalMatchService"
                }

The following example illustrates how you can configure the Match Engine to the characters inside a match group using the transliteratorCommand" : "Any-Latin command in the cleanseAdapterParams section:

{
   "uri": "configuration/entityTypes/HCP/matchGroups/HCPbyTransLiterator",
                    "label": "HCP by transliterated Name",
                    "type": "suspect",
                    "rule": {
                        "and": {
                            "exact": [
                                "configuration/entityTypes/HCP/attributes/NonLatin_Name"
                            ],
                            "cleanse": [
                                {
                                    "cleanseAdapter": "com.reltio.cleanse.impl.TransliterateCleanser",
                                    "cleanseAdapterParams": {
                                        "transliteratorCommand" : "Any-Latin"
                                    },
                                    "mappings": [
                                        {
                                            "attribute": "configuration/entityTypes/HCP/attributes/NonLatin_Name",
                                            "cleanseAttribute": "configuration/entityTypes/HCP/attributes/NonLatin_Name"
                                        }
                                    ]
                                }
                            ]
                        },
                        "matchTokenClass": "com.reltio.match.token.ExactMatchToken"
                    },
                    "matchServiceClass": "com.reltio.businesslogic.match.providers.internal.InternalMatchService"
                }