Clone the repository into your Databricks workspace
Learn how to clone the Reltio-provided notebooks repository into your Databricks workspace so that you can access the notebook required to run the entity resolution pipeline.
Clone the Reltio-provided notebooks repository into your Databricks workspace to access the notebook required to run the entity resolution pipeline.
Prerequisites
Before you begin, ensure that you have completed the following step:
Steps to clone the repository into your Databricks workspace
To clone the repository, follow these steps:
- Sign in to the Databricks workspace.
- In the left navigation pane, select Workspace.
- In the Workspace open the folder where you want to clone the repository (for example, Workspace/Users/john.doe@databricks.com).
- On the top-right toolbar, click Create, then select Git folder to open the Create Git folder dialog.
- In the Git repository URL field, enter the Bitbucket repository URL provided by Reltio. Remove the
git cloneprefix from the copied URL before using it. - Select Bitbucket Cloud as the Git provider.
- In the Git repository URL field, enter the Bitbucket repository URL provided by Reltio. Remove the
- Click Create Git folder.
Verification
Verify that the repository is cloned successfully in your Databricks workspace:
- The Git folder appears in the workspace.
- The repository files and folders (for example,
src,.gitignore) are visible inside the Git folder. - You can open the Git folder and browse its contents.
Result
The repository is cloned into your Databricks workspace. The Git folder appears in the workspace, containing the repository files.
To proceed, provide your Delta Sharing identifier to Reltio so that the required models can be shared with your workspace.