Paste IDs → download the input CSV → run the CVI notebook in Databricks → download the output → upload to CDQ for duplicate matching.
Open workflowPaste partner IDs → download the input CSV → run the Email Cleaning notebook in Databricks → download the output report.
Open workflowPaste partner IDs → download the input CSV → run the Contact notebook in Databricks → download the output report.
Open workflowEnd-to-end reference: Databricks extraction → General Mapping → CVI linkage → CDQ duplicate matching and download rules.
View guideEnter IDs below and click Download Input CSV. Then go to Databricks, upload the CSV, run the notebook, and download the output.
Log into Databricks. Go to your workspace, upload the downloaded CSV to the FileStore, and run the CVI Extraction notebook.
Open Databricks WorkspaceAfter the notebook finishes, go to Databricks FileStore and download the CVI output CSV file.
Open Databricks FileStoreGo to CDQ → Data Mirror Management → Upload using General Mapping. Then run Duplicate Matching and download Duplicate Consolidation.
Open CDQ PortalEnter the BP / Partner numbers to check for Celaning email addresses, then download the CSV.
Log into Databricks. Upload the CSV to FileStore and run the Email notebook in your workspace.
Open Databricks WorkspaceAfter the notebook completes, download the output CSV from Databricks FileStore.
Open Databricks FileStoreReview the output. Block or merge duplicate email records in the source system before migration.
Enter BP / Partner numbers to identify duplicate contact persons, then download the CSV.
Log into Databricks. Upload the CSV to FileStore and run the Contact notebook.
Open Databricks WorkspaceAfter the notebook finishes, download the output CSV from Databricks FileStore.
Open Databricks FileStoreReview the output. Delete or merge duplicate contacts in the source system. Document findings before migration.
Upload the Excel golden list to Databricks FileStore. Load with Pandas → convert to Spark DataFrame.
Cast and rename columns: Name (CONCAT NAME1–4), Country, City, Postal Code (STRING), Tax Numbers 1–5 (STRING — empty for US data), VAT Number.
Join PMD_but000_view + PMD_cvi_cust_link_view + PMD_cvi_vend_link_view. Always filter: OPtype ≠ 'D' to exclude deleted records.
Formula: BP_Number + '_C' + Customer_Number + '_V' + Vendor_Number. Use COALESCE on all fields.
UNION ALL of Customer_View (KNA1) and Vendor_View (LFA1) into Final_Template.
Export Final_Template as CSV. Upload via CDQ → Data Mirror Management → General Mapping.
Open CDQ PortalUpload Configuration File. Set thresholds. Mode: self-match (one source) or linkage (Pattern + Candidate).
Always select Duplicate Consolidation. Validate and block/delete duplicates before migration.
Always exclude OPtype = 'D' in all CVI joins — deleted records only.
Tax Number 1–5 must be NULL / empty for all US records before CDQ upload.
Cast Postal Code and all Tax Numbers to STRING — no numeric formatting.
CONCAT(COALESCE(NAME1,''), ' ', COALESCE(NAME2,''), ' ', COALESCE(NAME3,''), ' ', COALESCE(NAME4,''))
Always use COALESCE to handle NULL values in the External ID concat formula.
Always select "Duplicate Consolidation" when downloading results from CDQ.