2.2.  Duplicate analysis

2.2.1. Prerequisites
2.2.2. Overview of the entire process
2.2.3. Create new report
2.2.4. Report selection
2.2.5. User interface details
2.2.5.1. Structure tree
2.2.5.1.1. What does the tree represent?
2.2.5.1.2. Navigation in the structure tree
2.2.5.1.3. Coloring in the structure tree
2.2.5.1.4. Connection of tree and results (cluster) and statistical overview
2.2.5.2. Results
2.2.5.2.1. Edit cluster
2.2.5.2.2. Geometric similarity
2.2.5.2.3. Reference part
2.2.5.2.4. Color coding of variables
2.2.5.2.5. Duplicates" area
2.2.5.2.6. Workflow when deleting a main part
2.2.5.2.7. Part annotated in another cluster
2.2.5.3. Overview
2.2.5.4. Filter
2.2.6. Parts comparison
2.2.7. Export
2.2.8. Error messages

Duplicate analysis is a key tool for data quality control - especially for large volumes of data and imported catalogs. It finds duplicate candidates, but does not automatically eliminate them; instead, it forms the basis for downstream cleansing processes.

In concrete terms, this means for the process: