DW Handbook
This page shows the name of the construct, its description
Attribute
|
Value
|
Name
|
Remove Duplicates
|
Description
|
None
|
Function
|
None
|
Aim
|
Allow a data engineer (user) to create a new relation containing no duplicate tuples
|
Context
|
None
|
Rationale
|
None
|
Mechanisim
|
None
|
Formalisim
|
δR = {(a1...an) | (a1...an) ϵ R ^ t1...tm ⊆ R } where R is a relation of n attributes (a) and m tuples (t) and tx!= ty for all 1 ≤ x,y ≤m and x!=y.
|
Relational Algebra (RA)
|
Similar to RA operation
Distinct (δ)
|
Type
|
Atomic
|
Class
|
None
|
Transformation_category
|
None
|
Inputs
|
Inputs | Number of input datasets |
None | None |
|
Outputs
|
Outputs | Number of output datasets |
None | None |
|
Used in stage(s)
|
Cleaning
|