DW Handbook

This page shows the name of the construct, its description

Attribute Value
Name Join
Description None
Function None
Aim Allow a data engineer (user) to combine two datasets horizontally using a common attribute in a row-wise join
Context enriching a dataset by combining other datasets to it which are required to increases the value of analysis the dataset is being prepared for.
Rationale combining datasets required for subsequent operations or end-goal analysis increases the value of analysis the dataset is being prepared for.
Mechanisim merge datasets horizontally to increase analysis value. This can be done by exploring the facilities found in GUI-based tools and programming language functions.
Formalisim R1joinConditionR2 = {(a,b,c) | (a,b) ∈ R1 ^ (b,c) ∈ R2}, where R1 and R2 are relations (Codd, 1970)
Relational Algebra (RA) Similar to RA operation Join (⨝)
Type Atomic
Class N-Ary
Transformation_category M:1
Inputs
InputsNumber of input datasets
Input dataset to merge, attribute(s) to join byM
Outputs
OutputsNumber of output datasets
combined datasets1
Used in stage(s) Integration

Back