Attribute
|
Value
|
Name
|
Join
|
Description
|
None
|
Function
|
None
|
Aim
|
Allow a data engineer (user) to combine two datasets horizontally using a common attribute in a row-wise join
|
Context
|
enriching a dataset by combining other datasets to it which are required to increases the value of analysis the dataset is being prepared for.
|
Rationale
|
combining datasets required for subsequent operations or end-goal analysis increases the value of analysis the dataset is being prepared for.
|
Mechanisim
|
merge datasets horizontally to increase analysis value. This can be done by exploring the facilities found in GUI-based tools and programming language functions.
|
Formalisim
|
R1⨝joinConditionR2 = {(a,b,c) | (a,b) ∈ R1 ^ (b,c) ∈ R2}, where R1 and R2 are relations (Codd, 1970)
|
Relational Algebra (RA)
|
Similar to RA operation
Join (⨝)
|
Type
|
Atomic
|
Class
|
N-Ary
|
Transformation_category
|
M:1
|
Inputs
|
Inputs | Number of input datasets |
Input dataset to merge, attribute(s) to join by | M |
|
Outputs
|
Outputs | Number of output datasets |
combined datasets | 1 |
|
Used in stage(s)
|
Integration
|