Attribute
|
Value
|
Name
|
Create Column
|
Description
|
None
|
Function
|
None
|
Aim
|
Allow a data engineer (user) to append an empty column to the end of the dataset
|
Context
|
This is used when a user requires a new column to be appended to the dataset to be filled with values in subsequent operations to fulfil requirements of the analysis for which the dataset is to serve as input.
|
Rationale
|
adding a column is used when certain values are implicit in the data and are required for subsequent operations or end-goal analysis the dataset is being prepared for.
|
Mechanisim
|
Increase dataset dimension horizontally by adding a column of blank values to be filled in subsequent operations. This can be done by exploring the facilities found in GUI-based tools and programming language functions.
|
Formalisim
|
α(R, x)={(a1,...,an,x) | (a1,...,an) ∈ R}, Where: R is a relation with n columns. x is a value representing the new column's name. (Raman, V and Hellerstein, J 2001)
|
Relational Algebra (RA)
|
Similar to RA operation
Attribute Extension (ε)/Generalized Projection
|
Type
|
Atomic
|
Class
|
Unary
|
Transformation_category
|
1:1
|
Inputs
|
Inputs | Number of input datasets |
Input dataset, new column name | 1 |
|
Outputs
|
Outputs | Number of output datasets |
dataset | 1 |
|
Used in stage(s)
|
Structuring1
|