Abinitio Interview Questions 24 Abinitio Parallelism Advanced
Abinitio Interview Questions 24 Abinitio Parallelism Advanced
Watch my YouTube video for explanation :
Abinitio Parallelism
component parallelism
The AI component who increases component parallelism
replicate
input file
reformat
FBE
PBE
dedup sorted
split
The AI component who decreases component parallelism
gather
join
fuse
combine
concatenate
pipeline parallelism
SORT
ROLLUP
JOIN
Dedup Sorted
Phases / Check Points
component folding
Continuous Graphs: Checkpoint--> Compute points
data parallelism
types of layouts:
data layout
processing layout
file:serial
mfile:mfs /Adhoc multi file
dynamic layout:
fixed-depth dynamic layout => mfile:dynamic:$DEPTH_OF_PARALLELISM
for ex. mfile:dynamic:16:/data/warehouse/2023/dec
(note:Can be used to read and write)
variable-depth dynamic layout => mfile:dynamic:-1:$DATA_PATH:[mb per partition]:[max-depth]
for example mfile:dynamic:-1:/data/warehouse/2022/dec
for ex. mifile:dynamic:-1:s3a:////bucket/data:100:16
(note: can only be used for data reading)
Fore more Abinitio, AWS and data engineering videos please subscribe , view , like and share my YouTube channel
Click DataPundit
Comments
Post a Comment