MAX CORE in Rollup in Abinitio | Rollup algorithm for unsorted data
MAX CORE in Rollup in Abinitio | Rollup algorithm for unsorted data
hash Tabale willl have KEY VALUE record along with pass through filed and values
Roll Up reads the records and keep on adding/updating the Hash Table untill it reaches to the MAX_CORE limit
Then it read record and if the key is there in the HASH table it updates that records else it has to spill to disk
It does so untill all inputs have been read.
Spillage Depends on - The size of the group ( distinct key value group)+ the pass through fields + data distribution of the incoming data
How much spillage : after the max core everything will be spilled to disk
2nd Iteration of Spillage - when the distinct keys groups are huge , then it may not hold the data in the hash table again
so it may spill 2nd time as well.
For More Abinitio, AWS, Database content please visit my youTube channel.
https://www.youtube.com/@datapundit
@datapundit
Comments
Post a Comment