AWS Integration with Abinitio | How to access AWS S3 using Abinitio
Watch my YouTube video for explanation :
Please look into the class notes here for your references:
AWS Integration with AI
Storage - S3, DB (RDS , Redhsift)
How CO>operating System connects to the cloud object storage S3
Configuring access to Amazon S3
1. JAVA , java 8+
2. Configure access credential for the respective cloud store
a. default credential provider chain
b. aws credential profile file with named profile
c. Multiplexer file
Multiplexer
. to interact with apachee hadoop copatible file system
. to calculate the resource required for containerized jobs
AB_JOB_MUX_CONFIGURATION
AB_MUX_URI_SCHEMES
a. default credentials
In Linux
~/.aws/credentials
In Windows
C:\Users\Username\.aws\credentials
----------------------------------------------
AWS Credential profile file with named profiles
[default]
aws_access_key_id=<default-access-key>
aws_secret_access_key=<default-secret-key>
[custom]
aws_access_key_id=<custom-access-key>
aws_secret_access_key=<custom-secret-key>
[custom2]
aws_access_key_id=<custom-access-key>
aws_secret_access_key=<custom-secret-key>
----------------------------------------------
Multiplexer File
<configuraion>
<property>
<name>fs.s3a.access.key</name>
<value><Key-ID></value>
<description> aws access key id </description>
</property>
<property>
<name>fs.s3a.secret.key</name>
<value> <Secret Key> </value>
<description> AWS Secret Key</description>
</property>
<property>
<name> fs.s3a.endpoint </name>
<value> <s3-enpoint-url></value>
<description> amazon s3 endpoint to connect to s3</description>
</property>
</configuraion>
---------------------------------------------
Building Graph to access S3
File Components:
Input File
RMF
Read Parallel
Output File
WMF
aws s3 ls s3://
aws s3 cp <one bucket> <another bucket>
aws s3 cp <one bucket> <local path>
aws s3 rm <bucketpath>
m_ls s3://hostname//<BUCKET-NAME>/revenue/jan_11022023.dat
m_ls gs://hostname//<BUCKET_NAME>/revenue/jan_11022023.dat
We can use Read Multiple Files/Write Multiple Files for Reading/Writing data from/to S3
(config-file)
For Vendor Specific Component(Amazon S3 API ):
credential-provider=default/profile
S3 Subscribe,
S3 Publish,
Batch S3 Subscribe
Fore more Abinitio, AWS and data engineering videos please subscribe , view , like and share my YouTube channel
Click DataPundit
Comments
Post a Comment