🔌Add an AWS Athena connector
Configuring an AWS Athena connection
Fill connection parameters:
Name (mandatory)
String
Use it to identify your connection
MyAthenaConnection
S3 Output Bucket (mandatory)
String
the S3 bucket and prefix where results from your queries will be saved
s3://mybucket/athena-queries
AWS Access Key Id (mandatory)
String
the ID of the the AWS access key that will be used to connect to Athena.
AKIAIOSFODNN7EXAMPLE
AWS Secret Access Key (mandatory)
String
the AWS secret access key that will be used to connect to Athena. (will be stored as a secret)
wJalrXUtnFEMI/K7MDENG/bPxRfiCYEXAMPLEKEY
Region Name (mandatory)
String
the name of the AWS region you need to query.
eu-west-3
Retry Policy (optional)
Boolean
Boolean allows to configure a retry policy if the connection is flaky.
max attempts: maximum number of retries before giving up
max_delay: in seconds, above the connection is dropped
wait_time: time in seconds between each retry
Slow Queries' Cache Expiration Time (optional)
Integer
Slow queries' cache expiration time in seconds
Click on the TEST CONNECTION
button then SAVE
the connection
After successfully configuring the connector, you will be able to find it in the Connector section of the DataHub "Datasource" tab
Create a dataset from an AWS Athena connection
To create a dataset from AWS Athena, click on the "create from icon", you will then be able to:
Select the
Database
Select a
table
Add a Query (optional): you can query using the SQL syntax accepted by Athena
After selecting data from your connector you will be able to create a dataset thanks to YouPrep using the selection as "source step".
Last updated
Was this helpful?