🔌Setting up an AWS Athena connector
Last updated
Last updated
Fill connection parameters:
NAME : the name of your connector.
S3 OUTPUT BUCKET : the S3 bucket and prefix where results from your queries will be saved (Ex s3://mybucket/athena-queries).
AWS ACCESS KEY ID : the ID of the the AWS access key that will be used to connect to Athena.
AWS SECRET ACCESS KEY : the AWS secret access key that will be used to connect to Athena.
REGION NAME : the name of the AWS region you need to query.
After entering those informations, you can test the connection with AWS, to make sure your inputs are correct and working.
If all settings are valid, you’re going to have a success message like this
Depending on your rights, you will have the “Can list databases” checked or not, but note that this doesn’t prevent you to be able to use the connector itself.
After successfully configuring the connector, you will be able to find it in the Connector section of the DataHub "Datasource" tab
To create a dataset from AWS Athena, click on the "create from icon", you will then be able to:
Select the Database
Add a Query (optional): you can query using the SQL syntax accepted by Athena
Select a table
For more info, see the dedicated section Creating datasets
After selecting data from your connector you will be able to create a dataset thanks to YouPrep using the selection as "source step".