🔢Create a new dataset from a dataset
Use this documentation to know how to create a dataset from another dataset. if you are looking to create a dataset from a datasource refer to this documentation.
How to create a dataset from another dataset
Toucan allows to create a dataset from another dataset.
the resulted dataset will be a child dataset of the parent dataset. You can create You can create as many child datasets of a parent dataset as you want. When a child dataset is executed, its parent dataset(s) are executed beforehand.
To create a child dataset from another dataset
Go to the
datasets
tab in the DataHub
Click on the with a plus icon (“Home 1” in the example) on hover a label “Create a new dataset from this one” will be displayed
After clicking on
Create a new dataset from this one
, you will have access to YouPrep™ (know more about YouPrep here)Save the new dataset by clicking on the button "
Create
" (at the bottom).A modal will opened, choose an unique name for the dataset, and select the storage type
stored
orlive
Then click on
Save
If you store the dataset in Toucan, you also canSave and refresh
the dataset in order to make it available to use.
Tip: you can even create a child dataset from the Story Panel. You'll be redirected to the DataHub tab while you create your new child dataset. Take a look at this video.
Stored datasets specificities
Dataset refresh
Refreshing a dataset will also refresh all direct and indirect parent datasets.
Save a stored dataset
If you save a "stored dataset" without refreshing it, it won't make it available to create other datasets with it, or to build charts by using the dataset.
Dataset download
When the dataset is stored in Toucan, it's possible to download it as CSV file, though the action menu of the dataset (on the right part in the listing of the dataset).
Create a child dataset when configuring a tile/story
You can create a child dataset when you are configuring a tile/story as shown in this video
Permissions
If you create a new dataset A, from dataset B that has permissions applied, and define the new dataset A as a Stored dataset, the permissions of the parent dataset won't be applied anymore. You will have to define the permissions again on your dataset if you need to secure the access to the data.
Dataset column naming
When the dataset is stored within Toucan, the column names should respect the following constraints:
The dataset shouldn't contain a column named "_id"
The dataset name shouldn't contain points (".")
Limits on number of rows
The maximum number of rows is set at 1M (for both live and stored dataset). However, when being under NativeSQL that results on a dataset that has less than 1M of rows, there wont be any limitation.
Last updated
Was this helpful?