🔢Editing, Duplicating and Deleting a dataset
Last updated
Last updated
Now that you have datasets in the DataHub, you can edit, duplicate and switch the storage type of a dataset.
Once you have created a dataset, you will still of course be able to edit it.
To edit a dataset:
Click on the "edit" button of the dataset you want to edit.
Edit the pipeline of your dataset, and click on "Save".
You can change the storage type of the dataset if needed, and click again on "Save" or "Save and refresh"
Note
You can also save the dataset you're editing as a new one by clicking on the "Save as new" button within the YouPrep window. Thus, a new dataset considering your changes will be created, and the old dataset will remain unchanged.
Limitation
Datasets generated by the import of a file are not editable.
To duplicate a dataset:
Click on the "edit" button of the dataset you want to duplicate
Click on the "Save as new" button
Give a name to your dataset (the name shouldn't be already used by another dataset), and choose the storage type. Then, click "Save" or "Save and refresh," depending on your case.
To delete a dataset:
Go on the action menu of the dataset (3 dots on the right), and click on "Delete"
Confirm the deletion of the connector. If the datasets is used by other datasets or visualization, you will have a warning message and be able to select child datasets that you would like to delete simultaneously.
You have created a dataset that is Stored or Live, and you would like to change the storage type ... you can do it in a few clicks:
Go to the action menu of the dataset, and click on "Switch storage"
Select your new storage type and click on Save.
Note
You can't switch a Live dataset into a Stored dataset if it uses a variable.
Warning
Switching a Live dataset for which its parent(s) dataset(s) has permissions applied, to a Stored dataset will cancel the application of those permissions. You will have to define the permissions again on your dataset if you need to secure access to the data.
Note
Changing the storage of a dataset can affect the dataset, but also its child datasets.