From 2e89b214b0ef6e621dda0894d69f351a6776154a Mon Sep 17 00:00:00 2001 From: souravg-db Date: Tue, 2 Jan 2024 18:09:48 +0000 Subject: [PATCH] updated information --- docs/GDPR_RoE.md | 11 +++++++++-- 1 file changed, 9 insertions(+), 2 deletions(-) diff --git a/docs/GDPR_RoE.md b/docs/GDPR_RoE.md index 5c3b77c..d840d3c 100644 --- a/docs/GDPR_RoE.md +++ b/docs/GDPR_RoE.md @@ -1,6 +1,6 @@ ## GDPR Right of Erasure (Delete) -Any European citizen has the right to ask for his data to be deleted. If the user data is spread across multiple tables, erasing all the user data can be a daunting task. +Any EU citizen/resident has the right to ask for his data to be deleted. If the user data is spread across multiple tables, erasing all the user data can be a daunting task. If all your tables share the same column name for a user identifier, then the operation is quite straightforward. You can select all the tables that have that column, and run a delete statement on all tables at once. @@ -16,6 +16,13 @@ dx.from_tables("*.*.*")\ ## Vaccum -Note: You need to regularly [vacuum](https://docs.delta.io/latest/delta-utility.html#remove-files-no-longer-referenced-by-a-delta-table) all your delta tables to remove all traces of your deleted rows. +You need to regularly [vacuum](https://docs.delta.io/latest/delta-utility.html#remove-files-no-longer-referenced-by-a-delta-table) all your delta tables to remove all traces of your deleted rows. Check out how to [vacuum all your tables at once with DiscoverX](Vacuum.md). + +NOTE: Delta Lake latest features enable users to soft delete data. For example, + +* Dropping columns with [column mapping](https://learn.microsoft.com/en-us/azure/databricks/delta/delta-column-mapping) enabled. +* Deleting rows with [deletion vectors](https://learn.microsoft.com/en-us/azure/databricks/delta/delta-column-mapping) enabled. + +For detailed instructions on executing VACUUM with these features, please refer to the documentation.