Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scan across table versions #48

Open
tdikland opened this issue Jul 3, 2023 · 1 comment
Open

Scan across table versions #48

tdikland opened this issue Jul 3, 2023 · 1 comment
Labels
enhancement New feature or request

Comments

@tdikland
Copy link

tdikland commented Jul 3, 2023

I would love to be able to scan across all my active versions (i.e. not vacuumed) of my (Delta) tables to make sure that I classify/discover all data that can theoretically be accessed by users with SELECT permission on the table.

One of the areas where I feel this is a key capability is around the GDPR use case. DiscoverX already exposes functionality to remove rows across tables (which is awesome!), but the current documentation rightly mentions that a vacuum is needed to truly make these rows inaccessible. In situations where the responsibility for removing rows for GDPR and auditing GDPR are separated, it would be great for the second group to have functionality available to check whether the required vacuum operation has run (i.e. the data is no longer accessible).
Other use cases could be the case where a user mistakingly adds unwanted data to a table and after realising that deletes these rows without running vacuum. This poses similar challenges as the aforementioned use case.

@edurdevic
Copy link
Contributor

We don't have in plan to query the history versions, that would require quite some logic to be efficient (avoid re-processing the same rows for each version).
We are planning thought to add support for running vacuum over multiple tables, so that it can be scheduled easily across multiple tables with dx.vacuum(from_tables="*.*.*")

@david-tempelmann david-tempelmann added the enhancement New feature or request label Jul 4, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants