-
Notifications
You must be signed in to change notification settings - Fork 174
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TFDV - Slicing data based on date range #132
Comments
@srinivasaraov, |
@rmothukuru : I see the following documentation in the source code. Raises: So, I'm assuming specifying a date range is supported. Is that correct? |
what is the type of your date/timestamp feature? I don't think the default slicer will be able to slice by ranges but you can implement your own slicer. |
btw, we are looking at allowing using SQL statements to do slicing which may be able to support your use case. However there's no timeline yet. |
Thanks @brills Type of date/timestamp is DATETIME. Could you please point me to any example of custom slicer implementation if possible? |
sorry, which DATETIME type did you mean? I don't think TFDV supports such types (only integral, floating and string/bytes). Our feature value slicer is no exception than other potential custom slicers:
|
Did you eventually implement this slicer yourself, @srinivasaraov ? |
@axeltidemann : Not yet. This was deprioritised for us at the moment. I will update when I implement this. |
Cool, I think it would be a very useful feature. |
In Tensorflow Data Validation, there is a method slicing_util.get_feature_value_slicer() to slice data based on a feature value.
Is it possible to slice the data based on a date range using the above method and compare the sliced datasets ?
Let's say, I have 'n' records within date range t1-t10. If I want to split the data into 4 sets which fall in date ranges t1-t3, t4-t6 and t8-t10, is it possible with above slicing method?
The text was updated successfully, but these errors were encountered: