Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replacing feature name with feature path in statistics proto #73

Open
paulgc opened this issue Jul 12, 2019 · 0 comments
Open

Replacing feature name with feature path in statistics proto #73

paulgc opened this issue Jul 12, 2019 · 0 comments

Comments

@paulgc
Copy link
Member

paulgc commented Jul 12, 2019

Towards the goal of adding support for computing statistics over structured data (e.g., arbitrary protocol buffers, parquet data), we will populate path for each feature instead of name in the output statistics proto. Path contains a repeated string field step.

Current behavior

features {
  name: 'foo'
}

New behavior

features {
  path {
    step: 'foo'
  }
}

TFDV 0.14 will have this new behavior. The validation API will be backwards compatible with the protos with name populated.

tfx-copybara pushed a commit that referenced this issue Jul 17, 2019
… prepare for

the support for STRUCT features.

*** BREAKING CHANGE ***
Correspondingly, made TFDV populate FeatureNameStatistics.path __instead__ of .name.

Also see the announcement: #73
***********************

Also changed BasicStatsGenerator to assume a single chunk in its input table.

PiperOrigin-RevId: 258579936
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant