Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG]: Tempo drop the interpolated column in Databricks DBR14 #417

Open
1 task done
srggrs opened this issue Nov 21, 2024 · 1 comment
Open
1 task done

[BUG]: Tempo drop the interpolated column in Databricks DBR14 #417

srggrs opened this issue Nov 21, 2024 · 1 comment
Assignees
Labels
bug Something isn't working

Comments

@srggrs
Copy link

srggrs commented Nov 21, 2024

Is there an existing issue for this?

  • I have searched the existing issues

Current Behavior

In Databricks DBR13.3 cluster. This code will return the interpolated column with no problem.

# this table has id, day, ts (timestamp) and signal columns
input_table = spark.table("my_input_table")

assert input_table.columns == ["id", "day", "ts", "signal"], "cols are not the same"

transformed_data = TSDF(input_table, ts_col="ts", partition_cols=["id", "day"])

interpolated = (
    transformed_data.resample(freq="5 minutes", func="mean")
    .interpolate(method="linear")
    .df
)

interpolated.columns == ["id", "day", "ts", "signal"], "cols are not the same"

Expected Behavior

When upgrading to DBR14 I would expect there are no columns dropped and the interpolated dataframe has the same columns as the input one

Steps To Reproduce

  1. Set up a compute cluster with DBR13 LTS and one with DBR 14 LTS
  2. have an input table with similar columns as above, perharps even just one partition column
  3. Run the code above to see the difference between the two enviroments

Cloud

AWS

Version

dbl-tempo==0.1.27

Relevant log output

No response

@srggrs srggrs added the bug Something isn't working label Nov 21, 2024
@tnixon tnixon self-assigned this Dec 4, 2024
@tnixon
Copy link
Contributor

tnixon commented Dec 4, 2024

Hi @srggrs - thank you for reporting this issue to us. We are currently in process of a major re-write of the Tempo API, and we expect this to be available very soon. We will make sure that this issue does not come up in the new version. If this is a blocker for you, we might be able to try to develop a fix, but right now all our time is focused on the new release.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants