Enable rounding for Decimal32 and Decimal64 in cuDF #17332

a-hirota · 2024-11-15T00:28:24Z

Description

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

Example of Behavior After Modification

The following example demonstrates the behavior of the round function for both float and Decimal32Dtype before and after the modification.

import cudf

def check(t):
    a = cudf.DataFrame({'a': [1.00, 2.054, 3.01]}).astype(t)
    print(type(t), a)
    a = a.round(decimals=1, how="half_up")
    print(type(t), a)

print('float')
check(float)
print('decimal')
check(cudf.Decimal32Dtype(precision=3, scale=2))

Output

After Modification

For Decimal32Dtype, rounding works as expected:

float
<class 'type'>        a
0  1.000
1  2.054
2  3.010
<class 'type'>      a
0  1.0
1  2.1
2  3.0

decimal
<class 'cudf.core.dtypes.Decimal32Dtype'>       a
0  1.00
1  2.05
2  3.01
<class 'cudf.core.dtypes.Decimal32Dtype'>      a
0  1.0
1  2.1
2  3.0

Before Modification

For Decimal32Dtype, rounding did not modify the values:

float
<class 'type'>        a
0  1.000
1  2.054
2  3.010
<class 'type'>      a
0  1.0
1  2.1
2  3.0

decimal
<class 'cudf.core.dtypes.Decimal32Dtype'>       a
0  1.00
1  2.05
2  3.01
<class 'cudf.core.dtypes.Decimal32Dtype'>       a
0  1.00
1  2.05
2  3.01

This example shows that, prior to this modification, rounding had no effect on Decimal32Dtype columns, while after the change, Decimal32Dtype columns round as expected.

Additional Information

Problem:
cuDF currently does not support rounding for Decimal types, limiting its functionality compared to libcudf, which does support it.
Solution:
Updated the cols definition in cudf/python/cudf/cudf/core/indexed_frame.py to allow rounding for Decimal32 and Decimal64 types. The modified code now checks if col.dtype is of Decimal32 or Decimal64 type, allowing rounding directly on these columns without making a copy.
Alternatives Considered:
Using apply in Pandas can achieve similar functionality, but it is not as efficient as a native rounding method in cuDF.
Additional Context:
- libcudf documentation indicates that Decimal32 and Decimal64 are supported for rounding.

This enhancement improves consistency between cudf and libcudf by enabling direct rounding of Decimal types in cudf.

copy-pr-bot · 2024-11-15T00:28:27Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

vyasr · 2024-11-15T00:55:57Z

Thanks @a-hirota! Could you please add a test that would have failed with the old code that succeeds now? You should be able to construct the expected decimal column and then compare it to what you get out of rounding.

- Added test cases for "half_up" and "half_even" rounding modes. - Tested both Decimal32Dtype and Decimal64Dtype with various precisions and scales. - Ensured coverage for edge cases like .5 rounding to the nearest even number in "half_even". - Verified correctness of rounding logic across different decimal places.

a-hirota · 2024-11-15T08:48:18Z

Thank you @vyasr for your feedback!

I’ve added test cases that validate the rounding behavior for both Decimal32Dtype and Decimal64Dtype with the half_up and half_even rounding modes. These include scenarios where the previous code would have produced incorrect results, such as rounding .5 values inconsistently in the half_even mode. The expected output is constructed explicitly for each test case, and assertions ensure the correctness of the implementation.

Let me know if there's anything else you'd like me to adjust or add!

python/cudf/cudf/tests/test_series.py

python/cudf/cudf/core/indexed_frame.py

Co-authored-by: Matthew Roeschke <[email protected]>

Removed comments and print statements from the test function as requested. Clarified the approach for handling "half_up" and "half_even" rounding methods.

vyasr · 2024-11-19T17:40:00Z

/ok to test

vyasr · 2024-11-20T00:29:11Z

@a-hirota looks like you need to fix the style, could you please run pre-commit?

Ran pre-commit to address style issues as per review feedback. This ensures consistency with project coding standards.

a-hirota · 2024-11-20T02:07:19Z

@vyasr
Apologies, I have run pre-commit.

vyasr · 2024-11-20T04:19:34Z

No worries! Thank you.

vyasr · 2024-11-20T04:19:36Z

/ok to test

vyasr · 2024-11-20T16:46:44Z

/ok to test

vyasr · 2024-11-20T20:50:45Z

@a-hirota it looks like pre-commit is still making changes to the test that you added.

into include-decimal-round

a-hirota · 2024-11-21T11:32:51Z

@a-hirota it looks like pre-commit is still making changes to the test that you added.
@vyasr
Sorry, I forgot to push the test file...

vyasr · 2024-11-22T07:47:31Z

/ok to test

vyasr · 2024-12-07T00:50:24Z

/ok to test

mroeschke

Looks good. Thanks @a-hirota

mroeschke · 2024-12-10T01:41:16Z

/merge

a-hirota added 4 commits November 14, 2024 16:51

Include Decimal32 and Decimal64 in round operation

e030e52

my recent work

38d40d9

my recent work

6d75b0f

my recent work

83b384f

a-hirota requested a review from a team as a code owner November 15, 2024 00:28

a-hirota requested review from vyasr and brandon-b-miller November 15, 2024 00:28

github-actions bot assigned a-hirota Nov 15, 2024

github-actions bot added the Python Affects Python cuDF API. label Nov 15, 2024

Merge branch 'branch-24.12' into include-decimal-round

d89a6fb

mroeschke reviewed Nov 15, 2024

View reviewed changes

python/cudf/cudf/tests/test_series.py Outdated Show resolved Hide resolved

mroeschke reviewed Nov 15, 2024

View reviewed changes

python/cudf/cudf/tests/test_series.py Outdated Show resolved Hide resolved

mroeschke reviewed Nov 15, 2024

View reviewed changes

python/cudf/cudf/tests/test_series.py Show resolved Hide resolved

mroeschke reviewed Nov 15, 2024

View reviewed changes

python/cudf/cudf/core/indexed_frame.py Outdated Show resolved Hide resolved

a-hirota and others added 3 commits November 16, 2024 12:58

Update python/cudf/cudf/tests/test_series.py

3482094

Co-authored-by: Matthew Roeschke <[email protected]>

Update python/cudf/cudf/tests/test_series.py

1b897a2

Co-authored-by: Matthew Roeschke <[email protected]>

Addressed review feedback from mroeschke:

3a00194

Removed comments and print statements from the test function as requested. Clarified the approach for handling "half_up" and "half_even" rounding methods.

vyasr added feature request New feature or request non-breaking Non-breaking change labels Nov 19, 2024

Merge branch 'branch-24.12' into include-decimal-round

1d3d243

github-actions bot assigned vyasr Nov 19, 2024

a-hirota and others added 2 commits November 20, 2024 10:59

Merge branch 'branch-24.12' into include-decimal-round

05d1cf8

style: apply pre-commit fixes

3f1bade

Ran pre-commit to address style issues as per review feedback. This ensures consistency with project coding standards.

Merge branch 'branch-24.12' into include-decimal-round

a46f768

a-hirota and others added 3 commits November 21, 2024 20:25

fix: add missing test file after running pre-commit

3ebad5e

Merge branch 'branch-24.12' into include-decimal-round

f78bb57

Merge branch 'include-decimal-round' of https://github.com/a-hirota/cudf

a13cd51

into include-decimal-round

vyasr changed the base branch from branch-24.12 to branch-25.02 November 22, 2024 07:46

Merge branch 'branch-25.02' into include-decimal-round

985d293

a-hirota and others added 2 commits December 3, 2024 10:49

Merge branch 'branch-25.02' into include-decimal-round

e313901

Merge branch 'branch-25.02' into include-decimal-round

43bbd09

vyasr requested a review from mroeschke December 7, 2024 00:50

mroeschke approved these changes Dec 10, 2024

View reviewed changes

rapids-bot bot merged commit 4764395 into rapidsai:branch-25.02 Dec 10, 2024
104 of 105 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable rounding for Decimal32 and Decimal64 in cuDF #17332

Enable rounding for Decimal32 and Decimal64 in cuDF #17332

a-hirota commented Nov 15, 2024 •

edited

Loading

copy-pr-bot bot commented Nov 15, 2024

vyasr commented Nov 15, 2024

a-hirota commented Nov 15, 2024

vyasr commented Nov 19, 2024

vyasr commented Nov 20, 2024

a-hirota commented Nov 20, 2024

vyasr commented Nov 20, 2024

vyasr commented Nov 20, 2024

vyasr commented Nov 20, 2024

vyasr commented Nov 20, 2024

a-hirota commented Nov 21, 2024 •

edited

Loading

vyasr commented Nov 22, 2024

vyasr commented Dec 7, 2024

mroeschke left a comment

mroeschke commented Dec 10, 2024

Enable rounding for Decimal32 and Decimal64 in cuDF #17332

Enable rounding for Decimal32 and Decimal64 in cuDF #17332

Conversation

a-hirota commented Nov 15, 2024 • edited Loading

Description

Checklist

Example of Behavior After Modification

Output

After Modification

Before Modification

Additional Information

copy-pr-bot bot commented Nov 15, 2024

vyasr commented Nov 15, 2024

a-hirota commented Nov 15, 2024

vyasr commented Nov 19, 2024

vyasr commented Nov 20, 2024

a-hirota commented Nov 20, 2024

vyasr commented Nov 20, 2024

vyasr commented Nov 20, 2024

vyasr commented Nov 20, 2024

vyasr commented Nov 20, 2024

a-hirota commented Nov 21, 2024 • edited Loading

vyasr commented Nov 22, 2024

vyasr commented Dec 7, 2024

mroeschke left a comment

Choose a reason for hiding this comment

mroeschke commented Dec 10, 2024

a-hirota commented Nov 15, 2024 •

edited

Loading

a-hirota commented Nov 21, 2024 •

edited

Loading