fix: Handle potential overflow in internal state for `avg(decimal)` by AdamGS · Pull Request #22714 · apache/datafusion

AdamGS · 2026-06-02T12:18:01Z

Which issue does this PR close?

Closes Decimal average can overflow because its inner intermediate sum state overflows its storage size #22713.

Rationale for this change

Fixes a bug with avg that currently prevents us from running TPC-DS q1. I think that this issues is masked by Parquet because the current implementations infers that columns as a Decimal128.

What changes are included in this PR?

Mark Decimal32/64 as R in sqllogictest, like the bigger decimal types, and fixes some tests that used ?. (this can be a separate PR, but its seems very small).
Adds a test for decimal32 overflow
DecimalAvgAccumulator now takes another type to hold its inner sum accumulator, which can be different than the input/output type.
Decimal32 and Decimal 64 use i64 and i128 (respectively) to prevent an overflow (should i128 use i256 here?).
Adds some unit tests for the AVG impl building blocks.

Are these changes tested?

Additional SLT test that would've overflowed internally, and more focused unit tests for AvgGroupsAccumulator

Are there any user-facing changes?

None, just more code that doesn't currently work and will work now.

kosiew

@AdamGS

Thanks for working on this. The widening approach for Decimal32/64 looks good, but I think there are still correctness gaps around Decimal128 accumulation that need to be addressed before this can land. I also have a couple of non-blocking suggestions around SLT normalization and reducing duplication in the decimal averaging paths.

kosiew · 2026-06-05T07:52:38Z

-                    target_precision: *target_precision,
-                    target_scale: *target_scale,
-                })),
+                ) => Ok(Box::new(DecimalAvgAccumulator::<Decimal128Type>::new(


I think there is still a correctness issue here for Decimal128 AVG. The accumulator state remains Decimal128Type, and the new summation path uses add_wrapping (for example at lines 651, 672, and 947). That means intermediate overflow can still silently wrap before DecimalAverager gets a chance to run.

For example, avg(arrow_cast('999999999999999999999999999999.9999', 'Decimal128(34, 4)')) over roughly 20,000 rows has a valid Decimal128(38, 8) result, but the intermediate Decimal128 sum exceeds i128::MAX and wraps along the way. At that point the average is already corrupted even though the final result would be representable.

Could we widen the Decimal128 accumulation state (for example to Decimal256) or otherwise use checked/compensated accumulation so intermediate overflow does not invalidate averages whose final result fits?

kosiew · 2026-06-05T07:53:46Z

                (
                    Decimal32(_, scale),
                    Decimal32(target_precision, target_scale),
                ) => Ok(Box::new(DecimalDistinctAvgAccumulator::<Decimal32Type>::with_decimal_params(


I think the same issue still exists for AVG(DISTINCT) on decimals. The distinct path constructs DecimalDistinctAvgAccumulator::<Decimal32Type>, Decimal64Type, and Decimal128Type, and those accumulators still sum distinct values in the native type using wrapping arithmetic.

AVG(DISTINCT) is still an average, so it can hit the same intermediate overflow problem whenever the average is representable but the sum is not. Could the widening/state-type fix be applied here as well? Otherwise it would be good to explicitly narrow the supported contract and add tests that document the unsupported behavior.

kosiew · 2026-06-05T07:54:47Z

            DataType::Float16
            | DataType::Float32
            | DataType::Float64
+            | DataType::Decimal32(_, _)


Small suggestion: mapping Decimal32/Decimal64 globally to DFColumnType::Float makes SLT comparisons approximate for these decimal types and could hide formatting or rounding regressions that would otherwise be caught.

If the motivation is only the affected power/round queries, would it make sense to keep exact/text comparisons there instead? At minimum, it may be worth documenting why all Decimal32/64 SLT output is now treated approximately.

kosiew · 2026-06-05T07:55:50Z

@@ -365,17 +370,27 @@
                Decimal32(_sum_precision, sum_scale),


Small refactoring suggestion: the Decimal32 and Decimal64 branches both follow the same pattern of creating a wider DecimalAverager, dividing by a widened count, and then try_from-ing back to the output native type.

It might be nice to factor that into a helper such as avg_decimal_with_wider_sum. That would encode the widening invariant in one place and help keep the accumulator and group-accumulator paths aligned over time.

fix decimal intermidiate overflow

cdd0af7

github-actions Bot added sqllogictest SQL Logic Tests (.slt) functions Changes to functions implementation labels Jun 2, 2026

AdamGS mentioned this pull request Jun 2, 2026

Document why the Arrow exporter keeps Decimal128 as the default decimal width vortex-data/vortex#8197

Draft

kosiew requested changes Jun 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Handle potential overflow in internal state for `avg(decimal)`#22714

fix: Handle potential overflow in internal state for `avg(decimal)`#22714
AdamGS wants to merge 1 commit into
apache:mainfrom
AdamGS:adamg/fix-avg-decimal-overflow

AdamGS commented Jun 2, 2026 •

edited

Loading

Uh oh!

kosiew left a comment

Uh oh!

kosiew Jun 5, 2026

Uh oh!

kosiew Jun 5, 2026

Uh oh!

kosiew Jun 5, 2026

Uh oh!

kosiew Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

AdamGS commented Jun 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

kosiew left a comment

Choose a reason for hiding this comment

Uh oh!

kosiew Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

kosiew Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

kosiew Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

kosiew Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AdamGS commented Jun 2, 2026 •

edited

Loading