HIVE-29457: HiveSortExchangePullUpConstantsRule doesn't remove consta… by soumyakanti3578 · Pull Request #6316 · apache/hive

soumyakanti3578 · 2026-02-12T02:07:11Z

…nt column from distribution keys

What changes were proposed in this pull request? & Why are the changes needed?

Explained in https://issues.apache.org/jira/browse/HIVE-29457

Does this PR introduce any user-facing change?

No

How was this patch tested?

mvn test -pl itests/qtest -Pitests -Dtest=TestMiniLlapLocalCliDriver -Dtest.output.overwrite=true -o -Dqfile="distribution_key_constant_value.q"

…nt column from distribution keys

ql/src/java/org/apache/hadoop/hive/ql/optimizer/calcite/HiveRelDistribution.java

zabetak · 2026-02-18T20:18:18Z

ql/src/test/queries/clientpositive/distribution_key_constant_value.q

+SELECT col1 FROM test
+WHERE col2 = 'a'
+DISTRIBUTE BY col1, col2     
+SORT BY col1, col2; 


Can we drop the SORT BY to minimize the repro?

SELECT col1, col2 FROM test WHERE col2 = 'a' DISTRIBUTE BY col1, col2

Unfortunately this fails with:

EXPLAIN CBO SELECT col1 FROM test WHERE col2 = 'a' DISTRIBUTE BY col1, col2 fname=distribution_key_constant_value.q See ./ql/target/tmp/log/hive.log or ./itests/qtest/target/tmp/log/hive.log, or check ./ql/target/surefire-reports or ./itests/qtest/target/surefire-reports/ for specific test cases logs. org.apache.hadoop.hive.ql.parse.SemanticException: Line 6:20 Invalid table alias or column reference 'col2': (possible column names are: col1) at org.apache.hadoop.hive.ql.parse.CalcitePlanner.genAllRexNode(CalcitePlanner.java:5224) at org.apache.hadoop.hive.ql.parse.CalcitePlanner.genAllRexNode(CalcitePlanner.java:5154) at org.apache.hadoop.hive.ql.parse.CalcitePlanner$OrderByRelBuilder.getOrderByExpression(CalcitePlanner.java:5475) at org.apache.hadoop.hive.ql.parse.CalcitePlanner$OrderByRelBuilder.genSortByKey(CalcitePlanner.java:5441) at org.apache.hadoop.hive.ql.parse.CalcitePlanner$OrderByRelBuilder.addRelDistribution(CalcitePlanner.java:5507) at org.apache.hadoop.hive.ql.parse.CalcitePlanner$CalcitePlannerAction.genSBLogicalPlan(CalcitePlanner.java:3945) at org.apache.hadoop.hive.ql.parse.CalcitePlanner$CalcitePlannerAction.genLogicalPlan(CalcitePlanner.java:4975) at org.apache.hadoop.hive.ql.parse.CalcitePlanner$CalcitePlannerAction.apply(CalcitePlanner.java:1611) at org.apache.hadoop.hive.ql.parse.CalcitePlanner$CalcitePlannerAction.apply(CalcitePlanner.java:1553) at org.apache.calcite.tools.Frameworks.lambda$withPlanner$0(Frameworks.java:140) at org.apache.calcite.prepare.CalcitePrepareImpl.perform(CalcitePrepareImpl.java:936) at org.apache.calcite.tools.Frameworks.withPrepare(Frameworks.java:191) at org.apache.calcite.tools.Frameworks.withPlanner(Frameworks.java:135) at org.apache.hadoop.hive.ql.parse.CalcitePlanner.logicalPlan(CalcitePlanner.java:1331) at org.apache.hadoop.hive.ql.parse.CalcitePlanner.genOPTree(CalcitePlanner.java:588) at org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.analyzeInternal(SemanticAnalyzer.java:13222) at org.apache.hadoop.hive.ql.parse.CalcitePlanner.analyzeInternal(CalcitePlanner.java:481) at org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:358) at org.apache.hadoop.hive.ql.parse.ExplainSemanticAnalyzer.analyzeInternal(ExplainSemanticAnalyzer.java:187) at org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:358) at org.apache.hadoop.hive.ql.Compiler.analyze(Compiler.java:224)

I think this is a bug and should be resolved in another ticket.

If you use the query I shared above (with col2 in the SELECT) it doesn't throw the Invalid table alias.

Yes, that worked, thanks! And I have updated the test in the latest commit.

zabetak

We could do a bit of refactoring in HiveSortPullUpConstantsRule but it does not have to happen necessarily as part of this PR. We could leave it for a follow-up.

zabetak · 2026-02-20T18:21:15Z

ql/src/java/org/apache/hadoop/hive/ql/optimizer/calcite/rules/HiveSortPullUpConstantsRule.java

+    private RelDistribution applyToDistribution(
+        RelDistribution distribution, Mappings.TargetMapping mapping) {
+      List<Integer> newKeys = new ArrayList<>();
+      for (int key : distribution.getKeys()) {
+        final int target = mapping.getTargetOpt(key);
+        if (target < 0) {
+          // It is a constant, we can ignore it
+          continue;
+        }
+        newKeys.add(target);
+      }
+
+      return new HiveRelDistribution(distribution.getType(), newKeys);
+    }


This is very similar to applyToFieldCollations it would be nice to see if we can refactor some of the commons parts together.

I tried moving the for loop which checks if the key is present in the mapping to a new method. However, this doesn't really simplify applyToFieldCollations as we still need this loop: for (RelFieldCollation fc : relCollation.getFieldCollations()).

Maybe we can revisit this in a follow up.

sonarqubecloud · 2026-02-23T19:30:59Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

soumyakanti3578 · 2026-02-23T21:58:05Z

@zabetak
Since you have approved this and I have minimized the test and the subsequent tests are green, I am merging this to master.

HIVE-29457: HiveSortExchangePullUpConstantsRule doesn't remove consta…

e59ffa6

…nt column from distribution keys

asf-ci-hive added tests pending tests passed and removed tests pending labels Feb 12, 2026

zabetak reviewed Feb 18, 2026

View reviewed changes

address review comments

14774e1

asf-ci-hive added tests pending tests passed and removed tests passed tests pending labels Feb 19, 2026

zabetak approved these changes Feb 20, 2026

View reviewed changes

Remove unnecessary SORT BY clause

4f077f4

asf-ci-hive added tests pending and removed tests passed labels Feb 23, 2026

asf-ci-hive added tests passed and removed tests pending labels Feb 23, 2026

soumyakanti3578 merged commit e876bb4 into apache:master Feb 23, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HIVE-29457: HiveSortExchangePullUpConstantsRule doesn't remove consta…#6316

HIVE-29457: HiveSortExchangePullUpConstantsRule doesn't remove consta…#6316
soumyakanti3578 merged 3 commits intoapache:masterfrom
soumyakanti3578:HIVE-29457

soumyakanti3578 commented Feb 12, 2026

Uh oh!

Uh oh!

zabetak Feb 18, 2026

Uh oh!

soumyakanti3578 Feb 19, 2026

Uh oh!

zabetak Feb 20, 2026

Uh oh!

soumyakanti3578 Feb 23, 2026

Uh oh!

zabetak left a comment

Uh oh!

zabetak Feb 20, 2026

Uh oh!

soumyakanti3578 Feb 23, 2026

Uh oh!

sonarqubecloud bot commented Feb 23, 2026

Uh oh!

soumyakanti3578 commented Feb 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

soumyakanti3578 commented Feb 12, 2026

What changes were proposed in this pull request? & Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

Uh oh!

zabetak Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

soumyakanti3578 Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

zabetak Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

soumyakanti3578 Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

zabetak left a comment

Choose a reason for hiding this comment

Uh oh!

zabetak Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

soumyakanti3578 Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud bot commented Feb 23, 2026

Quality Gate passed

Uh oh!

soumyakanti3578 commented Feb 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants