Skip to content

[WIP]#6413

Open
ramitg254 wants to merge 7 commits intoapache:masterfrom
ramitg254:HIVE-29413
Open

[WIP]#6413
ramitg254 wants to merge 7 commits intoapache:masterfrom
ramitg254:HIVE-29413

Conversation

@ramitg254
Copy link
Copy Markdown
Contributor

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

@deniskuzZ
Copy link
Copy Markdown
Member

@ramitg254 please take a look: 9e7535c. I would suggest following similar approach

@ramitg254
Copy link
Copy Markdown
Contributor Author

ramitg254 commented Apr 10, 2026

9e7535c

but here we are creating separate method getEffectivePartCols() and leaving getPartCols() as it is, which as per our discussion on that closed pr we shouldn't do that, and only go ahead with updating getPartCols()

@deniskuzZ
Copy link
Copy Markdown
Member

deniskuzZ commented Apr 10, 2026

9e7535c

but here we are creating separate method getEffectivePartCols() and leaving getPartCols() as it is, which as per our discussion on that closed pr we shouldn't do that, and only go ahead with updating getPartCols()

Where did I say that? The ask was to keep the original method unchanged. same here

@ramitg254
Copy link
Copy Markdown
Contributor Author

ramitg254 commented Apr 10, 2026

oh I got confused due to this comment: #6337 (comment) in which getSupportedPartCols() was just separate method similar to getEffectivePartCols()

@ramitg254
Copy link
Copy Markdown
Contributor Author

ramitg254 commented Apr 10, 2026

I am fine with that earlier approach as well but recently I saw this one: https://issues.apache.org/jira/browse/HIVE-29525 so I thought we should have unified getPartCols() and getCols() which gives similar results as native hive tables as first step towards solving this after that those plan logics can be taken care of later on when that ticket will be addressed.
So I was first focussing on making getPartCols() unified for iceberg tables as well.

please share your thoughts on this idea

@deniskuzZ
Copy link
Copy Markdown
Member

deniskuzZ commented Apr 10, 2026

I am fine with that earlier approach as well but recently I saw this one: https://issues.apache.org/jira/browse/HIVE-29525 so I thought we should have unified getPartCols() and getCols() which gives similar results as native hive tables as first step towards solving this after that those plan logics can be taken care of later on when that ticket will be addressed. So I was first focussing on making getPartCols() unified for iceberg tables as well.

please share your thoughts on this idea

I also tried unified getPartCols(), but that ended up requiring column deduplication or adding supportNonNativePart in several places. So effectivePartCol was more of a cleaner workaround.

That said, you’re right—we might be missing some optimizations with the new method. If you’re already looking into it, maybe try getting everything working with a single getPartCols() first. That way you can identify all the spots that need adjustments (re-enable optimizations), and then switch back to the effectivePartCol API and drop those interim patches.

@sonarqubecloud
Copy link
Copy Markdown

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants