NIFI-15568: Fix Iceberg timestamp handling and add S3 storage class support by NirYanay2005 · Pull Request #10877 · apache/nifi

NirYanay2005 · 2026-02-09T15:55:09Z

Summary

This change improves Apache Iceberg integration in NiFi by addressing two related issues:

Adds support for configuring the S3 storage class in S3IcebergFileIOProvider, which is required for certain on-prem or S3-compatible object stores.
Fixes timestamp type compatibility issues when writing Parquet-backed Iceberg tables by converting java.sql.Timestamp values to java.time.LocalDateTime, including correct handling for nested records, collections, and partitioned tables.

The timestamp fix ensures compatibility with Iceberg’s internal expectations and allows writes to succeed when timestamp columns are used as partition keys.

Tracking

Please complete the following tracking steps prior to pull request creation.

Issue Tracking

Apache NiFi Jira issue created

Pull Request Tracking

Pull Request title starts with Apache NiFi Jira issue number, such as NIFI-15568
Pull Request commit message starts with Apache NiFi Jira issue number, such as NIFI-15568
Pull request contains commits signed with a registered key indicating Verified status

Pull Request Formatting

Pull Request based on current revision of the main branch
Pull Request refers to a feature branch with one commit containing changes

Verification

Please indicate the verification steps performed prior to pull request creation.

Build

Build completed using ./mvnw clean install -P contrib-check
- JDK 21
- [] JDK 25

Licensing

New dependencies are compatible with the Apache License 2.0
New dependencies are documented in applicable LICENSE and NOTICE files

Documentation

Documentation formatting appears as expected in rendered files

exceptionfactory

Thanks for submitted the revised pull request @NirYanay2005.

Please review the pull request instructions and run a build with the contrib-check profile enabled to correct formatting issues such as missing license headers.

exceptionfactory · 2026-02-09T17:08:51Z

Please note that each commit must also be signed with a key associated with your GitHub profile for verification.

NirYanay2005 · 2026-02-11T10:50:14Z

I added a key to my commits and fixed all contrib-check problems.

exceptionfactory · 2026-02-11T17:06:11Z

I added a key to my commits and fixed all contrib-check problems.

Thanks @NirYanay2005, the initial commit is now signed, but the subsequent commits are not. Can you squash all commits and this branch to ensure they are all signed?

Removed unnecessary code that already existed in main Added licenses and formatting Fixed all contrib-check issues

NirYanay2005 · 2026-02-12T09:49:19Z

I added a key to my commits and fixed all contrib-check problems.

Thanks @NirYanay2005, the initial commit is now signed, but the subsequent commits are not. Can you squash all commits and this branch to ensure they are all signed?

Ok, I squashed all commits

exceptionfactory

Thanks for making the initial adjustments @NirYanay2005.

The new Storage Class property looks good. If you are interested in getting that merged more quickly, it would be worth breaking that out to a separate Jira issue and pull request.

Regarding the Record field conversion, the problem makes sense, but the proposed solution does not appear to be the best way forward. Introducing a new IcebergRecordConverter in the Parquet module does not seem like the right location for shared Record formatting, although that is worth further consideration. More importantly, running each Record through a conversion process places additional overhead on the entire process.

Instead of this approach, constructing the Iceberg Record object as need in the PutIcebergRecord Processor seems like the optimal place for changes. The initial implementation of the delegating Record depends heavily on NiFi Record field conversion. That would be the first potential place to make a change for handling type conversion in an optimal way.

If you are willing to work through an alternative approach, I can review options. Alternatively, I may take a closer look at the problem and could propose an alternative solution.

Thanks again for working on these issues and feel free to follow up on how you would like to proceed.

NirYanay2005 · 2026-02-17T07:21:30Z

About the StorageClass, i need both and cant use one without the other so i don't mind waiting a bit more.
I want to work through an improved approach, but I need a bit more direction on where the change should be made.
I understand that introducing a separate IcebergRecordConverter is not ideal, and that the better location would be during Iceberg Record construction inside PutIcebergRecord.
However, I’m not yet fully familiar with how PutIcebergRecord constructs the Iceberg Record object and where NiFi Record field conversion is applied.
Could you point me to the specific method in PutIcebergRecord where timestamp type handling would be most appropriate?
I’m happy to iterate on the implementation once I understand the intended extension point.

exceptionfactory · 2026-02-19T12:25:36Z

About the StorageClass, i need both and cant use one without the other so i don't mind waiting a bit more. I want to work through an improved approach, but I need a bit more direction on where the change should be made. I understand that introducing a separate IcebergRecordConverter is not ideal, and that the better location would be during Iceberg Record construction inside PutIcebergRecord. However, I’m not yet fully familiar with how PutIcebergRecord constructs the Iceberg Record object and where NiFi Record field conversion is applied. Could you point me to the specific method in PutIcebergRecord where timestamp type handling would be most appropriate? I’m happy to iterate on the implementation once I understand the intended extension point.

Thanks for the reply. I recommend tracing through PutIcebergRecord and looking at DelegatedRecord as a starting point.

NirYanay2005 · 2026-02-22T11:52:27Z

I changed the implementation entirely. I moved it to DelegatedRecord get methods.

exceptionfactory

Thanks for adjusting the direction @NirYanay2005. This looks on a good track. Regarding the InternalRecordWrapper, is that still needed? Are there any options to avoid the RecordWrapper in ParquetPartitionedWriter?

NirYanay2005 · 2026-02-24T11:17:11Z

I tired without and it failed as the conversion to a long is needed after the conversion to local date time but at a different phase.

exceptionfactory requested changes Feb 9, 2026

View reviewed changes

NirYanay2005 force-pushed the NIFI-15568 branch from 44af491 to d5fcca9 Compare February 11, 2026 07:23

Added support for on prem s3 and fixed timestamp bug

7bfa0af

Removed unnecessary code that already existed in main Added licenses and formatting Fixed all contrib-check issues

NirYanay2005 force-pushed the NIFI-15568 branch from 09ed9a8 to 7bfa0af Compare February 12, 2026 09:48

NirYanay2005 requested a review from exceptionfactory February 15, 2026 14:13

exceptionfactory requested changes Feb 16, 2026

View reviewed changes

NirYanay2005 requested a review from exceptionfactory February 17, 2026 07:21

Moved convertion to DelegatedRecord

f9203f3

exceptionfactory reviewed Feb 23, 2026

View reviewed changes

NirYanay2005 requested a review from exceptionfactory February 24, 2026 11:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NIFI-15568: Fix Iceberg timestamp handling and add S3 storage class support#10877

NIFI-15568: Fix Iceberg timestamp handling and add S3 storage class support#10877
NirYanay2005 wants to merge 2 commits intoapache:mainfrom
NirYanay2005:NIFI-15568

NirYanay2005 commented Feb 9, 2026

Uh oh!

exceptionfactory left a comment

Uh oh!

exceptionfactory commented Feb 9, 2026

Uh oh!

NirYanay2005 commented Feb 11, 2026

Uh oh!

exceptionfactory commented Feb 11, 2026

Uh oh!

NirYanay2005 commented Feb 12, 2026

Uh oh!

exceptionfactory left a comment

Uh oh!

NirYanay2005 commented Feb 17, 2026

Uh oh!

exceptionfactory commented Feb 19, 2026

Uh oh!

NirYanay2005 commented Feb 22, 2026 •

edited

Loading

Uh oh!

exceptionfactory left a comment

Uh oh!

NirYanay2005 commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

NirYanay2005 commented Feb 9, 2026

Summary

Tracking

Issue Tracking

Pull Request Tracking

Pull Request Formatting

Verification

Build

Licensing

Documentation

Uh oh!

exceptionfactory left a comment

Choose a reason for hiding this comment

Uh oh!

exceptionfactory commented Feb 9, 2026

Uh oh!

NirYanay2005 commented Feb 11, 2026

Uh oh!

exceptionfactory commented Feb 11, 2026

Uh oh!

NirYanay2005 commented Feb 12, 2026

Uh oh!

exceptionfactory left a comment

Choose a reason for hiding this comment

Uh oh!

NirYanay2005 commented Feb 17, 2026

Uh oh!

exceptionfactory commented Feb 19, 2026

Uh oh!

NirYanay2005 commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

exceptionfactory left a comment

Choose a reason for hiding this comment

Uh oh!

NirYanay2005 commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

NirYanay2005 commented Feb 22, 2026 •

edited

Loading