Skip to content

Conversation

@zhengruifeng
Copy link
Contributor

What changes were proposed in this pull request?

Upgrade the minimum version of pyarrow to 18.0.0

Why are the changes needed?

1, pyarrow 18.0.0 was released at Oct 28, 2024;
2, there is a security issue PYSEC-2024-161 in pyarrow, the affected versions are 4.0.0 ~ 16.1.0, and it is recommended to upgrade to 17+;
3, since 18.0.0, pyarrow no longer depends on numpy, which will make the dependencies simpler to resolve;

Does this PR introduce any user-facing change?

no

How was this patch tested?

PR builder with

default: '{"PYSPARK_IMAGE_TO_TEST": "python-minimum", "PYTHON_TO_TEST": "python3.10"}'

Was this patch authored or co-authored using generative AI tooling?

no

@zhengruifeng zhengruifeng changed the title [SPARK-54849][PYTHON] Upgrade the minimum version of pyarrow to 18.0.0 [WIP][SPARK-54849][PYTHON] Upgrade the minimum version of pyarrow to 18.0.0 Dec 26, 2025
@zhengruifeng
Copy link
Contributor Author

zhengruifeng commented Dec 26, 2025

a known 17.x/18.x bug in pyarrow.RecordBatch.filter fails mapInArrow tests

https://github.com/zhengruifeng/spark/actions/runs/20514586584/job/58940753469

will try 19.0.0 then

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants