fix: Pre-create S3A event log dir before SparkContext init by abhijeet-dhumal · Pull Request #6317 · feast-dev/feast

abhijeet-dhumal · 2026-04-22T15:21:13Z

What this PR does / why we need it:

When spark.eventLog.enabled: "true" and spark.eventLog.dir points to an S3A path, feast materialize-incremental silently writes nothing to the online store and exits with code 0.
The failure chain:

SparkContext.__init__
  └─ EventLoggingListener.start()
       └─ EventLogFileWriter.requireLogBaseDirAsDirectory()
            └─ S3A 404 (prefix doesn't exist) → raises RuntimeException
                 └─ caught by _materialize_one(except Exception) → ERROR job
                      └─ CLI exits 0 — no data written, no visible error

S3 has no real directories. An empty prefix is indistinguishable from "does not exist", so Spark's pre-flight check always fails on a fresh bucket.

Which issue(s) this PR fixes:

In get_or_create_new_spark_session() (compute_engines/spark/utils.py), before building the SparkSession, call _ensure_s3a_event_log_dir() which:

Checks if the S3A prefix already contains objects (no-op if it does)
Writes a zero-byte .keep placeholder if empty
Uses boto3 — already a Feast dependency via the S3 offline store
Is fully non-fatal: swallows errors and lets Spark surface its own message if the write fails

No-ops for non-S3A paths (hdfs://, file://, etc.) and when event logging is disabled.

Checks

I've made sure the tests are passing.
My commits are signed off (git commit -s)
My PR title follows conventional commits format

Testing Strategy

Unit tests
Integration tests
Manual tests
Testing is not required for this change

Misc

R-behera

This looks like a useful guard for the S3A event log edge case, and the focused tests help. One follow-up worth considering is whether some Feast users rely on credentials or endpoint details only through Spark/Hadoop config rather than environment variables. If so, a short note or test around that path could prevent surprises when the pre-create step runs before Spark fully applies the config.

ntkathole · 2026-04-23T15:15:25Z

+        "spark.hadoop.fs.s3a.endpoint",
+        os.environ.get("FEAST_S3A_ENDPOINT", ""),
+    )
+    access_key = os.environ.get("AWS_ACCESS_KEY_ID", "")


access_key = spark_config.get( "spark.hadoop.fs.s3a.access.key", os.environ.get("AWS_ACCESS_KEY_ID", ""), ) secret_key = spark_config.get( "spark.hadoop.fs.s3a.secret.key", os.environ.get("AWS_SECRET_ACCESS_KEY", ""), ) session_token = spark_config.get( "spark.hadoop.fs.s3a.session.token", os.environ.get("AWS_SESSION_TOKEN", ""), ) or None

ntkathole · 2026-04-23T15:16:02Z

@abhijeet-dhumal Let's handle both comment from devin and @R-behera suggestion

abhijeet-dhumal · 2026-04-24T08:17:19Z

@abhijeet-dhumal Let's handle both comment from devin and @R-behera suggestion

@ntkathole Addressed both your comments ✅
credentials (access.key, secret.key, session.token) are now read from spark config first with env var fallback, and the Devin-flagged bucket-root path bug is fixed.

abhijeet-dhumal · 2026-04-24T08:18:12Z

This looks like a useful guard for the S3A event log edge case, and the focused tests help. One follow-up worth considering is whether some Feast users rely on credentials or endpoint details only through Spark/Hadoop config rather than environment variables. If so, a short note or test around that path could prevent surprises when the pre-create step runs before Spark fully applies the config.

@R-behera Good catch on the Spark/Hadoop config credentials path ✅
_ensure_s3a_event_log_dir now reads spark.hadoop.fs.s3a.access.key, secret.key, and session.token from the spark config before falling back to environment variables. Added tests verifying both the spark-config-takes-precedence and env-var-fallback paths.

ntkathole · 2026-04-25T14:39:23Z

+
+    endpoint = spark_config.get(
+        "spark.hadoop.fs.s3a.endpoint",
+        os.environ.get("FEAST_S3A_ENDPOINT", ""),


Wondering if this can be AWS_ENDPOINT_URL instead or atleast we need to document this new env var in our docs ?

Good call — switched to AWS_ENDPOINT_URL . No custom env vars to document now. Spark config (spark.hadoop.fs.s3a.endpoint) still takes precedence when set.

ntkathole · 2026-04-25T14:41:54Z

@abhijeet-dhumal let's fix the linting

ntkathole · 2026-04-25T14:52:06Z

+            aws_access_key_id=access_key or None,
+            aws_secret_access_key=secret_key or None,
+            aws_session_token=session_token,
+            config=BotoConfig(signature_version="s3v4"),


Also, consider supporting minio or other path style

addressing_style = ( "path" if spark_config.get("spark.hadoop.fs.s3a.path.style.access", "false").lower() == "true" else "auto" ) config=BotoConfig( signature_version="s3v4", s3={"addressing_style": addressing_style}, )

Added .. - _ensure_s3a_event_log_dir now reads spark.hadoop.fs.s3a.path.style.access and passes addressing_style: "path" to BotoConfig when it's "true", otherwise defaults to "auto". Tests cover both paths

…prevent silent materialize failure Spark's EventLogFileWriter.requireLogBaseDirAsDirectory() is called inside SparkContext.__init__. When spark.eventLog.dir points to an S3A path that doesn't exist yet (S3 has no real directories), SparkContext fails to initialise — silently from Feast's perspective because _materialize_one() catches the exception and returns an ERROR job. Add _ensure_s3a_event_log_dir() to utils.py: before building the SparkSession, check if the S3A prefix exists and write a zero-byte placeholder if it doesn't. Uses boto3 (already a Feast dep via S3 offline store). Non-fatal: logs a warning and lets Spark surface its own error if the write fails. Signed-off-by: abhijeet-dhumal <abhijeetdhumal652@gmail.com>

… config, add session token support Signed-off-by: abhijeet-dhumal <abhijeetdhumal652@gmail.com>

…linting Signed-off-by: abhijeet-dhumal <abhijeetdhumal652@gmail.com>

Signed-off-by: abhijeet-dhumal <abhijeetdhumal652@gmail.com>

abhijeet-dhumal requested a review from a team as a code owner April 22, 2026 15:21

This comment was marked as resolved.

Sign in to view

abhijeet-dhumal force-pushed the fix/spark-s3a-event-log-init branch from c8351c5 to 448212d Compare April 22, 2026 15:40

ntkathole changed the title ~~fix(spark): pre-create S3A event log dir before SparkContext init~~ fix: Pre-create S3A event log dir before SparkContext init Apr 22, 2026

R-behera reviewed Apr 22, 2026

View reviewed changes

ntkathole reviewed Apr 23, 2026

View reviewed changes

abhijeet-dhumal force-pushed the fix/spark-s3a-event-log-init branch from b60d47c to 19bdd11 Compare April 24, 2026 08:15

abhijeet-dhumal requested review from R-behera and ntkathole April 24, 2026 08:18

ntkathole reviewed Apr 25, 2026

View reviewed changes

abhijeet-dhumal requested a review from ntkathole April 27, 2026 08:19

ntkathole added the ok-to-test label Apr 27, 2026

ntkathole approved these changes Apr 27, 2026

View reviewed changes

abhijeet-dhumal added 5 commits April 27, 2026 17:24

fix(spark): handle bucket-root S3A paths, read credentials from spark…

c586577

… config, add session token support Signed-off-by: abhijeet-dhumal <abhijeetdhumal652@gmail.com>

fix(spark): use AWS_ENDPOINT_URL, support path-style addressing, fix …

c7b74db

…linting Signed-off-by: abhijeet-dhumal <abhijeetdhumal652@gmail.com>

fix: allow sample aws secret and fix linting issue

46094c4

Signed-off-by: abhijeet-dhumal <abhijeetdhumal652@gmail.com>

fix: fix linting issue

70215e2

Signed-off-by: abhijeet-dhumal <abhijeetdhumal652@gmail.com>

ntkathole force-pushed the fix/spark-s3a-event-log-init branch from 22b7e8e to 70215e2 Compare April 27, 2026 11:54

ntkathole merged commit 9feca77 into feast-dev:master Apr 27, 2026
21 of 24 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Pre-create S3A event log dir before SparkContext init#6317

fix: Pre-create S3A event log dir before SparkContext init#6317
ntkathole merged 5 commits intofeast-dev:masterfrom
abhijeet-dhumal:fix/spark-s3a-event-log-init

abhijeet-dhumal commented Apr 22, 2026 •

edited by devin-ai-integration Bot

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

R-behera left a comment

Uh oh!

ntkathole Apr 23, 2026

Uh oh!

ntkathole commented Apr 23, 2026

Uh oh!

abhijeet-dhumal commented Apr 24, 2026

Uh oh!

abhijeet-dhumal commented Apr 24, 2026

Uh oh!

ntkathole Apr 25, 2026

Uh oh!

abhijeet-dhumal Apr 27, 2026

Uh oh!

ntkathole commented Apr 25, 2026

Uh oh!

ntkathole Apr 25, 2026

Uh oh!

abhijeet-dhumal Apr 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

abhijeet-dhumal commented Apr 22, 2026 • edited by devin-ai-integration Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it:

Which issue(s) this PR fixes:

Checks

Testing Strategy

Misc

Uh oh!

This comment was marked as resolved.

Uh oh!

R-behera left a comment

Choose a reason for hiding this comment

Uh oh!

ntkathole Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

ntkathole commented Apr 23, 2026

Uh oh!

abhijeet-dhumal commented Apr 24, 2026

Uh oh!

abhijeet-dhumal commented Apr 24, 2026

Uh oh!

ntkathole Apr 25, 2026

Choose a reason for hiding this comment

Uh oh!

abhijeet-dhumal Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

ntkathole commented Apr 25, 2026

Uh oh!

ntkathole Apr 25, 2026

Choose a reason for hiding this comment

Uh oh!

abhijeet-dhumal Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

abhijeet-dhumal commented Apr 22, 2026 •

edited by devin-ai-integration Bot

Loading