Skip to content

Notebook

xuwenyihust edited this page Jan 8, 2024 · 8 revisions

Summary

DataPulse supports running Jupyter notebooks with PySpark integration on Kubernetes.

QuickStart

Create Notebook

Notebook is delpoyed as a Kubernetes deployment, and exposed as a Kubernetes service.

The notebook image uses the wenyixu101/all-spark-notebook.

To create a notebook, access the following service:

Alt text

Within the notebook, run the following code to automatically create a Spark session:

start()

Create Spark Session

Check Spark UI

Notebook Persistence

Event Log Persistence

Clone this wiki locally