PySpark SparkSession Builder with Kubernetes Master PySpark SparkSession Builder with Kubernetes Master kubernetes kubernetes

PySpark SparkSession Builder with Kubernetes Master


pyspark client mode works on Spark's latest version 2.4.0

This is how I did it (in Jupyter lab):

import osos.environ['PYSPARK_PYTHON']="/usr/bin/python3.6"os.environ['PYSPARK_DRIVER_PYTHON']="/usr/bin/python3.6"from pyspark import SparkContext, SparkConffrom pyspark.sql import SparkSessionsparkConf = SparkConf()sparkConf.setMaster("k8s://https://localhost:6443")sparkConf.setAppName("KUBERNETES-IS-AWESOME")sparkConf.set("spark.kubernetes.container.image", "robot108/spark-py:latest")sparkConf.set("spark.kubernetes.namespace", "playground")spark = SparkSession.builder.config(conf=sparkConf).getOrCreate()sc = spark.sparkContext

Note: I am running kubernetes locally on Mac with Docker Desktop.