pyspark-playground icon indicating copy to clipboard operation
pyspark-playground copied to clipboard

Spark History

Open marcelo-franceschini opened this issue 8 months ago • 3 comments

Thank you for sharing this repository.

I've been experimenting with it and encountered an issue with the Spark History Server. I've tried adjusting some environment configurations for both the master and worker nodes, but I still can't get it to display the logs.

image image

Here's my configuration docker compose file:

services:
  spark-master:
    container_name: da-spark-master
    user: root
    build: .
    image: da-spark-image
    entrypoint: ['./entrypoint.sh', 'master']
    environment:
      - SPARK_EVENTLOG_ENABLED=true
      - SPARK_EVENTLOG_DIR=/opt/spark/spark-events 
    healthcheck:
      test: ["CMD", "curl", "-f", "http://localhost:8080"]
      interval: 5s
      timeout: 3s
      retries: 3
    volumes:
      - ./book_data:/opt/spark/data
      - ./spark_apps:/opt/spark/apps
      - /mnt/c/Users/boss/Desktop/logs:/opt/spark/spark-events
      - /mnt/d/DADOS:/home/jovyan/work
    env_file:
      - .env.spark
    ports:
      - '9090:8080'
      - '7077:7077'

  spark-history-server:
    container_name: da-spark-history
    user: root
    image: da-spark-image
    entrypoint: ['./entrypoint.sh', 'history']
    environment:
      - SPARK_EVENTLOG_ENABLED=true
      - SPARK_EVENTLOG_DIR=/opt/spark/spark-events 
    depends_on:
      - spark-master
    env_file:
      - .env.spark
    volumes:
      - /mnt/c/Users/boss/Desktop/logs:/opt/spark/spark-events
      - /mnt/d/DADOS:/home/jovyan/work
    ports:
      - '18080:18080'

  spark-worker:
    image: da-spark-image
    user: root
    entrypoint: ['./entrypoint.sh', 'worker']
    environment:
      - SPARK_EVENTLOG_ENABLED=true
      - SPARK_EVENTLOG_DIR=/opt/spark/spark-events 
    depends_on:
      - spark-master
    env_file:
      - .env.spark
    volumes:
      - ./book_data:/opt/spark/data
      - ./spark_apps:/opt/spark/apps
      - /mnt/c/Users/boss/Desktop/logs:/opt/spark/spark-events
      - /mnt/d/DADOS:/home/jovyan/work

  jupyter:
    image: quay.io/jupyter/pyspark-notebook:latest
    user: root
    container_name: Jupyter_Notebook
    depends_on:
      - spark-master
    ports:
      - '8888:8888'
      - '4040:4040'
    volumes:
      - /mnt/d/DADOS:/home/jovyan/work      
    environment:
      - JUPYTER_ENABLE_LAB=yes
      - PYSPARK_PYTHON=python3
      - SPARK_HOME=/usr/local/spark
      - PYSPARK_DRIVER_PYTHON=jupyter
      - PYSPARK_DRIVER_PYTHON_OPTS=notebook
      - USE_JUPYTER_LAB=yes
    restart: always
    command: start-notebook.sh --NotebookApp.token='' --NotebookApp.password='' --NotebookApp.notebook_dir='/home/jovyan/work'

image

Thanks!

marcelo-franceschini avatar Jun 14 '24 20:06 marcelo-franceschini