aws-glue-samples/utilities/Spark_UI/README.md at master · jinet/aws-glue-samples

Spark UI

Launching the Spark History Server and Viewing the Spark UI Using Docker

If you prefer local access (not to have EC2 instance for Apache Spark history server), you can also use a Docker to start the Apache Spark history server and view the Spark UI locally. This Dockerfile is a sample that you should modify to meet your requirements.

Pre-requisite

Install Docker

Build docker image

Download a Dockerfile from GitHub

Run commands shown below

    $ docker build -t glue/sparkui:latest .

Start the Spark history server

Using a pair of AWS access key and secret key

Run commands shown below

Set LOG_DIR by replacing s3a://path_to_eventlog with your event log directory
Set AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY with your valid AWS credential

    $ LOG_DIR="s3a://path_to_eventlog/"
    $ AWS_ACCESS_KEY_ID="AKIAxxxxxxxxxxxx"
    $ AWS_SECRET_ACCESS_KEY="yyyyyyyyyyyyyyy"
    $ docker run -itd -e SPARK_HISTORY_OPTS="$SPARK_HISTORY_OPTS -Dspark.history.fs.logDirectory=$LOG_DIR -Dspark.hadoop.fs.s3a.access.key=$AWS_ACCESS_KEY_ID -Dspark.hadoop.fs.s3a.secret.key=$AWS_SECRET_ACCESS_KEY" -p 18080:18080 glue/sparkui:latest "/opt/spark/bin/spark-class org.apache.spark.deploy.history.HistoryServer"

Using AWS temporary credentials

Run commands shown below

Set LOG_DIR by replacing s3a://path_to_eventlog with your event log directory
Set AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, and SESSION_TOKEN with your valid AWS credential

    $ LOG_DIR="s3a://path_to_eventlog/"
    $ AWS_ACCESS_KEY_ID="ASIAxxxxxxxxxxxx"
    $ AWS_SECRET_ACCESS_KEY="yyyyyyyyyyyyyyy"
    $ SESSION_TOKEN="zzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzzz"
    $ docker run -itd -e SPARK_HISTORY_OPTS="$SPARK_HISTORY_OPTS -Dspark.history.fs.logDirectory=$LOG_DIR -Dspark.hadoop.fs.s3a.access.key=$AWS_ACCESS_KEY_ID -Dspark.hadoop.fs.s3a.secret.key=$AWS_SECRET_ACCESS_KEY -Dspark.hadoop.fs.s3a.session.token=$SESSION_TOKEN -Dspark.hadoop.fs.s3a.aws.credentials.provider=org.apache.hadoop.fs.s3a.TemporaryAWSCredentialsProvider" -p 18080:18080 glue/sparkui:latest "/opt/spark/bin/spark-class org.apache.spark.deploy.history.HistoryServer"

View the Spark UI using Docker

Open http://localhost:18080 in your browser

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spark UI

Launching the Spark History Server and Viewing the Spark UI Using Docker

Pre-requisite

Build docker image

Start the Spark history server

View the Spark UI using Docker

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Spark UI

Launching the Spark History Server and Viewing the Spark UI Using Docker

Pre-requisite

Build docker image

Start the Spark history server

View the Spark UI using Docker