spark-example

This repository contains a sample Spark job in Python. The script demonstrates:

Reading data from a CSV file into a Spark DataFrame (titanic dataset).
Performing a simple transformation (grouping by the Age column and counting).
Optionally running a user-provided SQL query on the data (via Valohai parameters).
Writing the transformation and optional SQL results to disk.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
preprocess.py		preprocess.py
requirements.txt		requirements.txt
valohai.yaml		valohai.yaml

Provide feedback