adapted from Software Carpentry
This example data analysis project analyzes the word count for all words in 4 novels. It reports the top 10 most occurring words in each book in a report.
Clone this repo, and using the command line, navigate to the root of this project.
git clone <repo_name>
cd <folder_name>Option 1: Use `conda-lock.yml`
2.1.1 Run the following commands to create the conda environment:
conda-lock install --name ia4 conda-lock.yml
2.1.2 Activate the conda environment:
conda activate ia4
2.1.3 Run the analysis:
bash runall.sh
Option 2: Use `environment.yml`
2.2.1 Create a conda environment using environment.yml
conda env create -n ia4 -f environment.yml2.2.2 Activate the conda environment:
conda activate ia4
2.2.3 Run the analysis:
bash runall.sh
Option 3: Use `docker-compose.yml`
2.3.1. Pull and launch the docker container, this will direct you to the terminal of the container, no GUI
docker compose run --rm ia42.3.3 You will land directly in the terminal of the container. Run the analysis:
bash runall.sh
2.3.4 After you are done, type exit to leave docker container.
Your task is to add a "smarter" data analysis pipeline using GNU Make!
It should accomplish the same task as bash runall.sh when you type
make all.
It should reset the analysis the starting point
(the state when you first copied this repo)
when you type make clean.
- GNU Make
- Quarto
- Python & Python libraries:
clickmatplotlibpandas