Ensure the survival of at-risk species, to maintain the level of biodiversity within their park
- Understand characteristics about the species and their conservations status using Descriptive statistics
- Explore relationship between species and their relationship to the national parks via Statistical Inference
- Project goals: Define the high-level objectives and set the intentions for this project.
- Data: Luckily in this project, data is already provided but still needs to be checked if project goals can be met with the available data.
- Analysis: Will have to be thought through, which include the methods and questions that are aligned with the project goals.
- Evaluation: To build conclusions and findings from our analysis.
This project requires Python 2.7 and the following Python libraries installed:
You will also need to have software installed to run and execute a Jupyter Notebook
If you do not have Python installed yet, it is highly recommended that you install the Anaconda distribution of Python, which already has the above packages and more included. Make sure that you select the Python 2.7 installer and not the Python 3.x installer.
Template code is provided in the notebook eda-bio-diversity.ipynb
Jupyter Notebook file.
In a terminal or command window, navigate to the top-level project directory (that contains this README) and run one of the following commands:
jupyter notebook eda-bio-diversity.ipynb
or
ipython notebook eda-bio-diversity.ipynb
This will open the Jupyter Notebook software and project file in your web browser.
Features
The species_info.csv
contains information on the different species in the National Parks. The columns in the data set include:
- category - The category of taxonomy for each species
- scientific_name - The scientific name of each species
- common_names - The common names of each species
- conservation_status - The species conservation status
The Observations.csv
contains information from recorded sightings of different species throughout the national parks in the past 7 days. The columns included are:
- scientific_name - The scientific name of each species
- park_name - The name of the national park
- observations - The number of observations in the past 7 days