- Python for Datascience
- Calculus
- Linear Algebra
- Probability
- Descriptive Statistics
- Random Variables
- Hypothesis Testing
- Optimizing Algorithms
- Data Science project structure.
2.1. Static Files (csv, json, yml).
2.2. Web Scraping tools and techniques. - How to connect to a SQL database using Python.
Project part I: Involves creating your own dataset from web scraping and storing it on SQL DB.
- Exploratory data analysis
- Data Cleaning
- Feature Engineering: creating new features from existing features.
- How to deal with outliers
- How to deal with missing data
- Label encoding and Normalization Techniques
Project Part II: Data cleaning
- Supervised and Unsupervised Learning
- Cross Validation (Overfitting vs underfitting)
- Introduction to the scikit-learn library.
- Metrics: Measuring your results
- Model parameters
- Regression Algorithms
- Classification Algorithms
- Clustering
- Hypertuning your machine learning algorithm
- Time Series Forecasting and Recommender Systems
- Introduction to Deep Learning.
- Business intelligence tools
- Using the ideal graph to show insights
- Communicating statistics in a simple way
- How to create a machine learning web app with Heroku.
- Look for a statistics case that can be solved in paper (finish the solution) and scale it to coding big amounts of data ---> can be the rats problem but reshaped.
- Look for an image classification good case-----> can be a case applied to medicine (xrays, etc)
- Look for an NLP, neural networks or recommender system(clustering) good case-----> Look for the new Spotify case in Kaggle competitions.