Skip to content

Documentation and projects from the DataCamp Associate Data Scientist in Python track.

Notifications You must be signed in to change notification settings

kar137/datascience-python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

47 Commits
 
 
 
 
 
 
 
 

Repository files navigation

DataCamp Associate Data Scientist (Python Track) - Journey Documentation

Overview

This repository serves as a personal documentation of my journey through DataCamp's Associate Data Scientist (Python Track). Here, I will track my progress, summarize key concepts, and share insights gained from each course and project. This README will be updated regularly to reflect my learning milestones.


Table of Contents

  1. About the Track
  2. Courses and Progress
  3. Key Takeaways
  4. Projects
  5. How to Use This Repository

About the Track

The Associate Data Scientist (Python Track) by DataCamp is designed to equip learners with the foundational skills required to become a data scientist. The track covers essential topics such as data manipulation, visualization, statistical analysis, machine learning, and more, all using Python.


Courses and Progress

Below is a list of courses in the track, along with my progress and notes for each:

Course Status Notes
Introduction to Python Completed Learned basics of Python syntax, Python Lists, Functions and Packages and Numpy.
Intermediate Python Completed Matplotlib, Dictionaries & Pandas, Logic, Control flow & filtering, Loops and Case study: Hacker Stastistics.
Data manipulation with Pandas In Progress Transforming DataFrames, Aggregating DataFrames, Slicing & Indexing DataFrames, and Creating & visualizing DataFrames
Python Data Science Toolbox (Part 2) Not Started Focuses on iterators, generators, and list comprehensions.
Importing Data in Python (Part 1) Not Started Learn to import and clean data from various sources.
Importing Data in Python (Part 2) Not Started Dive deeper into data importing and cleaning techniques.
Cleaning Data in Python Not Started Focus on handling missing data and data quality issues.
pandas Foundations Not Started Master the pandas library for data manipulation and analysis.
Manipulating DataFrames with pandas Not Started Learn advanced DataFrame manipulation techniques.
Merging DataFrames with pandas Not Started Explore how to combine and merge datasets.
Introduction to Data Visualization with Python Not Started Learn to create visualizations using Matplotlib and Seaborn.
Intermediate Data Visualization with Python Not Started Dive deeper into advanced visualization techniques.
Introduction to Statistics in Python Not Started Learn statistical concepts and their application in Python.
Supervised Learning with scikit-learn Not Started Explore machine learning algorithms for supervised learning.
Unsupervised Learning with scikit-learn Not Started Learn clustering and dimensionality reduction techniques.
Machine Learning with Tree-Based Models in Python Not Started Focus on decision trees, random forests, and gradient boosting.

Key Takeaways

  • Python Basics: Understanding Python syntax, data structures, and control flow is essential for any data science project.
  • Data Manipulation: Mastering pandas is crucial for cleaning, transforming, and analyzing data.
  • Data Visualization: Visualizing data helps in uncovering patterns and communicating insights effectively.
  • Machine Learning: Learning supervised and unsupervised learning techniques is key to building predictive models.

Projects

Here are the projects included in the track, along with my progress and reflections:

Project Status Reflections
Investigating Netflix Movies In Progress Will use foundational python skills[Introduction & Intermediate] to manipulate and visualize movie data.
Exploring NYC Public School Test Result Scores Not Started Will analyze and visualize educational data.
Exploring 67 Years of LEGO Not Started Dive into the history of LEGO sets and themes.
The GitHub History of the Scala Language Not Started Analyze the development of Scala through GitHub data.

How to Use This Repository

  1. Track Progress: Use the Courses and Progress table to see my journey through the track.
  2. Learn from Notes: Refer to the notes and key takeaways for summaries of each course.
  3. Explore Projects: Check out the projects section for detailed reflections and outcomes.
  4. Contribute: If you have suggestions or feedback, feel free to open an issue or pull request.

Happy learning! 🚀

About

Documentation and projects from the DataCamp Associate Data Scientist in Python track.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published