Welcome to the 883 Group Onboarding for Newbies! This repository is your gateway to a structured 10-chapter program, designed to seamlessly integrate you into our dynamic Data Ops team. The onboarding program is designed like "100 DAYS OF CODE", each chapter divided into days, and each day is dedicated to a specific topic or skill set that you'll be exploring. Each day will have a set of tasks, goals, and resources to guide you through the learning process. You'll also have the opportunity to engage in discussions, Q&A sessions, and hands-on exercises to reinforce your understanding and apply your knowledge in real-world scenarios.
- Day is not a day, it's just a session.
- Day 00: Welcome & Introduction - Understand the onboarding process and the 883 Group's vision and mission 🎯
- Day 01: Introduction to Big Data - Core Concepts
- Day 02: Customer Interaction and Insight into Data Operations & Data Lifecycle in the 883 Group :bow_and_arrow:
- Day 03: Introduction to Data concept & Hadoop Ecosystem (Hadoop, HDFS, MapReduce, YARN, Hive, ZooKeeper):elephant:
- Day 04: Introduction to Data concept & Hadoop Ecosystem (HBase, Spark, Kafka, Impala, Partitioning, Kerberos, Oozie):elephant::elephant:
- Final Exercise 01: Introduction to Data Ops Team and Hadoop Ecosystem concepts Q&A ❓
- Practical Exercise 01: Meet the Hadoop Ecosystem in Action :computer:
- Showcase 01: Develop a high-level understanding of the Hadoop ecosystem and its role in big data processing. 🔎
- Day 05: Mastering OOP and Design Principles with Test-Driven Development (TDD)
- Day 06: Writing Clean Code
- Day 07: Embarking on the Docker Journey
- Final Exercise 02: Clean Code Refactoring
- Final Exercise 03: Embracing DevOps: GitLab, Docker, and CI/CD Pipeline
- Bonus - 60 Linux Commands you NEED to know (in 10 minutes) 🔥
- Day 08: Openshift Fundamentals for Data Ops using "zero to hero" Openshift labs
- Day 09: Helm for Kubernetes Package Management using "Hero to Master" Openshift labs
- Day 10: Openshift "Best Practises" Openshift labs
- Day 11: Basic Spark Topics
- Day 12: Advanced Spark Topics
- Day 13: Exploring Spark SQL
- Final Exercise 04: Spark Q&A and Discussion
- Final Exercise 05: Spark RDD Exercises with Joker and Batman
- SKILA: 💃
- Showcase 02: In-Depth Exploration of Spark Component(s)
- Day 14: Introduction to Apache Airflow concepts, terminology and architecture
- Day 15: exploring the Airflow web UI for workflow management
- Day 16: Introduction to Trino concepts, terminology and architecture
- Day 17: Introduction to 883 Culture concepts, terminology and architecture.
- BIG Final Exercise: Senior Data Infra Engineer Onboarding Final Exercise 🚀
- Showcase: Prepare a presentation to the group to showcase the story, architecture, insights gained, and troubleshooting strategies of your final exercise solution in just two sentences.
- Day 18: Create documentation for a sample project from the 883 Group and present it to the team.(ask your mentor for the link to the project)
- Day 19: Open Source Contribution
- Day 20: Review the onboarding process and suggest improvements.
If you have any suggestions or ideas to enhance the onboarding for future members, feel free to contribute. Fork, create a feature branch, commit changes, and create a pull request.