Welcome to the 883 Group Onboarding for Newbies! This repository is your gateway to a structured 10-chapter program, designed to seamlessly integrate you into our dynamic Data Ops team. The onboarding program is designed like "100 DAYS OF CODE", each chapter divided into days, and each day is dedicated to a specific topic or skill set that you'll be exploring. Each day will have a set of tasks, goals, and resources to guide you through the learning process. You'll also have the opportunity to engage in discussions, Q&A sessions, and hands-on exercises to reinforce your understanding and apply your knowledge in real-world scenarios.
PAY ATTENTION:
- Day 00: Welcome & Introduction - Understand the onboarding process and the 883 Group's vision and mission 🎯
- Day 01: Introduction to Big Data - Core Concepts
- Day 02: Customer Interaction and Insight into Data Operations & Data Lifecycle in the 883 Group :bow_and_arrow:
- Day 03: Introduction to Data concept & Hadoop Ecosystem
- Day 04: Hadoop Distributed File System (HDFS)
- Day 05: MapReduce Programming Model, YARN & Oozie
- Day 06: Introduction to Partitioning & Hive
- Day 07: Introduction to Impala
- Day 08: Introduction to HBase
- [Final Exercise 01 - SKILA]: Introduction to Data Ops Team and Hadoop Ecosystem concepts Q&A ❓
- Practical Exercise 01: Meet the Hadoop Ecosystem in Action :computer:
- Showcase 01: Develop a high-level understanding of the Hadoop ecosystem and its role in big data processing. 🔎
- Day 09: Opening lecture
- Day 10: Embracing Docker
- Day 11: Openshift fundamentals
- Day 12: Python internals
- Day 13: The philosophy of writing code
- Day 14: Setting up your development environment
- Day 15: Getting familiar with Python syntax
- Day 16: Mastering testing & CI
- Day 17: Code shipping & Deployment
- Appendix: Group codebases introduction
- Day 19: Basic Spark Topics
- Day 20: Advanced Spark Topics
- Day 21: Exploring Spark SQL
- Final Exercise 04: Spark Q&A and Discussion
- Final Exercise 05: Spark RDD Exercises with Joker and Batman
- SKILA: 💃
- Showcase 02: In-Depth Exploration of Spark Component(s)
- Day 22: Introduction to Apache Airflow concepts, terminology and architecture
- Day 23: exploring the Airflow web UI for workflow management
- Day 24: Introduction to Trino concepts, terminology and architecture
- Day 25: Introduction to 883 Culture concepts, terminology and architecture.
- BIG Final Exercise: Senior Data Infra Engineer Onboarding Final Exercise 🚀
- Showcase: Prepare a presentation to the group to showcase the story, architecture, insights gained, and troubleshooting strategies of your final exercise solution in just two sentences.
- Day 26: Create documentation for a sample project from the 883 Group and present it to the team.(ask your mentor for the link to the project)
- Day 27: Open Source Contribution
- Day 28: Review the onboarding process and suggest improvements.
If you have any suggestions or ideas to enhance the onboarding for future members, feel free to contribute. Fork, create a feature branch, commit changes, and create a pull request.