This repository provides a structured MLOps project that integrates SQL, Python, Tableau, and Excel-based analysis.
It includes SQL data queries, Python scripts for automation, and Tableau dashboards for visualization.
mlops_sql_project/
├── env/ # Environment Configuration
│ ├── .env # Environment variables (e.g. DB credentials)
│ ├── db_config.yaml # Database configuration (host, port, user, etc.)
│ ├── logging_config.yaml # Logging configuration (for Python logs)
│ └── settings.json # General project settings
│
├── Excel/ # Excel-based Analysis (exported reports or manual exploration)
│ ├── transactions_overview.xlsx
│ ├── sales_summary.xlsx
│ ├── customer_behavior.xlsx
│
├── Tableau/ # Tableau Dashboards & Reports
│ ├── sales_dashboard.twb # KPI dashboard for sales
│ ├── transaction_analysis.twb # Visualization of transaction flow
│ ├── customer_insights.twb # Customer segmentation & behavior
│
├── python/ # Data Generation & Automation
│ ├── generate_customers.py # Generates random customer data
│ ├── generate_orders.py # Generates random orders
│ ├── generate_products.py # Generates product catalog
│ ├── generate_transactions.py # Simulates transaction history
│ ├── transactions_overview.py # Generates Excel summary
│ ├── sales_summary.py # KPIs for Tableau dashboard
│ ├── customer_behavior.py # Behavior analysis summary
│
├── sql/ # Structured SQL Queries
│ ├── ddl/ # Schema definition (Create, Constraints)
│ │ ├── 01_create_database.sql
│ │ ├── 02_create_tables.sql
│ │ ├── 03_constraints.sql
│ │
│ ├── dml/ # Data Manipulation (Insert, Update, Delete)
│ │ ├── 00_truncate_tables.sql
│ │
│ ├── dql/ # Queries & Analysis
│ │ ├── a_checks/ # Data Validation & Structure
│ │ │ ├── 01_check_constraints.sql
│ │ │ ├── 02_check_all_foreign_keys.sql
│ │ │ ├── 03_check_table_dependencies.sql
│ │ │ ├── 04_check_indexes_primary_keys.sql
│ │ │ ├── 05_check_privileges.sql
│ │ │ └── 06_null_value_check.sql # Check for NULLs in key columns
│ │ │
│ │ ├── b_aggregations/ # Aggregation & Statistical Analysis
│ │ │ ├── 06_table_counts.sql
│ │ │ ├── 07_check_total_records.sql
│ │ │ ├── 08_counts_the_number_of_products.sql
│ │ │ ├── 09_min_max_and_average_price.sql
│ │ │ ├── 10_stock_statistics.sql
│ │ │ └── 11_sales_by_category.sql
│ │ │
│ │ ├── c_transactions/ # Orders & Transactions
│ │ │ ├── 11_top_expensive_orders.sql
│ │ │ ├── 12_orders_by_month.sql
│ │ │ ├── 13_random_orders_check.sql
│ │ │ ├── 14_customers_orders_join.sql
│ │ │ ├── 15_transaction_amount_summary.sql
│ │ │ ├── 16_transactions_by_payment.sql
│ │ │ ├── 17_daily_transaction_volume.sql
│ │ │ ├── 18_top_10_biggest_transactions.sql
│ │ │ ├── 19_top_10_biggest_customers.sql
│ │ │ └── 20_avg_transaction_per_customer.sql
│ │ │
│ │ ├── d_joins/ # Multi-table Joins & Relationships
│ │ │ ├── 20_join_customers_orders_products.sql
│ │ │ ├── 21_join_orders_transactions.sql
│ │ │ ├── 22_top_10_customers_by_spent.sql
│ │ │ ├── 23_avg_order_value.sql
│ │ │ ├── 24_returning_customers.sql
│ │ │ ├── 25_bonus.sql
│ │ │ ├── 26_cleaned_bonus.sql
│
├── environment.yaml # Conda environment setup
├── requirements.txt # pip packages (for production or alt install)
├── .gitignore # Git exclusions
├── LICENSE # Project License (e.g., MIT)
├── README.md # Project Documentation
✅ SQL-Powered Analytics with structured queries and joins.
✅ Excel + Tableau for reporting and visualization.
✅ Python Automation for data generation and preprocessing.
✅ Scalable Architecture for BI & Data Analysis.
- PostgreSQL – SQL-based data storage & queries
- Tableau – Interactive dashboards and reporting
- Excel – Static reports and aggregated insights
- Python – Data automation and preprocessing
- GitHub Actions – CI/CD for automation
git clone https://github.com/your-username/mlops_sql_project.git
cd mlops_sql_project2️⃣ Create a virtual environment (Optional)
conda env create -f environment.yaml
conda activate mlops_envpython -m venv venv
source venv/bin/activate # On macOS/Linux
venv\Scripts\activate # On Windows
pip install -r requirements.txt✅ Advanced SQL optimization ✅ Improved Tableau dashboards ✅ CI/CD for SQL workflow automation
This project is distributed under the MIT License. Feel free to use the code! 🚀
💻 GitHub Repository: Evgenii Matveev
🌐 Portfolio: Data Science Portfolio
📌 LinkedIn: Evgenii Matveev
🔥 If you like this project, don't forget to star ⭐ the repository! 🔥