Skip to content
View y-preethi's full-sized avatar

Block or report y-preethi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. DataPipeline_airflow_pyspark DataPipeline_airflow_pyspark Public

    An end-to-end ETL pipeline that uses Apache Airflow DAGs to orchestrate multi-stage data ingestion, transformation, and loading workflows. Large-scale datasets are processed using PySpark integrate…

    Python

  2. batch_medallion batch_medallion Public

    A scalable batch ETL pipeline using PySpark and Azure Data Lake Storage that implements the Bronze-Silver-Gold medallion architecture to progressively clean, validate, and aggregate raw data into a…

    Python

  3. kafka_streaming kafka_streaming Public

    A real time streaming pipeline using Apache Kafka producers/consumers and Spark Structured Streaming to process event data with subsecond latency. Built with consumer group partitioning, offset che…

    Python

  4. data data Public

    Forked from saayam-for-all/data

    ML based micro service that uses historical data stored on AWS S3 and real time data to come up with real time responses.

    Jupyter Notebook