spark-streaming

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines

python spark faker pyspark spark-streaming data-generation databricks synthetic-data datagen datagenerator deltalake datageneration delta-live-tables

Updated May 12, 2025
Python

harbby / sylph

Star

Stream computing platform for bigdata

java sql big-data spark-streaming flink sylph streamsql

Updated Apr 24, 2024
Java

microsoft / data-accelerator

Star

Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.

Updated Mar 31, 2025
C#

databrickslabs / dqx

Star

Databricks framework to validate Data Quality of pySpark DataFrames

spark spark-streaming databricks data-quality-checks data-quality data-profiling dlt data-quality-monitoring

Updated May 28, 2025
Python

paypal / gimel

Star

Big Data Processing Framework - Unified Data API or SQL on Any Storage

python elasticsearch paypal scala kafka big-data spark cassandra jdbc hbase restapi pyspark spark-streaming aerospike teradata data-api gimel streaming-sql

Updated Dec 18, 2024
Scala

Azure / azure-event-hubs-spark

Star

Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs

microsoft streaming real-time scala kafka spark apache-spark stream connector azure bigdata apache spark-streaming eventhubs ingestion continuous event-hubs databricks structured-streaming

Updated Feb 14, 2025
Scala

mkuthan / example-spark

Star

Spark, Spark Streaming and Spark SQL unit testing strategies

testing spark spark-streaming

Updated Oct 12, 2016
Scala

Chabane / bigdata-playground

Star

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

Updated Feb 1, 2019
TypeScript

spirom / spark-streaming-with-kafka

Star

Self-contained examples of Apache Spark streaming integrated with Apache Kafka.

scala kafka spark spark-streaming

Updated Apr 15, 2018
Scala

Improve this page

Add a description, image, and links to the spark-streaming topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the spark-streaming topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spark-streaming

Here are 1,115 public repositories matching this topic...

Angel-ML / angel

lw-lin / CoolplaySpark

LuckyZXL2016 / Movie_Recommend

dotnet / spark

jacksu / utils4s

edp963 / wormhole

microsoft / Mobius

cdapio / cdap

lw-lin / streaming-readings

Stratio / sparta

spirom / LearningSpark

databrickslabs / dbldatagen

harbby / sylph

microsoft / data-accelerator

databrickslabs / dqx

paypal / gimel

Azure / azure-event-hubs-spark

mkuthan / example-spark

Chabane / bigdata-playground

spirom / spark-streaming-with-kafka

Improve this page

Add this topic to your repo