Druid is a distributed, column-oriented, real-time analytics data store that is commonly used to power exploratory dashboards in multi-tenant environments.
Druid excels as a data warehousing solution for fast aggregate queries on petabyte sized data sets. Druid supports a variety of flexible filters, exact calculations, approximate algorithms, and other useful calculations.
Druid can load both streaming and batch data and integrates with Samza, Kafka, Storm, and Hadoop.
More information about Druid can be found on http://www.druid.io.
You can find the latest Druid Documentation on the project website.
If you would like to contribute documentation, please do so under
/docs/content in this repository and submit a pull request.
We have a series of tutorials to get started with Druid. If you are just getting started, we suggest going over the first Druid tutorial.
If you find any bugs, please file a GitHub issue.
Community support is available on the druid-user mailing list([email protected]).
Development discussions occur on the druid-development list([email protected]).
We also have a couple people hanging out on IRC in #druid-dev on
irc.freenode.net.