Apache® Druid data modeling
This guide teaches you the fundamentals of ingesting your data into Apache Druid. This course consists of:
- Designing a good schema: expert interviews and Python notebooks that help you create a good table
- Data processing: how to think about when and where to process and query your data
- Optimizing segment layout: understanding partitioning in Druid and how it affects processing
- Summarization and sketching: how to summarize your data at ingestion time and take advantage of approximation
Each module contains an introductory video, links to Python notebooks in the learn-druid environment, and related material such as docs and community videos.
If you'd like to talk to through what you learn in this course, why not say "hi" to Imply's DevRel team?
Welcome video
Watch the following video for an introduction to this course:
We're in the process of migrating this content. Check back soon.
Set up your learning environment
Before you start, set up the learning environmentset on your local machine.
Go to the learn-druid Github repository. Follow the instructions on the main page to download all the resources you need to start running Apache Druid and associated components locally.
The video below demonstrates set up process.
We're in the process of migrating this content. Check back soon.