Deep Dives
AI Reveals Why BI Still Matters (Hint: It’s Not Dashboards)
This blog looks at how BI evolved, how dashboards are actually used today, and what survives when AI enters the picture — starting with the foundation that was never really about dashboards in the first place, and ending with the problem nobody in the AI hype cycle wants to talk about: who maintains it all.
Building an Agent-Friendly, Local-First Analytics Stack with MotherDuck and Rill
The convergence of embedded analytics engines (DuckDB/MotherDuck), declarative BI-as-code (Rill), and AI agent protocols (MCP) is creating a new architecture for business intelligence, one where dashb
Why Coinbase and Pinterest Chose StarRocks: Lakehouse-Native Design and Fast Joins at Terabyte Scale
Discover why Coinbase and Pinterest migrated to StarRocks for sub-second analytics. Learn how its lakehouse-native design and colocated joins handle terabyte-scale data on S3.
dlt+ClickHouse+Rill: Multi-Cloud Cost Analytics, Cloud-Ready
FinOps Made Easy: A Starter Repo to Oversee Cloud Costs from Different Hyperscalers.
Multi-Cloud Cost Analytics: From Cost-Export to Parquet to Rill
Learn how to unify AWS and GCP costs with revenue data in a single dashboard. Step-by-step guide using dlt, Parquet, and Rill. Clone and run immediately.
Data Modeling for the Agentic Era: Semantics, Speed, and Stewardship
Master the three pillars of agentic data modeling: Metrics SQL for semantics, sub-second analytics for speed, and AI guardrails for trusted insights.
Data Modeling Guide for Real-Time Analytics with ClickHouse
Learn how to build sub-second real-time analytics with ClickHouse. Complete guide covering data modeling strategies, optimization techniques, and practical S3-to-dashboard examples.
Has Self-Serve BI Finally Arrived Thanks to AI?
How conversational BI and MCP deliver on two decades of promises
The Open Table Format Revolution: Why Hyperscalers Are Betting on Managed Iceberg
This blog explores the four layers of the ICE Stack, from storage to catalogs, and why managed Iceberg might represent the post-Modern Data Stack future where data independence truly matters.
What "Shifting Left" Means and Why it Matters for Data Stacks
By shifting left, data teams can create more maintainable, performant, and reliable data systems while reducing duplication and inconsistency throughout the data stack.
Scaling Beyond Postgres: How to Choose a Real-Time Analytical Database
This blog explores how real-time databases address critical analytical requirements. We highlight the differences between cloud data warehouses like Snowflake and BigQuery, legacy OLAP databases like Vertica, and a new class of real-time analytical databases like ClickHouse and StarRocks that combine elements of both of these categories. We will also examine the categories of today's analytics solutions and how to choose the right one.
Designing a Declarative Data Stack: From Theory to Practice
This blog chronicles that journey of discovery, examining the key architectural considerations and trade-offs in building a declarative data stack and its engine. We'll explore existing approaches, compare different implementation strategies, and work through practical examples
BI-as-Code and the New Era of GenBI
Imagine creating business dashboards by simply describing what you want to see. This is the promise of Generative Business Intelligence (GenBI). The key lies in the declarative BI stack where dashboards and metrics are defined as code rather than hidden behind graphical user interfaces.
The Rise of the Declarative Data Stack
The future looks more code-first. From ingestion to transformation, orchestration, and measures in dashboards—all defined declaratively. But what does this shift actually mean?
GenBI - one-click dashboards with generative AI and BI-as-code
Rill’s open-source IDE, uses AI to transform an S3 data source into an operational dashboard in one click. This creative magic is made possible by combining (1) Rill’s BI-as-code philosophy defines dashboards entirely with SQL & YAML code and (2) Large language models, like OpenAI’s GPT series, excel at code generation
Lessons Learned in Developing Interactive Time Exploration in Rill Dashboards
Time is an integral component of data analysis. Learn about how Rill is built to handle the complexity of time in modern data visualization.
Accelerating the Core Analysis Loop
Achieve flow in your work with tools that bring data insights at the speed of conversation.
Seeking the Perfect Apache Druid Rollup
Apache Druid Rollups improve storage and query performance. Keep these 8 important concepts in mind to use rollups effectively and avoid mistakes.
Fast Path to Streaming Data Analysis
This blog is a step-by-step how-to demo for streaming data into Rill’s platform for sub-second analysis. We selected aircraft telemetry data to answer time-series analytical questions.
Setting up Apache Druid on Kubernetes in under 30 minutes
Kubernetes is an orchestration engine which can run and manage containerized applications. Each application has different ways to autoscale...
How to achieve fast query speed with no DevOps maintenance
The transition from on-prem to cloud has picked up speed. While just five years ago, companies were resisting moving to the cloud due to data security concerns, the more common question now is, “How can I move to the cloud as quickly and cost efficiently as possible?”
Apache Druid and Rill: better together
Apache Druid is an open source data store designed for high performance (sub-second) OLAP queries on large (terabyte) datasets. Learn how you can experience all of the benefits of Apache Druid's high performance real-time analytics database without the maintenance.