The 2025 & 2026 Ultimate Guide to the Data Lakehouse and the Data Lakehouse Ecosystem
Looking ahead to 2026, the lakehouse is no longer just a central repository; it extends outward to power real-time analytics, agentic AI, and even edge inference.
Looking ahead to 2026, the lakehouse is no longer just a central repository; it extends outward to power real-time analytics, agentic AI, and even edge inference.
The world of data lake deletion formats might seem complex, but it’s really about solving a fundamental problem: how do you efficiently manage changing data at scale? Apache Iceberg and Delta Lake…
Dremio helps cut through this complexity. It brings together key technologies like Apache Iceberg, query federation, semantic modeling, and autonomous performance tuning—all in a single platform. In this post, we’ll explore…
Apache Iceberg provides a path to standardize data across the enterprise, bringing structure, scalability, and openness to data lakes. But standing up an Iceberg lakehouse on your own comes with…
Dremio Enterprise Catalog, powered by Apache Polaris, removes that burden by handling optimization automatically. It continuously analyzes your tables—watching for small-file buildup, oversized files, outdated partitions, and metadata sprawl—and then…
The momentum behind Apache Polaris is building quickly. By starting with the Apache Iceberg REST Catalog standard, it has inherited instant compatibility with a broad set of engines and tools….
Apache Iceberg 1.10.0 represents a turning point in the evolution of the open lakehouse. With the general availability of format-version 3, Iceberg now offers a more complete solution for organizations…
Over the last twelve months, the open lakehouse ecosystem has taken a decisive step forward. Three projects in particular, Apache Arrow, Apache Iceberg, and…
When you save data, the format you choose makes all the difference. Think about it like keeping notes: writing them in a plain text file is simple, but finding…
Learn how to leverage the integration of Superset and Dremio to visualize your data lake.