Event Description
The convergence of Data Lakes and Databases has become an inevitable trend in modern data infrastructure. While Data Lakes provide open, cost-efficient, and scalable storage, Databases excel in query performance, concurrency, and usability. The key challenge is how to combine the strengths of both worlds to achieve flexible yet high-performance real-time queries—a problem that is attracting growing industry attention.
In this talk, We will deep-dive into how Apache Doris supports real-time queries on Iceberg tables, covering:
- The background and industry trend of Data Lake–Database convergence
- How Apache Doris integrates with Iceberg at the architectural level
- The core techniques behind Doris’s real-time query engine: vectorized execution, smart caching and materialized view, etc.
- Typical real-world use cases and performance benchmarks of Doris on Iceberg
- A forward-looking perspective on the evolution of the Data Lakehouse ecosystem
By the end of this session, you will understand how Apache Doris empowers organizations to run low-latency, high-concurrency analytics directly on Data Lakes, and gain insights into the latest trends in data architecture.
Speakers

Rayner (Mingyu) Chen
Apache Doris PMC Chair
Rayner (Mingyu) Chen is the Apache Doris PMC Chair and Vice President of Technology at VeloDB. Throughout his years of contributions to Apache Doris, he has nurtured the project's flourishing development and community growth while helping to advance the technical innovations of Apache Doris.