Ecosystem

14:15-15:00, UTC+8 , January 15

Pulsar in the Lakehouse: Overview of Apache Pulsar and Delta Lake Connector

The Apache Pulsar community is collaborating with the Delta Lake community to add up to both ecosystems: Pulsar - Delta Lake Connector.
In this session, we will first provide an overview of Pulsar - Delta Lake connector and Delta Lake Standalone Reader, then introduce the design of Pulsar - Delta Lake CDC source connector including how to capture data change of Delta lake and how to recover from last checkpoint with the help of Pulsar Function state store . We will also discuss the scalability of this Pulsar - Delta lake CDC source connector.

Speaker

Ke Xie

Apache Pulsar Contributor

Software Engineer at StreamNative, focusing on the distributed storage.