Ecosystem

2:10 PM - 2:40 PM, PST , October 25

Databricks Love Pulsar: integrate Pulsar with Databricks Runtime through pulsar-spark connector

Pulsar streaming data source was introduced into Databricks recently. It supports both Scala, Python and SQL interfaces. The different trigger modes of Spark Structured Streaming also enable various use cases including data ingestion and real time streaming applications. The Databricks pulsar data source is based on and fully compatible with the open source pulsar-spark connector, which we work on closely with StreamNative engineers. It is also enriched with many Databricks functionality such as Delta Live Table and Unity Catalog, which allows users to build data pipelines and perform advanced data analytics efficiently and securely.

Speaker

Chaoqin Li

Software Engineer, Databricks