Ecosystem

12:20 PM - 1:00 PM PDT , June 18

Feature Stores: Building Machine Learning Infrastructure on Apache Pulsar

Input features are the building blocks for machine learning models. You cannot have a great model without great features. By building on top of Apache Pulsar's infinite retention of events, we built infrastructure to serve features in production and to generate training datasets. It allowed our machine learning teams to change, test, and deploy personalization features at an extraordinary rate to 10s of millions of end-users.

This talk will discuss:
- What event-sourcing is and why it's so powerful for machine learning infrastructure.
- How we built the StreamSQL feature store on top of Pulsar, Flink, and Cassandra.
- How a feature store accelerates ML development.

Speaker

Simba Khadder

Founder & CEO of StreamSQL.io

Simba Khadder is a product leader with a strong engineering background. He has worked as a software engineer at Google where he worked on Cloud Datastore and Search. At StreamSQL, he leads a team delivering the next generation of datastore built on event-sourcing. He's a published astrophysicist for his work on finding Planet 9 and ran the SF marathon in basketball shoes.