MEETING

D2 Product Preview: The Multimodal Shift - Rethinking Data Infra for the Age of Vision, A/V & LLMs

# D2 Product Preview

As AI applications increasingly combine vision, audio, video, and LLMs, teams are drowning in fragmented infrastructure. A typical production AI workflow requires stitching together 5+ services: cloud storage for raw files, a separate metadata database, an orchestrator like Airflow, embedding services, vector databases for similarity search, and hundreds of lines of glue code to keep everything in sync. Every time a model improves or business requirements change, you're rewriting pipelines and re-processing entire datasets. This isn't building AI - it's maintaining infrastructure.

In this session, you'll see how a declarative approach to AI data infrastructure eliminates the glue code and maintenance burden. Instead of writing pipelines, you define transformations as computed columns and tables

And we'll walk through a real multimodal workflow, e.g. starting with raw video, extracting audio, transcribing with Whisper, generating embeddings with CLIP, and building semantic search. Then we'll show what happens when you update upstream data and/or add new transformations.

This session is most relevant for ML Engineers, AI App Developers, and technical leaders who are:

Building production AI applications that combine images, video, audio, or documents with LLMs
Currently managing multiple services (S3 + Postgres + Airflow + Vector DB + custom scripts)
Hitting scaling pain points: slow iteration, complex model updates, or lack of reproducibility
Looking to reduce infrastructure complexity without sacrificing capability

Agenda

9:00 PM

9:05 PM

GMT

Opening

Problem Framing

Host intro
Why multimodal is changing the default data stack (vision, A/V, LLMs)
What breaks in traditional infra when modalities multiply
What we’ll cover during this session

+ Read More

9:06 PM

9:15 PM

GMT

Roundtable

Solution

Define end-to-end pipelines in Python
Platform-managed orchestration + incremental compute
Reproducibility, iteration speed, and maintainability at scale

+ Read More

9:16 PM

9:25 PM

GMT

Founder Presentation

Live Demo

Walkthrough of workflows
Real examples of the solution in action

+ Read More

9:26 PM

9:40 PM

GMT

Roundtable

Q&A

Where pipelines get slow or brittle (data, orchestration, eval, cost)
What “production-ready multimodal workflows” should look like
Feedback on product direction + must-have capabilities

+ Read More

9:41 PM

5:45 AM+1

GMT

Closing

Key insights captured
Follow-up materials (optional)

+ Read More

Attendees

Bessie

member

Arlene

member

Cody

member

Colleen

member

Kathryn

member

Bessie

member

Already registered?

Starting in 21 days

February 25, 9:00 PM GMT

Online

Organized by

Marcel Kornacker

Add to calendar

Starting in 21 days

February 25, 9:00 PM GMT

Online

Organized by

Marcel Kornacker

Add to calendar