Building an AI Ready Global Scale Data Platform

Abstract

As organizations move from single-cloud setups to hybrid and multi-cloud strategies, they are under pressure to build data platforms that are both globally available and AI-ready. This talk walks through how to design and operate a global-scale data platform that spans regions and providers, supports multiple storage paradigms (files, object stores, NoSQL, relational), and exposes a clean experience to application teams. We’ll look at how to decouple storage, compute, and AI workloads so analytics, vector search, and LLM inference can run efficiently on shared datasets without creating a new kind of vendor lock-in. Along the way, we’ll cover patterns for embeddings pipelines and vector indexes, approaches for reliability and disaster recovery across regions and failure domains, and cost-management strategies that account for data gravity and GPU-heavy AI workloads. You’ll leave with concrete patterns, trade-offs, and pitfalls to avoid when taking real, messy, business-critical data platforms into an AI-centric, multi-cloud world.


Speaker

George Peter Hantzaras

Engineering Director, Core Platforms @MongoDB, Open Source Ambassador, Published Author

George is a distributed systems expert and a hands-on engineering leader. He is a Director of Engineering at MongoDB, focusing on implementing cloud native technologies at enterprise scale. He is an Ambassador of the Data on Kubernetes community and the author of The Platform Engineering Playbook, by Packt. Most recently, he has been a speaker at global events like Kubecon, OpenSource Summit, Hashiconf, LeadDev, SaaStr, and more.

Read more
Find George Peter Hantzaras at:

From the same track

Session agentic coding

The Right 300 Tokens Beat 100k Noisy Ones: The Architecture of Context Engineering

Wednesday Mar 18 / 10:35AM GMT

Your agent has 100k tokens of context. It still forgets what you told it two messages ago.

Speaker image - Patrick Debois

Patrick Debois

AI Product Engineer @Tessl, Co-Author of the "DevOps Handbook", Content Curator at AI Native Developer Community

Speaker image - Baruch Sadogursky

Baruch Sadogursky

DevRel Team and Context Engineering Management @Tessl AI, Co-Author of #LiquidSoftware and #DevOps Tools for #Java Developers, Java Champion, Microsoft MVP

Session

Explicit Semantics for AI Applications: Ontologies in Practice

Wednesday Mar 18 / 03:55PM GMT

Modern AI applications struggle not because of a lack of models, but because meaning is implicit, fragmented, and brittle. In this talk, we’ll explore how making semantics explicit (using ontologies and knowledge graphs) changes how we design, build, and operate AI systems.

Speaker image - Jesus Barrasa

Jesus Barrasa

Field CTO for AI @Neo4j

Session

Your Agent Sandbox Doesn't Know My Authz Model: A Standard-Shaped Hole

Wednesday Mar 18 / 02:45PM GMT

Sandboxes are the first line of defence for agentic systems: restrict the bash commands, filter the URLs, lock down the filesystem. But sandboxes operate on the syntax of requests, not the semantics of your authorization model.

Speaker image - Paul Carleton

Paul Carleton

Member of Technical Staff @Anthropic, Core Maintainer of MCP

Session

Beyond Benchmarks: How Evaluations Ensure Safety at Scale in LLM Applications

Wednesday Mar 18 / 11:45AM GMT

As LLM systems move from prototypes to production, the gap between benchmark performance and real-world reliability becomes impossible to ignore. Models that score well on benchmarks can still fail unpredictably when facing the complexity, ambiguity, and edge cases of real users.

Speaker image - Clara Matos

Clara Matos

Director of Applied AI @Sword Health, Focused on Building and Scaling Machine Learning Systems