Building a Global Scale Data Platform with Cloud-Native Tools

As businesses increasingly operate in hybrid and multi-cloud environments, managing data across these complex setups presents unique challenges and opportunities. This presentation provides a comprehensive guide to building a global-scale data platform using cloud-native tools. We will explore considerations, best practices, and pitfalls for deploying and managing scalable, cost-efficient, and high-performing data architectures that leverage various data storage solutions—including files, object stores, databases, and more—in cloud-native environments. Key topics include best practices for cloud storage, compute decoupling, ensuring high availability and data redundancy, and balancing the power of cloud providers while controlling costs.

Attendees will gain insights into automating data operations, leveraging cloud-native solutions, and avoiding common pitfalls in building scalable data infrastructures.


Speaker

George Hantzaras

Engineering Director, Core Platforms @MongoDB, Open Source Ambassador, Published Author

George is a distributed systems expert and a hands-on engineering leader. He is a Director of Engineering at MongoDB, focusing on implementing cloud native technologies at enterprise scale. He is an Ambassador of the Data on Kubernetes community and the author of The Platform Engineering Playbook, by Packt. Most recently, he has been a speaker at global events like Kubecon, OpenSource Summit, Hashiconf, LeadDev, SaaStr, and more.

Read more
Find George Hantzaras at:

From the same track

Session Data Architecture

Reliable Data Flows and Scalable Platforms: Tackling Key Data Challenges

Wednesday Apr 9 / 10:35AM BST

There are a few common and mostly well-known challenges when architecting for data. For example, many data teams struggle to move data in a stable and reliable way from operational systems to analytics systems.

Speaker image - Matthias Niehoff

Matthias Niehoff

Head of Data and Data Architecture @codecentric AG, iSAQB Certified Professional for Software Architecture

Session

Achieving Precision in AI: Retrieving the Right Data Using AI Agents

Wednesday Apr 9 / 11:45AM BST

In the race to harness the power of generative AI, organizations are discovering a hidden challenge: precision.

Speaker image - Adi Polak

Adi Polak

Director, Advocacy and Developer Experience Engineering @Confluent, Author of "Scaling Machine Learning with Spark" and "High Performance Spark 2nd Edition"

Session Data Architecture

Beyond the Warehouse: Why BigQuery Alone Won’t Solve Your Data Problems

Wednesday Apr 9 / 03:55PM BST

Many organizations mistake the adoption of a data warehouse, like BigQuery, as the golden ticket to solving all their data challenges. But without a robust data strategy and architecture, you’re simply shifting chaos into the cloud.

Speaker image - Sarah Usher

Sarah Usher

Data & Backend Engineer, Community Director, Mentor

Session

The Data Backbone of LLM Systems

Wednesday Apr 9 / 02:45PM BST

Any LLM application has four dimensions you must carefully engineer: the code, data, models and prompts. Each dimension influences the other. That's why you must learn how to track and manage each. The trick is that every dimension has particularities requiring unique strategies and tooling.

Speaker image - Paul-Emil Iusztin

Paul-Emil Iusztin

Senior ML/AI Engineer, MLOps, Founder @Decoding ML