Summary

Disclaimer: This summary has been generated by AI. It is experimental, and feedback is welcomed. Please reach out to info@qconlondon.com with any comments or concerns.

Summary of 'The Data Backbone of LLM Systems' Presentation

The presentation, given by Paul Iusztin, centers on the crucial role that data plays in constructing Large Language Model (LLM) systems. It highlights four key dimensions: code, data, models, and prompts, with a focus on the data aspect. Paul discusses strategies to manage and integrate these dimensions effectively using a framework for building LLM applications.

Key Topics Covered:

LLM Application Dimensions: Understanding the interdependent nature of code, data, models, and prompts, emphasizing the need for tailored strategies and tooling for each.
Data Engineering: Exploration of data flows in the LLM system, focusing on pipelines for data features, storage solutions for data sharing, versioning, processing, and analysis needed for training and inference.
Fine-Tuning and Inference: Detailed exploration of data access during LLM fine-tuning and inference, implemented through frameworks like Retrieval-Augmented Generation (RAG).

Use Cases:

LLM Twin: Your Digital AI Replica - A practical implementation from the LLM Engineer's Handbook.
Second Brain AI Assistant - Demonstrated through an open-source course, providing insights into hands-on implementation.

Additional Insights:

Data Pipelines: Introduction to the feature/training/inference architecture pattern stressing simplicity and clarity in system design.
Experimentation and Observability: Emphasis on the importance of reproducibility, observability from the outset, and the role of a data registry and model registry in version control.

This structured framework and the insights provided offer a comprehensive view of managing data in LLM systems, helping streamline the development and deployment of AI applications.

This is the end of the AI-generated content.

Abstract

Any LLM application has four dimensions you must carefully engineer: the code, data, models and prompts. Each dimension influences the other. That's why you must learn how to track and manage each. The trick is that every dimension has particularities requiring unique strategies and tooling. That's why directly applying SWE and DevOps principles that apply to the code does not work for others.

This presentation will dig into the data dimension and how it looks when building LLM applications. We will start by exploring a general framework that acts as the foundation. We will look into how the data flows, focusing on the data and features pipeline and how the data should be stored to be correctly shared, versioned, processed and analyzed for RAG, training and inference. Next, we will zoom into how the data is accessed during LLM fine-tuning and within the inference pipeline, which can be implemented as an RAG workflow or as something more complex, such as an agent.

To fully understand how the framework works (for building LLM applications), we will look at two concrete use cases, architecting the data layer of the following systems:

An LLM Twin: Your Digital AI Replica (Use case used in our LLM Engineer's Handbook)
A Second Brain AI Assistant (Use case used in our latest open-source course, freely available on Decoding ML)

We will present specific implementation details, tooling, and problems during these two use cases to fully understand an LLM system's data-related generalities and particularities.

Interview:

What is the focus of your work?

I am actively working and building LLM, RAG, and informational retrieval systems.

What’s the motivation for your talk?

I want to show people a framework for designing the data layer of RAG and LLM systems using MLOps/LLMOps best practices.

Who is your talk for?

Software/ML/AI/Data engineers or data scientists.

What do you want someone to walk away with from your presentation?

Architect the data layer of an LLM/RAG system

What do you think is the next big disruption in software?

Workflows and agents

Speaker

Paul Iusztin

Senior ML/AI Engineer, MLOps, Founder @Decoding ML

Paul Iusztin is a senior AI/ML engineer with over seven years of experience building GenAI, Computer Vision and MLOps solutions. His latest contribution was at Metaphysic, where he was one of the core AI engineers who took large GPU-heavy models to production. He previously worked at CoreAI, Everseen, and Continental.

He is the co-author of the LLM Engineer's Handbook, a bestseller on Amazon, which presents a hands-on framework for building LLM applications.

Paul is the Founder of Decoding ML, an educational channel on production-grade AI that provides code, posts, articles, and courses, inspiring others to build real-world AI systems. Through Decoding ML, he collaborated with companies such as MongoDB, Comet, Qdrant, ZenML and 11 other AI companies.

Connect with him on LinkedIn.

Subscribe to Decoding ML for weekly content on AI.

The Data Backbone of LLM Systems

Summary

Abstract

Interview:

What is the focus of your work?

What’s the motivation for your talk?

Who is your talk for?

What do you want someone to walk away with from your presentation?

What do you think is the next big disruption in software?

Speaker

Paul Iusztin

Find Paul Iusztin at:

Speaker

Paul Iusztin

Date

Location

Track

Topics

Slides

Share

From the same track

Reliable Data Flows and Scalable Platforms: Tackling Key Data Challenges

Achieving Precision in AI: Retrieving the Right Data Using AI Agents

Beyond the Warehouse: Why BigQuery Alone Won’t Solve Your Data Problems

Panel: Modern Data Architectures

Follow QCon

Contact

Menu

Conferences around the World