Summary

Disclaimer: This summary has been generated by AI. It is experimental, and feedback is welcomed. Please reach out to info@qconlondon.com with any comments or concerns.

Presentation Title: How to Build a Database Without a Server

Speaker: Alex Seaton, Engineer at ArcticDB, Man Group.

Overview:

The presentation discusses the transition from managing traditional databases like MongoDB to a serverless model using ArcticDB. The focus is on leveraging client-side libraries and object storage to simplify the scalability and management of large data systems.

Main Topics Covered:

Database Complexity: Managing large database installations can be costly and complex.
Serverless Architecture: Discusses a model where a database can function using only a client-side library and object storage.
ArcticDB: A solution developed by Man Group that interacts directly with object storage, reducing the need for traditional server-based databases.
Data Structures and ACID Compliance: Examines the choice of core data structures and methods for ensuring ACID properties.
Global State Management: Utilizes Conflict-free Replicated Data Types (CRDTs) to manage global state without extensive synchronization.
Object Storage Evolution: Covers how object storage technology has advanced and its implications for database systems.

Challenges and Solutions:

Discusses handling high latency in commodity storage and maintaining high read and write speeds.
Emphasizes the importance of version control and data chunking for efficient data management.
Explores potential future developments and improvements in object storage and serverless database design.

Conclusion:

The approach to databases presented offers a simplified, scalable solution that is particularly beneficial for distributed computing and large-scale data analytics, leveraging advanced features of modern object storage systems.

This is the end of the AI-generated content.

Abstract

Modern data analytics workflows rely on scaling out to huge numbers of users and compute nodes. Managing database installations to handle this scale can be unsustainably complex and expensive. Is it instead possible to get rid of all this complexity and build a database with just a client-side library and object storage?

At Man Group we have evolved over time from managing one of the largest MongoDB installations in Europe to a serverless model where users interact directly with object storage using ArcticDB, our own database engine. What we've learnt from this should be interesting to anyone interested in distributed computing, not just database development.

We will focus on topics such as:

Our choices around ACID and our core data structures
How to manage global state with lock-free techniques such as CRDTs
How we manage to work with relatively high latency commodity object storage
How object storage has evolved over time, and how advanced it is becoming

Interview:

What is the focus of your work?

I work on ArcticDB, a client-side database engine optimised for timeseries data that's been developed from scratch in Man Group and now its own business. A lot of my work on the project has been on an optional set of server side processes to manage data replication and streaming data ingestion.

What’s the motivation for your talk?

ArcticDB has an unusual serverless architecture where users use our library to interact directly with object storage, with no co-ordinating server. We've learnt a lot about distributed computing and working with object storage by building this, and I want to share some interesting techniques and design choices that we've used.

Who is your talk for?

The concepts in my talk should be interesting to anyone working on distributed computing problems, whether for database development or not. It should be particularly interesting to people who work with object storage like S3. We will discuss specific design choices we've made and why, especially around our data structures and data format, so a good audience would be senior developers and technical architects who make similar decisions in their own projects.

What do you want someone to walk away with from your presentation?

New ideas about how to make useful software in a serverless architecture and an appreciation of how powerful modern object storage technologies are.

What do you think is the next big disruption in software?

It might not be as big a disruption as AI but it will be interesting to see how columnar file formats evolve and whether a successor to Parquet as a de facto standard will emerge.

Speaker

Alex Seaton

Staff Engineer @ArcticDB, Previously Working on Quant Trading Systems @Man Group

Alex Seaton is an engineer working on ArcticDB at Man Group. ArcticDB is a high-performance dataframe database that is optimised for timeseries data, data-science workflows and scales to petabytes of data and thousands of simultaneous users. At ArcticDB his focus has been on data replication and tick streaming infrastructure. Before joining ArcticDB, Alex built trade execution and market data systems.

How to Build a Database Without a Server

Summary

Abstract

Interview:

What is the focus of your work?

What’s the motivation for your talk?

Who is your talk for?

What do you want someone to walk away with from your presentation?

What do you think is the next big disruption in software?

Speaker

Alex Seaton

Find Alex Seaton at:

Speaker

Alex Seaton

Date

Location

Track

Topics

Share

From the same track

The Way We Manage Compliance Is Wrong… And Is Changing! Bringing DevOps Principles to Controls and Audit

Thinking Out of the Sandbox - Using Product Management to De-Risk Technical Innovation

Extreme DevOps Automation

Latency: The Race to Zero...Are We There Yet?

Follow QCon

Contact

Menu

Conferences around the World