With the latest advancements in Natural Language Processing and Large Language Models (LLMs), and big companies like OpenAI dominating the space, many people wonder: Are we heading further into a black box era with larger and larger models, obscured behind APIs controlled by big tech monopolies?
I don’t think so, and in this talk, I’ll show you why. I’ll dive deeper into the open-source model ecosystem, some common misconceptions about use cases for LLMs in industry, practical real-world examples, and how basic principles of software development and best practices such as modularity, testability and flexibility still apply. LLMs are a great new tool in our toolkits, but the end goal remains to create a system that does what you want it to do. Explicit is still better than implicit, and composable building blocks still beat huge black boxes.
Interview:
What's the focus of your work these days?
I'm the co-founder and CEO of Explosion and a core developer of spaCy, a popular open-source library for Natural Language Processing in Python, and Prodigy, a modern annotation and data development tool for machine learning.
The majority of my time is spent on making it easier for developers of all kinds of different backgrounds to use the latest developments in Natural Language Processing and helping teams develop high-quality datasets efficiently and build modular and transparent natural language understanding applications.
What's the motivation for your talk at QCon London 2024?
There's a lot of talk about Large Language Models and I believe some people are wrong about how exactly they will transform the way we build AI systems. As ideas develop, we’re seeing more and more ways to use compute efficiently, producing AI systems that are cheaper to run and easier to control.
I want to share some practical approaches that you can apply today. If you’re trying to build a system that does a particular thing, you don’t need to transform your request into arbitrary language and call into the largest model that understands arbitrary language the best. The people developing those models are telling that story, but the rest of us aren’t obliged to believe them.
How would you describe your main persona and target audience for this session?
ML / NLP Engineers
Is there anything specific that you'd like people to walk away with after watching your session?
The future won't just consist of larger and larger black-box models by big tech companies and you don't need to spend tons of money to join in. Best practices from software development still apply and there's so much you can do right now to build better and transparent AI systems.
Speaker
Ines Montani
Co-Founder & CEO @Explosion, Core Developer of spaCy
Ines Montani is a developer specializing in tools for AI and NLP technology. She’s the co-founder and CEO of Explosion and a core developer of spaCy, a popular open-source library for Natural Language Processing in Python, and Prodigy, a modern annotation tool for creating training data for machine learning models.