Featured
Announcing Kedro 1.0
Kedro passes a major milestone signifying its maturity as a data science framework
After six years of focused development in the open, we’re proud to announce the release of Kedro 1.0: a major milestone that marks the maturity of Kedro as a data science framework.
In June 2019, we released Kedro 0.14.0 as an open-source framework to enable data scientists, data engineers, and machine-learning engineers worldwide to create maintainable, modular, and reproducible code. In 2020 and 2021 we focused on engineering and community building, and we announced our adoption by the Linux Foundation (AI & Data) in January 2022. In March 2023, we launched the Kedro blog; Kedro’s new branding was unveiled later that year, in June 2023. Then, in December 2024, Kedro achieved Graduation status within the LF AI & Data Foundation. What a journey!
As we reach Kedro 1.0, we want to be clear. It isn’t just a version number: it’s a statement of intent.
Kedro 1.0 is about a stable, curated core that gives you confidence to build, and a modular, extensible ecosystem that empowers you to grow.
Whether you’re designing your first pipeline or scaling complex workflows across teams, Kedro provides a foundation you can trust and the flexibility to adapt it to your individual project’s requirements.
At the heart of this release is a reimagined developer experience: a revamped DataCatalog
, clearer namespace management, improvements to the runners, and a polished public API designed for longevity. Surrounding that core is a flourishing ecosystem, including the long-awaited “run only missing” feature, Kedro Viz’s new run status view, and a completely redesigned documentation experience that helps you onboard faster.
Kedro 1.0 provides a solid launchpad: a stable, reliable foundation that supports teams confidently exploring today’s universe of traditional machine learning and data science. But importantly, it also keeps sight of what’s coming next: the rapidly expanding frontier of generative AI (gen AI) and new data paradigms. A true multiverse! We’re preparing to explore it beyond 1.0.
The core: solid, clear, and built to last
For this release, we focused on providing a stable core that is ready for more. Apart from dozens of bug fixes and small quality improvements, here is a list of highlights included in Kedro 1.0:
- Revamped
DataCatalog
TheDataCatalog
is a core element of Kedro. We’ve been brewing a newDataCatalog
for over a year after a broad research workstream that surfaced several pain points from users, and we are very excited about the result. These improvements happen under the hood, and regular users shouldn’t observe any disruption. You can read more in our documentation. - User experience improvements to namespaces: We have made significant user experience improvements to namespaces, including: adding the option to use datasets in namespaces without having to prefix them; modified pipeline filtering for namespaces to return exact matches instead of partial ones; support for running multiple namespaces within a single session; grouping by namespace, and much more!
- Improvements to runners: Runners are one of the key building blocks of Kedro. And yet, we’ve known some of them had usability issues that were preventing wider adoption. Several improvements to the runners shipped as part of Kedro 1.0 such as a new way to select which multiprocessing start method, and some internal refactoring.
- Polished public API: More broadly, we have made several improvements to the public API of Kedro.
Technical readers interested in finding out more can head over to the Kedro blog to find further details of the changes.
The ecosystem: modular, extensible, and versatile
Reimagined new documentation
We are thrilled to present our fresh new documentation!
We have completely redesigned the docs to look more fresh, modern, and in line with the Kedro brand. In addition, we reshaped the documentation into a new information architecture that is more flat and easy to navigate, and introduced a table of contents on the right-hand side that will make it easier for readers to browse pages. Finally, we modernized the installation instructions of Kedro to make it more clear how to use modern Python tools like uv
, while still including guidance for other native Python tools (Poetry, pipx) as well as Conda.
Kedro Viz pipeline status view
We are very excited to announce one of the biggest additions we have made in a while to Kedro Viz: a view that shows the run status of your pipeline!
Another highly anticipated feature, this will allow users to quickly see at a glance whether their pipeline ran correctly, in a reasonable time, and whether the output datasets have the expected properties.
We plan to continue expanding this feature based on feedback from the community in the coming months. Give it a try and let us know what you think!
Community-driven integration with Databricks
As always, the Kedro ecosystem is buzzing with activity well beyond the activity of the core team. One such example is kedro-databricks
, an extension created by Jens Peder Meldgaard that allows you to easily package your Kedro project as a Databricks Asset Bundle.
Jens already attended one of our regular Coffee Chats as a guest to present the plugin, and we’ve collaborated closely to bring a new documentation to life. We are always happy to collaborate with downstream plugin authors.
If you are using Kedro on Databricks, give this a try!
What’s next?
Of course we won’t stop here. Earlier this year we conducted a user survey to ask our community how they felt about Kedro and what ideas they had for the future. The response was excellent and more than 60 ideas were presented that we then categorized into different themes. The most popular ones were developer experience & learning curve, examples & guidance, integrations, complex projects, conditional nodes, and gen AI. But there were many more!
We are well aware that the work of Data Scientists is changing. On top of classical machine learning pipelines, the appearance of gen AI and agents has dramatically changed the data landscape.
Some challenges remain the same. Applying good SWE principles to highly iterative and experimental code is still hard, and crafting production-ready agentic workflows is hard.
At the same time, LLM-powered code assistants are enabling users to produce technical debt at the fastest rate in history, while simultaneously struggling to refactor unstructured codebases.
We think that Kedro is ready to tackle these challenges head on. As such, our main priorities in 2025 and beyond will be making Kedro easier to learn and use, as well as opening up new possibilities for complex I/O patterns for the gen AI era.
The humans behind Kedro
This release would not have been possible without the hard work of the Kedro team at QuantumBlack Labs, as well as the rest of the Technical Steering Committee.
We are proud to celebrate the contributions of Huong Nguyen, Sajid Alam, Laura Couto, Nok Lam Chan, Juan Luis Cano, Ankita Katiyar, Elijah Ko, Rashida Kanchwala, Merel Theisen, Elena Khaustova, Dmitry Sorokin, Ravi Kumar Pilla, Jitendra Gundaniya, and Stephanie Kaiser, as well as TSC members Ivan Danov, Yetunde Dada, Deepyaman Datta, Joel Schwarzmann, Marcin Zabłocki, Simon Brugman, and Yolan Honoré-Rougé.
Get started with Kedro 1.0 today
And that’s a wrap. You can install Kedro with your favorite Python package managers, connect with the Kedro community in Slack to ask questions or stay up to date with news. Don’t forget to subscribe to our YouTube channel to receive notifications on new video content.
Over the years, QuantumBlack, AI by McKinsey has helped organizations reinvent themselves to achieve accelerated, sustainable, and inclusive growth with AI. At QuantumBlack Labs, the R&D innovation hub for QuantumBlack, we use the learnings from the work with our clients to develop suites of tools and assets to supercharge future engagements. Kedro is an integral part of our open source ecosystem for AI that includes other projects such as Vizro, Agents at Scale, and MLRun.