Replies: 3 comments 2 replies
-
Firstly - I would point everyone towards the Principles of Kedro in the docs which articulates the team's wider philosophy when building the framework and this applies to our thinking on how users should approach building out pipelines too. From a personal perspective, I've been using what eventually became the Kedro open source project today for about 4 years and have a few personal views about what best practice could look like:
|
Beta Was this translation helpful? Give feedback.
-
Well, I'm a newcomer trying out Kedro in these days, so I don't have any best practices to share. However, I do have a question regarding best practices. Are people just using raw strings when defining inputs/outputs of nodes in a large pipeline? I quickly end up with quite a number of these, which easily becomes a mess. And does anyone know of any good open source projects based on Kedro which can showcase some of the best practices? It would be great to be able to go through and see how they organize large pipelines, enforce naming schemas, runs pipeline testing, etc. |
Beta Was this translation helpful? Give feedback.
-
One of the examples of best practices I liked the most was described in the kedro-mlflow plugin. Basically, it uses tags to reuse nodes. For example, when defining an ML pipeline, you can use the tags training and serving to define which node will be used in each step (or both steps). |
Beta Was this translation helpful? Give feedback.
-
A good question by was asked on Discord server about what best practice Kedro look like. This is something that will evolve and cannot and should not be designed up front.
However, we are now at a point where we there are thousands of Kedro practitioners and power users our there in the wild. At the time of writing this post - Kedro has been open sourced for ~2 years, we average about ~8,000 users visiting the Kedro documentation per month with only 10% of that traffic coming from internal users.
So - let's kick off a discussion where we share our own ideas of using best practice when using Kedro! 🚀
Beta Was this translation helpful? Give feedback.
All reactions