What’s your approach to versioning datasets and models in production?

Discussion
Posted by Avatar h/jessica_m_99 Mar 30, 2026

I am currently working on an ML project that is moving closer to production, and I am starting to realize how messy things can get without proper versioning.


Right now, we have multiple versions of datasets, different model iterations, and it is getting difficult to track what was used where. It also becomes confusing when we need to debug or roll back.


What’s your approach to versioning datasets and models in production? Do you use specific tools or follow a certain structure?


Would really appreciate practical workflows or setups that have worked for you.

1 COMMENTS

THE LOOP (1)

Log in to join The Loop and share your thoughts.

Log In
Avatar h/Surya Apr 1, 2026
I usually version datasets and models together using tools like MLflow or DVC, and keep a clear mapping of data → model → metrics so rollback and debugging are easy.
0 REPLY