What makes `mlflow run .` reproducible across a laptop and a CI runner?

The MLproject declares the locked environment, so both build identical dependencies

Reproducible Training in CI/CD — Semantic Web Academy

The project is the CI contract

Level 2 gave you MLproject + a locked environment. The payoff is CI: a pull request can train and evaluate the model in a clean runner, log everything to a remote tracking server, and gate the merge on a metric threshold. No more 'works on my laptop'.

The pattern has three moving parts:

mlflow run . -P data=s3://... — the entrypoint the runner calls.
A remote MLFLOW_TRACKING_URI so CI runs are visible to everyone.
A post-train check that fails the job if the new metric regresses against the current Production model.

Reproducible Training in CI/CD

Gate merges on model quality

The project is the CI contract

Analogy

Reflect