Chapter 02: ML Lifecycle & Pipelineπ
"An ML system is not just a model β it's data + code + infrastructure + people."
2.1 The Machine Learning Lifecycleπ
The ML lifecycle has 8 stages that repeat in a continuous loop:
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β ML LIFECYCLE (Circular Flow) β
β β
β 1. Business 2. Data 3. Data 4. Model β
β Understanding Collection βββΆ Preparation βββΆ Development β
β β β β
β β (Feedback Loop) β β
β β βΌ β
β 8. Monitor βββ 7. Deploy βββ 6. Package βββ 5. Evaluation β
β β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
2.2 Stage-by-Stage Breakdownπ
Stage 1: Business Understandingπ
- Define the problem (classification? regression? anomaly detection?)
- Identify success metrics (accuracy, F1, RMSE, ROI)
- Determine data availability
Stage 2: Data Collectionπ
Sources:
ββββββββββββββ ββββββββββββββββ ββββββββββββββ
β Databases β β External APIs β β CSV/Files β
βββββββ¬βββββββ ββββββββ¬ββββββββ βββββββ¬βββββββ
βββββββββββββββββββΌβββββββββββββββββββ
β
βββββββΌβββββββ
β Data Lake β
β (GCS/S3) β
ββββββββββββββ
Stage 3: Data Preparationπ
Tasks:
- Cleaning (handle nulls, duplicates, outliers)
- Feature Engineering (create meaningful features)
- Splitting (train / validation / test)
- Versioning with DVC
# Example: DVC data pipeline
dvc run -n preprocess \
-d data/raw/dataset.csv \
-o data/processed/clean.csv \
python src/preprocess.py
Stage 4: Model Developmentπ
Raw Features
β
βΌ
ββββββββββββ ββββββββββββ ββββββββββββ
β Model β β Model β β Model β
β A β β B β β C β
β(RF/XGB) β β (SVM) β β (Neural) β
ββββββ¬ββββββ ββββββ¬ββββββ ββββββ¬ββββββ
βββββββββββββββββββΌββββββββββββββββββ
β
ββββββΌββββββ
β Best β
β Model β β
ββββββββββββ
Stage 5: Model Evaluationπ
Key metrics tracked:
| Task Type | Metrics |
|---|---|
| Classification | Accuracy, Precision, Recall, F1, AUC-ROC |
| Regression | MAE, MSE, RMSE, RΒ² |
| Clustering | Silhouette Score, Davies-Bouldin |
Stage 6: Model Packagingπ
Model artifacts β Docker Image β Container Registry
β
βββ model.pkl / model.pt / model.h5
βββ preprocessor.pkl
βββ requirements.txt
βββ inference.py (FastAPI/Flask server)
Stage 7: Deploymentπ
Model Package
β
βΌ
ββββββββββββββββββββββββββββββββββββ
β Deployment Targets β
β β
β π REST API (Flask/FastAPI) β
β π¦ Docker Container β
β βΈοΈ Kubernetes Cluster (GKE) β
β βοΈ Cloud Functions β
β π Batch Scoring β
ββββββββββββββββββββββββββββββββββββ
Stage 8: Monitoringπ
What to monitor:
ββββββββββββββββββββββββββββββββββββββββββ
β MONITORING DIMENSIONS β
β β
β π Model Performance β
β βββ Accuracy drift over time β
β β
β π Data Quality β
β βββ Input feature distribution β
β β
β βοΈ Infrastructure β
β βββ Latency, Memory, CPU β
β β
β πΌ Business KPIs β
β βββ Revenue impact, conversions β
ββββββββββββββββββββββββββββββββββββββββββ
2.3 ML Pipeline vs ML Systemπ
ML PIPELINE (what you build):
ββββββββ βββββββββββ βββββββββ ββββββββββ
β Data ββββΆβ Feature ββββΆβ Train ββββΆβ Serve β
β Prep β β Eng β β Model β β Model β
ββββββββ βββββββββββ βββββββββ ββββββββββ
ML SYSTEM (what runs in production):
ML Pipeline + Configuration + Data Collection +
Feature Platform + Model Registry + Serving System +
Monitoring + CI/CD infrastructure
2.4 Key Artifacts in the Pipelineπ
| Artifact | What It Is | How It's Tracked |
|---|---|---|
| Raw Data | Original collected data | DVC + GCS |
| Processed Data | Cleaned, engineered features | DVC |
| Model Weights | Trained model parameters | MLflow + DVC |
| Hyperparameters | Config used for training | MLflow |
| Metrics | Evaluation scores | MLflow |
| Docker Image | Packaged model server | GCR (Container Registry) |
2.5 Feature Store Conceptπ
A Feature Store is a centralized repo for ML features β ensuring features used in training match features used at inference (training-serving skew prevention).
βββββββββββββββ
Batch Data ββββββΆβ ββββββ Streaming Data
β FEATURE β
Training βββββββββ STORE ββββββββΆ Online Serving
β (Feast/ β
Historical βββββββ Tecton) ββββββββΆ Batch Scoring
βββββββββββββββ
Next Chapter β 03: Git & GitHub