Game Recommender - MLOps Architecture

1. Data Ingestion & Versioning

Raw Data (CSV Files)

User interaction and game metadata are stored as raw CSV files.

↓

DVC (Data Version Control)

DVC tracks large data files, ensuring reproducibility without committing them to Git.

↓

Google Cloud Storage (Remote)

GCS acts as the central remote storage for our DVC-tracked data.

2. Jenkins Automation Pipeline

Trigger (Git Push to `main`)

The entire pipeline is automatically triggered by a code change in the main branch.

DVC Pull

→

Data Preprocessing

→

Model Training

→

Docker Build

→

Push to GCR

→

Deploy to GKE

3. Application Deployment & Serving

Google Container Registry (GCR)

Stores and manages our application's Docker images.

↓

Google Kubernetes Engine (GKE)

Orchestrates and runs our application containers, handling scaling and availability.

↓

Flask Web App (UI)

The final user-facing application that serves game recommendations.

MLOps Pipeline: Game Recommender System