Alexis Bruneteau c9dbe70bdb Fix DVC pull to only fetch raw data
Changed dvc pull to specifically pull data/raw.dvc instead of all
outputs. The processed data and model files are generated by the
DVC pipeline (dvc repro), not pulled from remote storage.

This prevents errors about missing processed files that haven't
been generated yet.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-01 18:52:16 +02:00
2025-09-30 17:03:15 +02:00
2025-09-30 17:03:15 +02:00
2025-10-01 17:35:13 +02:00
2025-10-01 15:04:13 +02:00
2025-10-01 17:35:13 +02:00
2025-09-30 17:04:43 +02:00
2025-09-30 16:38:14 +02:00
2025-09-30 15:48:38 +02:00
2025-10-01 15:04:13 +02:00
2025-10-01 15:04:13 +02:00

MLOps Project

This is an MLOps project for CSGO data analysis and model training.

Features

  • Data pipeline with Apache Airflow
  • Model training with PyTorch and scikit-learn
  • MLflow for experiment tracking
  • DVC for data versioning
  • Monitoring with Prometheus
  • FastAPI for API serving

Setup

  1. Install dependencies:

    poetry install
    
  2. Run the data pipeline:

    airflow dags unpause csgo_data_pipeline
    

Project Structure

  • dags/: Airflow DAGs
  • src/: Source code
  • models/: Trained models
  • data/: Data files
  • notebooks/: Jupyter notebooks
  • tests/: Test files
  • config/: Configuration files
  • docker/: Docker files
  • kubernetes/: Kubernetes manifests
Description
No description provided
Readme 350 KiB
Languages
Python 73.3%
Typst 25.9%
Dockerfile 0.8%