sortifal/MLOps

Go to file

Alexis Bruneteau 6995102d76 Remove map_wins features - they contain match outcome data

The map_wins_1 and map_wins_2 columns represent maps won DURING
the current match, not historical performance. This is data leakage
as these values are only known during/after the match.

Now using only truly pre-match features:
- rank_1, rank_2: Team rankings before match
- starting_ct: Which team starts CT side
- rank_diff: Derived ranking difference

This should finally give realistic model performance based solely
on information available before the match begins.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

2025-10-01 20:17:07 +02:00

setup dvc

2025-09-30 17:03:15 +02:00

.gitea/workflows

Fix Poetry cache path for proper dependency caching

2025-10-01 18:53:40 +02:00

Add CI/CD pipeline, monitoring, and model training components for CS:GO MLOps platform

2025-09-30 16:14:56 +02:00

Add CI/CD pipeline, monitoring, and model training components for CS:GO MLOps platform

2025-09-30 16:14:56 +02:00

setup dvc

2025-09-30 17:03:15 +02:00

Add CI/CD pipeline, monitoring, and model training components for CS:GO MLOps platform

2025-09-30 16:14:56 +02:00

secrets and mlflow should now work

2025-10-01 17:35:13 +02:00

maybe maybe not

2025-10-01 15:04:13 +02:00

Remove map_wins features - they contain match outcome data

2025-10-01 20:17:07 +02:00

train fix

2025-09-30 17:04:43 +02:00

.coverage

test

2025-09-30 16:38:14 +02:00

.dvcignore

Initialize DVC

2025-09-30 15:48:38 +02:00

.gitignore

Configure DVC credentials explicitly in CI/CD pipeline

2025-10-01 18:45:29 +02:00

docker-compose.yml

Add CI/CD pipeline, monitoring, and model training components for CS:GO MLOps platform

2025-09-30 16:14:56 +02:00

dvc.yaml

maybe maybe not

2025-10-01 15:04:13 +02:00

params.yaml

maybe maybe not

2025-10-01 15:04:13 +02:00

poetry.lock

Add dvc-s3 dependency for S3/MinIO storage support

2025-10-01 17:44:24 +02:00

pyproject.toml

Add dvc-s3 dependency for S3/MinIO storage support

2025-10-01 17:44:24 +02:00

README.md

Add Prometheus client dependency and update README with project details

2025-09-30 16:23:29 +02:00

README.md

MLOps Project

This is an MLOps project for CSGO data analysis and model training.

Features

Data pipeline with Apache Airflow
Model training with PyTorch and scikit-learn
MLflow for experiment tracking
DVC for data versioning
Monitoring with Prometheus
FastAPI for API serving

Setup

Install dependencies:
```
poetry install
```

Run the data pipeline:

airflow dags unpause csgo_data_pipeline

Project Structure

dags/: Airflow DAGs
src/: Source code
models/: Trained models
data/: Data files
notebooks/: Jupyter notebooks
tests/: Test files
config/: Configuration files
docker/: Docker files
kubernetes/: Kubernetes manifests