sortifal/MLOps

Go to file

Alexis Bruneteau efaf5ff0e1 Fix critical data leakage in feature engineering

Removed features that contain match outcome information:
- result_1, result_2 (actual match scores - only known after match)
- ct_1, t_2, t_1, ct_2 (rounds won per side - only known after match)
- total_rounds, round_diff (derived from results)

These features caused perfect 1.0 accuracy because the model was
essentially "cheating" by knowing the match outcome.

Now using only pre-match information:
- Team rankings (rank_1, rank_2)
- Historical map performance (map_wins_1, map_wins_2)
- Starting side (starting_ct)
- Derived: rank_diff, map_wins_diff

This will give realistic model performance based on what would
actually be known before a match starts.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

2025-10-01 20:01:46 +02:00

setup dvc

2025-09-30 17:03:15 +02:00

.gitea/workflows

Fix Poetry cache path for proper dependency caching

2025-10-01 18:53:40 +02:00

Add CI/CD pipeline, monitoring, and model training components for CS:GO MLOps platform

2025-09-30 16:14:56 +02:00

Add CI/CD pipeline, monitoring, and model training components for CS:GO MLOps platform

2025-09-30 16:14:56 +02:00

setup dvc

2025-09-30 17:03:15 +02:00

Add CI/CD pipeline, monitoring, and model training components for CS:GO MLOps platform

2025-09-30 16:14:56 +02:00

secrets and mlflow should now work

2025-10-01 17:35:13 +02:00

maybe maybe not

2025-10-01 15:04:13 +02:00

Fix critical data leakage in feature engineering

2025-10-01 20:01:46 +02:00

train fix

2025-09-30 17:04:43 +02:00

.coverage

test

2025-09-30 16:38:14 +02:00

.dvcignore

Initialize DVC

2025-09-30 15:48:38 +02:00

.gitignore

Configure DVC credentials explicitly in CI/CD pipeline

2025-10-01 18:45:29 +02:00

docker-compose.yml

Add CI/CD pipeline, monitoring, and model training components for CS:GO MLOps platform

2025-09-30 16:14:56 +02:00

dvc.yaml

maybe maybe not

2025-10-01 15:04:13 +02:00

params.yaml

maybe maybe not

2025-10-01 15:04:13 +02:00

poetry.lock

Add dvc-s3 dependency for S3/MinIO storage support

2025-10-01 17:44:24 +02:00

pyproject.toml

Add dvc-s3 dependency for S3/MinIO storage support

2025-10-01 17:44:24 +02:00

README.md

Add Prometheus client dependency and update README with project details

2025-09-30 16:23:29 +02:00

README.md

MLOps Project

This is an MLOps project for CSGO data analysis and model training.

Features

Data pipeline with Apache Airflow
Model training with PyTorch and scikit-learn
MLflow for experiment tracking
DVC for data versioning
Monitoring with Prometheus
FastAPI for API serving

Setup

Install dependencies:
```
poetry install
```

Run the data pipeline:

airflow dags unpause csgo_data_pipeline

Project Structure

dags/: Airflow DAGs
src/: Source code
models/: Trained models
data/: Data files
notebooks/: Jupyter notebooks
tests/: Test files
config/: Configuration files
docker/: Docker files
kubernetes/: Kubernetes manifests