6 Commits

Author SHA1 Message Date
Alexis Bruneteau
8dc524af22 Fix Poetry cache path for proper dependency caching
Changed cache configuration:
- Moved Install Poetry step before cache setup
- Updated cache path to ~/.cache/pypoetry/virtualenvs (actual venv location)
- Removed **/poetry.lock wildcard in favor of direct poetry.lock reference
- This ensures the virtualenv itself is cached, not just metadata

This should significantly speed up CI/CD runs by reusing installed packages.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-01 18:53:40 +02:00
Alexis Bruneteau
c9dbe70bdb Fix DVC pull to only fetch raw data
Changed dvc pull to specifically pull data/raw.dvc instead of all
outputs. The processed data and model files are generated by the
DVC pipeline (dvc repro), not pulled from remote storage.

This prevents errors about missing processed files that haven't
been generated yet.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-01 18:52:16 +02:00
Alexis Bruneteau
662d1a3b8f Configure DVC credentials explicitly in CI/CD pipeline
DVC needs credentials to be configured via 'dvc remote modify' command
rather than just environment variables. This fixes 403 Forbidden errors
when accessing MinIO/S3 storage.

Changes:
- Added dvc remote modify commands to set access_key_id and secret_access_key
- Applied to both pull and push operations in test and train jobs
- Added .dvc/config.local to .gitignore to prevent credential leaks

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-01 18:45:29 +02:00
Alexis Bruneteau
3cb1b23669 Add DVC S3 credentials to CI/CD pipeline
Configure DVC to use AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY
from Gitea secrets (DVC_ID and DVC_PASSWORD) for MinIO/S3 access.

Changes:
- Added DVC credentials to all DVC operations (pull/push)
- Changed poetry install to use --no-root flag for faster installs
- Credentials applied to both test and train jobs

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-01 17:52:21 +02:00
Alexis Bruneteau
d61fad678c Add dependency caching to CI/CD pipeline
Added actions/cache@v3 to cache Poetry and pip dependencies across
workflow runs. This significantly speeds up CI/CD by avoiding
full reinstallation when poetry.lock hasn't changed.

Cache strategy:
- Cache key based on OS and poetry.lock hash
- Caches ~/.cache/pypoetry and ~/.cache/pip
- Falls back to OS-specific cache if exact match not found

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-01 17:45:18 +02:00
Alexis Bruneteau
af9b700a5b secrets and mlflow should now work 2025-10-01 17:35:13 +02:00