dekun
0cce6cda7c
Fix CUDA OOM by mutually unloading Whisper and ChatTTS on 8GB GPU.
...
Release GPU memory before TTS/ASR switches, lower TTS token limits, and set PYTORCH_CUDA_ALLOC_CONF in PM2.
Co-authored-by: Cursor <cursoragent@cursor.com >
2026-06-12 17:03:37 +08:00
dekun
0f5277c22e
Add Whisper offline loading for air-gapped servers.
...
Pre-download via HF mirror scripts so inner-network deploys avoid Hub Network is unreachable errors.
Co-authored-by: Cursor <cursoragent@cursor.com >
2026-06-12 16:11:57 +08:00
dekun
aacdffac77
Fix ChatTTS load: pre-download via HF mirror, avoid GitHub timeout.
...
Co-authored-by: Cursor <cursoragent@cursor.com >
2026-06-12 15:16:27 +08:00
dekun
aea39a00ae
Support .env for server-local Ollama config to avoid git pull conflicts.
...
Co-authored-by: Cursor <cursoragent@cursor.com >
2026-06-12 14:53:47 +08:00