feat(ai-runtime): complete ai runtime policy refactor (ADR-035)
CI / CD Pipeline / build (push) Successful in 4m16s
CI / CD Pipeline / deploy (push) Successful in 11m51s

This commit is contained in:
2026-06-12 08:07:15 +07:00
parent 71c5e88181
commit 0227b7b982
63 changed files with 3566 additions and 451 deletions
+6
View File
@@ -57,6 +57,12 @@ OLLAMA_EMBED_MODEL=nomic-embed-text
OLLAMA_RAG_MODEL=typhoon2.5-np-dms:latest
OLLAMA_URL=http://192.168.10.8:11434
# VRAM, Residency & Concurrency settings (Feature-235 AI Runtime Policy)
AI_VRAM_HEADROOM_THRESHOLD_MB=3000
AI_GPU_MAIN_MODEL_PRESSURE_THRESHOLD_MB=12000
AI_OCR_RESIDENCY_WINDOW_SECONDS=120
AI_REALTIME_CONCURRENCY=2
# Qdrant (ADR-023A)
QDRANT_HOST=http://192.168.10.8:6333
QDRANT_COLLECTION=lcbp3_documents