ECO-AIM-AI-012¶
Name: Large model in low-SLA workload
Category: AIM
Family: AI
Primary layer: ai
System layers: ai
Description¶
Using high-cost models where latency/quality needs are modest wastes resources.
Impact¶
- confidence: 0.6
- notes: Usually a quick win.
- type: cost
Detection¶
- languages:
- org
- method: hybrid
Remediation¶
- guidance: Tier models by use case; route requests by need.
- tradeoffs: Routing complexity.
Pattern examples¶
No pattern examples provided.
Remediation examples¶
No remediation examples provided.