Skip to content

ECO-AIM-AI-012

Name: Large model in low-SLA workload

Category: AIM

Family: AI

Primary layer: ai

System layers: ai

Description

Using high-cost models where latency/quality needs are modest wastes resources.

Impact

  • confidence: 0.6
  • notes: Usually a quick win.
  • type: cost

Detection

  • languages:
  • org
  • method: hybrid

Remediation

  • guidance: Tier models by use case; route requests by need.
  • tradeoffs: Routing complexity.

Pattern examples

No pattern examples provided.

Remediation examples

No remediation examples provided.