Skip to content

ECO-AIM-AI-012

Name: Large model in low-SLA workload

Category: AI/ML

Family: AI

Primary layer: ai

System layers: ai

Description

Using high-cost models where latency/quality needs are modest wastes resources.

Impact

  • confidence: 0.6
  • notes: Usually a quick win.
  • type: cost

Detection

  • languages: org
  • method: hybrid

Remediation

  • guidance: Tier models by use case; route requests by need.
  • tradeoffs: Routing complexity.

Pattern examples

No pattern examples provided.

Remediation examples

No remediation examples provided.

Metadata

  • catalog_version: 0.4.0