Skip to content

ECO-AIM-AI-005 — Always-on inference endpoints

  • Category: AI/ML (AIM)
  • Family: AI (AI)
  • Layer: AI
  • Tier: 3
  • Severity: warning
  • Tags: ai, scale-to-zero
  • Legacy ID: ECO-AI-005

Summary

Always-on endpoints waste baseline compute when idle.

Rationale

Idle accelerators still burn power; scale needs boundaries.

Impact

{
  "confidence": 0.6,
  "notes": "High for GPU-backed endpoints.",
  "type": "carbon"
}

Detection

{
  "languages": [
    "infra"
  ],
  "method": "trace"
}

Remediation

{
  "guidance": "Autoscale; scale-to-zero where feasible; consolidate endpoints.",
  "tradeoffs": "Cold starts and scheduling complexity."
}

Ontology

{
  "system_layers": [
    "ai"
  ]
}