Resources

FinOps Resources & Guides

Learn how to turn your billing data into clear, actionable answers — using Training Objects and conversational insights that actually explain what changed and what to do next.

FinOps for AI

AI spend patterns that standard FinOps tools miss — cross-service billing disguises, OCU floor costs, inference waste, and detection logic for the new AI infrastructure stack.

Amazon Bedrock Knowledge Base Costs Don't Show Up Under Bedrock
When you create a Bedrock Knowledge Base, AWS silently provisions OpenSearch Serverless at $345/month — under the wrong service name. Five real incidents, detection SQL, and a fix checklist for builders, FinOps practitioners, and engineering leaders.
9 min read
Runaway AI Inference: How a Prompt Loop at 2 AM Becomes a $4,200 Weekend Bill
Token consumption can spike 18× above your rolling baseline in 48 hours. Standard FinOps alerts won't catch it. Six real incidents ($15 to $50K), the FOCUS-native detection SQL, and a three-layer fix for the engineer who built it, the analyst who watches the bill, and the manager who owns the outcome.
10 min read
SageMaker Real-Time Inference Endpoints Bill When Nobody Is Calling Them
SageMaker real-time endpoints can't scale to zero — they bill $724+/month whether they receive 0 requests or 10,000. The charge is invisible in standard billing data until you join it with CloudWatch. Five real incidents, detection SQL, and a fix checklist for builders, FinOps practitioners, and engineering leaders.
9 min read
A Stolen API Key, 48 Hours, $82,000: The AI Billing Signature That Standard FinOps Tools Miss
A Google Maps key gained Gemini scope without the owner knowing. 48 hours, $82,314. 2,863 live exposed keys in Common Crawl. AWS Bedrock API keys carry identical risk. The FOCUS-native detection query, why Cost Anomaly Detection can't catch this, and a full incident response checklist.
11 min read
Your AI Inference Bill Grows Every Sprint Without New Traffic — It's Your System Prompts
Input token consumption climbs 15%+ month over month as developers add few-shot examples, safety rules, and RAG context across sprints — invisible to dashboards watching total Bedrock spend. Four real incidents ($800 to $8,400/month), the FOCUS-native detection query, and a fix checklist for builders, practitioners, and engineering leaders.
10 min read

Billing Anomaly Patterns

Internet-validated detection patterns for cloud cost anomalies — waste, spikes, commitment loss, data transfer leaks, security events, and more.

65% Waste: The Developer VM Your Team Is Paying For at 2 AM
How flat-rate cloud billing turns idle developer VMs into silent budget leaks — and how to detect and fix them using FOCUS billing data.
7 min read
Your Billing Data as a Security Detector: The New-Region Anomaly
How credential compromise shows up in cloud billing exports — the zero-threshold new-region rule, four billing-observable attack phases, and two confirmed 2025 campaigns.
8 min read
The Weekend Test: Why Your Cloud Resources Should Cost Less on Saturdays
How to detect scheduling misses in cloud billing data — the weekend/weekday ratio, real incidents ($6,700 Monday bill, $40K/year savings), and a practical fix checklist.
6 min read
The Hidden Cross-AZ Tax: Why Network Costs Keep Growing Silently
Data transfer misconfiguration — cross-AZ traffic and NAT Gateway misrouting — is an architecture tax that compounds with every gigabyte. How to detect it and eliminate it.
8 min read
The Quiet RI Bleed: Detecting Commitment Loss Before It Compounds
Commitment loss is an inverse anomaly — the bill stays flat while waste climbs to 93%. How to detect the cliff (sudden) and the drift (gradual) using FOCUS CommitmentDiscountStatus.
7 min read
When Cloud Costs Won't Come Back Down: Detecting Persistent Runaway Patterns
The runaway pattern fires on persistence, not magnitude — 4+ of 7 days above baseline. The $47K ECS overnight and $72K Cloud Run incident that nobody caught in time.
6 min read
How to Detect a Cost Spike Before It Becomes a Bill: The 3-Day Baseline Method
The 3-day vs 30-day baseline method for detecting cloud cost spikes — $120K DDoS autoscaling, Lambda recursion, and why 2× threshold is the right bar.
7 min read
The Ghost Inventory: Detecting Orphaned Cloud Resources in Billing Data
Resources still billing, delivering nothing — how to detect configured-and-forgotten resources and over-provisioned commitments using FOCUS ConsumedQuantity.
6 min read
Untagged Spend Is a Budget Risk, Not Just a Policy Violation
How missing cost allocation tags turn cloud resources invisible to alert routing — the $1M sandbox incident, industry stats, and a practical fix checklist.
6 min read

FinOps Basics

Foundational concepts, terminology, and operating models for cloud financial management.

AWS Billing

Deep dives into CUR, FOCUS v1, cost tracking strategies, and common billing pitfalls.

Terraform Automation

IaC patterns, cost guardrails, and automation modules for enforcing FinOps at scale.

Automation Examples

Real Lambda functions and scripts for alerts, tagging, cleanup, and cost control.

ETL & Data Pipelines

Building robust billing data pipelines with S3, Glue, Athena, and query optimization.

Dashboards & Reporting

From static charts to conversational insights — how to turn billing data into decisions.

AI in FinOps

How LLMs enhance cloud cost management — what works, what doesn't, and what's coming.

Starter Kit

Terraform modules, playbooks, and checklists to get a FinOps practice off the ground fast.

Case Studies

Real results: cost reductions, automation wins, and FinOps transformations.