Tag: ai-engineering
All the articles with the tag "ai-engineering".
-
SRE for AI Agents: Error Budgets, Trust Ladders, and 90 Trials
Can an AI agent predict scope without hallucinating? We ran 90 trials. It added 1.7 phantom files per change. Error budgets and trust ladders are the gate.
-
What a Null Result Taught Us About AI Agent Evaluation
• UpdatedWe tested prompt repetition on 20 parallel AI agents. Ceiling effects dominated both experiments. The null result is a finding about evaluation design.
-
Why Your AI Agent Failed in Production
• UpdatedWhy your AI agent failed: missing decision provenance, not metrics. The 3 observability gaps traditional monitoring won't catch.
-
RAG for Legacy Systems: 7,432 Pages to 3s Answers
• Updated7,432 pages to 3-second answers. Production RAG for legacy systems with model-agnostic reranking. No vendor lock-in, validated across 4 LLM families.
-
AI Agents in Legacy Systems: ROI Without Modernization
• UpdatedLayer AI agents over legacy systems without modernization. 30-80% productivity gains in 3-6 months. Patterns that bypass technical debt.
-
Orchestrating AI Agents: A Subagent Architecture for Code
• Updated50% cost reduction with subagent architecture for AI coding. Capable models for planning, fast models for building. Real metrics from Goose.