Engineering notes from production. AI agents, platform infrastructure, and the systems that compound.

Aaron Griffith

Hi, I'm Aaron

     X        
Currently Reading: Loading...
Now Playing: Loading...

// the work

Reliability, AI infrastructure, and the tooling that ties them together.

I lead Site Reliability Engineers, Database Administrators, and Full Stack Developers, keeping production systems running while building the infrastructure that powers them. The technology and AI are the craft I love. The business they drive is what actually motivates the work.

  • AI agent infrastructure

    Daily-driving open-source agent frameworks, running local LLM tooling with Ollama, and shipping upstream fixes when something breaks.

  • Platform & production engineering

    Cloudflare Workers in production with KV caching, debugging edge cases at scale, and optimizing for cost and reliability on real traffic.

  • macOS & Linux automation

    CLI tools that remove friction, from VPN control to Pomodoro timers to AI-powered shell command generation. Small surface, durable leverage.

  • Technical leadership

    Frameworks for onboarding tech leaders, scaling engineering teams, and maintaining technical depth as an executive. Public, not theoretical.

  • Cross-industry reliability

    Shipped code and led teams across Aerospace/Defense, SaaS, Retail, and Education. Four industries, four reliability and compliance regimes.

  • Lab notebook in public

    This blog is the lab notebook. I write about what I'm debugging, what I'm building, and what I'm learning, for engineers who want the actual answer.

// systems shipped

Outcomes, not titles.

$4.2M

Savings delivered

99.95%

Production uptime

40%

Incident reduction

SRE DBA FSD

Teams led

// in flight

Side of the desk.

Open-source contribution and a small applied-AI venture. Both quiet, both real.

open source contributor

Contributing to Hermes Agent

I daily-drive Hermes Agent on my own server and ship fixes upstream when something breaks. It's the agent framework I reach for, and the posts about it are the long-form version of those debug sessions.

venture in development

Critana.io

Practical AI agents and automation for local service businesses. The parts that earn their keep on day one: scheduling, triage, follow-up, ops reporting.