Infrastructure & Platform Engineering

Infrastructure with
an AI on call.

I build and run modern infrastructure end to end, and keep it simple to operate. I also built a Claude Code plugin for incident response. Support resolves a case straight from a ticket ID, and a first-pass that used to need an engineer now takes a few minutes.

Platform
Talos · Omni · K8s
GitOps
ArgoCD · Terraform
AI-Ops
Ticket-ID diagnostics
Scroll to explore

Broad technical ownership across infrastructure

I work end-to-end across multiple technology stacks, from cloud and on-prem infrastructure to identity, security, networking, endpoint management, and observability.

AI-Ops From a Ticket ID

  • Enter a ticket ID and the agent reads the logs, analyses the metrics, and finds the root cause
  • Files one bug for the right team, or flags an upstream provider outage
  • Caught at first contact, not after three escalations, and a human makes the final call

Insight Everyone Can Use

  • Metrics and logs from Kubernetes and the on-prem Linux and Windows hosts in one place
  • AI and RAG over that data, so reading what is happening is no longer only an expert's job
  • Faster troubleshooting, and shared understanding across the team

A Platform You Own

  • Self-hosted Kubernetes on bare metal with Talos and Omni
  • No lock-in to someone else's managed operations or a third party you cannot see into
  • Immutable base, no SSH, least-privilege throughout, and secure because of how it is built

Everything as Code

  • Every change is a reviewed commit, not a console click
  • Roll anything back and rebuild it the same way every time
  • You always know what is running and why, with nothing stuck in one person's head

Technical Domains

Let's Connect

I'm open to conversations about AI platform engineering, AI engineering, and IT infrastructure in general. Reach out if you want to talk.