Sina Moghaddas

Site Reliability Engineer with over a decade of independent practice. I started SRE Together in 2015 to work directly with engineering teams on reliability strategy, infrastructure resilience, and the operational practices that keep services running when things go wrong.

What I Work On

  • Reliability strategy: SLOs, error budgets, and service health targets
  • Incident response: faster detection, clearer coordination, structured learning
  • Platform engineering: automation that reduces toil and deployment risk
  • Observability: actionable metrics, logs, and traces for production systems

How I Approach It

Reliability is a product decision as much as a technical one. I work at the intersection of engineering and business - translating uptime targets into clear priorities, and turning incidents into durable improvements rather than one-time fixes.

I've consulted across startups and established platforms, always with the same goal: make the team more capable, not more dependent.