I build resilient cloud infrastructure that scales. With 19 years of experience, I've gone from being on-call during denial-of-service attacks on LinkedIn's on-premise servers to scaling Slack's cloud-based infrastructure during a global pandemic. Currently, I'm on the ML Infra team responsible for the infrastructure that powers Slack's AI features.
I take pride in contributing to team culture through mentoring, sharing learnings from production incidents (we all have stories), and giving code reviews that make engineers betterβnot just code better.
Technology is 100% guaranteed to fail: One thing I've learned throughout my years in the industry is that bugs, defects, and failures are inevitable. The challenge is building systems that are resilient, reliable, and repeatable.
My approach to system design is security-first, leveraging infrastructure-as-code tools like Terraform for elegant, reproducible deployments. I'm passionate about creating sustainable operations by transforming on-call rotations from reactive firefighting into proactive system stewardship through comprehensive monitoring, intelligent automation, and well-defined SLOs.
Outside of work, you'll find me exploring the Bay Area on my road bike, playing with Cowboy (my cat), or overthinking the optimal way to serve websites for organizations I care about like Lavender Phoenix (lavenderphoenix.org) and Bay Area Transformative Justice Collective (batjc.org).