#Incident Management

Posts on production incidents, response, coordination, and learning after failure.

October 5, 20253 min read

Managers Have Been Vibe Coding All Along

Everyone’s been talking about vibe coding lately. I’ve been doing it myself. two projects. and . It’s the kind of work where you don’t analyze, architect, or overthink. You star...

October 6, 20249 min read

Balancing Act of Reliability

Software development involves both creating and maintaining systems. Once you put anything into production, reliability becomes critical. When your systems are not reliable, you...

June 5, 20246 min read

Operational Skills Needed

Over the years, I've interviewed many candidates. One crucial skill that often gets overlooked is operational reflexes during oncalls. Surprisingly, few companies test for this,...

March 20, 20226 min read

Update Statements on Production

Executing update statements on a production database is always a big challenge. It’s one of those tasks that looks deceptively simple until something breaks in ways you didn’t i...

March 20, 20228 min read

Engineering Roles and Responsibilities

Engineering organizations have roles and responsibilities either explicitly or implicitly. When it’s explicit, one or more people exercise the engineering role, and the responsi...

February 21, 20226 min read

Essential Engineering Principles

Engineering principles give teams a practical foundation for how to build and operate software. They guide decisions, shape behaviours, and help groups stay aligned even as syst...

February 13, 20226 min read

Addressing Technical Debt

Tech debt occurs when we solve a software problem with our limited understanding of the business at the time. We start building a solution to get feedback as early as possible....

December 29, 202114 min read

Service Overload Strategies

Service overload happens a lot. If you haven't seen one, count yourself lucky. The first time I watched it take a system down, I realized how serious it’s to get the basics righ...

November 26, 202117 min read

Promoting Learnings in Incidents

Incidents are used for the negative consequences of an action. The incident comes from an action that fails to result in the expected outcome. For instance, deploying a code to...

March 25, 20162 min read

Shoulder Ship It

Pair programming is no new topic and has been widely used in the industry. At first, it might seem like itʼs a waste of time because two coders work in the same station. However...

April 12, 20135 min read

Buggy Code on Production, Survived

Areca is the name of the billing engine I am working on for Turk Telekom. Funny enough, it is also the name of the flowers we bought to freshen the office. We wanted the office...