JJ Tang on AI SRE
- Start with your biggest context gaps: usually service ownership and historical knowledge
- Build incrementally: each context source adds value; integration creates exponential benefits
- Measure comprehensively: measure, track, learn, and retain knowledge
- Invest in feedback loops: the best systems learn from every incident
"Assessment Question: When your AI identifies the next critical issue, will it know exactly who to call, what's been tried before, and how to coordinate the response?
"If the answer is no, then operational context is your next competitive advantage.
"The future of reliability engineering belongs to organizations that understand not just what's happening in their systems, but how they respond, learn, and continuously improve."
Comments
Post a Comment
Empathy recommended