Key Responsibilities End-to-end service ownership: design for telemetry, security, resiliency, scalability, and performance; lead sizing/architecture; drive service health reviews and process simplification. Incident management and prevention: lead postmortems/RCAs, coordinate fixes, define repair items, and implement data-driven