Karthik Sj

Karthik SJ is General Manager at LogicMonitor, where he leads AI-powered observability that turns telemetry into decisions and action. A product builder with experience at public companies like SAP and high-growth startups, he focuses on role-ready assistants, managed autonomy with human approvals, and prevention over firefighting. His teams blend internal and external signals into a unified data core, cutting alert noise, accelerating recovery, and proving value with KPIs such as time to detect, time to recover, and tickets avoided. Karthik champions pragmatic governance, transparency, and change management so enterprises can scale AI safely and measurably.

Your Outage Playbook Is Broken. Here Is the AI Fix

Episode Summary

Karthik SJ, lays out a clear path from reactive firefighting to proactive prevention. He shows how role-ready assistants cut alert noise, cluster symptoms to root causes, and trigger the right runbooks so engineers spend time fixing issues, not chasing pages. You will hear how managed autonomy keeps humans in charge through approvals, audit trails, and rollback plans, and how to run agents like a real workforce with named owners, logs, and KPIs. We go deep on the data core that catches smoke before fire by fusing metrics, traces, logs, config, change events, cloud status, ISP health, and third-party signals. Expect practical guidance on the metrics that make both your CFO and SREs smile: time to detect, time to recover, ticket deflection, cost per incident, SLA adherence, change failure rate, and energy used per task. This is observability that acts, with guardrails that scale, and a playbook to move from war rooms at 3 a.m. to systems that heal themselves at 3 p.m.

Listen to other episodes
Join AI Realized Community