← Back to results

Incident Manager

Lead incident management efforts at Cockroach Labs, ensuring timely resolution and communication during high-severity incidents.

Location
USA
Compensation
$111k–$147k/yr
Level
senior
Type
full time · Remote OK

AI in the day-to-day

Working knowledge of AI-assisted tools and the ability to apply them effectively to incident analysis, documentation, and process improvement.

Requirements

Experience
5+ years
Education
Bachelor's degree

Benefits

Dental Insurance Stock Options Vision Insurance Flexible Time Off Life and Disability Insurance Paid Holidays Paid Parental Leave Retirement Benefits Paid Sick Days Medical Insurance Mental Wellbeing Benefits Professional Development Funds

Joblaze summary

In the role of Incident Manager at Cockroach Labs, the individual will oversee the entire incident management process, ensuring swift resolution and effective communication across teams during high-severity incidents. Key skills include a strong technical background in operations or site reliability engineering, along with experience in using AI-assisted tools for incident analysis. This position is ideal for someone with over five years of relevant experience who thrives in fast-paced environments and can maintain composure under pressure. Cockroach Labs fosters a collaborative culture, emphasizing the importance of teamwork in addressing complex challenges.

Joblaze insights

Quick facts

Is the Incident Manager role remote?
It's hybrid — Cockroach Labs expects some on-site time in USA.
What's the salary range?
Cockroach Labs lists $111,000–$147,000 for this role.
How much experience is required?
At least 5 years of relevant experience for this Incident Manager role.
Where is the role based?
Cockroach Labs is hiring for this position in USA.
What's the tech stack?
Joblaze extracted these technologies from the posting: JavaScript, cloud infrastructure, bash, Incident Management, Python, SaaS.
What seniority level is this role?
Cockroach Labs targets senior candidates for this position.
Is this full-time or contract?
Full-time for this Incident Manager role at Cockroach Labs.

From the original posting

Category-defining tech. Career-defining work.

The next era of software won’t operate at human scale. Applications will create applications. Systems will coordinate with systems. Data and decisions will move faster than teams can react to them. The infrastructure powering that world cannot be fragile, reactive, or limited by the assumptions of the past.

Cockroach Labs exists to build what comes next — before the world requires it.

We created CockroachDB to survive failures, scale without compromise, and adapt to changing conditions automatically. Now we’re helping define a future where complexity fades into the background and infrastructure simply works, no matter the scale.

This is the kind of challenge that attracts people who want to shape industries, not just participate in them. The work is ambitious, the standards are high, and the impact is real.


The Role

As an Incident Manager at Cockroach Labs, you will lead the coordination and resolution of incidents across internal systems, CockroachDB Cloud, customer-hosted environments, and security/compliance events in the NA region. You will drive structured response efforts, partner with cross-functional teams to identify root causes, and help prevent recurrence in an environment where the pace is fast and the bar is high.

To be eligible for this role, you must be located in the Pacific time zone.

You Will

  • Manage the full lifecycle of incidents from detection through resolution, ensuring adherence to established incident management processes.
  • Lead and coordinate cross-functional response efforts to drive timely and effective incident resolution.
  • Declare and escalate high-severity incidents, mobilizing appropriate stakeholders and leadership as needed.
  • Serve as an escalation point for critical incidents and support crisis response activities.
  • Lead structured root cause analysis and post-incident reviews, ensuring actionable follow-up items are identified.
  • Track corrective actions to completion to reduce repeat incidents.
  • Provide clear, timely communication to technical and non-technical stakeholders, including customer-facing updates when required.
  • Contribute to incident metrics tracking (e.g., MTTR, MTTD, recurrence) and support reporting on trends and areas for improvement.
  • Support ongoing improvements to incident management processes, documentation, and tooling.
  • Participate in a rotational on-call schedule to ensure 24x7 coverage for high-severity incidents.

The Expectations

In your first 30 days, you will familiarize yourself with CockroachDB, our customers, and our company. We will provide some self-guided onboarding with reading and hands-on material to familiarize yourself with the company and some of the responsibilities of the role. During this period, you will also start to get acquainted with our incident management protocols and tools, and begin shadowing incident response activities to observe and learn from other team members with an eye to future improvements and optimizations.

After 3 months, you will be integrated into the company and will be familiar with the various systems we use. You will be able to manage incidents from both internal and customer environments and will be actively contributing to the Incident Management program. You will be leading incident response efforts, conducting root cause analyses, and participating in post-incident reviews.

You Have

  • 5+ years of experience in technical operations, SRE, support, or incident management roles, including at least 2 years of direct Incident Management experience leading high-severity incidents.
  • Prior experience working in a highly technical, fast-paced environment such as a cloud infrastructure, SaaS, or enterprise software company.
  • Working knowledge of AI-assisted tools and the ability to apply them effectively to incident analysis, documentation, and process improvement.
  • Strong troubleshooting and analytical skills in a 24x7 operational environment.
  • Excellent written and verbal communication skills across technical and non-technical audiences.
  • Working knowledge of AI-assisted tools and the ability to apply them effectively to incident analysis, documentation, and process improvement.
  • Ability to remain calm and structured during high-pressure situations.
  • Proven ability to assume command during high-severity incidents, bringing structure, clarity, and decisive direction in fast-moving, ambiguous situations
  • Flexibility in working hours to support business needs, including participation in an on-call rotation coverage.
  • Bachelor’s degree in Computer Science, Information Technology, or equivalent experience.

Preferred Qualifications

  • Experience leading incident response calls and driving cross-team coordination.
  • Strong influencing skills when working across teams without direct authority.
  • Familiarity with IT service management principles (ITIL, Incident, Change, Problem Management).
  • Experience with incident management tooling.
  • Exposure to security or compliance-related incident response.
  • Basic scripting skills (Bash, Python, JavaScript) to support operational improvements.
  • Relevant technical or ITIL certifications.

Cockroach Labs is proud to be an Equal Opportunity Employer building a diverse and inclusive workforce. If you need additional accommodations to feel comfortable during your interview process, please email us at accessibility@cockroachlabs.com.

Cockroach Labs has a hybrid work model, with Roachers that are local to one of our offices coming in on Mondays, Tuesdays, and Thursdays and working flexibly the rest of the week. While we’ve learned valuable lessons working remotely, nothing can replace the connection, creativity, and fun that occurs when Roachers get together and we are committed to fostering a workplace that encourages collaboration and allows us all to do our best work.


Benefits

  • Stock Options
  • Medical Insurance
  • Vision Insurance
  • Dental Insurance
  • Life and Disability Insurance
  • Professional Development Funds
  • Flexible Time Off
  • Paid Holidays
  • Paid Sick Days
  • Paid Parental Leave
  • Retirement Benefits
  • Mental Wellbeing Benefits
  • And more!

The annual anticipated base salary range for U.S. candidates for this role is listed in USD below. Salary is one component of the Cockroach Labs’ Total Rewards package, which also includes, for each employee: stock options, medical insurance, vision insurance, dental insurance, life and disability insurance, funds towards professional development resources, flexible paid time off, 11 paid holidays a year, 10 paid sick days a year, paid parental leave, a 401(k) plan, and wellbeing benefits.

We set standard ranges for all U.S.-based roles based on function, level, and geographic location, benchmarked against similar stage growth companies. Actual salaries may vary and fall outside of this range depending on factors such as a candidate’s qualifications, geographic location, skills, experience, and competencies. In addition, we are often open to a wide variety of profiles, and recognize that the person we hire may be less experienced (or more senior) than this job description as posted.

Salaries for candidates outside the U.S. will vary based on local compensation structures.

This position will remain posted until filled. Applicants should apply via our Careers Page.

Annual Anticipated Base Salary Range (U.S)
$111,000$147,000 USD

Similar positions

Cockroach Labs
Incident Manager, EMEA
Cockroach Labs · London, UK
Cockroach Labs
Cockroach Labs
Cockroach Labs
Cockroach Labs
Sr. Customer Success Manager- NYC or Toronto
Cockroach Labs · Remote, United States