OneStream Software

Escalation Manager

Posted 3 Hours Ago

Be an Early Applicant

Remote

Hiring Remotely in United States

104K-130K Annually

Senior level

Remote

Hiring Remotely in United States

104K-130K Annually

Senior level

The Escalation Manager oversees critical customer issues in the OneStream Cloud platform, managing incidents, coordinating teams, and driving process improvements.

The summary above was generated by AI

Escalation Manager

Location: Remote, USA

Employment Type: Full-Time

Compensation: $104,000.00 - $130,000.00 (Range applies to US candidates only) + Benefits/Variable Comp/Equity - Range may vary based on experience. 

Benefits Offered: Vision, Medical, Life, Dental, 401K

Summary

The Escalation Manager is responsible for overseeing the resolution of critical customer-impacting issues across the OneStream Cloud platform. This role serves as the operational incident leader for high-severity events, ensuring incidents are managed with urgency, clear ownership, structured coordination, and transparent communication.

The Escalation Manager acts as the central coordination point during major incidents, partnering closely with Cloud Operations, Support, Platform Engineering, Cloud Engineering & Development, and Customer Success to drive rapid resolution and maintain customer confidence during complex situations.

In addition to incident coordination, this role drives continuous improvement of escalation and incident management processes. The Escalation Manager helps mature operational practices by improving incident response frameworks, strengthening root cause analysis discipline, and identifying systemic reliability improvements that reduce recurring incidents.

The ideal candidate brings strong technical operations knowledge, excellent communication skills, and experience leading incident response in high-availability SaaS or cloud environments. A passion for operational excellence, customer experience, and data-driven improvement is essential for success.

Primary Duties and Responsibilities

Lead the operational management of high-severity incidents and customer escalations across the OneStream Cloud platform.
Serve as the central coordination point during critical incidents, ensuring appropriate teams are engaged and resolution efforts remain focused and efficient.
Act as the incident manager during major incidents, maintaining situational awareness, coordinating response activities, and ensuring accountability for resolution actions.
Facilitate incident response calls, coordinate technical teams, and maintain executive-level communication during major incidents.
Clearly identify and assign resolution ownership to reduce ambiguity during incidents.
Ensure customers receive timely updates, clear communication, and strong ownership throughout the escalation lifecycle.
Own the operational incident lifecycle including incident declaration, coordination, escalation, communication, and post-incident review.
Drive root cause analysis (RCA) processes and ensure corrective and preventative actions are implemented and tracked to completion.
Track and manage escalated issues to resolution while identifying patterns, systemic risks, and recurring operational gaps.
Develop and improve incident management frameworks, escalation procedures, severity definitions, and operational runbooks.
Partner with cross-functional teams to reduce recurring incidents through automation, resiliency improvements, and architectural enhancements.
Monitor escalation metrics and operational KPIs including MTTR, incident frequency, and customer impact.
Lead post-incident reviews and drive accountability for operational improvements.
Own and drive measurable incident outcomes, including reduction in MTTR and reduction of recurring incidents.

Secondary Responsibilities

Collaborate with Customer Support, Cloud Operations, and Engineering teams to improve the customer experience during major incidents.
Maintain and evolve documentation for incident response procedures, escalation workflows, and communication templates.
Identify opportunities to improve monitoring, operational tooling, and incident coordination practices.
Contribute to reliability and operational maturity initiatives aligned with Site Reliability Engineering (SRE) practices.
Provide guidance and mentorship to engineers and support personnel on escalation and incident management best practices.

Required Education and Experience

Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related technical field, or equivalent professional experience.
5+ years of experience in cloud operations, incident management, site reliability engineering, or technical escalation management.
Proven experience coordinating and managing high-severity incidents in production cloud or SaaS environments.
Strong understanding of cloud infrastructure, distributed systems, networking fundamentals, and enterprise SaaS operations.
Experience coordinating cross-functional technical teams during complex production incidents.
Demonstrated experience operating incident management platforms used to coordinate major incident response (e.g., PagerDuty, Opsgenie, ServiceNow, or similar).
Experience using observability and monitoring tools to support incident diagnosis and response.
Demonstrated ability to communicate effectively with both technical teams and executive stakeholders during high-impact situations.
Strong analytical and problem-solving skills with the ability to drive root cause analysis and systemic resolution of operational issues.

Preferred Education and Experience

Experience working in enterprise SaaS, cloud-hosted application environments, or managed service providers (MSP/CSP).
Experience operating within Microsoft Azure environments.
Familiarity with incident management and problem management frameworks such as ITIL or SRE practices.
Experience working with observability platforms such as Datadog, New Relic, Prometheus, Grafana, or similar monitoring ecosystems.
Experience contributing to reliability engineering initiatives focused on improving service availability and operational maturity.
Relevant certifications such as Azure Fundamentals, Azure Administrator, ITIL Foundation, or reliability engineering certifications.

Knowledge, Skills, and Abilities

Strong leadership and coordination skills during high-pressure incident situations.
Ability to rapidly assess technical situations and drive structured incident response under pressure.
Strong situational awareness and decision-making during complex operational events.
Exceptional communication skills, including executive-level incident communication.
Customer-first mindset with strong ownership of outcomes.
Ability to manage multiple priorities in a fast-paced cloud operations environment.
Strong operational judgment and decision-making ability.
Experience analyzing operational metrics to drive improvement initiatives.
Ability to influence cross-functional teams without direct authority.
Commitment to continuous improvement, operational maturity, and service reliability.

Who We Are

OneStream is how today’s Finance teams can go beyond just reporting on the past and Take Finance Further™ by steering the business to the future. It’s the only enterprise finance platform that unifies financial and operational data, embeds AI for better decisions and productivity, and empowers the CFO to become a critical driver of business strategy and execution. Our vision is to be the operating system for modern finance, digitizing core financial functions and empowering the CFO to become a critical driver of business strategy. To learn more visit www.onestream.com.

Why Join The OneStream Team

Transparency around corporate structure, salary, and benefits
Core value of customer success
Variety of project work (not industry-specific) 
Strong culture and camaraderie
Multiple training opportunities

Benefits at OneStream
OneStream employees are passionate, hardworking individuals who go above and beyond to keep our customers happy and follow through on our mission statement. They consistently deliver the best and in turn, we make every effort to keep them cared for and happy. A sample of the benefits we provide are:

Excellent Medical Plan
Dental & Vision Insurance
Life Insurance
Short & Long Term Disability
Vacation Time
Paid Holidays
Professional Development
Retirement Plan

All candidates must be legally authorized to work for any company in the country where this position is located without sponsorship.

OneStream is an Equal Opportunity Employer.

#LI-TO1 #LI-Remote

Equal Opportunity Employer/Protected Veterans/Individuals with Disabilities
This employer is required to notify all applicants of their rights pursuant to federal employment laws. For further information, please review the Know Your Rights notice from the Department of Labor.

Top Skills

Cloud Operations

Datadog

Grafana

Incident Management Platforms

Itil

Azure

New Relic

Observability Tools

Opsgenie

Pagerduty

Prometheus

SaaS

Servicenow

Similar Jobs

MongoDB

Escalation Manager, FedRamp - Swing Shift

7 Days Ago

Easy Apply

Remote or Hybrid

United States

Easy Apply

101K-198K Annually

Mid level

101K-198K Annually

Mid level

Big Data • Cloud • Software • Database

The Escalation Manager coordinates resolution of critical technical issues, collaborates cross-functionally, manages customer communications, and enhances incident management processes.

Top Skills: AWSAzureGCPLinuxNoSQLPager Duty

MongoDB

Escalation Manager, FedRamp - 2nd Shift

10 Days Ago

Easy Apply

Remote or Hybrid

United States

Easy Apply

101K-198K Annually

Mid level

101K-198K Annually

Mid level

Big Data • Cloud • Software • Database

The Escalation Manager coordinates resolution of customer escalations, ensures communication among stakeholders, and drives issue resolution within technical teams.

Top Skills: AWSAzureGCPLinuxMongoDBNoSQL

MongoDB

Escalation Manager, FedRAMP - 3rd Shift

10 Days Ago

Easy Apply

Remote or Hybrid

United States

Easy Apply

101K-198K Annually

Mid level

101K-198K Annually

Mid level

Big Data • Cloud • Software • Database

Manage technical escalations for customers, ensuring resolution by coordinating with internal teams and maintaining clear communication. Lead incident analysis and prevention efforts while monitoring escalation trends for continuous improvement.

Top Skills: AWSAzureGCPLinuxMongoDBNoSQL

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories