OneStream Software Logo

OneStream Software

Escalation Manager

Posted 3 Hours Ago
Be an Early Applicant
Remote
Hiring Remotely in United States
104K-130K Annually
Senior level
Remote
Hiring Remotely in United States
104K-130K Annually
Senior level
The Escalation Manager oversees critical customer issues in the OneStream Cloud platform, managing incidents, coordinating teams, and driving process improvements.
The summary above was generated by AI

Escalation Manager 


Location: Remote, USA 

Employment Type: Full-Time 

Compensation: $104,000.00 - $130,000.00 (Range applies to US candidates only) + Benefits/Variable Comp/Equity - Range may vary based on experience.  

Benefits Offered: Vision, Medical, Life, Dental, 401K 


Summary 

The Escalation Manager is responsible for overseeing the resolution of critical customer-impacting issues across the OneStream Cloud platform. This role serves as the operational incident leader for high-severity events, ensuring incidents are managed with urgency, clear ownership, structured coordination, and transparent communication. 

The Escalation Manager acts as the central coordination point during major incidents, partnering closely with Cloud Operations, Support, Platform Engineering, Cloud Engineering & Development, and Customer Success to drive rapid resolution and maintain customer confidence during complex situations. 

In addition to incident coordination, this role drives continuous improvement of escalation and incident management processes. The Escalation Manager helps mature operational practices by improving incident response frameworks, strengthening root cause analysis discipline, and identifying systemic reliability improvements that reduce recurring incidents. 

The ideal candidate brings strong technical operations knowledge, excellent communication skills, and experience leading incident response in high-availability SaaS or cloud environments. A passion for operational excellence, customer experience, and data-driven improvement is essential for success. 


Primary Duties and Responsibilities 

  • Lead the operational management of high-severity incidents and customer escalations across the OneStream Cloud platform. 
  • Serve as the central coordination point during critical incidents, ensuring appropriate teams are engaged and resolution efforts remain focused and efficient. 
  • Act as the incident manager during major incidents, maintaining situational awareness, coordinating response activities, and ensuring accountability for resolution actions. 
  • Facilitate incident response calls, coordinate technical teams, and maintain executive-level communication during major incidents. 
  • Clearly identify and assign resolution ownership to reduce ambiguity during incidents. 
  • Ensure customers receive timely updates, clear communication, and strong ownership throughout the escalation lifecycle. 
  • Own the operational incident lifecycle including incident declaration, coordination, escalation, communication, and post-incident review. 
  • Drive root cause analysis (RCA) processes and ensure corrective and preventative actions are implemented and tracked to completion. 
  • Track and manage escalated issues to resolution while identifying patterns, systemic risks, and recurring operational gaps. 
  • Develop and improve incident management frameworks, escalation procedures, severity definitions, and operational runbooks. 
  • Partner with cross-functional teams to reduce recurring incidents through automation, resiliency improvements, and architectural enhancements. 
  • Monitor escalation metrics and operational KPIs including MTTR, incident frequency, and customer impact. 
  • Lead post-incident reviews and drive accountability for operational improvements. 
  • Own and drive measurable incident outcomes, including reduction in MTTR and reduction of recurring incidents. 

Secondary Responsibilities 

  • Collaborate with Customer Support, Cloud Operations, and Engineering teams to improve the customer experience during major incidents. 
  • Maintain and evolve documentation for incident response procedures, escalation workflows, and communication templates. 
  • Identify opportunities to improve monitoring, operational tooling, and incident coordination practices. 
  • Contribute to reliability and operational maturity initiatives aligned with Site Reliability Engineering (SRE) practices. 
  • Provide guidance and mentorship to engineers and support personnel on escalation and incident management best practices. 

Required Education and Experience 

  • Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related technical field, or equivalent professional experience. 
  • 5+ years of experience in cloud operations, incident management, site reliability engineering, or technical escalation management. 
  • Proven experience coordinating and managing high-severity incidents in production cloud or SaaS environments. 
  • Strong understanding of cloud infrastructure, distributed systems, networking fundamentals, and enterprise SaaS operations. 
  • Experience coordinating cross-functional technical teams during complex production incidents. 
  • Demonstrated experience operating incident management platforms used to coordinate major incident response (e.g., PagerDuty, Opsgenie, ServiceNow, or similar). 
  • Experience using observability and monitoring tools to support incident diagnosis and response. 
  • Demonstrated ability to communicate effectively with both technical teams and executive stakeholders during high-impact situations. 
  • Strong analytical and problem-solving skills with the ability to drive root cause analysis and systemic resolution of operational issues. 

Preferred Education and Experience 

  • Experience working in enterprise SaaS, cloud-hosted application environments, or managed service providers (MSP/CSP). 
  • Experience operating within Microsoft Azure environments. 
  • Familiarity with incident management and problem management frameworks such as ITIL or SRE practices. 
  • Experience working with observability platforms such as Datadog, New Relic, Prometheus, Grafana, or similar monitoring ecosystems. 
  • Experience contributing to reliability engineering initiatives focused on improving service availability and operational maturity. 
  • Relevant certifications such as Azure Fundamentals, Azure Administrator, ITIL Foundation, or reliability engineering certifications. 

Knowledge, Skills, and Abilities 

  • Strong leadership and coordination skills during high-pressure incident situations.
  • Ability to rapidly assess technical situations and drive structured incident response under pressure.
  • Strong situational awareness and decision-making during complex operational events. 
  • Exceptional communication skills, including executive-level incident communication. 
  • Customer-first mindset with strong ownership of outcomes. 
  • Ability to manage multiple priorities in a fast-paced cloud operations environment. 
  • Strong operational judgment and decision-making ability. 
  • Experience analyzing operational metrics to drive improvement initiatives. 
  • Ability to influence cross-functional teams without direct authority. 
  • Commitment to continuous improvement, operational maturity, and service reliability. 

Who We Are 

OneStream is how today’s Finance teams can go beyond just reporting on the past and Take Finance Further™ by steering the business to the future. It’s the only enterprise finance platform that unifies financial and operational data, embeds AI for better decisions and productivity, and empowers the CFO to become a critical driver of business strategy and execution. Our vision is to be the operating system for modern finance, digitizing core financial functions and empowering the CFO to become a critical driver of business strategy. To learn more visit www.onestream.com. 

Why Join The OneStream Team 

  • Transparency around corporate structure, salary, and benefits 
  • Core value of customer success 
  • Variety of project work (not industry-specific)  
  • Strong culture and camaraderie 
  • Multiple training opportunities 

Benefits at OneStream   
OneStream employees are passionate, hardworking individuals who go above and beyond to keep our customers happy and follow through on our mission statement. They consistently deliver the best and in turn, we make every effort to keep them cared for and happy. A sample of the benefits we provide are: 

  • Excellent Medical Plan 
  • Dental & Vision Insurance 
  • Life Insurance 
  • Short & Long Term Disability 
  • Vacation Time 
  • Paid Holidays 
  • Professional Development 
  • Retirement Plan 

All candidates must be legally authorized to work for any company in the country where this position is located without sponsorship. 

OneStream is an Equal Opportunity Employer. 


#LI-TO1 #LI-Remote

Equal Opportunity Employer/Protected Veterans/Individuals with Disabilities
This employer is required to notify all applicants of their rights pursuant to federal employment laws. For further information, please review the Know Your Rights notice from the Department of Labor.

Top Skills

Cloud Operations
Datadog
Grafana
Incident Management Platforms
Itil
Azure
New Relic
Observability Tools
Opsgenie
Pagerduty
Prometheus
SaaS
Servicenow

Similar Jobs

7 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
101K-198K Annually
Mid level
101K-198K Annually
Mid level
Big Data • Cloud • Software • Database
The Escalation Manager coordinates resolution of critical technical issues, collaborates cross-functionally, manages customer communications, and enhances incident management processes.
Top Skills: AWSAzureGCPLinuxNoSQLPager Duty
10 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
101K-198K Annually
Mid level
101K-198K Annually
Mid level
Big Data • Cloud • Software • Database
The Escalation Manager coordinates resolution of customer escalations, ensures communication among stakeholders, and drives issue resolution within technical teams.
Top Skills: AWSAzureGCPLinuxMongoDBNoSQL
10 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
101K-198K Annually
Mid level
101K-198K Annually
Mid level
Big Data • Cloud • Software • Database
Manage technical escalations for customers, ensuring resolution by coordinating with internal teams and maintaining clear communication. Lead incident analysis and prevention efforts while monitoring escalation trends for continuous improvement.
Top Skills: AWSAzureGCPLinuxMongoDBNoSQL

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account