Everbridge Logo

Everbridge

Site Reliability Specialist (Observability & Kubernetes)

Posted 4 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in United States
119K-145K Annually
Senior level
Remote
Hiring Remotely in United States
119K-145K Annually
Senior level
The Site Reliability Specialist is responsible for managing Everbridge's observability platform, ensuring reliability, scalability, and visibility into system health through tools like Grafana and Kubernetes.
The summary above was generated by AI

At Everbridge, we build resilient, scalable, and secure cloud platforms that power critical services used by 6,000+ organisations worldwide, especially when it matters most.

We’re looking for a Platform Site Reliability Specialist to take ownership of our enterprise observability platform and help shape how our teams understand, monitor, and improve system reliability at scale.

This is a high-impact role where you’ll drive both technical excellence and strategic direction, ensuring our engineers have deep, real-time visibility into system health, performance, and reliability across a complex, cloud-native environment.

*Please note that this role requires eligibility to obtain secret secret clearance*

 

What you'll do:

    Observability Platform Ownership
    • Head the design, operation, and evolution of Everbridge’s observability stack
    • Build and maintain a highly available, scalable observability platform
    • Standardize instrumentation, dashboards, alerts, and SLOs
    • Support incident response, root cause analysis, and capacity planning
    • Grafana Stack & Telemetry
      • Operate and scale Grafana and technology
      • Grafana Loki (logs)
      • Grafana Mimir (metrics)
      • Grafana Tempo (tracing)
      • Grafana Alerting
      • Kubernetes
        • Maintain reliability and security of EKS clusters running observability
        • Manage cluster lifecycle and upgrades
        • Infrastructure as Code & Automation
          • Terraform for infrastructure provisioning
          • HashiCorp Packer
          • Gitlab CI/CD at Scale

What you'll bring:

    • 6+ years of experience in Site Reliability Engineering or Platform Engineering
    • Strong hands-on experience with the Grafana ecosystem
    • Deep expertise in Kubernetes, especially Amazon EKS
    • Solid proficiency with Terraform and infrastructure as code

Preferred Qualifications:

    • Experience with OpenTelemetry
    • Background in large-scale observability systems
    • Experience with cloud cost optimization

The reasonably estimated salary for this role at Everbridge ranges from $118,700 - $145,000 and may also include variable compensation. Actual compensation is based on factors such as the candidate's skills, qualifications, and experience. In addition, Everbridge offers a wide range of best in class, comprehensive and inclusive employee benefits for this role including healthcare, dental, parental planning, and mental health benefits, disability income benefits, life and AD&D insurance, a 401(k) plan and match, paid time off, and fitness reimbursements.
 
Fair Chance Statement US & Canada
We are committed to providing equal employment opportunities in compliance with all applicable Federal, Provincial/State and Local laws, including the California Fair Chance Act and any local County Fair Chance Ordinance (or local equivalent). Pursuant to these and other relevant regulations, we consider qualified applicants with criminal histories in a manner consistent with the law.
 
For roles subject to background checks, the following material job duties may be affected by an applicant’s criminal history:
- Access to sensitive or confidential information, such as financial records, proprietary data, or client information.
- Management of cash, company funds, or other valuable assets.
- Work in environments requiring heightened security measures.
- Compliance with contractual or regulatory requirements specific to the position.
 
We evaluate each applicant's criminal history individually, considering its nature, timing, and relevance to the specific job duties, while maintaining our commitment to fair hiring practices and promoting workplace equity.

About Everbridge

Everbridge empowers enterprises and government organizations to anticipate, mitigate, respond to, and recover stronger from critical events. In today’s unpredictable world, resilient organizations minimize impact to people and operations, absorb stress, and return to productivity faster when deploying critical event management (CEM) technology. Everbridge digitizes organizational resilience by combining intelligent automation with the industry’s most comprehensive risk data to Keep People Safe and Organizations Running™. For more information, visit www.everbridge.com, read the company blog, and follow on Twitter. Everbridge… Empowering Resilience
 
Everbridge is an Equal Opportunity/Affirmative Action Employer. All qualified Applicants will receive consideration for employment without regard to race, creed, color, religion, or sex including sexual orientation and gender identity, national origin, disability, protected Veteran Status, or any other characteristic protected by applicable federal, state, or local law.

Everbridge Burlington, Massachusetts, USA Office

25 Corporate Dr., Burlington, MA, United States, 01803

Similar Jobs

27 Minutes Ago
Easy Apply
Remote or Hybrid
US
Easy Apply
94K-110K Annually
Junior
94K-110K Annually
Junior
AdTech • Enterprise Web • Information Technology • Machine Learning • Marketing Tech • Sales
The Data Analyst will build performance reporting tools, automate data processes, conduct ad-hoc analyses, and provide insights for company initiatives.
Top Skills: Google BigqueryGoogle SuiteMS OfficePythonSQL
30 Minutes Ago
Remote or Hybrid
USA
100K-155K Annually
Mid level
100K-155K Annually
Mid level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The Security Advisor II ensures the security posture of Falcon Complete customers through assessment, recommendations, and direct communication to resolve issues.
Top Skills: CybersecurityLinuxmacOSMdrSIEMUebaWindowsXdr
30 Minutes Ago
Remote or Hybrid
TX, USA
85K-120K Annually
Entry level
85K-120K Annually
Entry level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
As an Analyst I at CrowdStrike, you'll handle incident responses, perform malware analysis, improve detection processes, and communicate findings to clients.
Top Skills: .NetCC#Forensic Analysis ToolsLinuxmacOSNetwork Analysis ToolsPerlPythonRuby On RailsVbWindows

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account