Mattermost Logo

Mattermost

Site Reliability Engineer (SRE)

Posted 9 Days Ago
Remote
Hiring Remotely in United States
150K-190K Annually
Senior level
Remote
Hiring Remotely in United States
150K-190K Annually
Senior level
The Site Reliability Engineer at Mattermost will design, operate, and enhance the platform's infrastructure, focusing on reliability, performance, and automation.
The summary above was generated by AI
At Mattermost, we build the #1 collaborative workflow solution for defense, intelligence, security, and critical infrastructure organizations. Trusted by governments, financial institutions, and technology companies, our platform enables secure, efficient operations for the world’s most critical teams.
 
We’re dedicated to empowering organizations to operate with confidence, reducing risks, and accelerating productivity. Guided by our core values of Customer Obsession, Earn Trust, Self Awareness, Ownership and High Impact, we collaborate closely with our customers to deliver solutions that meet complex needs and drive success.
 
To learn more, visit www.mattermost.com

Mattermost is seeking a highly skilled Site Reliability Engineer (SRE) to help design, operate, and improve the infrastructure powering our secure, mission-critical collaboration platform. As part of our globally distributed Engineering team, you will focus on reliability, scalability, performance, and automation across cloud and hybrid environments. 

You will play a key role in ensuring our systems are observable, resilient, and efficient, working closely with development, security, and operations teams to deliver exceptional uptime and performance to our customers in defense, government, and critical infrastructure sectors. 

Responsibilities Include:

  • Build, maintain, and optimize containerized workloads for production environments 
  • Implement infrastructure-as-code for repeatable and reliable deployments 
  • Implement and maintain compliant cloud environments to meet regulatory and security requirements for customers in highly regulated domains (e.g., FedRAMP, DoD). 
  • Establish and maintain observability solutions for monitoring, alerting, and performance tuning 
  • Perform incident response for production systems, including root cause analysis and remediation 
  • Drive automation to reduce manual operations and improve system reliability 
  • Collaborate across teams to design scalable, secure, and compliant architectures 
  • Participate in an on-call rotation for production systems 

 Requirements:

  • BS in Computer Science, Cybersecurity, Software Engineering, or a related technical field, or equivalent experience, with 5+ years of relevant experience in site reliability engineering, DevOps, or cloud infrastructure roles. 
  • Strong background in container orchestration platforms, ideally Kubernetes 
  • Proven experience with infrastructure-as-code tooling, ideally Terraform 
  • Proven experience with cloud service providers, ideally AWS 
  • Experience designing and maintaining monitoring and alerting solutions 
  • Strong skills in troubleshooting and performance tuning for distributed systems 
  • Proficiency in at least one scripting or programming language for automation 
  • Excellent communication skills and ability to work in distributed teams 
  • For candidates residing in the U.S.: This role may require the ability to obtain and maintain a U.S. government security clearance in the future. As such, U.S. applicants must be U.S. citizens and eligible under applicable clearance requirements.  
  • Applicants must meet eligibility requirements for access to export-controlled information as defined by U.S. export control laws, including EAR and ITAR. 

 Preferences:

  • Familiarity with observability stacks such as Grafana and Prometheus 
  • Knowledge of high-availability, disaster recovery, and scaling strategies 
  • Experience in highly regulated industries such as defense, finance, or critical infrastructure 
  • Experience with U.S. federal compliance frameworks and authorization processes, including FedRAMP, DoD ATO, NIST 800-53, and related government standards. 
  • Experience preparing, delivering, and maintaining software offerings through AWS Marketplace and other cloud provider marketplaces (e.g., Azure Marketplace, Google Cloud Marketplace), including packaging, compliance validation, and ongoing operational support. 
  • Open-source contributions related to reliability or infrastructure tooling 
  • Certifications in cloud infrastructure, reliability, or DevOps engineering (e.g., CKA, CKAD, AWS Certified Solutions Architect) 

Mattermost takes a market-based approach to pay and pay may vary depending on your location. The successful candidate’s starting pay will be determined based on job-related skills, experience, qualifications, work location, and market conditions. These ranges may be modified in the future.

 

Salary Range
$150,000$190,000 USD
Mattermost is an EEO Employer, we are a remote-first, open-source company.
 
We are continually working to expand our hiring in more countries and regions, ensuring compliance with local laws and regulations, which takes time.
 
Mattermost values your unique perspective—we welcome all applicants. We encourage individuals from all backgrounds to apply and are committed to assessing candidates based on their skills and qualifications. We do not tolerate discrimination against staff or applicants based on race, religion, national origin, age, disability, pregnancy status, veteran status, or other personal characteristics.
 
If you require accommodations during the interview process, please let us know—we’re happy to assist.

Top Skills

AWS
Grafana
Kubernetes
Prometheus
Terraform

Similar Jobs

2 Days Ago
In-Office or Remote
2 Locations
141K-212K
Mid level
141K-212K
Mid level
Artificial Intelligence • Productivity • Software • Automation
The Site Reliability Engineer will enhance reliability, observability, and incident response at Zapier, developing platform tooling, automating operations, and mentoring others.
Top Skills: ArgocdAWSDatadogGitlabGoGrafanaKafkaKubernetesOpensearchPrometheusPythonRedisSentryTerraformTypescript
8 Days Ago
Remote or Hybrid
New York, NY, USA
130K-180K Annually
Senior level
130K-180K Annually
Senior level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Oversee operational support for SAP BTP applications, lead troubleshooting efforts, manage relationships with teams, and ensure high system performance and availability.
Top Skills: Abap ProxiesCapmEncryptionIdentity ManagementIdocJSONMessage QueuesOauthOdataRestSAMLSap AribaSap BtpSap C4CSap CallidusSap CpiSap Success FactorsSfapiSftpSoapWorkdayXML
9 Days Ago
Remote or Hybrid
2 Locations
205K-257K Annually
Senior level
205K-257K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
Lead diverse technology projects as a Site Reliability Engineer to optimize and automate business-critical services, focusing on cloud-based solutions and advanced technologies.
Top Skills: AWSCassandraDockerKafkaNode.jsOpensearchPostgres

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account