Coalfire Logo

Coalfire

Director of Site Reliability Engineering

Posted 15 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
Lead and mentor a team in managing cloud operations, focusing on service delivery, incident response, and strategic organizational growth.
The summary above was generated by AI
About Coalfire

Coalfire is on a mission to make the world a safer place by solving our clients’ hardest cybersecurity challenges. We work at the cutting edge of technology to advise, assess, automate, and ultimately help companies navigate the ever-changing cybersecurity landscape. We are headquartered in Denver, Colorado with offices across the U.S. and U.K., and we support clients around the world.

But that’s not who we are – that’s just what we do.
 
We are thought leaders, consultants, and cybersecurity experts, but above all else, we are a team of passionate problem-solvers who are hungry to learn, grow, and make a difference.

Position Summary
We are seeking a technically adept and operationally focused leader to oversee our Cloud Operations group,  a growing team responsible for managing client infrastructure across AWS, Azure, and GCP. Our clients rely on us to operate secure, high-performing cloud environments that support regulated workloads and long-term service stability.
 
As Director of Cloud Operations, you will provide technical and operational leadership to a U.S.-based team of cloud and systems engineers and administrators, guiding them through the implementation of scalable processes, standards, and tooling that improve quality, reliability, and customer satisfaction. You’ll also mentor frontline leaders and help shape a team culture built on ownership, communication, and operational discipline.
 
This is a technical management role, while you won’t be hands-on, you will be expected to engage deeply with cloud architecture, infrastructure operations, automation frameworks, and service delivery workflows. Your success will be measured by your ability to improve execution, reduce operational risk, and build a high-performing team culture centered on accountability, transparency, and excellence.

What You'll Do

  • Team Leadership and Development
  • Lead and mentor 5+ direct managers and 20–30 indirect reports across cloud operations and systems engineering functions.
  • Build a team culture of accountability, urgency, and client ownership.
  • Support overall performance management, and long-term career development practices.
  • Act as an escalation point for technical and operational blockers impacting delivery or customer satisfaction.

  • Operational Excellence & Service Delivery
  • Drive improvements in incident response, ticket handling, change management, and patch compliance.
  • Standardize runbooks, monitoring, escalation paths, and documentation across client environments.
  • Identify and track key operational metrics such as MTTR, SLA adherence, and customer satisfaction.
  • Partner with internal teams to create more proactive service models that anticipate client issues before escalation.

  • Strategic and Organizational Growth
  • Collaborate with leadership to expand technical capabilities and develop new professional service offerings.
  • Evaluate emerging technologies and trends to guide innovation within the team’s technical practices.
  • Support organizational growth by creating scalable frameworks for service delivery and team expansion.
  • Participate in strategic planning sessions to align technical direction with business objectives.

  • Cross-Functional Collaboration
  • Collaborate with other departments to ensure alignment between professional services and broader business goals.
  • Partner with the Security Director on shared concerns such as incident containment, vulnerability remediation, and tooling integration.

What You'll Bring

  • Proven leadership experience with technical operations teams in a managed services or MSP context.
  • Deep knowledge of cloud infrastructure in AWS, Azure, and GCP environments.
  • Familiarity with infrastructure-as-code tools like Terraform, Ansible, GitHub/GitLab pipelines.
  • Strong communication skills with the ability to manage both internal teams and client expectations.
  • High emotional intelligence and situational awareness during client escalations and internal performance issues.
  • Experience leading operational maturity or ITSM process rollouts (e.g., incident/change/problem management).
  • Familiarity with SRE principles, but adaptable to operationally heavy environments.
  • Metric and KPI management
  • 8+ years of technical leadership experience, ideally within a managed services or multi-client environment.
  • Proven success in scaling technical organizations and driving operational excellence in a professional services environment.
  • Experience managing key operational metrics such as utilization, margins, and capacity.

Bonus Points

  • Direct experience leading cloud-focused teams or organizations.
  • Background in customer-facing roles, with experience in client escalations or high-level technical discussions.
  • Experience leading operational maturity or ITSM process rollouts (e.g., incident/change/problem management).
  • Familiarity with SRE principles, but adaptable to operationally heavy environments.
  • Relevant certifications in cloud platforms (AWS, Azure, GCP) or IT frameworks (ITIL, TOGAF) are preferred.

Why You’ll Want to Join Us

At Coalfire, you’ll find the support you need to thrive personally and professionally. In many cases, we provide a flexible work model that empowers you to choose when and where you’ll work most effectively – whether you’re at home or an office.

Regardless of location, you’ll experience a company that prioritizes connection and wellbeing and be part of a team where people care about each other and our communities. You’ll have opportunities to join employee resource groups, participate in in-person and virtual events, and more. And you’ll enjoy competitive perks and benefits to support you and your family, like paid parental leave, flexible time off, certification and training reimbursement, digital mental health and wellbeing support membership, and comprehensive insurance options.

At Coalfire, equal opportunity and pay equity is integral to the way we do business. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran. Coalfire is committed to providing access, equal opportunity, and reasonable accommodation for individuals with disabilities in employment, its services, programs, and activities. To request reasonable accommodation to participate in the job application or interview process, our Human Resources team at [email protected].

Top Skills

Ansible
AWS
Azure
GCP
Git
Gitlab
Terraform

Similar Jobs

15 Hours Ago
Remote
3 Locations
185K-205K Annually
Senior level
185K-205K Annually
Senior level
Healthtech • Software
The Director of Site Reliability Engineering leads cloud service management, incident response, and automation, mentoring SRE teams and optimizing cloud infrastructure for reliability and cost efficiency.
Top Skills: AWSCloud InfrastructureDeployment ToolsIncident Management ToolsMonitoring ToolsSecurity Tools
11 Hours Ago
Remote
Hybrid
New York, NY, USA
125K-155K Annually
Senior level
125K-155K Annually
Senior level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
The Staff Cyber Security Engineer will lead security analysis for technology deployments, ensuring secure design and compliance with best practices, while collaborating with various teams.
Top Skills: Application SecurityCis ControlsCloud SecurityCyber SecurityEdrMitre Att&CkNetwork SecurityNist CsfOwasp
15 Hours Ago
Remote
Hybrid
San Diego, CA, USA
127K-215K Annually
Senior level
127K-215K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead a team of Site Reliability Engineers to ensure reliable operations, automate processes, and improve system performance for federal clients.
Top Skills: AzureCloud OperationsCodingDatabasesItil V3LinuxMonitoring Solutions

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account