Axiom (axiom.co)

Site Reliability Engineer

Reposted 23 Days Ago

Remote

Hiring Remotely in United States

Mid level

Remote

Hiring Remotely in United States

Mid level

Design, operate, and automate scalable, secure infrastructure for Axiom Cloud. Define SLOs, plan disaster recovery and capacity, tune performance, improve deployment practices, build reliability tooling, respond to incidents, and promote monitoring and observability across teams.

The summary above was generated by AI

Site Reliability Engineer (SRE)

Global (UTC-3 preferred)

Axiom’s mission is to empower developers to get the best insights into their data, as fast as possible. We are a remote-first and globally distributed team building a cloud native, serverless data analytics platform. Axiom completely changes the way in which developers and organizations think about their data: they can now send unlimited data with cost-effective storage and lightning-fast querying.

As a Site Reliability Engineer at Axiom, you will be pivotal in upholding our promise of superior reliability and performance to our customers. Collaborating with backend engineers and product teams, you will emphasize creating and operating scalable and reliable systems. Axiom's emphasis on SREs revolves around automating, measuring, and continuously improving the reliability and efficiency of our systems.

Your primary responsibilities:

Engineer and maintain a robust, secure, and scalable infrastructure for Axiom Cloud.
Collaborate with engineering teams to define and refine service level objectives.
Contribute to disaster recovery planning, capacity engineering, performance analysis, and system tuning.
Foster best practices for code deployments, aiding in the education of the broader development team.
Roll out tooling and solutions that improve system reliability and reduce manual toil.
Address and remediate service incidents and contribute to postmortems and root cause analyses.
Foster a culture of monitoring, alerting, and observability across the organization.

You are an ideal candidate if:

You have over two years of experience in a reliability-focused engineering environment.
You are passionate about system reliability, latency, performance, and efficiency.
You're familiar with AWS tools and technologies.
You have hands-on experience with Docker, Kubernetes, and Amazon EKS.
You understand infrastructure-as-code tools such as Terraform/Pulumi.
You possess strong networking knowledge and are adept with Linux systems.
Familiarity with CI platforms like GitHub Actions, GitLab, CircleCI or others.
You can efficiently use LLMs.
Experience with monitoring, alerting, and observability tools.

Bonus skills and experiences:

Proven track record of maintaining production systems at scale.
A software engineering background with expertise in Golang.

We provide:

Flexibility to work from wherever suits you best. For this role, we are considering individuals based in the timezone range UTC-5 (EST) to UTC +2.
Budget to build your home office set-up.
Monthly budget to support mental and physical wellness.
A focus day each week with no meetings, Slack or Zoom. Uninterrupted time to focus on work.
Uncapped vacation to unplug and rejuvenate.
Generous and flexible family leave for everyone.

Top Skills

Amazon Eks

AWS

CircleCI

Docker

Github Actions

Gitlab

Kubernetes

Linux

Llms

Monitoring And Observability Tools

Pulumi

Terraform

Similar Jobs

CrowdStrike

Senior Software Engineer

7 Days Ago

Remote or Hybrid

140K-215K Annually

Senior level

140K-215K Annually

Senior level

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity

As a Senior Engineer in the Embedded Reliability team, you will develop and optimize distributed systems, improve reliability practices, and mentor other engineers, focusing on hands-on systems engineering and complex problem-solving.

Top Skills: AWSCassandraGoKafkaKubernetesOpensearch

MongoDB

Site Reliability Engineer

8 Days Ago

Easy Apply

Remote or Hybrid

Easy Apply

127K-249K Annually

Senior level

127K-249K Annually

Senior level

Big Data • Cloud • Software • Database

The Senior Site Reliability Engineer will develop and support distributed storage services, ensuring reliability and operational safety, with a focus on automation and efficiency.

Top Skills: AWSAzureDnsGoGoogle Cloud PlatformKubernetesLinuxPythonTcp/IpTls

MongoDB

Site Reliability Engineer

8 Days Ago

Easy Apply

Remote or Hybrid

United States

Easy Apply

127K-249K Annually

Expert/Leader

127K-249K Annually

Expert/Leader

Big Data • Cloud • Software • Database

Seeking a Site Reliability Engineer with expertise in networking and distributed systems for building secure multi-cloud infrastructure. Responsibilities include maintaining network architecture and ensuring reliable service-to-service communication, involving a 24/7 on-call rotation.

Top Skills: AWSAzureBgpDnsGCPIpv6KubernetesLoad BalancingMtlsService MeshTcp/IpTlsVpcsVpns

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories