AuthZed Logo

AuthZed

Sr. Site Reliability Engineer

Posted 6 Days Ago
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
As a Site Reliability Engineer, you will design, implement, and maintain scalable infrastructure, ensure system reliability, automate processes, and collaborate with engineering teams.
The summary above was generated by AI
About AuthZed:

We are the creators and maintainers of SpiceDB and the authorization infrastructure that companies around the world depend on to keep their engineering teams focused on what matters most - their own product.

We are a Series A company, fixing broken access control with products that eliminate complex permission management while delivering enterprise-scale performance and consistent access control.

AuthZed is a fully remote company with employees across the US, Canada, and Europe. We’re a hardworking and close-knit group with a software-driven culture (yep, even our GTM team understands and loves this technology)! We bring integrity to all our interactions, fostering confidence in decision making - trusting and respecting each voice on our team, every day.

Company Values:
  • Agency: Everyone should have the capability, freedom, and confidence to bring about changes to our business and product. Organizational processes exist to clearly define our goals, but not restrict how progress is made.

  • Collaboration: Success is defined in various dimensions and no single person can be an expert in all of them. Without valuing the opinions of others, finding compromises, and sharing mutual trust and respect, you cannot arrive at the best possible solution.

  • Open-mindedness: Without asking questions, testing assumptions, and questioning our pre-existing biases we risk operating within an echo-chamber. We celebrate the representation of diverse perspectives and backgrounds as a catalyst for creating an inclusive work environment that everyone can appreciate.

About the Role:

As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems. You will be responsible for designing, implementing, and maintaining scalable infrastructure solutions to support our growing customer base. This is an exciting opportunity to work in a fast-paced environment and contribute to the success of a company bringing a Google-inspired authorization system to companies around the globe.

What you’ll own:
  • Design, implement, and maintain highly available and scalable infrastructure solutions for our projects, products, and customers.

  • Monitor and analyze system performance, identifying and resolving bottlenecks and issues to ensure optimal performance and reliability.

  • Automate infrastructure deployment and configuration management processes.

  • Continuously improve system reliability, security, and efficiency through proactive monitoring, capacity planning, and performance tuning.

  • Troubleshoot and resolve complex infrastructure and application issues in production and test environments.

  • Collaborate with software engineering teams to design and implement systems that are resilient, scalable, and secure.

  • Participate in on-call rotation and respond to production incidents in a timely manner.

  • Document system configurations, troubleshooting procedures, and operational guidelines.

What you bring:
  • Proven experience as a Site Reliability Engineer or in a similar role.

  • Strong understanding of networking, operating systems, and cloud infrastructure.

  • Experience with Site Reliability Engineering, System Design, and Distributed Computing.

  • Experience in various programming languages — we currently have SDKs for NodeJS, Java, Python, Ruby, and Go.

  • Experience with containerization technologies such as Docker and Kubernetes.

  • Knowledge of infrastructure-as-code tools like Terraform and Pulumi.

  • Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).

  • Experience with lower-level implementation details of relational databases (bonus if you have have experience with distributed SQL databased like Google Cloud Spanner or CockroachDB).

  • Experience working with Git and GitHub.

  • Experience with continuous integration and deployment systems.

  • Strong problem-solving and troubleshooting skills.

  • Excellent communication and collaboration abilities.

Extra shine:
  • Experience with Authorization systems.

Life at AuthZed:
  • Opportunity to work with cutting-edge technology in a rapidly growing sector.

  • A supported environment where your ideas lead to real impact.

  • Competitive salary based on experience.

  • Stock options at an early-stage startup.

  • Comprehensive benefits including healthcare (US-based) and other insurance.

  • A full remote and flexible schedule to accommodate different timezones

  • Twice-yearly travel for team offsites focused on team bonding, collaboration, and having fun!

Similar Jobs

Yesterday
Remote
110K-160K Annually
Senior level
110K-160K Annually
Senior level
Software
Operate and maintain production AWS/EKS Kubernetes clusters; design and ship infrastructure-as-code with Terraform; manage Helm charts and ArgoCD GitOps for multi-region SaaS; maintain observability (Grafana, alerting, logs); improve CI/CD pipelines; remediate container and infrastructure CVEs; support compliance (FedRAMP/SOC2/NIST); create runbooks and lead incident response and post-incident reviews.
Top Skills: Amazon EksArgocdAWSCi/CdClaudeDockerGitopsGrafanaHelmKubernetesTerraform
Yesterday
Remote
108K-125K Annually
Senior level
108K-125K Annually
Senior level
Internet of Things
Operate and evolve an EKS-based Kubernetes platform, design CI/CD pipelines (GitHub Actions, OIDC), maintain infra-as-code (Pulumi/Terraform/OpenTofu) across AWS accounts, run observability stack, enforce security best practices, diagnose incidents and lead postmortems, participate in on-call rotation, and produce runbooks and documentation.
Top Skills: Amazon EksAWSAws IamAws Secrets ManagerExternal Secrets OperatorGithub ActionsGrafanaKubernetesOidcOpentofuPulumiTerraformVectorVictorialogsVictoriametrics
18 Days Ago
In-Office or Remote
Senior level
Senior level
Information Technology • Software
The Senior Site Reliability Engineer will manage cloud infrastructure, enhance developer productivity, improve operational reliability, and mentor engineering teams.
Top Skills: AutomationAWSAzureCi/CdKubernetesScriptingTerraform

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account