Netskope

Staff Site Reliability Engineer

Reposted Yesterday

Remote

Hiring Remotely in United States

111K-226K Annually

Expert/Leader

Remote

Hiring Remotely in United States

111K-226K Annually

Expert/Leader

The Staff Site Reliability Engineer will enhance AI/ML infrastructure, manage CI/CD pipelines, ensure system reliability, and troubleshoot applications, focusing on cloud-based operations.

The summary above was generated by AI

About Netskope

Today, there's more data and users outside the enterprise than inside, causing the network perimeter as we know it to dissolve. We realized a new perimeter was needed, one that is built in the cloud and follows and protects data wherever it goes, so we started Netskope to redefine Cloud, Network and Data Security.

Since 2012, we have built the market-leading cloud security company and an award-winning culture powered by hundreds of employees spread across offices in Santa Clara, St. Louis, Bangalore, London, Paris, Melbourne, Taipei, and Tokyo. Our core values are openness, honesty, and transparency, and we purposely developed our open desk layouts and large meeting spaces to support and promote partnerships, collaboration, and teamwork. From catered lunches and office celebrations to employee recognition events and social professional groups such as the Awesome Women of Netskope (AWON), we strive to keep work fun, supportive and interactive. Visit us at Netskope Careers. Please follow us on LinkedIn and Twitter@Netskope.

About the role

We are a team of software engineers focused on improving reliability, availability, latency, performance, efficiency, monitoring, emergency response, and capacity planning of the engineering stacks. If you are passionate about solving complex problems and developing cloud services at scale, we would like to speak with you.

As a SRE, you will be writing software to solve operational problems and drive cutting edge reliability and observability practices. Your expertise will also extend to setting up and maintaining monitoring, logging, and alerting systems to oversee extensive training runs and client-facing APIs. You will ensure that training environments are optimally available and efficiently managed across multiple clusters, enhancing our containerization and orchestration systems with advanced tools like Docker and Kubernetes.

Partner closely with service owners and engineers to develop reliable services driven by best practices
Develop software and tools to solve a variety of problems across service and infrastructure
Set up and manage monitoring, logging, and alerting systems for extensive training runs and client-facing APIs.
Ensure training environments are consistently available and prepared across multiple clusters.
Develop and manage containerization and orchestration systems utilizing tools such as Docker and Kubernetes.
Improve reliability, quality, and time-to-market of our suite of software solutions
Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement
Provide primary operational support and engineering for multiple large-scale distributed software applications

Is this you?

Someone who works with a sense of ownership
Takes pride in building and operating scalable, reliable, secure systems
Are comfortable with ambiguity and change
You have a knack for troubleshooting complex systems and enjoy solving challenging problems
Proactive in identifying problems, performance bottlenecks, and areas for improvement
Has experience in working and collaborating with teams based across different geographies and time zones

Required skills and experience:

Software programming experience in any programming language
Good understanding of principles of distributed systems
Deep understanding of Kubernetes and Docker
Understanding of data technologies like Kafka, Yugabyte, Redis etc
Good understanding of AWS ecosystem
Basic understanding of networking
Exposure to Infrastructure as code tools like Terraform
Familiar with monitoring tools such as Prometheus, Grafana, or similar
8+ years building core infrastructure
BSCS or equivalent required, MSCS or equivalent strongly preferred

Nice to have experience

Experience in operating and monitoring services communicating across AWS and private clouds
Experience operating Kubernetes at scale

#LI-SC1

Compensation:

At Netskope, salary is one component of our competitive total rewards package. The salary range for this position is as listed below. This is a national range. For purposes of complying with applicable laws, the range applies to candidates in California, Colorado, Illinois, Maryland, New York, Washington, and other states.

The successful candidate’s starting pay will also be determined based on job-related skills, experience, qualifications, location, and market conditions.

For all sales roles, the posted salary range is the On Target Earnings (OTE) range for the role, which is the sum of base salary and target commission amount at 100% goal achievement.

In addition to salary, candidates may be eligible for other forms of compensation such as participation in a bonus plan (for non-sales roles) and a stock award program. Candidates may also be eligible for a comprehensive health plan and other benefits that can be reviewed at Netskope Benefits site.

Salary Range

$111,000—$225,500 USD

Netskope is committed to implementing equal employment opportunities for all employees and applicants for employment. Netskope does not discriminate in employment opportunities or practices based on religion, race, color, sex, marital or veteran statues, age, national origin, ancestry, physical or mental disability, medical condition, sexual orientation, gender identity/expression, genetic information, pregnancy (including childbirth, lactation and related medical conditions), or any other characteristic protected by the laws or regulations of any jurisdiction in which we operate.

Netskope respects your privacy and is committed to protecting the personal information you share with us, please refer to Netskope's Privacy Policy for more details.

The application window for this position is expected to close within 50 days. You may apply by filling out the below information, or visiting our Netskope Careers site.

Top Skills

AWS

Azure

Bash

Docker

Git

GCP

Grafana

Huggingface Transformers

Kubernetes

Llm

Prometheus

Python

PyTorch

Tensorrt

Terraform

Similar Jobs

Zscaler

Site Reliability Engineer

4 Days Ago

Easy Apply

Remote or Hybrid

San Jose, CA, USA

Easy Apply

119K-170K Annually

Senior level

119K-170K Annually

Senior level

Cloud • Information Technology • Security • Software • Cybersecurity

As a Staff Site Reliability Engineer, you'll oversee Zscaler production data center services, optimize code, and ensure cloud service availability and performance. Collaborate with cross-functional teams to improve processes and resolve escalated issues.

Top Skills: BashDnsFirewallsGrafanaHTTPIcmpLoad BalancingNagiosOsi ModelPrometheusPythonTcp/Ip

Thrive Market

Site Reliability Engineer

4 Days Ago

In-Office or Remote

180K-225K Annually

Senior level

180K-225K Annually

Senior level

Consumer Web • eCommerce • Food • Healthtech • Natural Language Processing • Social Impact

Lead and define the DevOps strategy, oversee migration and architecture of Kubernetes-based platforms, and mentor engineering teams.

Top Skills: AnsibleAWSBashChefCloudFormationDatadogGoGrafanaKubernetesPrometheusPuppetPythonRubyTerraform

Ping Identity

Site Reliability Engineer

5 Days Ago

Easy Apply

Remote or Hybrid

USA

Easy Apply

136K-170K Annually

Senior level

136K-170K Annually

Senior level

Cloud • Security • Software

As a Staff Site Reliability Engineer, you will design, build, and maintain cloud infrastructure, improve deployment processes, and collaborate across teams.

Top Skills: Ci/CdDockerGoKubernetes

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories