Lytx Logo

Lytx

Staff, Site Reliability Engineer

Posted 17 Days Ago
Remote
Hiring Remotely in USA
184K-233K Annually
Senior level
Remote
Hiring Remotely in USA
184K-233K Annually
Senior level
Leads SRE practices, manages system reliability, architects solutions, oversees incident management, and collaborates on infrastructure deployment strategies.
The summary above was generated by AI

Why Lytx:

We are a team of Hungry, Low ego and capable engineers that design and support our IOT Infrastructure. Are you interested in "Operations as Code", "Infrastructure as Code" and infrastructure automation solutions? If so keep reading....

Site Reliability Engineering team is responsible for the availability, reliability, observability and resilience of Infrastructure and related automation of the entire fleet of servers on-prem and the expanding cloud posture of the organization. This team’s responsibilities are very critical to the continuity of business of the organization. If you love crafting new solutions and building a scalable cloud and on-prem infrastructure, then this role may be an excellent match for you!

What you'll get to do:

  • Strategic Leadership: Define and drive the strategic direction for SRE practices and reliability engineering within the organization, influencing both technical and operational strategies.
  • Advanced System Architecture: Architect and implement complex systems and solutions, addressing high-impact and cross-team challenges with a focus on scalability, reliability, and performance.
  • High-Level Incident Management: Lead major incident response efforts and postmortem analyses, ensuring thorough investigations and comprehensive resolution strategies to improve overall system resilience.
  • Cross-Functional Collaboration: Partner with engineering, operations, and product teams to embed reliability and performance best practices into all aspects of system design and development.
  • Innovation and Improvement: Drive innovation in reliability engineering practices, introducing new tools, technologies, and methodologies to enhance system performance and operational efficiency.
  • Strategic Capacity Planning: Oversee long-term capacity planning and forecasting, aligning resource allocation with business goals and scaling needs to ensure continuous service reliability.
  • Mentorship and Leadership: Provide guidance and mentorship to senior and junior SREs, fostering a culture of learning and professional development within the SRE team.
  • Organizational Impact: Contribute to and influence organizational policies, procedures, and best practices related to system reliability, ensuring alignment with broader business objectives and industry standards.

What you'll need:

  • 8+ years of experience as an SRE in AWS environments within medium to large-scale organizations.
  • 8+ years of hands-on experience with observability tools, including Prometheus, New Relic, Grafana, or similar.
  • Exceptional proficiency in programming, with expertise in Python, Go, PowerShell, YAML, Node.js and Bash.
  • Extensive experience managing database technologies, both SQL and NoSQL.
  • 5+ years of experience in designing and building infrastructure deployment pipelines using Git, GHA, Terraform, Helm, or similar tools.
  • Advanced expertise in designing and managing production environments in AWS, including services such as VPCs, EKS, IAM, AMI, EC2, CloudWatch, CloudTrail, Control Tower, GuardDuty, MSK, S3, Glacier, Gateways, Direct Connect, Route 53, RDS, ALBs, Autoscaling, and more.
  • Deep knowledge of Linux systems and a range of protocols and technologies, including HTTP, REST, TCP/IP, SSL, DNS, SMTP, SSH, NTP, Load Balancing, SQL/NoSQL, Message Brokers, Nginx, Vault, ELK, and others.
  • Expert level experience with Kubernetes and a variety of container and cloud-native technologies.
  • Proven ability to manage 24/7 on-call rotations, develop runbooks, establish support procedures, and proactively monitor systems across multiple geographic locations.
  • Ability to excel under pressure in complex, high-stakes environments.

Benefits:

  • Medical, dental and vision insurance 
  • Health Savings Account
  • Flexible Spending Accounts
  • Telehealth
  • 401(k) and 401(k) match
  • Life and AD&D insurance
  • Short-Term and Long-Term Disability
  • FTO or PTO
  • Employee Well-Being program
  • 11 paid holidays plus 1 inclusive holiday per year
  • Volunteer Time Off
  • Employee Referral program
  • Education Reimbursement Program
  • Employee Recognition and Appreciation program
  • Additional perk and voluntary benefit programs

Salary is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.  This position is also eligible for an incentive compensation plan.  The expected hiring salary for this position is:

$183,500.00 - $232,500.00

Innovation Lives Here

You go all in no matter what you do, and so do we. At Lytx, we’re powered by cutting-edge technology and Happy People. You want your work to make a positive impact in the world, and that’s what we do. Join our diverse team of hungry, humble and capable people united to make a difference.

Together, we help save lives on our roadways!

Lytx, Inc. is proud to be an equal opportunity employer. We’re committed to building a diverse and inclusive workforce and do not discriminate based on race, color, religion, sex, sexual orientation, gender identity or expression, gender, genetic information, uniformed service, national origin, age, veteran status, disability, pregnancy, or any other status protected by federal or state law. We are committed to providing reasonable accommodation for candidates with disabilities who need assistance during the hiring process. To request a reasonable accommodation, please email [email protected].  Lytx conducts background checks on applicants who receive a conditional offer of employment in accordance with applicable local, state, federal and regional laws. Qualified applicants with arrest or conviction records will be considered. Background check results may potentially result in the withdrawal of a conditional offer of employment and will be made in accordance with all applicable local, state, federal and regional laws. 

Top Skills

AWS
Bash
Dns
Elk
Gha
Git
Go
Grafana
Helm
HTTP
Kubernetes
Linux
Load Balancing
Message Brokers
New Relic
Nginx
Node.js
NoSQL
Ntp
Powershell
Prometheus
Python
Rest
Smtp
SQL
Ssh
Ssl
Tcp/Ip
Terraform
Vault
Yaml

Lytx Framingham, Massachusetts, USA Office

492 Old Connecticut Path, 601, Framingham, MA, United States, 01701

Similar Jobs

10 Days Ago
Remote
United States
Senior level
Senior level
Healthtech • Other • Social Impact • Software • Telehealth
The Staff SRE & DevOps Engineer at Rula will enhance system robustness and scalability, promote observability, and adopt SRE best practices while collaborating with engineering teams.
Top Skills: AWSDevOpsKubernetesSre
10 Days Ago
Remote or Hybrid
New York, NY, USA
130K-180K Annually
Senior level
130K-180K Annually
Senior level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Oversee operational support for SAP BTP applications, manage incidents, collaborate on engineering strategy, lead integration development, and ensure system performance.
Top Skills: AbapCapmIdentity ManagementIdocJSONMessage QueuesOauthOdataRestSAMLSap Api Business HubSap AribaSap BtpSap C4CSap CallidusSap CpiSap Success FactorsSfapiSftpSoapWorkdayXML
4 Days Ago
Remote or Hybrid
Orlando, FL, USA
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Staff Production Service Engineer will maintain cloud infrastructure, drive reliability improvements, troubleshoot issues, mentor team members, and utilize software development and systems engineering skills.
Top Skills: AnsibleAWSAzureBashDockerGCPGrafanaJavaJavaScriptKafkaKubernetesLinuxMariadbMySQLNginxOpenstackOraclePostgresPrometheusPuppetPythonSplunkTerraform

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account