Spreedly Logo

Spreedly

Senior Software Engineer, Site Reliability Engineering (SRE)

Posted 13 Days Ago
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
Enhance the reliability and performance of Spreedly's payments platform by implementing monitoring, incident management, and optimizing application performance.
The summary above was generated by AI
About Us:

Spreedly is the world's leading Open Payments Platform, sitting at the center of a network processing more than $50b of GMV annually. Spreedly's Payments Orchestration platform enables and optimizes digital transactions with the world’s most complete payment services marketplace. Built on Spreedly’s PCI-compliant architecture, our Advanced Vault solution combines a modern feature-set with rule-based configurations to optimize the vaulting experience for all stored payment methods. Global enterprises and hyper-growth companies grow their digital business faster by relying on our payments platform. Hundreds of customers worldwide secure card data in our PCI-compliant vault and use tokenized card data to enable and optimize over $45 billion of annual transaction volumes with any payment service.

Our vision is that the world is better with a diversified, inclusive payment ecosystem. Our mission is to accelerate commerce with an open, secure, and flexible payment platform that welcomes all payment participants. Our employees help us execute our vision by building a culture focused on autonomy, transparency, and collaboration in a dynamic, high-growth organization.

Product Offering: 

Spreedly provides an open payments platform. The platform’s connectivity provides payments performance. Key products and services include:

Payment Gateway Integration: Connects merchants, platforms, and marketplaces to multiple payment gateways and payment services.
Tokenization: Securely stores and manages payment data with a universal tokenization service.
Transaction Routing: Enables intelligent routing of transactions to optimize success rates and costs.
Payment Vault: A secure storage solution for sensitive payment information.
Fraud Tools Integration: Integrates with various fraud prevention tools to enhance transaction security.


About the Role:

As a Senior Software Engineer, Site Reliability Engineeringing (SRE) at Spreedly, you will enhance the reliability, performance, and scalability of our globally distributed payments platform. You'll focus heavily on the application layer, collaborating with product and platform engineers to implement effective monitoring, resolve performance issues, and raise system resiliency. This is a high-leverage technical role with visibility across engineering.

Responsibilites:

  • Application Observability & Monitoring: Design, implement, and improve observability systems using Datadog, OpenTelemetry, and other tools to proactively detect and resolve system issues.
  • Incident Management: Lead root cause analysis, incident resolution, and response rotation (~every 10–12 weeks), with a bias toward prevention and measurable reliability improvements.
  • Performance Engineering: Diagnose and resolve application-level bottlenecks in Ruby on Rails and Elixir codebases, and partner with engineering teams to deliver SLIs/SLOs.
  • Database Optimization: Identify and fix query and indexing inefficiencies in PostgreSQL and CockroachDB.
  • Cross-Team Collaboration: Serve as a reliability partner to product and infrastructure teams, coaching on reliability principles and embedding SRE best practices.
  • Tooling & Automation: Build developer tools to automate deployment, monitoring, and diagnostics across production systems.

Requirements:

  • 5+ years in SRE or related software engineering roles, with direct experience supporting production services at scale
  • Proficiency in a modern programming language (Ruby, Rails, and Elixir experience are preferred)
  • Hands-on experience with observability tooling (Datadog, OpenTelemetry, Sentry, etc.)
  • Experience with AWS services, such as EC2 (Ubuntu Linux), S3, and RDS.
  • Knowledge of relational databases (e.g., CockroachDB, PostgreSQL, Riak) with experience in performance optimization and query tuning. Experience with Kafka is a plus
  • Experience supporting incident response and postmortems in high-stakes environments
  • Prior work developing and improving SLIs/SLOs and leading uptime initiatives in customer-facing systems
  • Understanding of software design patterns to support scalability and fault-tolerance
  • Experience mentoring other engineers and advocating for best practices
  • Application-focused SRE who has worked on monoliths and complex service architectures

We Offer US-based Employees:

  • Competitive salary + Equity
  • Outstanding Medical and Dental benefits, including 100% employer-paid options
  • Company-paid Life and Disability insurance
  • Optional vision and supplemental insurance options, and various Flexible Spending Accounts (FSA)
  • Open Paid Time Off policy + 12 weeks of paid leave for new parents
  • Matching 401(k) plan (5% up to $5,000 yearly)
  • $1,000 annual professional development stipend
  • Monthly home working/digital lifestyle stipend, new MacBook, and one-time accessory reimbursement
  • Access to company-paid professional coaching service
  • Visits to HQ in Durham, North Carolina for remote employees

#LI-AE1

Spreedly is an equal opportunity employer. We are committed to fostering, cultivating, and preserving a culture of diversity, equity, inclusion, and belonging. We actively work to drive out even unintentional discrimination in our hiring processes via practices like blindly graded work samples, structured interviews, and diversity awareness training.

Due to the sensitive nature of what Spreedly does - handling payment data - finalist candidates must complete a successful background and reference check.

At this time Spreedly is unable to provide sponsorship for employment, and we are not set up to support remote employees who reside in California or New York. In order to be considered for employment, applicants must be currently legally authorized to work in the job location country and not require future sponsorship in order to continue working in that country.

We appreciate your interest in our company. Because of the high volume of resume flow, we may only respond to those candidates that we think will be a potential fit.

Top Skills

AWS
Cockroachdb
Datadog
Elixir
Opentelemetry
Postgres
Ruby on Rails
Ruby

Similar Jobs

40 Minutes Ago
Remote
United States
249K-336K Annually
Expert/Leader
249K-336K Annually
Expert/Leader
Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
The Director of Engineering will lead teams building foundational products at Dropbox, ensuring engineering excellence and alignment with product initiatives while nurturing talent.
Top Skills: Ai/Ml TechnologiesDistributed StorageFull-StackOs-Level IntegrationsWeb Technologies
43 Minutes Ago
Remote or Hybrid
US
105K-148K Annually
Senior level
105K-148K Annually
Senior level
Artificial Intelligence • eCommerce • Information Technology • Internet of Things • Automation
The Sr. Fullstack Observability Engineer will design, integrate, and deliver technical solutions, mentor engineers, and serve as a technical advisor for clients.
Top Skills: Azure DevopsCi/Cd PipelinesCisco ObservabilityContainersDynatraceKubernetesLogicmonitorOpen Telemetry
45 Minutes Ago
Remote or Hybrid
Moline, IL, USA
145K-155K Annually
Senior level
145K-155K Annually
Senior level
Artificial Intelligence • Cloud • Internet of Things • Machine Learning • Analytics • Industrial
Lead the design and implementation of battery management system algorithms, interface with testing teams, and explore advanced techniques for battery performance and safety.
Top Skills: Matlab/SimulinkPython

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account