Topstep Logo

Topstep

Staff Platform Engineer

Posted Yesterday
Be an Early Applicant
Remote
Hiring Remotely in United States
205K-235K Annually
Senior level
Remote
Hiring Remotely in United States
205K-235K Annually
Senior level
The Staff Platform Engineer is responsible for leading the technical direction of infrastructure, reliability, and operational excellence, focusing on AWS, observability, and team mentorship.
The summary above was generated by AI

Summary 

Are you a systems-minded engineer who thrives on building resilient infrastructure, driving operational excellence, and enabling teams to move fast with confidence?

As a Staff Platform Engineer at Topstep, you’ll set the technical direction for how we approach infrastructure, reliability, and operational excellence across the engineering organization. This is a role that blends deep infrastructure expertise with SRE leadership you’ll be as comfortable tuning Terraform modules and Kubernetes manifests as you are defining SLOs, shaping incident response culture, and mentoring engineers on production ownership. 

You’ll own the strategy for our AWS infrastructure, observability stack, and platform tooling while driving the practices that let product teams ship with speed and confidence. You’ll influence architectural decisions across teams, close the gaps that prevent fast diagnosis of production issues, and build the foundations of a mature, unified platform engineering function.

This role is ideal for someone who brings both hands-on technical depth and a builder’s mindset, someone excited to define best practices from the ground up, embed reliability into engineering culture, and shape what operational excellence looks like at a fast-growing fintech company.

Key Responsibilities 

  • Provide technical leadership for infrastructure, reliability, and observability, driving architectural decisions and platform standards.
  • Build and mature the platform engineering practice defining SLOs, incident response protocols, on-call standards, and operational runbooks.
  • Own the observability stack using Datadog (metrics, APM, logging, distributed tracing) and CloudWatch, instrumenting systems and closing gaps that currently prevent fast diagnosis of production issues.
  • Design and evolve AWS infrastructure (EKS, Aurora, ElastiCache, SQS, CloudFront) for reliability, security, scalability, and cost efficiency.
  • Own and evolve CI/CD pipelines, deployment strategies, and release engineering practices across the organization.
  • Drive infrastructure-as-code strategy with Terraform across a multi-account AWS environment, ensuring consistency and repeatability.
  • Lead incident response and blameless post-mortems, turning outages into opportunities for systematic improvement.
  • Partner with product engineering teams to embed reliability principles early in the design process and improve system resilience.
  • Mentor engineers across the organization on infrastructure, reliability practices, operational thinking, and production ownership.
  • Champion a culture of transparency, continuous improvement, and shared ownership of production systems.

Required Qualifications and Key Competencies

  • 7+ years of professional experience in Platform Engineering, SRE, or Infrastructure Engineering, with demonstrated impact building practices that scaled across multiple teams.
  • Proven track record either starting a platform/SRE function from scratch or scaling an existing practice with measurable improvements to MTTR, MTTD, change failure rate, or availability.
  • Deep expertise with AWS infrastructure (EKS, EC2, RDS/Aurora, VPC, ALB/NLB, CloudFront, SQS) running production services at scale.
  • Strong proficiency with Datadog for end-to-end observability (metrics, APM, logs, distributed tracing) and building alerting that catches real issues without causing fatigue.
  • Hands-on experience building and maintaining CI/CD pipelines (GitHub Actions, CodePipeline, or similar), writing automation (Bash, Python), and contributing to platform tooling.
  • Strong proficiency with Kubernetes in production cluster operations, networking, security, scaling strategies, and GitOps workflows.
  • Solid foundation in distributed systems, networking, database performance, and debugging complex system failures across service boundaries.
  • Deep familiarity with Terraform for multi-account, multi-environment infrastructure management.
  • Track record of influencing engineering culture through documentation, tooling, mentorship, and technical leadership.
  • Excellent communication skills with the ability to explain complex system behavior, trade-offs, and pragmatic decisions between long-term platform vision and immediate business needs to varied audiences.

Preferred Qualifications

  • Experience with GitOps tooling such as ArgoCD or Flux.
  • Familiarity with event-driven architectures and message brokers (NATS, Kafka, or similar).
  • Experience with Google Cloud Platform (GCP) services and multi-cloud infrastructure management.
  • Experience with advanced deployment patterns (blue/green, canary, progressive delivery) and release engineering at scale.
  • Working knowledge of JavaScript/TypeScript and common Node.js frameworks (Express, Fastify, NestJS).
  • Experience with disaster recovery planning, multi-region architectures, and high-availability design patterns.
  • Background in infrastructure security, compliance frameworks, and WAF management.
  • Experience building financial, trading, or fintech platforms where data consistency, performance, and reliability are mission-critical.
  • Experience defining and driving SLO/SLI frameworks that product teams actively use.

Company Culture & Perks

  • Topstep is an engaging working environment that ranges from fully remote to hybrid. We foster a culture of collaboration by keeping cameras on during meetings and maintaining a robust Slack environment for communication. 
  • 7 Company-paid Holidays and generous Family Leave. Paid time off is front-loaded.
  • Competitive 401(k) matching, health, dental, and vision insurance are offered for full-time employees. 
  • Vacations are encouraged with a bonus for taking 5 consecutive days. Topstep offers a food and groceries budget and contributes towards health and wellness. 

New Hire Base Salary Range 

  • $205,000 - $235,000.
  • The compensation offered will take into account the internal compensation structure and may vary depending on the candidate's geographic region, job-related knowledge, skills, and experience, among other factors.
  • The compensation offered will take into account internal compensation structure and may vary depending on the candidate's geographic region, job-related knowledge, skills, and experience among other factors.

Equal Opportunity Employer

Topstep is an Equal Opportunity Employer. We are committed to fostering an inclusive environment where all employees and applicants are valued. All qualified candidates will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, age, disability, or veteran status, in compliance with applicable federal, state, and local laws.

Interested in the role? Apply today with your resume!

Similar Jobs

3 Days Ago
Easy Apply
Remote
United States
Easy Apply
185K-210K Annually
Expert/Leader
185K-210K Annually
Expert/Leader
Healthtech • Software
As a Staff Software Engineer, you will enhance platform engineering by improving deployment strategies and developer experiences, establishing reliable systems and mentorship across teams.
Top Skills: AWSCi/CdEcsJavaKafkaKubernetesSpringTerraform
3 Days Ago
Easy Apply
Remote
United States
Easy Apply
191K-265K Annually
Senior level
191K-265K Annually
Senior level
Big Data • Fintech • Mobile • Payments • Financial Services
The Staff Analytics Engineer will own the Financial Subledger Data Platform, build dbt models, implement data quality controls, and mentor a junior engineer, ensuring high operational reliability and cross-functional collaboration.
Top Skills: AWSDbtPythonSnowflakeSQL
12 Days Ago
Easy Apply
Remote
USA
Easy Apply
218K-257K Annually
Senior level
218K-257K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Lead the Identity Accounts team at Coinbase, responsible for authentication and authorization systems, ensuring high uptime, and collaborating with cross-functional teams to build scalable platform solutions.
Top Skills: DatadogGoGrpcKafkaKubernetesPostgresReact

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account