Upgrade, Inc. Logo

Upgrade, Inc.

Principal DevOps Engineer, Infrastructure Performance

Reposted 17 Days Ago
Easy Apply
Remote or Hybrid
Hiring Remotely in United States
Senior level
Easy Apply
Remote or Hybrid
Hiring Remotely in United States
Senior level
Design and build a cloud-based observability platform, troubleshoot performance issues, improve monitoring tools, and scale infrastructure. Lead operational improvements in a collaborative environment.
The summary above was generated by AI

Upgrade helps customers move in the right direction with affordable and responsible financial products. Since 2017, we’ve helped over 7 million customers access over $40 billion in consumer credit. With a relentless focus on improving our customers' financial well-being, we build products that put more money in their pocket and support their journey toward a better financial future. We’re backed by some of the most prominent technology investors and were most recently valued at $6.3B.

We’re consistently recognized for our collaborative and inclusive culture. Most recently, we were named one of the World’s Top Fintech Companies by CNBC, Best Places to Work by Built In, Best Places to Work by the San Francisco Business Times, America’s Greatest Workplaces by Newsweek, Best Startup Employer by Forbes, and Healthiest Employers by Phoenix Business Journal. 

We’re looking for new team members who get excited about designing and delivering new and better products. Come join us and help build a better financial future for millions of people.


What You'll Do:
  • Build a resilient, secure, and efficient cloud based observability platform.
  • Monitor and troubleshoot platform issues, including finding solutions to reduce known issues.
  • Build and scale the observability infrastructure to meet rapidly increasing demand.
  • Develop and improve operational practices and procedures.
  • Sample projects:
    • Improve database monitoring: develop custom prometheus exporters in Go for use cases that go beyond what is possible with SQL exporter. Create Grafana dashboards and alerts for these new metrics.
    • MCP servers for observability: deploy MCP server to integrate our observability stack with our LLM tools.
What We Look For:
  • 8+ years of relevant production-level experience.
  • Experience with VictoriaMetrics.
  • Experience with Sumologic.
  • Experience with tracing tools (e.g. OpenTelemetry, Honeycomb, Tempo).
  • Experience with profiling tools (e.g. Pyroscope)
  • Knowledge of cloud monitoring, logging and cost management tools.
  • Programming/scripting knowledge (Go, Java, or Python) and understanding of JVM concepts.
  • In-depth knowledge of AWS services, hands-on experience in AWS provisioning using terraform.
  • Experience with containerized applications and Kubernetes / EKS. Creating and updating / maintaining Helm charts.
  • Understanding of microservices architecture and debugging/investigation techniques.
  • Strong understanding of systems, networking and troubleshooting techniques.
  • Experience in automated build pipeline, continuous integration and continuous deployment.
  • Ability to operate in an agile, entrepreneurial start-up environment.
  • Experience with running Linux in production.
Our Tech Stack:
  • Monitoring: VictoriaMetrics, Grafana, Prometheus, OpenTelemetry, Honeycomb, Sumologic.
  • Infrastructure as code: Terraform.
  • CD: GitOps, ArgoCD, ArgoRollouts.
  • CI: Tekton.
  • Scripting: Bash.
  • Programming: Golang (preferred).
  • AWS: EKS, Cloudwatch, S3, DynamodDB, RDS, SNS, SQS, Lambda.


What We Offer You: 

  • Competitive salary and stock option plan
  • 100% paid coverage of medical, dental and vision insurance 
  • Flexible PTO
  • Competitive 401(k) and RRSP program
  • Opportunities for professional growth and development 
  • Paid parental leave
  • Health & wellness initiatives

#LI-Remote  #BI-Remote

For California residents: Upgrade's California Notice at Collection and Privacy Policy describes our practices regarding the collection, use, and disclosure of the personal information of job applicants.

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Upgrade does not accept unsolicited resumes from staffing agencies, search firms, or any third parties. Any resume submitted to any employee of Upgrade without a prior written agreement in place will be considered the property of Upgrade, and Upgrade will not be obligated to pay any referral or placement fee. Agencies must obtain advance written approval from Upgrade's Talent Acquisition department to submit resumes and only in conjunction with a valid, fully executed agreement. English is required for all positions, as it involves interacting with staff at Upgrade's offices worldwide.

Top Skills

Argocd
Argorollouts
AWS
Bash
Eks
Gitops
Go
Grafana
Honeycomb
Java
Kubernetes
Opentelemetry
Prometheus
Pyroscope
Python
Sumologic
Tekton
Tempo
Terraform
Victoriametrics

Similar Jobs

An Hour Ago
Easy Apply
Remote
US
Easy Apply
105K-125K Annually
Senior level
105K-125K Annually
Senior level
Insurance
The Senior Talent Acquisition Partner will manage full-cycle recruiting, partner with hiring managers, and enhance the candidate experience while promoting Openly's values and diversity initiatives.
An Hour Ago
Easy Apply
Remote
United States
Easy Apply
160K-175K
Senior level
160K-175K
Senior level
Healthtech • Software
Lead the architectural design for healthcare technology platforms, ensuring scalability, performance, compliance, and security while collaborating with cross-functional teams.
Top Skills: AWSFhirHipaaHl7Java SpringKafka
An Hour Ago
Remote or Hybrid
Illinois, USA
90K-105K Annually
Mid level
90K-105K Annually
Mid level
Artificial Intelligence • Hardware • Information Technology • Security • Software • Cybersecurity • Big Data Analytics
Manage customer installation activities, verify system functionality, conduct routine tasks, and troubleshoot issues while ensuring optimal security and reliability.
Top Skills: Active DirectoryFortinet FortigateHp AristaHpe AlletraHpe Compute HardwareLinuxMicrosoft Sql ServerNimbleRed Hat OpenshiftTeamcityVmware EsxiWindows 10Windows 11Windows Server

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account