QuickNode Logo

QuickNode

Technical Operations Engineer, Core

Posted 3 Days Ago
Be an Early Applicant
Remote
5 Locations
74K-157K
Senior level
Remote
5 Locations
74K-157K
Senior level
The Technical Operations Engineer will manage and optimize blockchain infrastructure, handle incident management, and support operational excellence, leveraging deep technical knowledge in Web3 technologies.
The summary above was generated by AI

QuickNode is a cloud-based infrastructure company that powers the blockchain ecosystem.

Our mission is to be the indispensable utility that empowers companies and innovators globally to build next-generation, Web3 enabled businesses & applications using blockchain technology. QuickNode is backed by some of the world's best investors including Tiger Global, Y Combinator, SoftBank, and the Seven Seven Six Fund. The QuickNode team has over 120 people maintaining high performance global data infrastructure for amazing customers serving billions of requests daily.

 We are a global remote company with an HQ in Miami, Florida.

The Role

We’re seeking a seasoned Technical Operations Engineer to ensure the stability, reliability, and performance of our production systems. In this key role, you’ll leverage deep technical expertise, particularly in Web3/blockchain technologies, to manage, optimize, and enhance our platform infrastructure. You’ll drive operational excellence through proactive monitoring, meticulous incident management, innovative problem-solving, and collaborative cross-team initiatives.

What You'll Do
  • Blockchain Network Management: Lead the deployment, optimization, and operational management of new blockchain networks. Conduct thorough testing, benchmarking, and continuous improvement of chain reliability and performance.

  • Complex Web3 Issue Resolution: Address high-impact Web3 incidents through rigorous troubleshooting, detailed log analysis, JSON-RPC response debugging, and direct coordination with blockchain foundations and ecosystem partners.

  • Proactive System Monitoring: Develop and maintain comprehensive monitoring and alerting solutions using advanced dashboards (e.g., Grafana, DataDog), identifying trends, anomalies, and performance bottlenecks before they become critical.

  • Incident & SLO Management: Define, implement, and enforce service-level objectives (SLOs) and agreements (SLAs), ensuring measurable standards of system reliability and performance are consistently met.

  • Automation & Optimization: Implement and maintain automation solutions (Ansible, Terraform, Kubernetes) to streamline deployments, reduce manual tasks, and optimize cloud infrastructure cost and efficiency.

  • Technical Collaboration: Actively collaborate with Tier-1 support, infrastructure, and development teams, ensuring alignment on system improvements, rapid issue resolution, and operational knowledge sharing.

  • On-Call Support: Participate in a rotating 24/7 on-call schedule to swiftly address critical system incidents, maintain continuous service delivery, and uphold customer trust.

What You'll Bring
  • Minimum of 5 years in Technical Operations, Site Reliability Engineering (SRE), or related roles. Proven Linux/Unix system administration and advanced troubleshooting capabilities.

  • Deep experience managing complex Web3 infrastructures (RPC services, validator setups, node operations). Skilled in interpreting blockchain logs, JSON-RPC responses, and debugging intricate Web3 protocol issues.

  • Solid hands-on experience with configuration management and infrastructure automation tools (Helm, Terraform, Ansible, Consul), including containerization expertise (Docker, Kubernetes), managing and scaling services in cloud environments.

  • Competency in scripting/programming languages (Python, Go, JavaScript).

  • Advanced proficiency in monitoring and analytics platforms (Grafana, DataDog), enabling proactive and data-driven operational decision-making.

  • Demonstrated ability to identify performance patterns, forecast potential issues, and implement preventive solutions.

  • Strong track record defining, measuring, and maintaining SLAs/SLOs, and experienced with incident response tooling and processes (PagerDuty), ensuring quick resolution and systematic root-cause analyses.

  • Willing to travel on a limited basis for conferences, offsites and/or meetings, generally less than 10 days per year.

  • Exceptional interpersonal and communication skills, with a proven ability to collaborate effectively across multiple teams and stakeholders.

  • Self-motivated, solution-oriented, and consistently striving for operational improvements, quality enhancements, and reduced technical debt.

  • Solid professional attributes, committed to transparency, accountability, and ethical behavior. Capable of managing complexity and staying adaptable under pressure, and able to demonstrate continuous learning and comfort evolving within a rapidly changing technical landscape.

  • Self-starter driven by curiosity and initiative, proactively identifying opportunities, addressing gaps, and implementing solutions autonomously.

  • Thrives in dynamic environments and committed to maintaining industry leadership through close collaboration with the most innovative and talented minds in Web3.

Performance Metrics

Success in this role will be measured by:

  1. Proactively monitor, rapidly respond to, and diligently resolve high-severity platform incidents during on-call and shift hours, ensuring ≥99.99% uptime (less than 4min 30sec downtime per month) across all Core Platform RPC services and validators.

  2. Actively seek opportunities to enhance operational efficiency through automation and streamlined processes, aiming to automate a minimum of two critical operational tasks or deployments per quarter, resulting in at least a 25% reduction in manual interventions and measurable improvements in deployment velocity.

  3. Autonomously tackle research, rapid operationalization, and rigorous maintenance for new L1/L2 chain deployments, achieving stable production readiness within 14 days, proactively ensuring ≥99.99% uptime post-launch, and effectively onboarding initial traffic for shared and public endpoint services.

The US base salary range and level for this position are $156,510 - $`73,900 per year and level P3. International ranges, in local currency, will be discussed during the hiring process with applicable candidates. This role is eligible for a quarterly bonus tied to company and individual goal achievement. We consider years of experience, level of proficiency in job function, the technical competencies required and location when determining base salary ranges for positions and levels.

The QuickNode compensation philosophy includes pillars to ensure fair and unbiased compensation for all employees. To design and deliver total reward offerings that are employee-centric. To offer a competitive benefit package in all locations where we operate. To prioritize attracting and retaining the best talent globally. To maintain a high-performing and flexible way of working.

 During the hiring process, we are committed to discussing compensation openly and honestly. We encourage candidates to share their salary expectations and requirements early, allowing for an individualized discussion. We know that our total rewards practices impact the lives and wellbeing of our employees. Therefore, we will never stop learning about the market, our business, your needs, and how best to achieve our goals through thoughtful and data-driven practices. If you have any questions or require further information about the compensation for this position, please don't hesitate to reach out to your Recruiter. 

We at Quicknode are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, disability status, genetic information, protected veteran status, or any other characteristic protected by law.

Top Skills

Ansible
Blockchain
Datadog
Docker
Go
Grafana
JavaScript
Json-Rpc
Kubernetes
Linux/Unix
Python
Terraform
Web3

Similar Jobs

10 Hours Ago
Remote or Hybrid
14 Locations
200K-230K Annually
Senior level
200K-230K Annually
Senior level
Marketing Tech • Real Estate • Software • PropTech • SEO
Lead the design and implementation of scalable systems, mentor engineers, and collaborate with product and AI teams to launch innovative features.
Top Skills: ApolloAWSDynamoDBElasticsearchJavaScriptKafkaKubernetesLambdaLangchainLangfuseMastraNode.jsOpenrouterPostgresPythonReactSqsTailwindTypescript
10 Hours Ago
Remote or Hybrid
12 Locations
Senior level
Senior level
Marketing Tech • Real Estate • Software • PropTech • SEO
Lead Luxury Presence's QA strategy, overseeing QA engineers to ensure exceptional quality across web, integrations, and mobile platforms while embedding quality practices throughout the software development lifecycle.
Top Skills: AppiumAWSCi/CdCypressPlaywrightSelenium
10 Hours Ago
Remote or Hybrid
14 Locations
150K-180K Annually
Senior level
150K-180K Annually
Senior level
Marketing Tech • Real Estate • Software • PropTech • SEO
The role involves designing and building cloud-native APIs and services, mentoring, driving key architectural decisions, and fostering collaboration across teams.
Top Skills: AWSDynamoDBElasticsearchJavaScriptKafkaLangchainLangfuseMastraNode.jsOpenrouterPostgresPythonReactSqsTypescript

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account