Systems Engineering - Metrics and Alerting

Posted 12 Days Ago
Remote
130K-180K Annually
3-5 Years Experience
Information Technology • Security • Cybersecurity
The Role
Design, deliver, and operate software that progresses Cloudflare's Observability competency. Solve scaling bottlenecks in critical services in our Logging pipeline. Work on highly distributed and scalable systems. Participate in the constant cycle of knowledge sharing and mentoring. Participate in the global on-call rotation for the services your team owns. Research and introduce cutting-edge technologies. Contribute to open-source.
Summary Generated by Built In

Available Locations: Amsterdam, or Remote Netherlands
About the Department
Production Engineering is responsible for the world's most reliable, observable, performant, and safe network ecosystem. Our customers rely on our products and systems to safely modify, troubleshoot, and release products without external impact.
Our external customers rely on us to provide seamless and predictable incident, traffic, policy management, resulting in the fastest and safest network services in the world.
We are accountable for the overall performance of internal and external facing services, guiding our product teams to optimal configurations and maximum efficiency. From the moment that a packet enters the Cloudflare ecosystem, we know exactly what its expected purpose and behavior is and we are capable of determining and exposing anomalous behavior.
The Cloudflare network makes it possible to solve challenges at massive scale and efficiency which would be impossible for almost any other organization.
In this role, you can expect to:

  • Design, deliver, and operate software that progresses Cloudflare's Observability competency
  • Solve scaling bottlenecks in critical services in our Metrics & Alerting pipeline
  • Work on highly distributed and scalable systems
  • Participate in the constant cycle of knowledge sharing and mentoring
  • Participate in the global on-call rotation for the services your team owns
  • Research and introduce cutting-edge technologies
  • Contribute to open-source


We are a small team, well-funded, growing and focused on building an extraordinary company. This is a systems engineering role and is a superb opportunity to be part of a high performing team to help to support Cloudflare's mission and help build a better internet.
You may be a good fit for our team if you have:

  • Proficiency in distributed Linux environments
  • Proficiency in designing high-scale distributed systems
  • Proficiency in high-level programming languages (e.g., Golang)
  • Proficiency in Prometheus, Alertmanager, Thanos
  • Proficiency in networking protocols Layer 2-7 of the OSI model
  • Experience working in a fast, high-growth environment
  • Experience working in a 24/7/365 service environment
  • Exquisite written and verbal communication skills
  • Familiarity with Internetworking and BGP
  • Strong bias for action


Bonus points if you have:

  • Experience with high-bandwidth transit Internetworking and routing
  • Passion for code simplicity and performance

Top Skills

Go
The Company
Boston, MA
3,300 Employees
Hybrid Workplace
Year Founded: 2010

What We Do

Cloudflare, Inc. is on a mission to help build a better Internet. Cloudflare’s suite of products protect and accelerate any Internet application online without adding hardware, installing software, or changing a line of code. Internet properties powered by Cloudflare have all web traffic routed through its intelligent global network, which gets smarter with every request. As a result, they see significant improvement in performance and a decrease in spam and other attacks. Cloudflare was awarded by Reuters Events for Global Responsible Business in 2020, named to Fast Company's Most Innovative Companies in 2021, and ranked among Newsweek's Top 100 Most Loved Workplaces in 2022.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Cloudflare Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

We are committed to developing a global team that is distributed with a flexible working approach. Doing this equitably and inclusively is essential to our success. Visit our careers site for more on 'How & Where We Work.'

Typical time on-site: Flexible
Boston, MA

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account