NVIDIA Logo

NVIDIA

Senior Solution Engineer, Networking

Posted 3 Hours Ago
Be an Early Applicant
In-Office or Remote
6 Locations
136K-265K
Senior level
In-Office or Remote
6 Locations
136K-265K
Senior level
The role involves troubleshooting and developing solutions for networking issues in AI clusters, leveraging extensive software engineering knowledge and direct customer interaction.
The summary above was generated by AI

The NVIDIA Enterprise Experience (NVEX) Solutions Engineering team is looking for a senior Computer or Software Engineer who is ready to become an authority in ground-breaking network technology used in AI clusters. Our team of software engineers bridge the gap between the customer support teams and R&D, focusing on resolution of tough problems from the front lines and providing the highest level of support for InfiniBand, NVLink, and Spectrum-X network systems that interconnect GPUs and AI compute infrastructure.

Candidates must have a software development background in the networking industry either for a network hardware manufacturer or software integrator. It is essential to have a proven grasp of in-field, production network operations and have experience in root-causing customer-found issues down to the source code level, primarily C and Python. Breadth of experience is key. We want to see experience in multiple areas such as network operating systems (NOS), Linux network drivers and internals, network hardware, NIC software, Smart NICs, DPUs, embedded firmware, Software Defined Networking, and infrastructure management technologies. IPC, race conditions, finite state machines, event processing loops, queue management, network traffic and flow analysis, and software design gaps will be common areas of focus. The individual will get to work across many NVIDIA teams and often interact with both internal and external customers, so superb interpersonal and communication skills are essential. Candidates will need to understand, root cause, and resolve complex issues, and provide detailed explanations of what you find.

What you will be doing:

  • Assist various network and AI cluster support teams in reproducing, resolving, and root causing sophisticated customer issues

  • Work with R&D teams to develop bug fixes, workarounds, and solutions for critical customers using NVIDIA’s network technologies

  • Become an authority in NVIDIA network technologies used in AI clusters such as Infiniband, NVLink, and Spectrum-X

  • Analyze network performance metrics and make tuning recommendations for high-performance, lossless networks

  • Develop support and analysis tools to help analyze and root cause field issues

  • Daily use of ground breaking AI tools for software development, log and trace analysis, and source code debugging

  • Occasional work on weekends or holidays to support customers

What we need to see:

  • Minimum of a BS in Computer, Electrical, or Software Engineering (or equivalent experience)

  • 5-10 years of experience in C programming in Linux and embedded systems

  • Proficiency in Python

  • At least 5 years of experience developing software for one or more of the following:
    Linux NIC drivers, switch ASICs and SDKs, embedded network device firmware, Linux based network equipment (routers, switches, gateways, etc), network operating systems, virtual routers, SDN stacks, virtual switching, DPDK, SRIOV stacks

  • At least 5 years of experience directly supporting end-customers, partners, or integrators for network equipment and infrastructures

  • Strong system software (firmware, BIOS, kernel, driver, operating system) expertise

  • Experience with container environments (K8s and Docker)

  • Professional-level communication skills, including adjusting communication to the technical level of the audience, and staying calm and focused in negative situations.

  • Passion for learning innovative tech and motivation to work hard on ground-breaking products

Ways to stand out from the crowd:

  • Background with AI infrastructure and HPC networking

  • Experience programming switch and NIC ASICs and SDKs

  • Experience with Infiniband or other non-Ethernet network technologies

  • Experience developing or supporting DPUs or SmartNICs

  • Knowledge of HPC performance test tools and NVIDIA AI stacks (NCCL, MPI, DOCA, CUDA)

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 136,000 USD - 212,750 USD for Level 3, and 168,000 USD - 264,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until August 11, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

C
Docker
Dpdk
Infiniband
Kubernetes
Linux
Nvlink
Python
Smart Nics
Spectrum-X
Sriov

Similar Jobs

An Hour Ago
Easy Apply
Remote
United States
Easy Apply
70K-105K Annually
Mid level
70K-105K Annually
Mid level
Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
The Technical Recruiter will manage the full-cycle recruitment process for various technical roles, collaborating with hiring managers to improve recruiting strategies and building a pipeline of candidates.
Top Skills: Greenhouse Ats
An Hour Ago
Easy Apply
Remote or Hybrid
9 Locations
Easy Apply
157K-230K
Expert/Leader
157K-230K
Expert/Leader
Fintech • HR Tech
Drive strategic decision-making using data analysis. Collaborate cross-functionally, perform analyses, develop reports, and influence product strategy to enhance business performance.
Top Skills: HexLookerOmniSQLTableau
An Hour Ago
Remote or Hybrid
USA
30-40
Junior
30-40
Junior
Artificial Intelligence • Productivity • Software
The Research Ops Coordinator will manage scheduling for user interviews, ensuring a smooth participant experience and supporting the research team.
Top Skills: CalendlyGoogle CalendarNotionSlackZoom

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account