NVIDIA Jobs

Senior Software Engineer, Matrix Multiplication

NVIDIA

Senior Software Engineer, Matrix Multiplication

Reposted 11 Hours Ago

In-Office or Remote

7 Locations

184K-288K Annually

Senior level

In-Office or Remote

7 Locations

184K-288K Annually

Senior level

Develop AI systems for efficient inference, design and optimize kernels, and build domain-specific compilers and runtimes while collaborating with engineers.

The summary above was generated by AI

We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build innovative AI systems software to accelerate for AI inference. As a member of the team, you'll develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture. This means designing and building things like new abstractions, efficient attention kernel implementations, new LLM inference runtimes components, and kernel code generators to accelerate large language models, agents, and other high-impact AI workloads.

What you'll be doing:

Innovating and developing new AI systems technologies for efficient inference
Designing, implementing, and optimizing kernels for high impact AI workloads
Designing and implementing extensible abstractions for LLM serving engines
Building efficient just-in-time domain specific compilers and runtimes
Collaborating closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams
Contributing to open source communities like FlashInfer, vLLM, and SGLang

What we need to see:

Masters degree in Computer Science, Electrical Engineering, or related field (or equivalent experience); PhD are preferred
6+ years (academic/ industry) experience with ML/DL systems development preferable
Strong experience in developing or using deep learning frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX, etc) and ideally inference engines and runtimes such as vLLM, SGLang, and MLC.
Strong Python and C/C++ programming skills
Strong experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, Triton, or similar) with hands-on experience with Matrix Multiplication

Ways to stand out from the crowd:

Background in domain specific compiler and library solutions for LLM inference and training (e.g. FlashInfer, Flash Attention)
Expertise in inference engines like vLLM and SGLang
Expertise in machine learning compilers (e.g. Apache TVM, MLIR)
Open source project ownership or contributions

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until July 18, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Federal Reserve Bank of Boston

Security Engineer

An Hour Ago

Remote

USA

150K-224K Annually

Senior level

150K-224K Annually

Senior level

Fintech • Information Technology • Payments • Sharing Economy • Financial Services • Cryptocurrency

Senior security engineer responsible for developing automation and tooling, building and deploying security solutions, conducting incident response and proactive threat hunting, supporting investigations through data analysis, and advising development teams. Must document solutions, participate in agile teams, mentor juniors, and continuously research evolving security threats.

Top Skills: AWSAws CodedeployAzureBashCircleCICloud IamContainer OrchestrationGithub ActionsGitlab PipelinesGoJavaLog ManagementPythonTravisci

Liberty Mutual Insurance

Inside Sales Representative

An Hour Ago

Remote or Hybrid

55K-75K Annually

Junior

55K-75K Annually

Junior

Artificial Intelligence • Fintech • Insurance • Marketing Tech • Software • Analytics

Handle inbound and warm insurance leads to consult customers, recommend Property & Casualty coverage, and close policies. Participate in paid remote training and licensing, work scheduled shifts (including weekends), meet sales targets, and maintain a professional home workspace with required wired high-speed internet.

Top Skills: Dsl)FiberHigh-Speed Internet (CablePc

Liberty Mutual Insurance

Inside Sales Representative

An Hour Ago

Remote or Hybrid

Cambridge, MA, USA

55K-75K Annually

Junior

55K-75K Annually

Junior

Artificial Intelligence • Fintech • Insurance • Marketing Tech • Software • Analytics

Handle inbound and warm sales leads to consult customers on insurance needs, recommend appropriate coverages, and convert leads into policyholders. Participate in paid remote training and obtain Property & Casualty license; work scheduled shifts including one weekend day; meet W@H technical and workspace requirements.

Top Skills: 100 Mbps Download20 Mbps UploadPcWired High-Speed Internet (Cable/Fiber/Dsl)

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories