NVIDIA Logo

NVIDIA

Senior Software Engineer, Agentic AI

Posted 8 Days Ago
In-Office or Remote
6 Locations
148K-288K
Senior level
In-Office or Remote
6 Locations
148K-288K
Senior level
As a Senior Software Engineer, you will develop the Agent Intelligence toolkit for AI agents, optimize applications for efficiency, and collaborate with engineers and data scientists to enhance GenAI workflows.
The summary above was generated by AI

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.

NVIDIA is seeking a Senior Software Engineer to help build the Agent Intelligence (AIQ) toolkit, an open-source library for connecting enterprise agents to data sources and tools across any framework. In this role, you’ll be at the forefront of agentic application development, working with the latest LLM frameworks and libraries to create a powerful toolkit that enables large-scale AI agents for modern enterprises. You’ll design tracing and profiling tools to help scale these applications and collaborate with experts across domains to optimize performance, using the full power of the NVIDIA stack. Together, we’ll push the boundaries of NVIDIA’s core frameworks, revolutionizing AI applications for our enterprise customers!

What you'll be doing:

  • Implementing new features of our GenAI SDKs that enable LLM agents to expand to new, more demanding use cases and larger deployment configurations.

  • Crafting proof-of-concept workflows rooted in first principles that apply modern data science techniques to GenAI use cases.

  • Collaborating with other engineers to develop new optimizations for agentic applications across the entire data center, which focus on improving accuracy, reducing latency, and growing efficiency.

  • Building integrations between the AIQ toolkit and other NVIDIA products and services, such as the NeMo Framework, NIMs, and NVIDIA Blueprints.

  • Working with data scientists and ML/DL engineers to move from proof-of-concept analysis and modeling to production-ready pipelines and deployments.

What we need to see:

  • BS in Computer Engineering, Computer Science, Data Science, or other closely related field (or equivalent experience).

  • Proficient in Python, with at least 5+ years of experience building Python libraries or applications for enterprise customers.

  • Experience with GenAI application development using LLM frameworks (such as Langchain, Llamaindex, or AutoGen), evaluation systems (such as RAGAs), and observability platforms (such as Arize Phoenix, W&B Weave, or LangSmith).

  • Understanding of different agent architectures, RAG systems, and communication protocols (such as MCP or Google A2A).

  • Deep desire to solve complex engineering challenges with efficiency as a priority.

  • Ability to quickly learn and apply new technologies and libraries.

  • Self-starter with a proactive attitude, capable of working independently and effectively within a distributed team.

  • Excellent communication skills, essential for collaboration with multi-functional teams.

Ways to stand out from the crowd:

  • MS, PhD or equivalent experience in Computer Engineering, Computer Science, Data Science, or other closely related field.

  • Experience developing for GPU platforms and familiarity with NVIDIA technologies (e.g., CUDA, TensorRT, Triton, NeMo) and LLM serving frameworks (e.g., Dynamo, vLLM, SGLang).

  • Proficient in distributed systems and communication frameworks (e.g., Ray, Dask, Spark, gRPC, Kafka, nats.io).

  • Proven ability to prototype and productionize features, including deploying large-scale agentic applications with high concurrency.

  • Track record of contributing to open-source Python projects.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until September 8, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

Arize Phoenix
Autogen
Cuda
Dask
Grpc
Kafka
Langchain
Langsmith
Llamaindex
Llm Frameworks
Nats.Io
Nemo
Python
Ragas
Ray
Spark
Tensorrt
Triton
W&B Weave

Similar Jobs

21 Minutes Ago
Remote or Hybrid
Austin, TX, USA
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Design and engineer scalable cloud network infrastructure across Azure, AWS, and GCP. Automate provisioning and ensure security compliance while collaborating with engineering teams.
Top Skills: Amazon Web ServicesAnsibleCi/CdGitGoogle Cloud PlatformJavaScriptAzurePythonTerraform
27 Minutes Ago
Remote
USA
Internship
Internship
Artificial Intelligence • Cloud • Fintech • Professional Services • Software • Analytics • Financial Services
The Marketing Intern supports projects across paid media, content creation, events, and partner marketing, providing hands-on experience in various marketing functions.
Top Skills: CredlyGoogle CalendarHigher LogicHootsuiteMarketoMonday.ComOn24QualtricsRightwave
Internship
Artificial Intelligence • Cloud • Fintech • Professional Services • Software • Analytics • Financial Services
The internship involves research, analysis, and practical application in communications, public affairs, and sustainability, helping to enhance company narratives.
Top Skills: Google Suite

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account