Apollo GraphQL Logo

Apollo GraphQL

Senior Software Engineer, AI Runtime

Posted 8 Hours Ago
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
Seeking a Senior Software Engineer to enhance AI workflows with scalable server architecture, develop multi-agent systems, and ensure high performance and reliability.
The summary above was generated by AI

We’re seeking a Senior Software Engineer to help power the future of agentic AI workflows. You’ll take our MCP Server to the next level, turning it into an enterprise-grade service that lets diverse tools and systems be exposed effortlessly to AI agents. Looking ahead, you’ll also help architect the MCP Gateway—a new layer that will route requests across tools, enforce policies, and provide the runtime foundation for scalable multi-agent systems. Along the way, you’ll tackle challenges in scalability, performance, and developer experience to ensure our platform feels seamless, powerful, and enterprise-ready.

🚀 About the Team

The Graph DX AI Runtime Team builds and maintains the MCP Server and Gateway—the backbone of agent-to-tool communication and the routing layer that keeps everything flowing. We make it simple for developers to wire up agents, orchestrate workflows, and scale interactions reliably. Our focus is on speed, security, and seamless integration, so teams can spend less time managing infrastructure and more time building intelligent experiences.

🔧 What You'll Do
  • Scale an enterprise AI/MCP Server and Gateway that powers multi-agent workflows across Apollo, including routing, orchestration, and integration boundaries.

  • Implement robust server infrastructure to ensure reliability, performance, and security at scale.

  • Build and maintain tools for agent discovery, communication, and coordination.

  • Define deployment strategies and runtime optimizations to maximize efficiency and minimize operational overhead.

  • Develop frameworks and patterns that enable seamless multi-agent collaboration and AI-driven orchestration.

  • Integrate observability, logging, and monitoring for full visibility into server and agent behavior.

  • Explore and implement AI-enhanced developer workflows to optimize orchestration and agent interactions.

  • Collaborate with teams within our org to ensure the MCP Server meets evolving product and developer needs.

🧠 Technical Challenges You’ll Tackle
  • Build and scale the MCP Gateway—Apollo’s routing layer for agentic workflows—ensuring tools and services can be discovered, invoked, and orchestrated reliably across diverse environments.

  • Design and implement high-performance routing infrastructure with reliability, scalability, and security at its core.

  • Build and maintain routing patterns and coordination mechanisms that let agents interact with the right tools at the right time.

  • Define deployment strategies and runtime optimizations to minimize latency and operational overhead.

  • Explore and implement AI-driven routing strategies to optimize context retrieval, reduce cost, and improve decision accuracy.

  • Collaborate with teams across Apollo to ensure the MCP Server and Gateway integrates seamlessly with Apollo’s control plane for AI tools.

  • Integrate observability and monitoring into the routing layer to provide full visibility into traffic flows, tool availability, and agent interactions.

✅ What We’re Looking For

Required Skills

  • Expertise in agent-to-tool orchestration, routing, and coordination in scalable, fault-tolerant systems.

  • Deep expertise in Rust programming language.

  • Strong background in distributed systems, server architecture, and high-performance backend development.

  • Proven experience with protocol design, message routing, and server-side orchestration frameworks.

  • Experience building and maintaining robust runtime infrastructure that supports AI-driven workflows and enables reliable agent-to-tool interactions.

  • Proven experience with protocol design, message routing, and building server-side frameworks that enable scalable, reliable multi-tool agent workflows.

  • Hands-on experience with observability, monitoring, and debugging frameworks for complex systems.

  • Passion for clean, maintainable code, high system reliability, and scalable architecture.

  • Experience in strategic system design, making architectural trade-offs, and planning for long-term scalability and maintainability.

  • Strong technical leadership and mentorship, including guiding junior engineers and driving engineering best practices across teams.

  • Ability to influence cross-team architecture decisions and align engineering efforts with product and business objectives.

  • Production ownership experience: leading incident response, debugging, and performance optimization in high-impact backend systems.

Bonus

  • Exposure to AI/ML-enabled developer tooling or autonomous system orchestration.

  • Familiarity with cloud-native architectures, containerization, or orchestration frameworks.

  • Experience with performance optimization and cost-efficient scaling of high-throughput distributed systems.

Top Skills

Rust

Similar Jobs

An Hour Ago
Remote
United States
240K-300K Annually
Expert/Leader
240K-300K Annually
Expert/Leader
Fintech • Financial Services
Lead the data science team in developing and deploying advanced machine learning models to improve decision-making and efficiency. Collaborate cross-functionally while mentoring staff and managing the data science function.
Top Skills: ArizeAWSDatabricksGitMetaflowPythonSagemakerSnowflakeSQLTaktileTecton
An Hour Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
111K-193K Annually
Senior level
111K-193K Annually
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
The Senior Customer Lifecycle Marketing Manager will design and implement lifecycle campaigns, optimize customer journeys, and drive cross-functional alignment to improve customer engagement and retention.
Top Skills: B2B SaasCSSHTMLIterableMarketing Automation
An Hour Ago
Easy Apply
Remote or Hybrid
US
Easy Apply
104K-144K Annually
Mid level
104K-144K Annually
Mid level
Fintech • Machine Learning • Mobile • Security • Software
Oversee training governance systems, ensuring quality training delivery across BPO partners, while collaborating with teams to drive continuous improvement and excellence in training implementation.
Top Skills: Learning Management Systems (Lms)Reporting Tools

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account