Private Health Management Logo

Private Health Management

AI Infrastructure Operations Engineer

Posted 10 Days Ago
Remote
Hiring Remotely in USA
120K-140K Annually
Mid level
Remote
Hiring Remotely in USA
120K-140K Annually
Mid level
Operate and scale an Azure-based AI platform (Companion) by maintaining AKS infrastructure, improving observability, handling incidents, ensuring security and operational hygiene, and building runbooks and processes to support production AI agent workloads and deployments.
The summary above was generated by AI

AI Infrastructure & Operations Engineer 

Location: Remote (U.S.) 
Reports To: Juan Sandoval-Tobias 

About Private Health Management 

Private Health Management (PHM) supports people with serious and complex medical conditions, helping them obtain the best possible medical care. We guide individuals and families to top specialists, advanced diagnostics, and personalized care. Trusted by healthcare providers and businesses, PHM offers independent, science-backed insights to help clients make informed decisions and access the best care. 

About the Role 

PHM is building and scaling Companion, an AI-enabled clinical platform operating in a high-trust healthcare environment where reliability, observability, and security are foundational requirements. The platform includes headless AI agents designed to support clinical and operational professionals by acting as intelligent workstations that integrate with enterprise applications and workflows. 

The AI Infrastructure & Operations Engineer will operationalize the platform so it runs reliably at production scale, helping ensure the systems behind Companion are observable, recoverable, secure, and maintainable as adoption grows. 

This role sits at the intersection of Kubernetes operations, AI platform reliability, observability engineering, and operational security. You will help evolve and maintain the Azure-based infrastructure stack while partnering closely with technology leadership, AI architects, and security stakeholders. This is a high-ownership role for someone who thrives in fast-moving environments, is comfortable operating with incomplete information, and enjoys building operational discipline around emerging AI systems. 

What You’ll Accomplish 

  • Establish operational reliability for Companion across AKS infrastructure, AI agent workloads, monitoring systems, and deployment pipelines.  
  • Build meaningful observability practices that help PHM understand platform behavior, usage trends, and operational risks before they become incidents.  
  • Create sustainable operational hygiene around patching, CVE remediation, secrets rotation, dependency management, and cloud maintenance cycles.  
  • Strengthen platform resilience, documentation, and operational processes so the environment can scale without relying on tribal knowledge.  

How You’ll Spend Your Days 

Operate and Improve Platform Reliability 

  • Monitor and maintain AKS infrastructure, AI agent workloads, deployment pipelines, and support Azure services.  
  • Investigate incidents, troubleshoot production issues, and improve platform resilience through better operational patterns and tooling.  
  • Support release operations and help ensure deployments remain stable, observable, and recoverable.  

Build Observability and Operational Insight 

  • Develop dashboards, alerts, logging patterns, and operational baselines using Azure Log Analytics and Application Insights.  
  • Identify system trends, performance bottlenecks, and emerging operational risks across infrastructure and AI workloads.  
  • Improve visibility into AI agent behavior, enterprise workflow integrations, latency patterns, and system health under real user load.  

Strengthen Security and Operational Hygiene 

  • Maintain operational cadence for dependency updates, CVE remediation, image signing, secrets rotation, and cluster patching.  
  • Support security-first infrastructure practices across Kubernetes, CI/CD pipelines, and Azure environments.  
  • Partner with security and engineering stakeholders to maintain compliance-aware operational practices in a HIPAA-regulated environment.  

Collaborate Across a Small, High-Ownership Team 

  • Work closely with technology leadership, platform engineers, security stakeholders, and AI architects to evolve the operational maturity of Companion.  
  • Contribute documentation, operational runbooks, and shared knowledge that reduce platform fragility over time.  
  • Help establish practical operational patterns for AI systems where industry best practices are still emerging.  

What You Bring to the Table 

Required 

  • Strong hands-on Kubernetes operations experience, including troubleshooting workloads, admission controllers, cluster networking, and production incidents.  
  • Experience supporting cloud-native infrastructure in Azure environments, particularly AKS and related operational tooling.  
  • Demonstrated strength in monitoring, observability, and incident response using structured logging and metrics platforms.  
  • SRE mindset with experience handling on-call responsibilities, operational prioritization, and post-incident analysis.  
  • Comfort operating in fast-moving environments with incomplete documentation, evolving processes, and broad ownership areas.  
  • Strong communication and collaboration skills with the ability to explain technical issues clearly across technical and non-technical audiences.  

Nice to Have 

  • Experience with CI/CD pipeline tooling including GitHub Actions, Kaniko, cosign, image signing, or Actions Runner Controller.  
  • Familiarity with Infrastructure as Code practices using Bicep or Azure resource automation tooling.  
  • Exposure to HIPAA, SOC2, or other compliance-aware operational environments.  
  • Experience supporting AI or LLM-backed applications in production environments.  

Compensation 

The target base salary for this position is $120000 - $140000 

This base salary is only a part of a total compensation package that also includes health/dental/vision benefits, annual cash incentive program, 401k with match, flexible PTO, PHM for PHM — our services for you and your dependents — and other benefits. Individual pay may vary from the target range as several factors including market forces, experience, location, disparities in market data, and other relevant business considerations may all factor into final compensation. 

Location 

This is a remote role requiring that you live in and physically perform all work in the United States. 

Next Steps 

Private Health Management is a remote company with employees around the United States. We’re committed to providing a thoughtful, transparent interview experience and meaningful opportunities to get to know our company, mission, and wonderful teammates through fully remote interviews. 

If your application is selected for interviews, you’ll hear from a member of our recruiting team to schedule next steps. Interviews will also include the hiring manager, peers, and often an executive from the department. 

PHM uses AI-enabled tools at certain points in the recruiting process to help identify and evaluate top talent; however, all hiring decisions are made by human reviewers. 

Have a quick question about the role? Email [email protected] or simply apply here. 

Anticipated Pay Range
$120,000$140,000 USD

Similar Jobs

9 Minutes Ago
Remote
United States
164K-216K Annually
Expert/Leader
164K-216K Annually
Expert/Leader
Artificial Intelligence • HR Tech • Information Technology • Software • Business Intelligence
Lead and grow a global partner marketing team to design and execute partner programs for resellers, GSIs, and technology partners. Drive partner-ready demand generation, joint solution messaging, collateral development, partner enablement and certification, and manage global MDF processes. Collaborate with field marketing, channel management, product marketing, and sales to accelerate partner-sourced pipeline and global adoption.
16 Minutes Ago
Remote or Hybrid
United States
182K-227K Annually
Senior level
182K-227K Annually
Senior level
Healthtech • Information Technology • Security • Software • Cybersecurity
Senior Solutions Engineer provides pre-sales technical leadership: discovery, customized demos, and proof-of-concept evaluations; authors SOWs, responds to RFI/RFPs, enables channel partners, and transitions accounts to services. Works with sales teams to meet territory revenue goals and requires up to 50% travel.
Top Skills: Active DirectoryAzureAzure Virtual DesktopCitrixDnsEnterprise Saas SecurityEntra IdGCPHTTPIdentity ManagementLdapMicrosoft Terminal ServicesWindowsMtireNistSingle Sign-OnSmtpVmware ViewVmware Workstation
19 Minutes Ago
Remote or Hybrid
Texas, USA
109K-184K Annually
Mid level
109K-184K Annually
Mid level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
The Software Sales Executive will sell Identity Security Solutions, exceed revenue goals, engage with customers, and collaborate with partners while ensuring excellent customer service and account management.
Top Skills: Salesforce

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account