https://ad.doubleclick.net/ddm/clk/628601142;435308584;f?https://www.capitalonecareers.com/tech?source=rd_builtin_job_posting_tm&utm_source=builtin.com&utm_medium=job_posting&utm_campaign=Tech&utm_content=niche_site&utm_term=435308584&ss=paid
Join a new Mexico City SRE center to build reliability for payment-critical systems. Develop observability, alerts, runbooks, and automation using Python/Java/shell across on-prem and AWS. Troubleshoot production incidents, participate on-call, automate operational processes, manage secrets, and deliver CI/CD-driven solutions that improve MTTR and settlement reliability.
WeWork Reforma Latino (97001), Mexico, Ciudad de Mexico, Ciudad de Mexico
Principal Associate SRE
We're building a Site Reliability Engineering center in Mexico City and hiring Principal Associate SREs to join one of our founding teams. You'll work on payment-critical systems across the Discover Network, Diners Club International, and PULSE - contributing to settlement reliability, alert quality, observability, and automation that directly impacts millions of transactions daily.
This is a ground-floor opportunity. You'll be part of the first cohort of engineers in CDMX, working alongside experienced SRE leaders to build the operational muscle that allows Mexico City to own reliability outcomes independently. Depending on team placement, you'll focus on one of the following areas:
What You'll Do
What Success Looks Like
The Environment
You'll work across hybrid on-prem and cloud infrastructure supporting real-time and batch financial transaction systems at global scale. The tech stack includes Python, Java, shell scripting, AWS, Kubernetes, OpenShift, CI/CD pipelines, and API automation frameworks. Observability runs on Datadog and Observe with extensive dashboard configuration. Secret management uses HashiCorp Vault. You'll use agentic AI tools (Claude Code and others) to develop automation solutions and accelerate your engineering output. The systems span three on-prem data centers and AWS, with both modern cloud-native services and legacy payment platforms. Strong troubleshooting and debugging skills are essential.
Basic Qualifications
Preferred Qualifications
At Capital One, we respect individual differences in culture, religion, and ethnicity. Likewise, we promote equal opportunities and development for all personnel. In the hiring process, we seek to provide equal employment opportunities to candidates, regardless of race, color, religion, gender, sexual orientation, marital or civil status, national origin, disability, or any other situation protected by federal, state, or local laws.
For technical support or questions about Capital One's recruiting process, please send an email to [email protected]
Capital One does not provide, endorse nor guarantee and is not liable for third-party products, services, educational tools or other information available through this site.
Capital One Financial is made up of several different entities. Please note that any position posted in Canada is for Capital One Canada, any position posted in the United Kingdom is for Capital One Europe, any position posted in the Philippines is for Capital One Service Corp (COPSSC), and any position posted in Mexico is for Capital One Technology Labs Mexico.
Principal Associate SRE
We're building a Site Reliability Engineering center in Mexico City and hiring Principal Associate SREs to join one of our founding teams. You'll work on payment-critical systems across the Discover Network, Diners Club International, and PULSE - contributing to settlement reliability, alert quality, observability, and automation that directly impacts millions of transactions daily.
This is a ground-floor opportunity. You'll be part of the first cohort of engineers in CDMX, working alongside experienced SRE leaders to build the operational muscle that allows Mexico City to own reliability outcomes independently. Depending on team placement, you'll focus on one of the following areas:
- Settlement - ensuring batch settlement cycles complete accurately, on time, and in compliance with regulatory requirements across domestic credit/debit and international cross-border networks
- Alert Signal & Observability - reducing alert noise, building automated severity classification, and creating customer impact dashboards that make incident response faster and more decisive
- Reliability Automation & Platform Convergence - building automated runbooks, driving Capital One platform adoption, and developing AI-powered remediation workflows
What You'll Do
- Build and maintain reliability tooling - observability dashboards, automated alerts, runbooks, and remediation scripts that reduce toil and improve mean time to recovery
- Develop automation solutions - using Python, Java, and shell scripting to eliminate manual operational processes, from certificate rotation to compliance artifact generation
- Troubleshoot and debug complex production issues - diagnose failures across distributed systems spanning on-prem data centers and AWS, identify root causes, and implement durable fixes
- Contribute to observability - configure and tune monitoring in Datadog and Observe, build dashboards that surface actionable signals, and reduce unactionable alert volume
- Support incident response - participate in on-call rotations, respond to production incidents, drive diagnosis, and contribute to blameless postmortems
- Leverage AI tools to accelerate engineering - use agentic AI automation (Claude Code and others) to develop solutions, generate runbook drafts, and build automation agents
- Manage secrets and certificates - automate rotation and provisioning, ensuring security posture without manual toil
- Deliver through CI/CD pipelines - build, test, and deploy automation via continuous integration and API automation frameworks
What Success Looks Like
- Independently troubleshooting and resolving production issues within your domain without escalation
- At least one operational process fully automated and running in production
- Contributing measurably to team OKRs - whether that's alert noise reduction, MTTR improvement, or settlement cycle reliability
- Producing or improving runbooks and dashboards that your teammates and partner teams actively use
The Environment
You'll work across hybrid on-prem and cloud infrastructure supporting real-time and batch financial transaction systems at global scale. The tech stack includes Python, Java, shell scripting, AWS, Kubernetes, OpenShift, CI/CD pipelines, and API automation frameworks. Observability runs on Datadog and Observe with extensive dashboard configuration. Secret management uses HashiCorp Vault. You'll use agentic AI tools (Claude Code and others) to develop automation solutions and accelerate your engineering output. The systems span three on-prem data centers and AWS, with both modern cloud-native services and legacy payment platforms. Strong troubleshooting and debugging skills are essential.
Basic Qualifications
- Professional English fluency
- Bachelor's degree
- Background in SRE, production operations, or reliability engineering
- At least 4 years of experience in DevOps Engineering (internship experience does not apply)
- 4+ years of experience in at least one of the following: Java, Python, Go
- At least 2 years of experience with Cloud Native technologies (Amazon Web Services, Microsoft Azure, Google Cloud Platform)
- 2+ years of experience with container orchestration services including Docker or Kubernetes
- Experience with Shell or Bash scripting
- At least 2 years of Unix or Linux system administration experience
Preferred Qualifications
- Experience developing automation solutions using agentic AI tools (Claude Code, Copilot CLI)
- Troubleshooting and debugging skills across distributed systems
- Familiarity with payments, financial services, or other regulated high-availability domains
- Knowledge or experience of Networking concepts (TCP/DNS/TLS)
At Capital One, we respect individual differences in culture, religion, and ethnicity. Likewise, we promote equal opportunities and development for all personnel. In the hiring process, we seek to provide equal employment opportunities to candidates, regardless of race, color, religion, gender, sexual orientation, marital or civil status, national origin, disability, or any other situation protected by federal, state, or local laws.
For technical support or questions about Capital One's recruiting process, please send an email to [email protected]
Capital One does not provide, endorse nor guarantee and is not liable for third-party products, services, educational tools or other information available through this site.
Capital One Financial is made up of several different entities. Please note that any position posted in Canada is for Capital One Canada, any position posted in the United Kingdom is for Capital One Europe, any position posted in the Philippines is for Capital One Service Corp (COPSSC), and any position posted in Mexico is for Capital One Technology Labs Mexico.
Capital One Boston, Massachusetts, USA Office


You’ll thrive in our newest office in Boston, Massachusetts. The location features ample workspace to collaborate with your team, complete focused work and meet with colleagues over Zoom. Recharge in mindfulness rooms, a gaming area and atop a rooftop with views of Boston and the Charles river.
Similar Jobs at Capital One
Fintech • Machine Learning • Payments • Software • Financial Services
This role manages the content pipeline for blogs and social posts, coordinating with teams to promote Capital One's technology leadership and drive engagement.
Top Skills:
AIBlog MarketingCloud ComputingData ManagementMachine LearningSocial MediaSoftware EngineeringSprinklr
Fintech • Machine Learning • Payments • Software • Financial Services
The Senior Director of Product Management will lead product strategy in a global environment, manage teams, and drive technological innovation and user-focused product development.
Top Skills:
APIsCloudMicroservices
Fintech • Machine Learning • Payments • Software • Financial Services
Lead the technical vision and roadmap for SRE in Mexico City; establish SLOs, error budgets, and operational standards; design AI-driven automation for alert classification and remediation; drive observability and platform convergence; triage and resolve complex incidents; architect secure automation for operational processes; and mentor engineers to raise reliability and operational excellence across payment systems.
Top Skills:
Api Automation FrameworksAWSBashCi/CdClaude CodeCopilot CliDatadogDnsDockerGoGoogle Cloud PlatformHashicorp VaultJavaKubernetesLinuxLlm FrameworksMainframeAzureObserveOpenshiftPythonShellTcpTls
What you need to know about the Boston Tech Scene
Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.
Key Facts About Boston Tech
- Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
- Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
- Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
- Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories



