WHO WE ARE
Apex Fintech Solutions (AFS) powers innovation and the future of digital wealth management by processing millions of transactions daily, to simplify, automate, and facilitate access to financial markets for all. Our robust suite of fintech solutions enables us to support clients such as Stash, Betterment, SoFi, and Webull, and more than 20 million of our clients' customers.
Collectively, AFS creates an environment in which companies with the biggest ideas in fintech are empowered to change the world. As a global organization, we have offices in Austin, Dallas, Chicago, New York, Portland, Belfast, and Manila.
If you are seeking a fast-paced and entrepreneurial environment where you'll have the opportunity to make an immediate impact, and you have the guts to change everything, this is the place for you.
AFS has received a number of prestigious industry awards, including:
2021, 2020, 2019, and 2018 Best Wealth Management Company - presented by Fintech Breakthrough Awards
2021 Most Innovative Companies - presented by Fast Company
2021 Best API & Best Trading Technology - presented by Global Fintech Awards
ABOUT THIS ROLE
The Incident Practice Manager is a critical member of our Platform organization, specializing in both rapid incident response and proactive measures to ensure service reliability and minimize downtime. This role focuses on processes and practices of handling service disruptions, effective incident handling, and continuously improving incident processes to maintain a high standard of service availability and resilience.
Duties/Responsibilities
Conduct ongoing training and education sessions; Ensure firm-wide understanding and adherence to the Incident Management processes and procedures
Utilize and improve automation tools and scripts to streamline incident response and remediation, ensuring rapid and consistent handling of common problems.
Lead practices for how to conduct effective RCA investigations; Guide teams on how to identify underlying or systemic causes, and document lessons learned to prevent recurrence.
Implement incident response practices such as game-day exercise or chaos engineering to test and address our response to potential failures before they affect production
Assess when and how to escalate incidents, ensuring the appropriate responders, teams, owners and stakeholders and are engaged for effective resolution.
Facilitate comprehensive post-incident reviews, compiling insights and recommendations for process, tooling, or systems improvements.
Develop, document, and regularly update incident management procedures and playbooks to support continuous improvement and onboarding of new staff.
Education and/or Experience
Bachelor's degree in Computer Science, Information Systems, or a related field; advanced degree is a plus
5+ years' experience leading incident investigations, root cause analysis and remediations
2+ years of experience as a people manager
Experience as an SRE or working extensively in an SRE environment preferred
Proven experience with relevant automation tools (e.g., scripting, runbooks, orchestration platforms) used in SRE and incident response.
Extensive experience with ITSM, ITOM (e.g., ServiceNow, Jira Service Management) and Observability tools (PagerDuty, OpsGenie, Datadog, Prometheus, Grafana, etc.) preferred
Certification or training in incident management or IT service management (e.g., ITIL, ICS) preferred
Required Skills/Abilities
Deep understanding of cloud systems, microservices, distributed systems, and the technical landscape of the organization.
Exceptional analytical skills with the ability to quickly diagnose and resolve complex technical issues in high-pressure environments.
Excellent verbal and written communication skills, enabling you to inform and coordinate across technical and non-technical stakeholders.
Hands-on familiarity with incident management methodologies and frameworks, such as the Incident Command System (ICS).
Strong knowledge of monitoring, logging, and tracing best practices to measure and interpret system behavior and performance.
Advanced cybersecurity knowledge preferred
Familiarity with chaos engineering, SLO/SLA management, and the principles of site reliability engineering preferred
Work Environment
This job operates in a hybrid, office environment 3 days per week
#IT #mid-senior #full-time #LI-SD1 #APEX
Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties, or responsibilities required of the employee for this job. Duties, responsibilities, and activities may change at any time with or without notice.
Our Rewards
We offer a robust package of employee perks and benefits, including healthcare benefits (medical, dental and vision, EAP), competitive PTO, 401k match, parental leave, and HSA contribution match. We also provide our employees with a paid subscription to the Calm app and offer generous external learning and tuition reimbursement benefits. At AFS, we offer a hybrid work schedule for most roles that allows employees to have the flexibility of working from home and one of our primary offices.
EEO Statement
Apex Fintech Solutions is an equal opportunity employer that does not discriminate on the basis of race, color, religion, sex (including pregnancy, sexual orientation, and gender identity), national origin, age, disability, veteran status, marital status, or any other protected characteristic. Our hiring practices ensure that all qualified applicants receive fair consideration without regard to these characteristics.
Disability Statement
Apex Fintech Solutions is committed to creating an inclusive and accessible workplace for all candidates, including those with disabilities. We are dedicated to ensuring equal employment opportunities and providing reasonable accommodations to qualified individuals with disabilities. If you require reasonable accommodations to participate in the application or interview process, please submit your request via the Candidate Accommodation Requests Form. We will work with you to provide the necessary accommodations to ensure your full participation in our hiring process.
Top Skills
Similar Jobs at Apex Fintech Solutions
What you need to know about the Boston Tech Scene
Key Facts About Boston Tech
- Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
- Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
- Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
- Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories