Senior Infrastructure Engineer
SmarterTravel is a fast-growing start up with a mission to operate the best travel concierge product in the world with a messenger-focus, which will provide users with the personalization, recommendations, and discounts of a personal travel agent while offering the ease of use and accessibility of an online site.
The Shared Infrastructure team's mission is to deliver infrastructure solutions that meet the needs of the software engineering and analytics teams across SmarterTravel. Members of this team use a combination of software engineering and DevOps skill sets. These skills are leveraged to build, deploy, and maintain our physical and virtual hardware and a core set of services used by many teams. These services include functions like data warehousing, interacting with data storage, monitoring, alerting, logging, and instrumentation.
The Shared Infrastructure team at SmarterTravel takes an Infrastructure As Code approach to operations, using tools like Terraform, Ansible, Helm, and Kubernetes to build and modify our infrastructure. An ideal candidate will be able to align day-to-day activities with business objectives to empower other teams to operate with efficiency and confidence in a fast paced, ever evolving environment with aggressive business and revenue goals.
What you'll do:
- Participate in an on-call rotation to provide ongoing support and monitoring for all critical infrastructure
- Continually review existing infrastructure and find ways to improve scalability, availability, performance, and efficiency.
- Maintain and support our Kubernetes clusters and AWS services, providing assistance and training to other engineering teams in deploying their projects
- Maintain and improve our CI/CD pipelines (primarily Github Actions) for various microservices and internal tools
- Leverage tools like Grafana, Loki, Prometheus, and Nagios to monitor systems and services
- Audit existing security protocols and propose improvements
- Distribute DevOps knowledge amongst other members of the team via pairing, presentations, demos, etc
- Strike a balance between pragmatic, timely short-term solutions and technical debt avoidance
- Foster a healthy agile development environment both inside and outside the team
- Implement Infrastructure as Code using Terraform/Terragrunt
Ideal candidates will have:
- 3+ years of relevant work experience
- The ability to plan out a process and execute on it on a daily basis with an emphasis on achieving measurable results at the organization level
- Experience acting as a mentor and technical leader helping other engineers grow
- Entrepreneurial spirit and excitement about impact on the bottom line of the company, and on building the company's value
- The ability to identify gaps in process and work with the team and stakeholders to implement new processes or tooling
- Experience with all of the technologies used in our environment is not strictly required, but ideal candidates will have some experience with: Kubernetes, Terraform, Helm, AWS, Github Actions, and Python.
Bonus Points if you have any of the following certifications:
- A./B.S. in Computer Science, or a related field
- AWS Certified DevOps Engineer
- AWS Certified Solutions Architect
- AWS Certified Database Specialty or similar DBA certification
- AWS Certified Security Specialty
- CNCF Certified Kubernetes Administrator (CKA)
- CNCF Certified Kubernetes Security Specialist (CKS)