Site Reliability Engineer
Sorry, this job was removed at 7:23 p.m. (EST) on Monday, January 21, 2019
By clicking Apply Now you agree to share your profile information with the hiring company.
Site Reliability Engineer
Waltham, MA, Boston, MA, Boulder, CO or Remote
11 AM - 7 PM Eastern Time
We’re looking for a Site Reliability Engineer who will perform operations and development support for our cloud product line. This individual will work with development and have responsibility for the health of our services.
If you are:
- Ready for your next challenge
- Experienced as operations engineer
- Able to maintain the delicate balance between quality, speed, user experience and customer expectations in a 24x7 operations environment
- Apt to take the stairs… and questioning the efficiency of doing so three times a week
Then you’re exactly the person we need. Join us in the battle to secure the world’s intellectual property.
What You’ll Do
- Share responsibility for health, scalability and availability of our cloud services.
- Define scope and acceptance criteria for automation.
- Participate in on-call rotation for production issues.
- Work with the team to ensure cloud architecture meets scalability, availability and cost requirements.
- Follow good operational practices such as use of playbooks, and upkeep of documentation.
What You’ll Bring
- S. in Computer Science or related fields or commensurate experience
- 2-5 years of experience managing cloud infrastructure, in a 24x7 uptime environment
- Minimum of 2 years of experience with technical operations and software development support that worked on enterprise scale, mission critical, highly available Linux systems
- Experience with configuration management tools and cloud management automation, e.g. Cloud Formation, Salt Stack is a plus
- Working knowledge of monitoring such as Splunk, CloudWatch, and Grafana
- Good understanding of web services, databases and related infrastructure
- Solid understanding of backup/restore best practices in the cloud
- Ability to manage using a modern scripting language
- Excellent troubleshooting skills, for when the playbook just doesn’t cover it
- Thrive in a fast-paced, results driven environment
- Ability to work independently and take specific instruction, switching as required
- Knowledge of AWS APIs a plus
- Security Experience a plus
Read Full Job Description