08 Jan
Sr. Site Reliability Engineer (SRE)
Vacancy expired!
- Triage, troubleshoot, and fix production problems in every layer of the stack
- Design, develop, improve, and tune logging, monitoring, and alerting
- Identify manual work, document the fix in the form of a runbook, then automate it away
- Write software to improve reliability and recoverability of production systems
- Perform and automate system administration tasks
- Participate in on-call rotation supporting production systems
- Mentor junior and mid-level members of the team
- Drive large projects from a technical perspective
- Bachelors degree in Computer Science or related field, or equivalent work experience
- 6+ years of software development experience
- 6+ years of Linux system administration experience
- 6+ years of performance engineering experience
- Strong understanding of SRE concepts and DevOps principles
- Strong understanding of microservice environments and distributed systems
- Experience with containerization and container orchestration
- Experience troubleshooting complex systems
- Experience with application performance monitoring
- Experience with relational databases and SQL
- Familiarity with front-end technologies
- Ability to clearly communicate technical concepts
- A strong desire to make things better
- Datadog
- Opsgenie
- Atlassian Suite (Jira, Confluence, BitBucket)
- Java/Spring
- Python
- JavaScript/React
- SQL
- Ansible
- Jenkins
- Tomcat
- Git
- Redis
- RabbitMQ
- Splunk/Kibana
- Terraform
Vacancy expired!