Lead Cloud Site Reliability Engineer
Vacancy expired!
At Northwestern Mutual, we are strong, innovative and growing. We invest in our people. We care and make a positive difference. Northwestern Mutual is looking for a talented engineer to join our growing Cloud Platform team. The Cloud Platform team develops, maintains, loves, and appreciates all things cloud and containers.We enable hundreds of developers to harness the power of Kubernetes and the cloud to deploy world class apps and infrastructure on their own 1000s of times a day. This is a fun, fast-paced experience that exposes you to cutting-edge cloud technologies and encourages personal learning and development.What's the role?As a Site Reliability Engineer, you will be responsible for leading efforts to implement stability, and observability improvements to our Kubernetes container and Cloud platforms. Key to the role will be your ability to mentor and educate other engineers and establish strong relationships with application development customers. You will be focused on SLI development, Automation, TOIL elimination, incident response, root cause analysis and monitoring enhancements. You should have the aptitude and enthusiasm for building and servicing highly distributed, scalable, and mission-critical systems. You should have a passion for automation and creating self-service mechanisms for customers.Experience/Skills:
- Bachelor's Degree or equivalent experience
- 6+ years experience with networking, Linux based platforms, and modern programming and scripting languages (Python, Go, JavaScript)
- 6+ years experience performance tuning and operations of application stacks, OSs, DBs, etc.
- 3+ years experience with AWS Cloud Services (AWS Certified Preferred), and containerized applications and container orchestration (Docker, Kubernetes - CKA Preferred)
- 3+ years experience in DevOps or SRE roles
- Strong experience with monitoring and performance management/tuning of systems
- Strong experience with Prometheus, Dynatrace, New Relic, or other APM solutions with a focus on observability and alerting.
- Experience with Infrastructure-as-Code frameworks (Terraform, CloudFormation)
- Experience working with DevOps, CICD, GitOps, Agile methodologies.
- Experience with CI/CD pipelines and automation and how to apply it with services such as Gitlab CI, Jenkins, CodePipeline, or Circle CI.
- Strong written and verbal communication skills.
- Problem solver who enjoys learning on the job and thinking outside of the box.
Vacancy expired!