02 Jan
Azure Site Reliability Engineer
Virginia, Arlington , 22201 Arlington USA

Azure Site Reliability Engineer This company is a fast-growing organization looking for an experienced people manager to lead our Developer Platform team. Developer Platform enables both internal and partner developers to add to, or extend, the Swiftly platform. Our tech stack?is composed primarily of?Kotlin?microservices running in?Kubernetes?on Azurebut we are more interested in engineering talent that is?versatile and has the intellectual curiosity to learn new things and find the best tools to solve our problems. This company is located in the D.C. Metro Area and will remain 100% Remote.

What You Will Be Doing:
  • Design and drive consensus for the operational infrastructure of our platform team. This includes how our microservices are provisioned, how they will run in test and production, how configurations and networking is managed per environment, and how all systems are monitored, supported, and scaled in production.
  • Partner with Platform and Professional Services engineering stakeholders to identify operational needs and deliver solutions as part of the infrastructure.
  • Write, deliver, and maintain infrastructural services that can be used by directly by development teams, including documentation and support materials.
  • Establish and reinforce healthy software engineering practices - including code quality, continuous delivery, and automated testing.
  • Aggressively automate all the things and make it easy to do the right thing when it comes to security, reliability, and resource management using GitOps principles. Drive the automation from build Pipelines (Azure DevOps/ GitHub Actions), Kubernetes operators (Flux/Argo) and scaling via KEDA.
  • Mentor and force multiply other members of the engineering team.
  • Ensure the uptime and scalability or systems through best practices such as auto[1]healing and circuit breaking.
  • Be a trusted-on call support engineer as part of a rotation Required Qualifications.

Required Skills & Experience:
  • BS or MS in Computer Science / Engineering or relevant work experience.
  • 8+ years of professional experience in software development.
  • Deep experience with distributed systems and highly available backend services.
  • Demonstrated ability to be analytical and have strong problem-solving skills.
  • Excellent communication, collaboration skills and a strong teamwork ethic with both technical and non-technical audiences.
  • Experience working with Azure or Similar Cloud Environments.
  • Strong understanding of modern software engineering practices, including logging, monitoring, continuous integration/deployment, infrastructure as code and automated testing practices.
  • Proven Success operating Azure or similar cloud-native deployment and operational contexts, including Kubernetes, Pulumi/Terraform, and Open telemetry/New Relic/Datadog Preferred Qualifications.
  • Experience working in an organization that embraces modern infrastructure tools such as Kubernetes, service mesh, GitOps, and Infrastructure as Code.
  • E-commerce or Retail experience.
  • Experience designing and deploying infrastructure automation for use by multiple application development teams.
  • Experience running Kubernetes workloads in multiple cloud regions.
  • Ability to collaborate with senior staff and mentor juniors.
Applicants must be currently authorized to work in the United States on a full-time basis now and in the future. This position doesn't provide sponsorship.


Related jobs

Report job