14 Mar
Engineer - Service Reliability Engineering (SRE) (100% Remote)
California, Mountainviewca 00000 Mountainviewca USA

Vacancy expired!

Role: Service Reliability Engineer (SRE) Location: Remote/Mountain View, CA Duration: Fulltime Visa: Any visa is fine including transfers. Responsibilities:

  • Implement tools and processes necessary to achieve required SLOs for Client's Platform.
  • Define and implement CI/CD pipelines.
  • Automate delivery of platform services using infrastructure-as-a-code. Build self-service playbooks for platform which can be consumed across globally distributed teams at Client's.
  • Define and implement incident response management process, deploy necessary tools.
  • Fix support and escalation issues.
  • Conduct post-incident reviews.
  • Collaborate with application and business stakeholders to ensure high-quality product is developed and deployed in production. Work diligently with other engineering teams to ratify release processes necessary to meet business goals.
  • Drive continuous improvement process.
Required Knowledge and Skills:
  • Expert knowledge of one of the major public cloud platforms (Azure, AWS, Google Cloud Platform).
  • Hands-on programming experience in Python or other object-oriented programming languages.
  • Expert knowledge of Infrastructure and Application Monitoring tools: Prometheus, Grafana, DataDog, etc.
  • Experience implementing IaC concepts using Terraform, Chef, Puppet.
  • Experience with Elasticsearch, Kibana.
  • Experience administering Databases.
  • Expert in Linux administration.
  • Expert knowledge of Docker, Helm.
  • Experience implementing CI/CD for cloud native applications.
  • Experience with deploying applications that utilize Service Mesh.
  • Experience administering Kubernetes clusters.
  • Experience defining and implementing incident response management processes.
Basic Requirements:
  • Bachelor's degree preferred; may consider relevant experience in lieu of a degree.
  • 8+ years' experience in software engineering with a degree; 12+ years' experience in software engineering in lieu of a degree.
Preferred Knowledge and Skills:
  • Master's degree
  • Understanding of GitOps principals.
  • Experience implementing secure and compliant Kubernetes platforms.
  • Experience deploying and managing stateful distributed service in Kubernetes.
  • Experience with security scanning tools.
  • Experience with intrusion detection systems.
  • Experience with various messaging systems, such as Kafka or RabbitMQ.
  • Working knowledge of Databricks, Team Foundation Server, TeamCity, Octopus deploys and DataDog.
Work Conditions:
  • Corporate office/lab environment.
  • Ability to travel 10% of the time.

    Lokesh Gurgela

    NexGen IOT Solutions, LLC

    Email: lokesh(at)nexgeniots(dot)com

    Vacancy expired!


    Report job