27 Jan
Systems Reliability Engineer
Vacancy expired!
One of our clients in Los Angeles is looking for a Systems Reliability Engineer with the following skills and experience:
Description/Comment:We're looking for engineers who love learning new technologies at a rapid pace. You should be intimately familiar with large scale data center infrastructure as well as public cloud environments. Lean, agile, self-sufficient teams is how we operate. We value a cloud first approach where we develop infrastructure-as-code.Basic Qualifications:- Evaluate, architect, and build high-quality systems
- Lead the design and deployment of public cloud services across engineering organizations
- Design systems promoting rapid development, high availability, and clear observability
- Improve the reliability and operability of public and private cloud services
- Collaborate on tool creation and selection, leverage open source, and automate tasks with resiliency and repeatability
- Serve as an escalation resource to operations groups for troubleshooting and optimization
- Lead and collaborate with engineers to ensure services are designed to be cloud-native (where applicable), scalable, and easily operated
- Identify & evangelize new technologies, patterns, solutions, and best practices
- Lead in the creation of self-healing infrastructure-as-Code. Automate everything
- Ensure security best practice is embedded in the DNA of all designs
- Influence the adoption of automation and code-driven solution design in the media systems spaceTechnical Requirements
- Adept at leveraging wide variety of modern public and private cloud service provider resources
- Experience and ability to differentiate service offerings from three major public cloud service providers, AWS, Azure, and Google
- Strong experience with Infrastructure as Code technologies (e.g. Terraform, Ansible, CloudFormation)
- Skilled in PaaS/IaaS/SaaS offerings and their use by developers, studio and commercial applications
- Experience with cloud service storage offerings (e.g. S3, Glacier, Nearline, Blob)
- Strong experience moving and storing large data sets in cloud service providers
- Strong experience architecting and deploying with both in-house and third-party development teams
- Strong experience with a wide variety of automation technologies and techniques
- Experience working in media production environments
- Experience with thin client and virtualization technologies (e.g. Teradici, VDI)
- Experience writing software on, or operating, Linux platforms
- Expertise in multiple scripting languages (e.g. Python, GO, Ruby, or Swift), with ability to build test coverage for all code being developed
- Strong written and verbal communication skills
- Strong knowledge in system configuration management languages (e.g. Ansible, Chef, Puppet)
- Experience with operating systems and systems management (e.g. RHEL/CentOS, Amazon Linux, Ubuntu, Windows)
- Expertise in Software Development Continuous Integration (CI) Pipeline knowledge (e.g. Jenkins, Gitlab CI) and Source Control Management (e.g. Git)
- Expertise with Development and Container Platforms (e.g. Docker, Kubernetes, EKS, GKE, AKS, Openshift, Fargate)
- Knowledge of Local and Networked File Systems (e.g. SMB, SMBv3, NFS, CIFS)
- Experience developing data center, network, and application architectures
- Deployment experience with high performance data storage solutions (e.g SAN, NAS, or Clustered)
- Experience with secure high speed transport solutions (e.g. Aspera, Signiant)
- Systems Security (e.g. key management, encryption, vulnerability management)
Vacancy expired!