Site Reliability Engineer (SRE) Level - (Direct Hire with Client) : 2021-04-10

10 Apr

Site Reliability Engineer (SRE) Level 3 - (Direct Hire with Client)

California, Irvine , 92602

Vacancy expired!

Site Reliability Engineer (SRE) Level 3

Irvine, CA

Direct Hire with Client

Job ID 2021-5299 Solugenix is assisting a client, a prestigious cloud-based company, in their search for a

Site Reliability Engineer (SRE) Level 3. This is a direct-hire opportunity and will be based out of Irvine, CA. The Site Reliability Engineering team at the client provides leadership, direction, and accountability for platform architecture, system design, and end-to-end implementation to meet and exceed the product non-functional requirements including quality, security, reliability, availability, and performance. SREs allow Product Development teams to focus on shipping products with reliable velocity. The Site Reliability Engineer (SRE) level 3 on our dynamic SRE team focuses on driving the SRE charter by using software engineering to enable automation and efficiency in all aspects of platform change management and operations. The main responsibilities include, but are not limited to, optimizing day-to-day activities to reliably support product rollout and operation through automation and mentoring other staff SRE to adopt and implement the DevOps culture.

Qualifications:

7+ years of experience in infrastructure, system engineering, QA/testing automation.
Demonstrable and subject matter expert with experience in testing methodology, testing automation framework.
Advanced level of Linux/Unix experience.
Full-stack software test engineer, having advanced experience in at least 2-3 in the following technology stack: React, Go, Git, Bitbucket, Python, SQL or No-SQL Databases, Docker, Kubernetes, and monitoring tools like New Relic and Stack Driver.
Strong experience in at least 2 of the following sets of logging and monitoring tools: ELK stack, Prometheus, Stackdriver, New Relic, Datadog, Dynatrace.
Intermediate level of knowledge for software release tooling to include but not limited to Bitbucket, Jenkins, Cloud Build, Spinnaker, Gitlab.
Experience in designing, analyzing, scaling, and troubleshooting medium-scale distributed systems.
Well-versed with SRE methodologies and passionate about solving operation problems through automation and software engineering.
Ability to communicate effectively vertically and horizontally within the organization via demonstrated written and verbal communication skills.
Intermediate level of knowledge of Docker technologies including experience in optimizing Docker image and managing Docker image lifecycle.
Experience working with Google Cloud preferred but will consider any other public cloud provider experience.
API and front-end testing automation.
Microservices lifecycle management (integration, testing, deployment).
A systematic problem-solving approach coupled with strong communications skills and a sense of ownership and drive.

Responsibilities

Identify opportunities to design, build and implement innovative solutions to solve unique platform and infrastructure problems to enhance developer workflow and production stability for the products.
Collaborate with other senior team members to evangelize the SRE mindset and system design to implement technology solutions that will maximize the performance and availability of our environment.
Design and implement orchestration, and tooling solutions to ensure that repetitive administration tasks are performed at a high level of efficiency and free of defect.
Design and implement monitoring and recovery tools to provide for site high availability (HA) and disaster recovery (DR).
Design and develop highly available infrastructure and platform components to meet the needs of our growing and evolving product lines.
Design and implement security engineering best practices in all our deployed platforms and environments.
Triage alerts & diagnose/resolve critical issues, manage the implementation of changes.
Manage the coordination, documentation, and tracking of critical incidents ensuring rapid and complete issue resolution and appropriate closed-loop to customers and other key stakeholders.
Develop continuous integration/continuous deployment orchestration system to reduce friction for software delivery to production.
Evangelize SRE mindset and mentor others about reliability and best practices of SRE
Identify and work with engineering to implement opportunities for automation, signal noise reduction, recurring issues, and other actions to reduce time to mitigate service impacting events and increase the productivity of cloud operations and development resources.
Maintain a strong understanding of IaaS, Paas, and SaaS offerings with building and maintaining a state-of-the-art, cloud-based environment for massive-scale data processing.
Ensure that implementation and solution are fully documented, and solution deployed with fully operationalized processes to support the solution lifecycle.
Other tasks as assigned.

About the ClientThis position is with one of our prestigious Clients. A leading cloud-based company based out of Irvine, CA.

About SolugenixFor over 50 years, Solugenix has been a global technology development and services firm with locations in California, Arizona, India, and the Dominican Republic. As a pioneer in Professional Staffing and IT Consulting, we have partnered with some of the biggest global corporations across many industries. Our history was built on a foundation of partnerships with global brands like McDonald’s, Microsoft, CIT Group, Johnson & Johnson, Herbalife, Sony Pictures Entertainment, and many others who look to Solugenix to be their trusted partner in providing professional staffing, non-IT, and IT solutions.We live our core values in everything that we do, starting with “doing the right thing” for our employees/contractors and “committing to client success”. This is a big part of how we continue to make lists like “2019 Forbes Small Giants”. We also forge strategic partnerships with vendors and corp-to-corp candidates (C2C) that share our core values and encourage you to partner with us.In addition to generating ground-breaking, industry-defining solutions for our clients and our own projects, we partner with clients with whom we share core values and a common professional culture to help them find talent for their valuable opportunities. At Solugenix, we invest in the personal development and growth of every individual. While this is a position with one of our esteemed clients, Solugenix will continue to invest in your personal growth and development, providing you with a successful career as well as ensuring client success.

Vacancy expired!

Subscribe for new vacancies

ID	#12095018
State	California
City	Irvine
Source	Solugenix Corporation
Job type	Permanent
Salary	$120,000 - $145,000
Showed	2021-04-10
Date	2021-04-02
Deadline	2021-06-01
Category	Et cetera

Site Reliability Engineer (SRE) Level 3 - (Direct Hire with Client)

Site Reliability Engineer (SRE) Level 3 - (Direct Hire with Client)

Related jobs

»2nd Shift- Assembler- $18/hr Start ASAP- Entry Level

»R&D Engineer

»Senior Software Engineer, Security/Privacy, Google Ads

»Engineer II, Manufacturing

»Software Engineer III, Infrastructure, Core

»Software Engineer IT Co-Op, Summer to Fall Co-Op 2024

»Principal Engineer, New Product Development