13 Nov
Site Reliability Engineer
Michigan, Remote , 10001 Remote USA

Vacancy expired!

ROLE: Senior Site Reliability EngineerLOCATION: remoteDURATION: Projected 12 month contract-to-hireNOTE: ONLY W2Our notes per conversation with hiring manager: Main role is to keep the system up and running; End to end support python - scripting language, this person won't do development but would need this experience/background knowledge to work with dev teams - implementation sub solutions 2+ years in SRE 2+ years of Architect (technical architecture) open to systems engineer that moved into an SRE role/Devops automation engineer CI/CD - container technologies, kubernetes and openshift monitoring tools - splunk, grafana, prometheus, Dynatrace need enterprise level experience health care experience optional Required Tech: cloud technologies Container tech (kubernetes/openshift) Microservices (Kafka, API s) Networking technologies (F5, datatrace, palo alto) not from an implementation perspective but from a troubleshooting perspectiveJob Description:This role will be a critical member of the Site Reliability Engineering (SRE) program at Optum. The ideal candidate has multiple years of system administration or development experience, is a self-starter, innovative, and not afraid to challenge the status quo. Successful candidates have a proven ability to network and lead without direct authority. The responsible person will be part of the operations team (India) and help the team in maintaining high availability, improve monitoring, improve checkouts post deployment, troubleshoot complex infrastructure problems etcResponsibilities: Drives reliability into systems across the enterprise. Utilize scripting & development skills to reduce operational man-hours and reduce time to restore for incidents. This should include implementing practices that support Agile and Continuous Integration/Continuous Delivery (CI/CD) principles. Perform Information Technology Service Management (ITSM) tasks without supervision, and able to provide direct feedback to improve ITSM processes. Create complex architecture design models that describe enterprise solutions that both conform to enterprise standards and advocates for modern technology and best practices Provide end-to-end support and leadership of multiple platforms. Leverage modern tools and instrumentation to drive reliability and meet Service Level Objectives (SLOs) Demonstrable understanding of key networking technologies, and ability to include this understanding in troubleshooting and problem management. Utilize monitoring tools to track performance and availability of applications, and determine trends. Ability to coach others in learning this technology. Utilize log forwarding technology to troubleshoot problems and identify trends. Ability to coach others in learning this technology Encourage collaboration and cohesiveness within their team Lead teams outside their immediate organization in implementing SRE principles. Complete tasks with minimal guidance and oversight, and review work of others. Demonstrate integrity and ethical behavior by complying with applicable laws, regulations and policies, and requiring the same from others. Leverage diversity and inclusion to bring in the right talent, drive employee engagement and foster teamwork and collaboration.Required Qualifications: 4+ years of professional IT experience (system administration, software development, or a combination ), with steadily increasing responsibilities. 2+ years of experience designing and building highly distributed, reliable systems. 2+ years of experience as a Site Reliability Engineer Experience planning and supporting +99.99% availability against critical applications in production. Experience with Agile & CI/CD methodologies and enabling automation within development teams. Practical experience with one or more programming or scripting languages. Experience with one or more modern monitoring and log forwarding tools. Is proficient with modern software tools, technology, and practices. Previous utilization of agile work management tools. Excellent customer service and communication skills. Clear understanding of security best practices. Grows and maintain knowledge of and leverage cutting edge IT industry technologies and trends to support highly-available distributed systems, and the transformation of legacy systems.Assets: Undergraduate Degree or 3-5 years relevant work experience Health Care industry experience. Proficient in one or more programming or scripting languages. Previous Site Reliability Engineering experience. Mixed skill set of system administration/operations and development. Thanks & Regards,Mohan Sai|Technical RecruiterThoughtwave Software and Solutions1444 N, Farnsworth Ave Suite 302, Aurora, IL , 60505PhoneEmail:Website:www.thoughtwavesoft.comlinkedin: https://www.linkedin.com/in/mohan-badaru-112b581b9/

Vacancy expired!


Related jobs

Report job