14 Jan
Site Reliability Engineer
Michigan, Sanjose , 95134 Sanjose USA

Vacancy expired!

RESPONSIBILITIES:Kforce is immediately seeking an experienced Site Reliability Engineer for their enterprise cloud services and networking client in San Jose, California (CA).Summary:As a Site Reliability Engineer within the WebEx Meetings Platform organization, you will collaborate and help guide a global Operations team that collaborates with various Software Development groups to operate and deliver high availability cloud services to our customers. You will be accountable for ensuring 5-9's service availability, making metrics-driven operational decisions, and balancing the challenges of delivering stable, high quality, services against speed-to-market needs for rapid delivery of new features and releases.Responsibilities include the following: Frequently participate in cross functional collaboration efforts with a wide variety of internal teams, to leverage different functional expertise, and work towards common goals Help drive a paradigm shift towards a DevOps culture. Collaborate with software development groups throughout the development lifecycle, to ensure operational needs are adequately considered and baked into new software releases: Define: KPIs, health-check APIs, monitoring & alarming thresholds; Develop: Infrastructure (network, compute, storage) models; Drive toward: Automated deployments & modern approaches to configuration management Responsible for ensuring uptime & availability of cloud-based audio & video services Setup clear and accurate SLO/SLI for efficient service monitoring Actively participate in the incident management process Full-service handling (analysis, debugging, response, and resolution) of customer tickets and management escalations Engage with external telecom vendors, to evaluate and on-board new third-party circuits & servicesREQUIREMENTS: BS in Computer Science & 5 years of experience, or MS in Computer Science & 3 years of previous experience 3-5 years of development experience in any of the following languages; Java, Python, C Relevant experience in any of the following areas: DevOps, Cloud Operations, SRE, Systems Engineering, or Software Engineering; With an emphasis on creating automation software and automated processes Excellent debugging & technical problem-solving skills Experience operating large complex cloud solutions across private, public and hybrid clouds Demonstrable knowledge of, and tangible experience with the DevOps principles and concepts. Experience building CICD pipelines is a plus Experience managing and operating large voice and video collaboration services & infrastructure In-depth knowledge and experience with PSTN, VoIP technologies and relevant protocols such as SIP, H.323, RTP, etc. Knowledgeable of various packet-based media codecs such as G.711, Opus, H.264, etc. Advanced knowledge of operating systems, and good understanding of various Linux distributions, Windows, etc. You are a Site Reliability Engineer, whom is passionate about running cloud services in a multitude of mixed environments; You can be relied upon in high pressure situations to provide technical guidance and direction to assist your team members, as well as your peers within the extended team Advanced knowledge of networking technologies: TCP, UDP, IPv4, DNS, etc. Daily hands-on experience is preferredKforce is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, gender identity, national origin, age, protected veteran status, or disability status.

Vacancy expired!


Report job