15 Feb
Production Support and Site Reliability Engineering Manager
Texas, Plano 00000 Plano USA

Vacancy expired!

Plano 1 (31061), United States of America, Plano, TexasAt Capital One, we’re building a leading information-based technology company. Still founder-led by Chairman and Chief Executive Officer Richard Fairbank, Capital One is on a mission to help our customers succeed by bringing ingenuity, simplicity, and humanity to banking. We measure our efforts by the success our customers enjoy and the advocacy they exhibit. We are succeeding because they are succeeding.Guided by our shared values, we thrive in an environment where collaboration and openness are valued. We believe that innovation is powered by perspective and that teamwork and respect for each other lead to superior results. We elevate each other and obsess about doing the right thing. Our associates serve with humility and a deep respect for their responsibility in helping our customers achieve their goals and realize their dreams. Together, we are on a quest to change banking for good.Production Support and Site Reliability Engineering ManagerIn Capital One, Production Support and Site Reliability Engineering (SRE) is an engineeringdiscipline that combines software and systems engineering to manage and support large-scale, massively distributed, fault-tolerant systems hosted in the external cloud environment.SRE Engineering Manager ensure that Capital One financial services—both our internally critical and our externally-visible systems—have reliability and uptime appropriate to customers needs and a fast rate of improvement while keeping an ever-watchful eye on availability, capacity and performance in a 24/7 environment.SRE engineers will be engaged in automation and development work, reducing toil, developingself -service capabilities, automating manual tasks and develop support tools, utilizing common scripting languages (i.e. Python). When incidents do occur, SRE Engineering Manager is responsible for taking ownership and consulting, engaging and partnering with Lines of Business to lead the team towards successful resolution, as well as conducting problem management activities coupled with implementing prevention strategies.Basic Qualifications:

Bachelor's Degree

At least 3 years of expertise in designing, analyzing, and troubleshooting distributed systems (3-tier applications end-to-end troubleshooting)

At least 2 years of experience with Unix or Linux operating systems.

At least 2 year of Infrastructure experience with networks, load balancers, firewalls and web application firewall (WAF)

At least 2 years of experience with Scripting languages to debug, optimize code, and automate routine tasks.

At least 2 years of experience using and supporting external cloud environments ( AWS , Azure or GCP)

At least 2 years experience with enterprise monitoring (Splunk, Datadog, PagerDutry or New Relic)

Preferred Qualifications:

Master's Degree

3+ years of software development

3+ years of automation experience

3+ years of DevOps or SRE experience

At this time, Capital One will not sponsor a new applicant for employment authorization for this position.

Vacancy expired!


Related jobs

Report job