Performance and Monitoring Engineer, Senior
Vacancy expired!
This role will be responsible for the monitoring, analysis, troubleshooting and reporting for Rooms To Go's overall operational performance. This includes but not limited to Infrastructure, Application, Network and Security. This individual will take an active role in driving performance enhancements, and leading targeted process improvement initiatives.The role defines the metrics, data collection methods, and reporting mechanisms as well as implementation of an overall performance management strategy. Ensures the effective capture of all logging and monitoring of all aspects of system and application behavior to facilitate fast detection and resolution of issues. This role will be the SME in troubleshooting all performance issue across the Enterprise. This role will work closely with IT, Application Development, Project Management and external vendors ensuring the consistent tracking and reporting of metrics and performance data across the Enterprise. This role will also be the SME in measuring and reporting the financial performance of the Enterprise in the Cloud, by supporting cost transparency efforts, and helping to develop mature cost metrics and benchmarking. The individual should possess a passion for operational excellence and a high level of interpersonal and soft skills necessary to work collaboratively across the Rooms To Go Enterprise.
This Role Offers: · Industry-leading, paid training· Comprehensive benefits & perks package including 401k + company match, vision, dental, health and life insurance, disability coverage, vacation, holiday pay, award winning wellness & fitness programs, employee discounts on furniture, and more!What you'll be doing: · Define and maintain IT's performance monitoring and reporting strategy (processes, tools, & templates); develop enhanced reporting capabilities through standardization and automation. · Collect, consolidate and validate performance data for inclusion in IT and business communications, including weekly reports, monthly scorecards, and executive presentations · Proactively analyze trends in performance across IT; collaborate with process owners and stakeholders to identify and implement process improvements to increase operational efficiency and customer satisfaction. · Analyze and recommend performance improvements for Rooms To Go Enterprise for capacity, availability, performance, support and security. · Participate in efforts to improve cost transparency; develop more robust cost metrics and benchmarking capabilities; assist in efforts supporting IT cost leadership. · Stays informed of production changes that could affect functionality and alerting. · Ability to coordinate across teams, working closely with peers to ensure the appropriate focus and sense of urgency is applied to all issues · Troubleshooting using logs, alerts and external data sources to determine network, application, or security issues. The ability to corelate data to determine root cause. · Network monitoring and management which includes discovering network components and software, analyzing network traffic, monitoring network equipment for indications of network congestion, faulty network interfaces, and faulty transmission media. The role will also include responsibility for security operations on the network including detecting, monitoring, analyzing threats and reporting security incidents. · Work with application architects to improve scalability and performance, and MBTF. · Stays informed of emerging cloud technologies and evaluates the value to Rooms To Go operations. · Accurately troubleshoots, reproduces, and documents issues and other pertinent information in Incident or Problem tickets. · Handles incident queue and performs various tasks as assigned and determines business impact. · Handles ad hoc requests and take on new procedures as required. · When working on projects, identify and track project issues and dependencies, ensure follow-through, and appropriate actions are taken to complete project on time.- Recommend, implement and manage cloud Automation using native Cloud tools.
- Design and implement load testing and application performance monitoring for applications
- Lead troubleshooting calls for performance issues.
- Provide runbooks for other departments to execute.
- Recommend ideas to streamline operations, improve operations, create processes to proactively determine potential issues.
- Provide training and mentoring of other engineers.
- Drive overall improvement of operation results for Infrastructure, Application, Network and Security, due to application of application performance monitoring tools and techniques.
- Drive overall cost management of infrastructure in the Cloud.
- Drive overall improvement in business and operational results due to increases in speed and uniformity of automated service creation.
- Bachelor's degree in computer science or information systems (Master's Degree preferred) or an equivalent combination of education, work experience and/or applicable certifications.A minimum of five years of experience related to Performance analysis and monitoring across multiple areas including Infrastructure, Application, Network and Security for medium to large scale companies.
- Expert knowledge of IT performance metrics. Experience with data management, report design, data visualization and presentation techniques
- Experience with one or more Cloud platforms; Microsoft Azure, Amazon Web Service (AWS), Google Cloud or IBM Cloud as it relates to performance, monitoring and cost management.
- Expert experience with Application and Network Performance Management Tools
- Experience with Network or Security Operations Center.
- Experience with BI reporting tools such as MS PowerBI
- Familiarity with financial data.
- Candidate must have expert knowledge of at least one scripting language such as JavaScript, bash, or PowerShell.
- Expert experience with networking fundamentals, including TCP/IP, UDP, DNS, DHCP, VLANS, routing.
- Expert experience with networking elements such as load balancers, proxies, routers and switches.
- Network Security elements including intrusion protection systems, anti-virus, proxies, and firewalls.
- Datacenter fundamentals, server hardware systems, KVM, UPS.
- Candidate should have advanced knowledge of troubleshooting performance issues with complex large-scale multi-tier and distributed application infrastructures.
- Superior analytical and problem-solving skills.
- Self-motivator with the ability to work effectively with minimal supervision.
- Excellent written and verbal communication.
- Strong organizational skills and attention to detail.
- Ability to lead using informal authority to drive organizational objectives.
- Ability to work with diverse, cross-functional teams, including external vendors.
- Time management in relation to assigned projects.
- Demonstrates commitment to RTG and its customers.
- Takes personal responsibility for words and actions.
- Maintains consistency between words and actions.
- Acts in compliance with department, company, and industry standards.
- Holds self and others accountable.
- Demonstrates drive to excel.
- Exhibits mature self-confidence.
- Achieve Results
- Exhibits customer service orientation.
- Demonstrates flexibility and change.
- Demonstrates analytical thinking.
- Demonstrates conceptual thinking.
- Exhibits teamwork and collaboration.
- Communicates effectively.
- Understands and influences others.
- Communicating and Influencing.
- After-hours or weekend work will be required.
- Employee will be responsible for performance of infrastructure and will be required to be accessible via cell phone outside of normal business hours.
- The physical demands and characteristics of the work environment described here are representative of those occurring in the performance of the essential functions of this job.
- Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
- While performing the essential functions of this job, the employee is frequently required to stand; walk; sit; use hands to finger, handle, or feel objects, tools, or controls; and talk or hear. The employee is occasionally required to reach with hands and arms and stoop, kneel, crouch, or crawl.
- The employee must occasionally lift and/or move up to 20 pounds. Specific vision abilities required by this job include close vision, distance vision, peripheral vision, depth perception, and the ability to adjust focus.
- This position works in an office, and the noise level in the work environment is usually low to moderate. While performing the duties of this job, the employee is occasionally exposed to toxic or caustic chemicals, i.e. copier toner.
- ?This position description generally describes the principle functions of the position and the level of knowledge and skills typically required. It does not constitute an employment agreement between the employer and employee, and it is subject to change as the needs of the employer and the requirements of the job change.
Vacancy expired!