Enterprise Monitoring Transformation Lead

Job Description: The role will require influencing the full lifecycle of technology development, from design through build up to and beyond onboarding into operations, and optimization thereafter. This will be accomplished in conjunction with Monitoring Architects, Tools Engineering, Automation Teams and Operations teams, to consider monitoring as a key early requirement, with a service health and quality based outcome, driving for and delivering on standardized and automated monitoring onboarding, and collaborating with process experts to be compliant with relevant standards. The goal of the Monitoring Transformation lead is to standardize and enable the ability to reliably and consistently deliver applications and infrastructure that are operationally ready, in order to minimize significant service disruptions, with a minimum bar of reducing mean-time-to-repair, but striving to deliver the capability to predict and prevent service impact to the customer. Developing and delivering a standardized automated workflow to self-serve the selection and instantiation of standard monitoring capabilities, through loosely coupled standard tools, will be a key deliverable. When business needs dictate custom solutions, a thoughtful process must be established to capture the risk vs. reward justifications, and ensure that due diligence is done to limit proliferation of similar custom solutions. A time-to-market driven custom solution must have a roadmap to comply with and deliver on one of the above approaches to control custom sprawl. In doing so, the individual in this role will implement the vision of a common/shared self-service and automated by default and custom by exception monitoring onboarding process for CTO Service Operations. Main Responsibilities Identify monitoring capability gaps against operational requirements, contributes to and influences strategies for Monitoring across all platforms. Influence architectural strategy for monitoring in collaboration with service owners, business owners, performance engineering and operations teams Analyzes and helps bridge the gap between current monitoring to target state. Develop and manage strategic programs that drive monitoring solutions adoption and migration activity. Collaborate with shared tools engineering, L1/2/3 support leadership, and line of business application operators to evolve the support model. Plans and directs tasks in a measurable way that aligns short term goals and long term initiatives. Enables adoption of standard solutions with minimal customizations. Enables integration within diverse solution components across multiple platforms. Understands, seeks and encourages automation techniques. Establishes monitoring onboarding procedures compliant with operational excellence standards to ensure technology meets operational non-functional requirements Provide technical leadership and accountability for procedures for production monitoring Drive the transition of non-standard to standard solutions Drive a service and user-experience first, automated approach to monitoring requirements Solve intractable problems with previous domain experience and automation Establish a disruptive culture of agile time-to-market, user experience focus to owning monitoring requirements within operations. Establish close relationships with Tools Architecture, engineering, CIOs and Production services leads for continual service improvement to monitoring and event management Required Skills: ? Have a deep understanding of technology operational monitoring excellence. ? Good understanding of industry leading monitoring tools ? Good grasp of monitoring approaches/tools/concepts ? Excellent understanding of Software Engineering methodologies and development cycle (including Open Source development) ?Experience in programming and working knowledge of one or more of the following C, C++, Java, and Shell, Perl, GO or Python ? Strong knowledge on Service Oriented Architecture design patterns ? Identifying, troubleshooting, and resolving system level issues on large scale complex systems, and across the entire stack - hardware, software, application, and network. ? Good understanding of security information and event management technologies ? Ability to use a wide variety of open source technologies and tools. ?Experience with large scale production systems and IT operations ? Expertise in implementing, supporting or operating in one or more of the following with working knowledge/broad experience in others      Have a deep understanding of technology operational monitoring excellence      Knowledge in relational DB (Oracle, MySQl,)   Soft skills: ? Strong focus on business outcomes. ? Strong sense of collaboration, open communication and reaching across functional borders. ? Provide hands-on engineering, administration and technical support. ? Ability to document current and future configuration processes and policies. ? Proactive thought leadership for creative and efficient technology solutions. ? Drive continuous improvement to the service delivered to customer (agility, stability ...) ? Process reengineering and optimization ? Drive the enforcement and definition of operational requirements / non-functional requirements in collaboration with application owners, engineering organizations and production services ? Commitment to continuously improving services, feeding back into any part of the organization that needs to engage and holding them responsible to accomplish results. Relevant Job Experience: Minimum 5-10 years in systems administration/Software Engineering/DevOps, networking in a large environment. Minimum 3-5 years' experience of operations experience, including application and infrastructure build and release engineering, operational readiness and support roles. Qualifications: University degree in Computer Sciences, Software Engineering background, hands on Operations experience in large scale technology estate. Job number: 18061408
Salary Range: NA
Minimum Qualification
5 - 7 years

Don't Be Fooled

The fraudster will send a check to the victim who has accepted a job. The check can be for multiple reasons such as signing bonus, supplies, etc. The victim will be instructed to deposit the check and use the money for any of these reasons and then instructed to send the remaining funds to the fraudster. The check will bounce and the victim is left responsible.