Description

Description

JOB PURPOSE

Deliver the monitoring platform as a service (troubleshoot, maintain).
Establish clear, well-defined monitoring standards across all service delivery teams.
Establish guidelines around system alerting and alert routing/escalation to ensure efficient and timely response to infrastructure\system issues which impact production environments.
24x7 monitoring of Infrastructure.
Enforce best practices around data collection and storage of collected metrics to ensure that team has the appropriate data required for real-time problem detection, rapid problem resolution, and effective post-mortem analysis.
Implement automation for monitoring implementation and an audit process to ensure that systems are being monitored as prescribed.
Create a “Single Pane of Glass” monitoring hub which correlates multiple streams of monitored data into a centralized location for more efficient and effective drill down of monitored data.

Skills

Supporting and maintaining enterprise monitoring tools for networks, server and storage infrastructures application monitoring and performance.
Intermediate understanding of the Compute & Network infrastructure
Alert and event troubleshooting knowledge.

Qualifications & Experience

Bachelor’s degree in computer science, and/or equivalent work experience.

Min of 2-3 years of experience in IT infrastructure

Experience with ITSM/ITIL methodologies.

Ability to identify and analyze problems quickly, recommend and implement flexible creative permanent solutions.

Education

Bachelor’s degree in computer science