Title: APM Administrator Job Requirements TECHM-JOB-22746
Location: Atlanta, Hybrid
Full Time
Skill: Linux/Unix Shell Scripting
Experience: 10+ years
- We are looking for an experienced APM (X-Ray & NewRelic) with a DevOps and Site Reliability background . you will provide proactive monitoring to ensure end-to-end insights and help the teams address their challenges in performance monitoring.
- Roles and Responsibilities:
- Define and maintain performance monitoring and reporting strategy for the organization .
- Collaborate with application teams to onboard services and applications into the APM platform.
- Ensure effective monitoring of critical business applications ,
- Provide guidelines to application teams to create dashboards, reports and alerts.
- Introduce best practices to teams in root cause analysis and resolution.
- Help teams to Analyze performance data and assist them with troubleshooting / tuning of applications, architecture, procedures and practices.
- Capture of all logging and monitoring of all aspects of system and application behavior to facilitate fast detection and resolution of issues.
- Define metrics, data collection methods, and reporting mechanisms to standardize processes.
- Identify gaps and provide hands-on development and enablement assistance to application teams technical users.
- Develop automation to on-board application to APM platform using Devops practices.
- Develop more robust cost metrics and benchmarking capabilities to assist in efforts supporting IT cost management.
- Experience:
- 3-5 years of experience in Observability/Application Performance Monitoring Tools, specifically New Relic and AWS X-Ray.
- Knowledge of requirement gathering and rollout monitoring and observability solutions.
- 2+ years of working experience in NRQL
- Working Knowledge of Python and any databases (SQL/NoSQL).
- Experience with gathering and organizing large amounts of data to use for instrumentation into an Enterprise monitoring solution.
- Experience related to Performance analysis and monitoring across multiple areas including Infrastructure, Application, Network and Security.
- Experience with creating technical documentation.
- Experience with performance metrics ,data management, report design, data visualization .
- Experience with recommending baseline monitoring thresholds and performance monitoring KPIs and SLAs
- Develop enhanced reporting capabilities through standardization and automation.
- Analyze and recommend performance improvements to teams for capacity, availability, performance, support and security.
- Experience with Amazon Web Service (AWS) performance, monitoring and cost management.
- knowledge of troubleshooting performance issues with complex large-scale multi-tier and distributed application infrastructures.
- Providing health and performance reports, developing AIOps rules, creating alerts, creating custom dashboards.
- Experience in integrating APM with other tools such as Splunk-OnCall, PagerDuty, Slack, webhook, and JIRA.
- Experience in creating workloads and user onboarding.
- Experience with DevOps tools like Jenkins, Artifactory, Ansible, Splunk and other automation tools.