Must-Have Skills:
- 4+ years experience with Splunk and Dynatrace
- Deep understanding of software stacks and how applications interact with underlying infrastructure
- Proven ability to design and build monitoring solutions using Dynatrace and Splunk
- Strong understanding of application architecture and components
Nice-to-Have Skills:
- Experience working with Site Reliability Engineering (SRE) teams
Job Description:
This role bridges the gap between Business, Application Development, Support teams, and Monitoring Tools teams. You will define monitoring requirements and implement comprehensive monitoring solutions to ensure application health and performance.
Key Responsibilities:
- Lead the rollout of strategic monitoring tools (Splunk and Dynatrace) across Banking Client applications
- Act as a subject matter expert (SME) for these monitoring tools
- Set up monitoring for applications deployed on cloud and containerized infrastructure
- Conduct demos, training sessions, and create documentation to drive tool adoption
- Collaborate with Client teams on deployment planning and scheduling
- Assist Client teams in configuring monitoring events and dashboards
- Prepare reports on tool adoption and other relevant metrics
- Participate in global discussions on the evolution of monitoring strategies
Qualifications:
- Education: 4-year college degree required
- Development/Architecture: 2+ years of experience in development or architecture for multi-tiered business applications
- Production Support: 5+ years of experience in architecting and/or supporting production systems
- Monitoring Tools: Proven experience with Dynatrace, Tivoli, Splunk, SiteScope, Catchpoint, or similar tools
- Cloud & Containers: Familiarity with monitoring applications on cloud and containerized infrastructure
- Infrastructure: Knowledge of OS (Windows/Unix) based infrastructure services, processing, monitoring, and shell scripting
- Programming: Proven experience in Java and/or .NET
- Performance Monitoring: Experience in application/network performance and availability monitoring
- N-Tier Architecture: Experience supporting N-Tier architectures with various products (Webserver, Middleware, Database)
- ITIL: Knowledge of ITIL functions and processes
- Communication & Leadership: Excellent communication and influencing skills across multiple levels
- Self-Motivation: A proactive, hands-on, and driven individual
- Strategic Thinking: Ability to connect business context to technical solutions
- Banking Experience: Prior experience in development and/or support of banking applications is preferred
- Incident Management: Experience with Major Incident Management