Primary Responsibilities:
- Working closely with application development, IT Architecture, Application Resiliency Foundation and other key partners ensuring end-to-end Application resiliency in accordance with our policy, procedures and standards
- Capture technical requirements, assessing capabilities and mapping to organizational resiliency principles to determine resiliency characteristics of applications.
- Contribute to strategy discussions and decisions on overall application design and best approach for implementing cloud, and on premises solutions.
- Support the ETE Resiliency services / work such as the NFR Assessments, Failure Mode Analysis, Test Scenarios creation and execution.
- Improving, setting the direction for the integration of new test scenarios for TC applications to the resiliency test automation framework, identifying and updating common reusable artifacts
- Support capability building for resiliency testing targeted toward modernization initiatives, common capabilities framework, reference architectures Recommended Skillsets:
- Experience in testing / architecting and delivering distributed solutions.
- Must have expertise with industry patterns, chaos engineering methodologies, and techniques across the disaster recovery subject areas.
- Chaos Engineering / Resiliency Testing experience for distributed applications using tools like Gremlin or Cavisson NetHavoc
- Enterprise Java technologies, tools and system architectures; Splunk and application monitoring tooling such as Dynatrace / AppDynamics