Description

Description

The successful candidate will work closely with cross-functional teams to maintain and improve our platform, automate processes, and ensure our systems are highly available, reliable, and secure. The candidate should have a minimum of 3 years of experience working with SRE teams, including working in shifts and over weekends. The candidate should also have experience in Splunk, Dynatrace, Python, and Unix Shell Scripting. Experience working with Bigdata, Java, Salesforce, and financial clients such as AMEX will be a plus.

Responsibility

  • Work with cross-functional teams to maintain and improve our infrastructure, ensuring that our systems are highly available, reliable and secure.
  • Develop and maintain tools and automation for monitoring, deploying, and scaling our systems.
  • Perform incident management and participate in an on-call rotation to ensure rapid response to production issues.
  • Identify and remediate performance bottlenecks, security vulnerabilities, and other operational issues.
  • Develop and maintain documentation related to system design, deployment, and operation.
  • Collaborate with development teams to ensure that our systems are designed to be scalable resilient and fault-tolerant.

Requirements

  • A minimum of 3 years of experience working with SRE teams.
  • Experience in Splunk, Dynatrace, Python, and Unix Shell Scripting.
  • Good understanding of the SQL queries.
  • Strong problem-solving and troubleshooting skills.
  • Good understanding of networking, security, and cloud infrastructure.
  • Ability to work in shifts and over weekends as part of an on-call rotation.
  • Good verbal and written communication skills.
  • Experience working with Bigdata, Java, Salesforce/CRM.
  • Experience working for financial clients such as AMEX will be a plus.

Education

Bachelor's degree in Computer Science