Description

Job Description:

  • Utilize containerization technologies (e.g., Docker) and orchestration platforms (e.g., Kubernetes).
  • Knowledge in GCP and New Relic other similar cloud technologies.
  • Possess strong API knowledge, ensuring availability and reliability.

Key Responsibilities:

  • Application Monitoring.
  • Proactively monitor application stability using Splunk and New Relic.
  • Set up alerting and automated responses to minimize downtime.
  • Perform root cause analysis and manage incidents for issue resolution.
  • Monitor system performance, identify bottlenecks, and collaborate on optimizations.
  • User Support.
  • Assist users with UI-related issues and provide effective resolutions.
  • Create and maintain user-friendly documentation for self-service support.
  • Develop and maintain incident response procedures for rapid issue resolution.
  • Enhance troubleshooting tools and processes for improved efficiency.
  • Be proficient in the web application's user interface for user support.
  • Diagnose and resolve complex technical issues affecting the web application.
  • Collaboration with Scrum Team.
  • Collaborate with UI/UX designers and developers to enhance the user experience.
  • Actively participate in Scrum team activities, including stand-ups and sprint planning.
  • Ensure seamless integration of reliability and performance enhancements into development.
  • Collaborate with the team to prioritize and track defect and improvement request progress.
  • Product Continuous Improvement.
  • Maintain open communication with the Product Owner for product alignment.
  • Ensure SRE tasks align with the product's strategic goals.
  • Participate in backlog refinement meetings to prioritize SRE-related work items.
  • Suggest UI improvements based on user feedback and usage patterns.
  • Identify, document, and communicate defects and improvement opportunities.

Education

Any Gradute