Job Description: Site Reliability Engineer (Buffer)
• Bachelor's Degree in Computer Science or related; or equivalent combination of education and experience
• 5~~@~~ yrs overall experience in Software Application Development & Engineering
• 2~~@~~ years of SRE experience
• 1~~@~~ yrs experience in AWS services
• Experience in Typescript, NodeJs, and web development technologies
• Proficient in scripting languages such as Powershell and/or Python
• Knowledge of DevOps methodologies and the tools involved such as CI/CD concepts, CI/CD tools (Jenkins, CodePipeline, etc.), automation and config • Help build a Site Reliability Engineering culture by sharing best practices, approaches, documentation, and code with other engineering teams
• Define and setup KPIs to monitor Error Budgets
• Implement strategies to ensure Error Budgets stay above the defined-acceptance levels
• Define and implement response mechanisms when Error Budget thresholds are breached
• Apply automation and software to any tasks or parts of the system that would benefit from it or are performed manually;
• Able to troubleshoot complicated issues handling OS, Networking, Database in a cloud-based SaaS environment and handle live production incidents, debug/troubleshoot infrastructure and application issues, including development and testing
• Monitor application performance, take steps to improve overall application performance and stability and follow through with implementation (design, develop and test);
• Conduct system analysis, configuration management and develops improvements for system software performance, availability and reliability;
• Design, write, ship, and motivate the creation of software and systems to increase observability, product reliability and organizational efficiency;
• Work closely with software engineers and QAs to ensure the system is responding properly to non-functional requirements such as performance, security, and availability;
• Document your system knowledge as you acquire it over time, create runbooks, and ensure critical system information is readily available to those who need it;
• Maintain and monitoring deployment, orchestration, of the servers, docker containers, databases, and general backend infrastructure;
• Design, Develop & Test Terraform based Infrastructure as Code scripts to automate AWS infrastructure setup
• Develop Typescript, NodeJS based REST/JSON Web Services deployed on AWS.
Compensation: 55-64.52 Hourly W2 (Open to C2C)
Any Graduate