Principal Application Reliability Engineer - Singapore - International SOS

International SOS
International SOS
Verified Company
Singapore

2 weeks ago

Wei Jie

Posted by:

Wei Jie

beBee Recruiter


Description

About the role:


Key responsibilities:


  • Be on rotation to for availability incidents and provide support for customer service engineers.
  • Proactively develop scripts and tools to prevent incidents from ever happening.
  • Develop comprehensive set of monitoring and alerting alert on symptoms and potential issues to prevent outages.
  • Document every action so findings turn into repeatable actionsand then into automation.
  • Improve the deployment process to make it as smooth and effortless as possible.
  • Design, build and maintain core infrastructure pieces that allow scaling to support enterprise level of concurrent users.
  • Debug production issues across services and levels of the stack.
  • Plan the growth of infrastructure and capacity planning.
  • Provide technical leadership of the SRE team (internal or thorugh Managed Services)
  • Proactively working with development leads, client service leads, solution architects, and infrastructure leads to enhance system reliability, scability, and robustness.

About you:


Required Skills and Knowledge

  • Systems level thinking to maintain overall system stability and reliability edge cases, failure modes, behaviors, specific implementations, etc.

Required Competencies

  • Thorough, detailed, and careful planning, developing, and execution
  • Proactively looking for areas to improve
  • Clear communication with all involved parties
  • Calm under pressure
  • Clear sense of ownership and accountability

Required Work Experience

  • 10+ years hands on experiences with Windows and Linus operating systems, databases (SQL & Non-SQL)
  • 5+ years of SRE or closely related experiences for large scale cloud SaaS10+ years of handson technical experiences in DevOp, Release Management Engineering, or similar areas.

Required Qualifications

  • Extensive knowledge of config management systems
  • Strong programming skills, Net, Java, React, etc.
  • B.S. in Computer Science or Software Engineering. M.S. in similar fields preferred.

More jobs from International SOS