- Provision SaaS environments as new clients are onboarded.
- Be part of the on-call rota (during business hours), responsible for resolving alerts generated by proactive monitoring and working closely with CANs to provide L2 support for client-initiated support requests.
- Define and implement the feature roadmap to improve the SaaS platform, for example by implementing self-service functionality, exposing metrics to clients, improving automation and self-healing properties of the system.
- Improving the scalability, security and performance of the SaaS platform, by implementing automated compliance and controls, testing different Kafka and DB setups (e.g. Aurora vs RDS) and running load tests at every level of the stack.
- Implementing and regularly testing DR strategies to ensure the highest level of resilience and fault tolerance of the platform.
- Strong background in Linux/Unix administration, e.g. Ubuntu, Debian
- A strong background in at least one of Go, Python or Java
- A strong background in one of the following: database administration, Kafka, observability tools (such as Prometheus or Zipkin) or infrastructure automation.
- Experience with AWS or GCP is essential
- Experience or knowledge of container orchestration tools, e.g. Kubernetes Desirable
- Experience in supporting production systems
- Experience with automation/configuration management, e.g. Terraform, Puppet, Chef, Ansible
- Highly competitive salary
- Bonus incentive
- Healthcare
- 25 days holiday and public holidays
- Competitive maternity and paternity leave
- $1,500 SGD per year flexible spend benefit
- All the latest tech you need
- A talented and experienced team as your colleagues
- An environment where we encourage learning and progress
-
Reliability Engineer
5 days ago
NTUC Enterprise Nexus Co-operative Limited SingaporeCOMPANY DESCRIPTION · NTUC Enterprise Co-operative Limited is the holding entity and single largest shareholder of the NTUC group of Social Enterprises. We aim to create a greater social force to do good by harnessing the capabilities of the social enterprises to meet pressing so ...
-
Reliability Engineer
2 days ago
Cielo Talent SingaporeThis role is for a multinational company. The Reliability Engineer will ensure effective and efficient service contract operation, principally by providing engineering and reliability support with the objective of improving overall equipment reliability, availability, and capabil ...
-
Engineer Reliability
4 days ago
GLOBALFOUNDRIES Singapore**About GlobalFoundries** · **Introduction** · **Your Job** · - SRAM/Flash/NVM/OTP/MTP/eFUSE/CPI reliability setup & analysis, and handle PRM (Periodic reliability monitoring) · - Work with customer/vendor to design & bring in hardware & software for reliability characterization. ...
-
Reliability Engineer
1 day ago
NE Digital SingaporeCOMPANY DESCRIPTION · NE Digital is the digital, data and technology organization that serve as a center of excellence to drive digital transformation for our group of NTUC Social Enterprises to meet the critical social needs of Singapore's community. Delivering innovative produc ...
-
Reliability Engineer
5 days ago
Evonik SingaporeWhat we offer · Explore a world of opportunities with us. Look ahead with us and help shape innovative solutions to make our world more sustainable and life healthier, more vibrant and more comfortable. At Evonik, you have the chance to explore, thrive, and grow alongside 33,000 ...
-
Reliability Engineer
3 days ago
ANTER CONSULTING PTE. LTD. Singapore**Responsibilities**: · - Collaborating on the enhancement of Reliability strategies and programs, prioritizing process safety, yields, capacity, and uptime performance. · - Providing technical expertise for addressing Reliability-related challenges, aiding in the resolution of p ...
-
Reliability Engineer
2 days ago
Cielo Talent SingaporeThis role is for a multinational company. The Reliability Engineer will ensure effective and efficient service contract operation, principally by providing engineering and reliability support with the objective of improving overall equipment reliability, availability, and capabil ...
-
Engineer Reliability
1 week ago
GLOBALFOUNDRIES Singapore**About GlobalFoundries** · **Introduction** · **Your Job** · - SRAM/Flash/NVM/OTP/MTP/eFUSE/CPI reliability setup & analysis, and handle PRM (Periodic reliability monitoring) · - Work with customer/vendor to design & bring in hardware & software for reliability characterization. ...
-
Reliability Engineer
1 week ago
NTUC Enterprise Nexus Co-operative Limited SingaporeCOMPANY DESCRIPTION · NTUC Enterprise Co-operative Limited is the holding entity and single largest shareholder of the NTUC group of Social Enterprises. We aim to create a greater social force to do good by harnessing the capabilities of the social enterprises to meet pressing so ...
-
Reliability Engineer
1 week ago
ONE STOP ENGINEERING PTE. LTD. SingaporeTitle**:Reliability Engineer · Purpose Statement (2-3 Sentences): · - Ensures reliability and maintainability of equipment, processes, utilities, facilities and controls with an objective to constantly improve site production and cost performance. · - Develops engineering solutio ...
-
Reliability Engineer
1 week ago
ATR EASTERN SUPPORT PTE LTD SingaporeAvions de Transport Regional (ATR) GIE Founded in 1981. ATR has become world leader on the market for regional aircraft with 90 seats or less. Since its creation, ATR has sold over 1,500 aircraft to over 200 operators based in more than 100 countries. ATR planes have totaled over ...
-
Reliability Engineer
1 week ago
Smiths Group SingaporeREF: · - JOHNCRANEAPAC02004- DIVISION: · - John Crane- JOB FUNCTION: · - OperationsAbout Us · Founded in 1917, John Crane is a global leader in the design, manufacturing, and engineering of mission critical flow control solutions for increased efficiency, emission reductions and ...
-
Reliability Engineer
1 week ago
RMA CONTRACTS PTE. LTD. Singapore**Our Client is a crude oil processing refinery in Jurong Island. They are looking for highly-motivated and achievement-oriented individuals who thrive in a team-based work environment. · **The Annual Gross Salary Range is from $76,000 to $135,000. · **Scope of Work** · - Investi ...
-
Reliability Engineer
1 week ago
John Crane Singapore**About Us** · Founded in 1917, John Crane is a global leader in the design, manufacturing, and engineering of mission critical flow control solutions for increased efficiency, emission reductions and energy transformation. Our products include mechanical seals and systems, coupl ...
-
Engineering and Reliability Engineer
1 week ago
Singapore Technologies Engineering Ltd Singapore**Date**:17-Feb-2023 · **Location**: Singapore, SG · **Company**:ST Engineering Group · **Engineering & Reliability Engineer / Executive** · **Position Purpose**: · Provide component reliability support in relation to group functions, internal departments, and external customers, ...
-
Reliability Engineer
4 days ago
Marvell SingaporeAbout Marvell · At Marvell, we believe that infrastructure powers progress. That execution is as essential as innovation. That better collaboration builds better technology. Trusted by the world's leading technology companies for 25 years, we move, store, process and secure the w ...
-
Engineering and Reliability Engineer
1 week ago
ST Engineering Group Singapore**Engineering and Reliability Engineer**: · **Date**:17-Feb-2023 · **Location**: Singapore, SG · **Company**:ST Engineering Group · **Engineering & Reliability Engineer / Executive** · **Position Purpose**: · Provide component reliability support in relation to group functions, i ...
-
Lead Reliability Engineer
3 days ago
John Crane SingaporeThe Lead Reliability Engineer will ensure effective and efficient service contract operation, principally through providing engineering and reliability support with the objective of improving overall equipment reliability, availability and capability. The person is responsible fo ...
-
Platform Reliability Engineer
1 week ago
ARYAN SOLUTIONS PTE. LTD. SingaporeBachelor's degree in information technology, Computer Science, Engineering, or similar areas. · - Working experience as a Platform Reliability Engineer or as a Site Reliability Engineer in a cloud operating environment is required. · - Strong experience in Kubernetes and Docker. ...
-
Site Reliability Engineering
3 days ago
BYTEDANCE PTE. LTD. Singapore**About ByteDance** · Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance h ...
Site Reliability Engineer - Singapur, Singapore - Thought Machine
Description
Description
Thought Machine's mission is bold – to properly and permanently rid the world's banks of legacy technology. To achieve this, we have developed the foundations of modern banking through core and payments technology which run natively in the cloud. What we are attempting is hard and means we need great people working together to build great technology.We have grown rapidly in the past few years – growing our team to more than 550 individuals across offices in London, New York, Singapore and Sydney. We have raised more than $500m in funding and are now valued at $2.7bn. Our investors include Temasek, Standard Chartered Ventures, Molten Ventures, Eurazeo, Intesa Sanpaolo, Nyca Partners, JPMorgan Chase Strategic Investments, and more.We have created a culture enabling our team to produce the best work in the industry, ensuring we have fun along the way. We're regularly cited as having a fantastic workplace culture and have been recognised by Sifted magazine as having one of the highest Glassdoor ratings for a UK fintech company and the most generous employee share package in the industry. We've been named in the IDC list of top 100 fintechs, and the Singapore HR Awards awarded us Gold and Silver for our workplace culture and employee experience. We are spinning up a new regional SaaS platform team responsible for providing a world-class SaaS offering, by continuously improving and maintaining our SaaS platform. The team will be geographically distributed across our two main hubs: UK, SG.Joining this team is an excellent opportunity to get exposure to how mission-critical systems are run in production. You will be part of a team that owns the system end-to-end and have a deeper understanding of exactly how our clients use the system (for example by extracting usage analytics).The team will own the platform end-to-end, making use of existing infrastructure, improving core Terraform modules, as well as developing operators, tooling and additional infrastructure where appropriate. They will also be responsible for L2 support (for client-initiated support requests) and L1 (for alerting-based incidents). Support will be provided during working hours, with a follow-the-sun model and handovers happening between the 3 regions.Definition and development of the SaaS roadmap is another critical responsibility of this team. Alongside the Product Management function, they will define technical requirements, features and implement them with the goal of offering an excellent SaaS experience to our clients. DutiesRequirements
EssentialBenefits