Site Reliability Engineering - Singapore - GEOTAB (ASIA) PTE. LTD.

    GEOTAB (ASIA) PTE. LTD.
    GEOTAB (ASIA) PTE. LTD. Singapore

    2 weeks ago

    Default job background
    Description
    Roles & Responsibilities

    What you'll do

    As a part of Site Reliability Engineering, your key area of responsibility will be to provide stellar escalated engineering support and customer service to our core MyGeotab applications. You will support both the MyGeotab software application and the Geotab GO devices, IOX, and other hardware. Geotab receives detailed data and metrics for hundreds of thousands of vehicles every second, which is processed and interpreted to create actionable information for our customers. You will understand the business impact of incoming tickets and will troubleshoot effectively to resolve technical issues. You will also collaborate with Geotab engineers to resolve software bugs and firmware issues. To be successful in this role you will be a self-starter with strong written and verbal communication skills, and have the ability to quickly understand complex, technical concepts.

    How you'll make an impact

    • Effectively communicate technical solutions to customers during service disruptions.
    • Troubleshoot production issues and coordinate with development teams to implement fixes.
    • Collaborate with team members to improve engineering tools and dashboards for monitoring computing architecture.
    • Implement automation tools and frameworks to streamline routine tasks.
    • Conduct system tests for security, performance, and reliability.
    • Assist in long-term strategic planning to ensure server capacity meets current and future requirements.
    • Recommend design enhancements for improving reliability, security and processes.
    • Maintain supporting documentation for areas of expertise.
    • Monitor servers proactively using a combination of tools to ensure high availability and superior SLAs.
    • Plan and execute routine maintenance tasks.
    • Conduct post-incident reviews and identify areas of improvement for long term reliability.
    • Participate in a 24x7 on-call rotating schedule.

    What you'll bring to this role

    • Post-secondary Diploma/Degree specialization in Computer Science, Software Engineering or a related field.
    • At least +3 years experience in Software Development/DevOps/SRE or a similar role.
    • Strong application level troubleshooting skills with a systematic problem-solving approach.
    • Experience using and ideally setting up monitoring systems.
    • Experience with Powershell, Python, SQL, and Linux-based systems.
    • Some experience working with cloud services is highly valuable (GCP, AWS, Azure).
    • Desire to work in a fast paced, evolving, growing, and dynamic environment.
    • Knowledge of customer service principles and practices.
    • Experience working within a technical or engineering organization with knowledge of the high-technology industry considered an asset.
    • Ability to work well under pressure and respond to fast changing priorities and deadlines.
    • Highly organized and able to manage multiple tasks and projects simultaneously.
    • Excellent verbal and written communication skills.
    • Strong analytical skills with the ability to problem solve to well-judged decisions.
    • Strong team player with the ability to engage with all levels of the organization.
    • Technical competence using software programs, including but not limited to, Google Suite for business (Sheets, Docs, Slides).

    If you got this far, we hope you're feeling excited about this role Even if you don't feel you meet every single requirement, we still encourage you to apply.

    Tell employers what skills you have

    Troubleshooting
    Azure
    Ubuntu
    Scripting
    SQL
    Reliability Engineering
    Python
    Infrastructure Architecture
    Ansible
    Linux