Site Reliability Engineer - Singapore - TIKTOK PTE. LTD.

    TIKTOK PTE. LTD.
    TIKTOK PTE. LTD. Singapore

    2 weeks ago

    TIKTOK PTE. LTD background
    Description
    Roles & Responsibilities

    TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.

    Why Join Us

    Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.

    Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.

    To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.

    At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.

    Join us.

    About The Team

    Site Reliability Engineering (SRE) of Applied Machine Learning (AML) team combines system engineering and the art of machine learning to develop and run massively distributed recommendation system around the world.

    On the SRE team, you'll have the opportunity to sharpen your expertise in coding, performance analysis and large system operation, and get heavily involved in the process of hardware/capacity decision-making. SRE ensures that the very centric machine learning services at TikTok have the highest level of availability, as well as creating highly automated systems and pipelines.

    Responsibilities

    Research, design, and develop computer and network software or specialised utility programs.

    Analyse user needs and develop software solutions, applying principles and techniques of computer science, engineering, and mathematical analysis.

    Update software, enhances existing software capabilities, and develops and direct software testing and validation procedures.

    Work with computer hardware engineers to integrate hardware and software systems and develop specifications and performance requirements.

    Research, design, and develop computer and network software or specialised utility programs.

    Analyse user needs and develop software solutions, applying principles and techniques of computer science, engineering, and mathematical analysis.

    Update software, enhances existing software capabilities, and develops and direct software testing and validation procedures.

    Work with computer hardware engineers to integrate hardware and software systems and develop specifications and performance requirements.

    Qualifications

    Bachelor's degree in Computer Science or equivalent with 3+ years of relevant experience.

    Proven experience in analyzing and troubleshooting distributed systems.

    Prior experience designing and maintaining large-scale systems.

    Experience programming in at least one of the following languages: Python or C/C++.

    Preferred Qualifications

    Ability to thrive in a fast-paced environment.

    Strong understanding of code optimizing and routine tasks automation.

    Proficiency in at least one machine learning framework: TensorFlow, PyTorch, MXNet or PaddlePaddle.

    Solid background of algorithms and data structures.

    TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

    Tell employers what skills you have

    Machine Learning
    Troubleshooting
    Ubuntu
    Pipelines
    Data Structures
    Computer Hardware
    PyTorch
    Administration Management
    Distributed Systems
    Reliability Engineering
    Python
    Infrastructure Architecture
    Technical Consultation
    Technical Engineering
    Research Design