Incident Response Manager - Singapore - ByteDance

    ByteDance
    ByteDance Singapore

    13 hours ago

    Technology / Internet
    Description
    Responsibilities

    The Data Systems Infrastructure (DSI) team sits within the ByteDance global technology structure and supports the company's fast growth by building and operating hyper-scale datacenters, managing the life cycle of server fleet, providing cloud solutions, and developing various infrastructure services, making sure they are scalable and are reliable.


    We are seeking a technically skilled and detail-oriented professional to serve as a front-line responder for incident detection, triage, and response across infrastructure, facilities, and security operations.

    The ideal candidate will have a strong foundation in facility operations, broad knowledge across IT, infrastructure, or engineering disciplines, experience in critical environments, and the ability to analyze incidents, manage them calmly, identify trends, and drive sustained improvements.

    This role requires performance under pressure, data-driven thinking, and a proactive approach to continuous improvement and operational resilience.

    Responsibilities

    • Serve as the first responder in the IRC Operation Center, detecting and responding to events across infrastructure, facilities using tools such as Server Automation, Data Center Infrastructure Management, Network monitoring, Grafana, and related systems.
    • Respond promptly to events including but not limited to:
    - Data Center Environmental systems (e.g. high temperature, humidity, power fluctuations or failures)
    - IT infrastructure (e.g. server performance issues, network outages, system failures)
    - Facility and environmental alerts relevant to operations (e.g. Flooding,
    - External Facing Services (e.g. colocation maintenance notices, service requests from CDN partners, and critical notifications)

    • Conduct detailed investigations to diagnose the root cause of events, assess their impact, and determine appropriate response actions.
    • Monitor and analyze detected events, accurately classify incidents based on potential or actual customer impact, and proactively communicate risks. Coordinate timely escalations by notifying and collaborating with relevant support teams to ensure swift incident resolution.
    • Monitor incident response performance against agreed SLAs, ensuring timely alerts and notifications.
    • Manage incidents efficiently, performing indepth investigations to determine root causes and impacts, while promptly engaging and coordinating with the designated resolver teams to facilitate timely resolution.
    • Draft detailed incident reports and conduct postmortem reviews to document lessons learned.
    • Generate regular reports to deliver comprehensive insights into the effectiveness of incident response and recovery processes.
    • Analyze trends and patterns in events to identify opportunities for improvement and optimization
    • Own and drive the Incident, Problem, and Change Management processes in alignment with ITIL or internal ITSM frameworks.
    • Develop and maintain a comprehensive library of Standard Operating Procedures (SOPs), Methods of Procedure (MOPs), runbooks, and operational guides to ensure consistency and readiness across teams.
    • Lead or support continuous improvement projects aimed at enhancing incident response capabilities, operational security, system reliability, and overall infrastructure performance. Collaborate with crossfunctional teams to implement engineering solutions and process optimizations.
    • Provide technical and operational leadership to the incident response center team, ensuring consistent performance and adherence to best practices.
    Qualifications
    Minimum Qualifications

    • Bachelor's degree in Computer Science, Information Technology, Engineering, or a related technical field.
    • Strong technical background, prioritizing experience in Data Center Facility Operations Center (DC FOC) management; experience in IT infrastructure, network operations, or systems monitoring is also desirable.
    • Proven ability to analyze complex systems, investigate incidents, and identify root causes effectively.
    • Familiarity with monitoring and alerting tools such as Grafana, Nagios, or similar platforms.
    • Experience in incident and problem management processes, with the ability to drive corrective actions and coordinate crossfunctional teams.
    • Strong communication skills to draft reports, conduct reviews, and liaise with technical and nontechnical stakeholders.
    • Proactive mindset with a focus on continuous improvement and operational excellence.

    Preferred Qualifications:

    • 5+ years of experience in IT environments—such as data centers or enterprise systems—combined with handson incident and problem management experience.
    • Proven experience in facility management across mechanical, electrical and plumbing (MEP) systems.
    • Proven ability to perform effectively under pressure and within tight time constraints to resolve issues and meet deliverables.
    • Handson experience with ticketing systems, monitoring tools such as Grafana, server infrastructure, and data center systems.
    • Working knowledge and/or certifications in one or more of the following:
    ITIL Foundation/ CompTIA Server+/ Schneider Electric Data Center Certified Associate (DCCA)/ Cisco Certified Network Associate (CCNA)/ Project Management Professional (PMP)/ Data Analytics and Visualization tools or methodologies

    • Demonstrated experience in driving or contributing to improvement projects focused on operational efficiency, security enhancements, or infrastructure reliability.
    • Ability to manage multiple tasks and projects, ensuring timely delivery and alignment with organizational goals.
    • Strong adaptability and problemsolving skills in ambiguous and rapidly changing environments.
    • Willingness to be on call during weekends, nights, and holidays.
    About Us
    Founded in 2012, ByteDance's mission is to inspire creativity and enrich life.

    With a suite of more than a dozen products, including TikTok, Lemon8, CapCut and Pico as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.



    Why Join ByteDance
    Inspiring creativity is at the core of ByteDance's mission.

    Our innovative products are built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible.

    Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day.


    As ByteDancers, we strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company.

    By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our Company, and our users.

    When we create and grow together, the possibilities are limitless. Join us.​
    Diversity & Inclusion​
    ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life.

    To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach.

    We are passionate about this and hope you are too.​

  • Work in company

    Incident Response Manager

    Only for registered members

    The Data Systems Infrastructure (DSI) team sits within the ByteDance global technology structure and supports the company's fast growth by building and operating hyper-scale datacenters, · managing the life cycle of server fleet, providing cloud solutions, · and developing variou ...

    Singapore

    2 weeks ago

  • Work in company

    Cyber Response, Manager

    KPMG Singapore

    KPMG in Singapore is part of a global organization of independent professional services firms providing Audit, Tax and Advisory services. We operate in 138 countries and territories with more than 276,000 partners and employees working in member firms around the world. Each KPMG ...

    Singapore

    13 hours ago

  • Work in company

    Cybersecurity Incident Response Manager

    Only for registered members

    As a Cybersecurity Incident Response Manager in our CISO office, you will lead incident response, threat intelligence, and use case development to protect the organisation from cyber threats. · ...

    Singapore

    1 month ago

  • Work in company

    Incident Response Manager, Singapore

    Blackpanda

    About Blackpanda · Blackpanda is Asia's premier cyber crisis response firm, founded by former elite military special operations forces and cyber defense experts. Headquartered in Singapore, we specialize in incident response and digital forensics across the Asia-Pacific region. · ...

    Singapore

    13 hours ago

  • Work in company

    Product Manager, Workforce Management, Platform Responsibility

    Only for registered members

    TikTok is the leading destination for short-form mobile video. At TikTok our mission is to inspire creativity and bring joy. We strive to do great things with great people. We lead with curiosity humility and a desire to make impact in a rapidly growing tech company. · Deliver bu ...

    Singapore

    1 month ago

  • Work in company

    Product Manager, Workforce Management, Platform Responsibility

    Only for registered members

    We are at the forefront of building and optimizing content safety systems. With a focus on optimising and advancing content safety, we leverage advanced large language and multimodal models to enhance review efficiency, risk control, · and user trust. · Bachelor's degree or above ...

    Singapore

    3 weeks ago

  • Work in company

    Engagement and Systems Manager – Responsible Sourcing

    Only for registered members

    The Engagement and Systems Manager in the HP Global Responsible Sourcing Team is responsible for leading stakeholder engagement initiatives and managing the development, integration, and continuous improvement of systems and processes that support human rights and environmental d ...

    Singapore

    1 month ago

  • Work in company

    Product Manager, Workforce Management, Platform Responsibility

    Only for registered members

    Our team is at the forefront of building and optimizing content safety systems. With a focus on optimising and advancing content safety. · ...

    Singapore, Singapore

    1 month ago

  • Work in company

    Cyber Security Incident Response Manager

    Only for registered members

    Lead and manage end-to-end cyber security incident response activities. · Act as the incident commander for high-severity security incidents. · Coordinate with SOC teams during incidents. · ...

    Central Region

    1 month ago

  • Work in company

    Product Manager, Video Moderation, Platform Responsibility

    Only for registered members

    As a Product Manager for our video moderation system, you will have the unique opportunity to directly engage in the iteration and optimization of a system that processes hundreds of millions of videos daily, setting industry standards. · ...

    Singapore

    4 weeks ago

  • Work in company

    Product Manager, Photo Safety, Platform Responsibility

    Only for registered members

    As a product manager on our Platform Responsibility team you will work with crossfunctional stakeholders to build safety product features and strategies. · ...

    Singapore

    1 month ago

  • Responsibilities · Our team is at the forefront of building and optimizing content safety systems. With a focus on optimising and advancing content safety, we leverage advanced large language and multimodal models to enhance review efficiency, risk control, and user trust. Workin ...

    Singapore

    13 hours ago

  • Work in company

    Product Manager, Teen Experiences, Platform Responsibility

    Only for registered members

    We are at the forefront of building and optimizing content safety systems. With a focus on optimising and advancing content safety, · we leverage advanced large language and multimodal models to enhance review efficiency, risk control, · and user trust. · ...

    Singapore

    1 month ago

  • Work in company

    Lead Product Manager – Agentic Response System

    Only for registered members

    PayPal is seeking a Product Manager to lead the strategy, execution, and evolution of its core decisioning stack focusing on the Agentic Response System. · ...

    Singapore

    1 month ago

  • Work in company

    Product Manager, Red Team, Platform Responsibility

    Only for registered members

    The team is responsible for testing and evaluating global content security systems from an attacker's perspective, · identifying potential vulnerabilities, and providing improvement recommendations. · ...

    Singapore

    2 weeks ago

  • Work in company

    Product Manager, LIVE Safety, Platform Responsibility

    Only for registered members

    +h2>Job summary · We are looking for a Product Manager to join our team in Singapore. The successful candidate will be responsible for driving product strategy and building features to protect content integrity on our platform. · ...

    Singapore

    1 month ago

  • Work in company

    Product Manager, Teen Experiences, Platform Responsibility

    Only for registered members

    +We are at the forefront of building and optimizing content safety systems. We leverage advanced large language and multimodal models to enhance review efficiency, risk control, and user trust. · +Own the end-to-end product strategy for Minor Content Safety, ensuring age-appropri ...

    Singapore

    2 weeks ago

  • The Opportunity · You will belong to an international connected team of specialists helping our clients with their most complex information security needs and contributing toward their business resilience. You will work with a team of cyber security experts to lead, manage and de ...

    Singapore

    13 hours ago

  • Work in company

    Product Manager, Red Team, Platform Responsibility

    Only for registered members

    The product manager will be responsible for testing and evaluating global content security systems from an attacker's perspective. · ...

    Singapore

    1 month ago

  • Responsibilities · Team Introduction · The team is responsible for testing and evaluating global content security systems from an attacker's perspective, identifying potential vulnerabilities, and providing improvement recommendations, ensuring content safety across cultural and ...

    Singapore $100,000 - $150,000 (SGD) per year

    13 hours ago

Jobs
>
Singapore