Lead Data Engineer - Singapore - NodeFlair

NodeFlair
NodeFlair
Verified Company
Singapore

3 weeks ago

Wei Jie

Posted by:

Wei Jie

beBee Recruiter


Description

Job Summary:


Salary
S$11,000 - S$18,000 / Monthly


Job Type

Seniority
Lead


Years of Experience
At least 5 years


Tech Stacks
Strategy TDD Amazon S3 AWS Oozie EMR HortonWorks Cloudera HDFS HBase MapR Azure Hive Airflow Spark NoSQL kafka Cassandra Hadoop

Are you at your most vibrant when you've successfully distilled data into its simplest, most meaningful form?

Thoughtworks is a global software consultancy with an aim to create a positive impact on the world through technology. Our community of technologists thinks disruptively to deliver pragmatic solutions for our clients' most complex challenges.

We are curious minds who come together as collaborative and inclusive teams to push boundaries, free to be ourselves and make our mark in tech.

Our developers have been contributing code to major organizations and open source projects for over 25 years.

They've also been writing books, speaking at conferences and helping push software development forward, changing companies and even industries along the way.

We passionately believe that software quality is driven by open communication, review and collaboration.

That's why we're such vehement supporters of open source and have made significant contributions to open source tools for testing, continuous delivery (GoCD), continuous integration (CruiseControl), machine learning and healthcare.

Data Engineers develop modern data architecture approaches to meet key business objectives and provide end-to-end data solutions.

You might spend a few weeks with a new client on a deep technical review or a complete organizational review, helping them to understand the potential that data brings to solve their most pressing problems.

On other projects, you might be acting as the architect, leading the design of technical solutions, or perhaps overseeing a program inception to build a new product.

It could also be a software delivery project where you're equally happy coding and tech-leading the team to implement the solution.


You'll spend time on the following:

You might spend a few weeks with a new client on a deep technical review or a complete organizational review, helping them to understand the potential that data brings to solve their most pressing problems.

You will partner with teammates to create complex data processing pipelines in order to solve our clients' most ambitious challenges

You will collaborate with Data Scientists in order to design scalable implementations of their models

You will pair to write clean and iterative code based on TDD

Leverage various continuous delivery practices to deploy, support and operate data pipelines

Advise and educate clients on how to use different distributed storage and computing technologies from the plethora of options available

Develop and operate modern data architecture approaches to meet key business objectives and provide end-to-end data solutions

Create data models and speak to the tradeoffs of different modeling approaches

On other projects, you might be acting as the architect, leading the design of technical solutions, or perhaps overseeing a program inception to build a new product.

Seamlessly incorporate data quality into your day-to-day work as well as into the delivery process


Here's what we're looking for:
You are equally happy coding and leading a team to implement a solution

You have a track record of innovation and expertise in Data Engineering

You're passionate about craftsmanship and have applied your expertise across a range of industries and organizations

You have a deep understanding of data modelling and experience with data engineering tools and platforms such as Kafka, Spark, and Hadoop

Hands on experience in MapR, Cloudera, Hortonworks and/or cloud (AWS EMR, Azure HDInsights, Qubole etc.) based Hadoop distributions

You're genuinely excited about data infrastructure and operations with a familiarity working in cloud environments

Working with data excites you:
you have created Big data architecture, you can build and operate data pipelines, and maintain data storage, all within distributed systems

Advocate your data engineering expertise to the broader tech community outside of Thoughtworks, speaking at conferences and acting as a mentor for more junior-level data engineers

Assure effective collaboration between Thoughtworks' and the client's teams, encouraging open communication and advocating for shared outcomes

More jobs from NodeFlair