Title: Data Analyst - L4
Job Description
As the Data engineer in the Fulfillment Data Engineering team, you will work closely with data modelers, product analytics, product managers, software engineers and business stakeholders across the SEA in understanding the business and data requirements. You will be responsible for building and managing the data asset, including acquisition, storage, processing and consumption channels, and using some of the most scalable and resilient open source big data technologies like Flink, Airflow, Spark, Kafka and more on cloud infrastructure. You are encouraged to think out of the box and have fun exploring the latest patterns and designs.
The Day-to-Day Activities
- Developing scalable and reliable ETL pipelines and processes to ingest data from a large number and variety of data sources
- Developing a deep understanding of real-time data productions availability to inform on the real time metric definitions
- Maintaining and optimizing the performance of our data analytics infrastructure to ensure accurate, re‐liable and timely delivery of key insights for decision making
- Design and deliver the next-gen data lifecycle management suite of tools/frameworks, including in‐gestion and consumption on the top of the data lake to support real-time, API-based and serverless use-cases, along with batch as relevant
͏
Must Have
- At least 5+ years of relevant experience in develop‐ing scalable, secured, distributed, fault tolerant, re‐silient & mission-critical Big Data platforms.
- Proficiency in at least one of the programming languages Python, Scala or Java.
- Strong understanding of big data and related technologies like Flink, Spark, Airfl ow, Kafka etc.
- Experience with different databases – NoSQL, Columnar, Relational.
- You have a hunger for consuming data, new data technologies, and discovering new and innovative solutions to the company's data needs
- You are organized, insightful and can communicate your observations well, both written and verbally to your stakeholders to share updates and coordinate the development of data pipelines
The Nice-to-Haves
- You have a degree or higher in Computer Science, Electronics or Electrical Engineering, Software Engineering, Information Technology or other related technical disciplines.
- You have a good understanding of Data Structure or Algorithms or Machine Learning models.