We are a fast-growing SaaS company in the EduTech space, operating mainly in Singapore, Vietnam, and Malaysia. With almost 2000 schools as customers, our mission is to revolutionize early childhood education through technology and provide innovative solutions to our customers. We are committed to transforming the way education is delivered in the Southeast Asian region and beyond.
#### What you will be doing:
- Collecting data from different sources, processing it, and storing it in a ready-to-use format in the companys data warehouse using tools such as AWS Glue, Databricks, Airflow
- Identifying, designing and implementing data pipeline improvements for greater scalability, optimizing data delivery, and automating manual processes. Parquet in S3 for storage, Airbyte for pipeline management
- Improve data models that feed business intelligence tools, increasing data accessibility and fostering data-driven decision making across the organization.
- Engage proactively with business and product teams to gather and comprehend data requirements, ensuring seamless communication and collaboration throughout the data engineering process.
- Develop strategy for long term data platform architecture based on business and engineering needs
- Performs data analysis required to troubleshoot data related issues and assist in the resolution of data issues
- Own company data assets (data models), spark, sparkSQL, and hiveSQL jobs to populate data models
#### Requirements
- Deep expertise in AWS Glue, Databricks, Airflow
- Experience with Airbyte, Fivetran, Stitch
- Experience with SQL, Nosql, Parquet
Job Type: Full-time