29 Đường Tân Thuận (Tòa nhà FPT Tân Thuận), District 7, Ho Chi Minh
Không xác định
2019-09-09 -> 2019-09-10
- At least 2 years of experience with building ETL pipeline, Data Warehouse
- Proficiency with Hadoop v2, MapReduce, HDFS
- Experience with Spark and its features: spark sql, spark streaming, structured streaming.
- Experience with SQL, Python, bash shell scripts.
- Experience with various messaging systems, such as Kafka or RabbitMQ.
- Experience processing data in DBMS (Mongo, MySQL, SQL Server)
- Experience with Linux servers.
- Additional Preferred Qualifications
- Experience with Cloudera or Horton works is a plus.
- Experience with Hadoop ecosystem is a plus
- Design and build data pipeline solution architectures that consume large dimensional structured,
- unstructured data.
- Writes ETL processes, designs database systems and deploys/develops tools for real-time and
- offline analytic processing.
- Collaborate and understand the requirements from Data Analyst/ Business Users and turn into
- technical insight.
- Collaborate with data scientists to automate model training, testing and deployment via ML
- continuous delivery pipelines.
- Research new technologies/ methodologies which can be applied to improve business