Thach Tuan Anh
Data Engineer
📞 (+84) 965631523 ✉ [email protected] 🔗 linkedin.com/in/anhtt25
OBJECTIVE
I am enthusiastic about resolving intricate challenges associated with software development and data engineering. I am drawn to
the realms of creating robust software solutions and managing data infrastructure. Furthermore, I am seeking opportunities
within an applied science environment where I can apply my skills, continuously learn, contribute to research, and transform
innovative ideas into practical solutions. My goal is to effectively address real-world complexities and leverage data to drive value
for businesses and organizations.
EDUCATION
FPT University 09/2020 - Present
Bachelor of Software Engineer
PROJECT
Instacart Market Basket Analysis (05/2024 - 06/2024)
Developed end to end project market basket analysis
Main responsibility:
• Design data lakehouse on Fabric
• Apply medallion architecture to design lakehouse
• ETL data to lakehouse
• Visualization with Power BI
NBA Player Data (03/2023 - 04/2023)
Developed an NBA Player Data Analyzer project that collected, processed, and analysed player statistics.
Main responsibility:
• Data Scraping
• EDA
• Visualization reports about NBA salary
IMDB Movie Analysis (09/2023-09/2023)
Developed an IMDB Movie Project that created a data pipeline for transferring and analyzing movie data from IMDb, Flixtor.
Main responsibility:
• Data ingestion: Web scraping from IMDB, Flixtor using Python
• Data storage: PostgreSQL
• Data visualization: Power BI
• Data orchestration: Apache Airflow
Spotify Data Set (03/2023 - 04/2023)
Collect Spotify song data and process it into a structured dataset to support Music Genre classification
Main responsibility:
• Write script collect data.
• Ensure data quality and availability for the model training process.
Console App WebScraping(01/2023)
Collecting data from social media.
Main responsibility:
• Analyze web structure of social media platforms and perform code development to collect data
• Clean data and store it in the database or the data warehouse.
CERTIFICATIONS
Web Design for Everybody: Basics of Web Development & Coding Specialization 2022
Databases and SQL for Data Science with Python 2022
ETL and Data Pipelines with Shell, Airflow and Kafka 2023
Hands-on Introduction to Linux Commands and Shell Scripting 2023
Getting Started with Data Warehousing and BI Analytics 2023
SKILLS
English Intermediate
Programing languages C, Java, Python, C#
Data visualization Pandas, Numpy,...
Database SQL Server, PostgreSQL, MongoDB, MySQL
Other Have the ability use Ubuntu, Git, Gitlab, Docker,...
WORK EXPERIENCE
Vietinbank 09-2023 - 12-2023
Database Administrator
Setup and configuration database oracle, postgre, mongodb in uat, dev, test environment
Develop data pipeline with Kafka and Spark.
Seta International 08-2024 - 01/2025
Data Engineer
Design ETL pipelines and develop Airflow scripts for data orchestration and automation.
ACTIVITIES
FPTU Data Science Club 01/2023 - 04/2024
Vice President
• Teach and design a training curriculum for beginners
• Manage overview operations, support communication, and members' connection activities
© topcv.vn