Carlos da Costa
SQL
Data Science & Analytics Roadmap
carlosdavidcosta.medium.com /carlosdavidcosta
SQL
/carlosdavidcosta
carlosdavidcosta.medium.com
Data Science & Analytics Roadmap
The 20% of SQL you need to perform 80% of your analysis
tasks. Note that this does not cover the entirety of SQL topics,
but it focuses on the parts that will make you quickly start using
SQL for data science and analytics.
You can use my free SQL tutorials to support your studies:
My collection of SQL Tutorials
1
The basic of databases
Different types of databases: Relational vs NoSQL.
Basic database concepts: Database, tables, rows, columns,
relationships, primary and foreign keys, and normalization.
Setting up your environment:
install one of the relational data base management
system (MySQL, PostgreQL, Or SQLite)
Install one DB client (Workbench or DBever)
2
SQL Fundamentals
SQL datatypes.
BasICSQL commands: SELECT, INSERT, UPDATE,
DELETE.
Filtering data using WHERE.
Sort data using ORDER BY
Aggregate functions: COUNT, SUM, AVG, MIN, MAX.
Filtering grouped data: HAVING.
3
Practice
Practicing SQL queries: LeetCode and Datalemur.
Work on real-world datasets to apply SQL for data
analysis.
4
SQL JOINS
Get confortable with joins and their application: INNER
JOIN, LEFT JOIN, RIGHT JOIN, and FULL JOIN
Practice SQL Joins on Dataleur and LeetCode
5
Data Cleaning
Describing tables
Finding duplicates
Dealing with missing data
SQL CASE
Practice: Find a dataset on Kaggle and perform data
cleaning using SQL
6
Advanced SQL
Subquery
CTE
Window Function
7
Continuous learning
Perform a complete exploratory data analysis on a dataset
of your interest and share the insights.