Synapse Analytics: End-to-End Project
(Analyzing NYC Taxi Trip data)
Step 1: Create Synapse Workspace
Step 2: Load the NYC Taxi Trip data
Note: Synapse Analytics uses Azure Data Lake Storage Gen2
as its default data storage.
In this project we will load the NYC Taxi data into ADLS Gen2.
Download the NYC Taxi dataset from the below site:
[Link]
In Synapse Studio, navigate to the Data tab and select Linked.
Under the category Azure Data Lake Storage Gen2, select the
storage vnycdatalake (Primary).
Select the container named users (Primary).
Click Upload and select the [Link] file to upload
the dataset.
Once the parquet file is uploaded it is available through two
equivalent URIs:
[Link]
[Link]
abfss://users@[Link]/
[Link]
Note:
In the above URL’s
vnycdatalake is the storage account
users is the container in the storage account.