0% found this document useful (0 votes)

66 views12 pages

Unloading - Data - From - Snowflake

Unloading Data from Snowflake

Uploaded by

Akash Kalwani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views12 pages

Unloading - Data - From - Snowflake

Unloading Data from Snowflake

Uploaded by

Akash Kalwani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Unloading Data

Sujith Nair
Cloud Data Architect
Snowflake Snowpro Certified
# What is data unloading ?

Data unloading is the process of moving data out of snowflake to cloud

storage. We use the COPY command to unload data from snowflake.

Why is unloading of data needed ?

• Snowflake is the source of the data and is needed in other apps.
• Data has been computed inside of snowflake and is needed elsewhere.
• Business user wants data in XLS format.
#When unloading data into cloud storage what best practice do
you use?

To ensure that the export process is completed quickly and with usage of
least amount of credits, I would use the partition clause in the COPY
command to ensure that parallelism feature of snowflake is used, and
multiple files are generated based on the partition I want.

I would also name of the file to disallow the file to be named generically
by snowflake.

COPY INTO @[Link].UNLOAD_OUTPUT/CUST

FROM SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.CUSTOMER
PARTITION BY C_MKTSEGMENT
# Was there a scenario in your project where you had to
unload the data to an internal stage ?

Request for data is frequently received from business users from

snowflake. This could be for data analysis or dealing with data quality
issues. The way I provide the data is to unload into an internal stage and
use the GET command to download the data, I use the SINGLE=TRUE
option to ensure that they file does not get split to multiple files.
# Do you prefer unloading data into multiple files or a single
file from snowflake ?

To take advantage of parallelism provided by snowflake and get the files

faster I prefer multiple files being generated. This is the default
format. There is also a 5GB file limitation on cloud storage and if your
file size is bigger then you need to generate multiple files.

To get a single file we need to use the COPY option SINGLE=TRUE

# Can you unload data from multiple tables in a COPY command

COPY statement supports the full syntax of snowflake SQL and hence
you can join the tables in the COPY command and get data from more
than 1 table.

COPY INTO @[Link].UNLOAD_OUTPUT FROM

(SELECT C.* FROM
SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.CUSTOMER
C,SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.NATION N
WHERE
C.C_NATIONKEY=N.N_NATIONKEY
)
# How are parquet files zipped when being unloaded?

The default compression when unloading data to snowflake in parquet

format is snappy. We can also use LZO if we wish. However we need to
explicitly provide the compression type if we don’t want the default.

For CSV and JSON files the default compression type is GZIP.
The other supported compression types are bzip2, Brotli,Zstandard
# How do we modify column datatypes when unloading data?

When using the COPY command we can use the CAST function to modify
data types when unloading data in parquet format.

COPY INTO @UNLOAD_OUTPUT_PARQUET

FROM (SELECT
CAST(C_CUSTKEY AS STRING),
CAST(C_NATIONKEY AS STRING)
FROM
SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.CUSTOMER
)
# What problems have you encountered when unloading data?

Load failures due to files already being present in the folder is a

challenge we have encountered, we resolved that by creating a lambda
function(or Azure functions) that copies the file to a different folder
and adds the timestamp to the file.
# What problems have you encountered when unloading data?

When unloading data in CSV format we observed that snowflake

truncates data in decimal columns to 15,9.

We overcame this problem by casting the data to string and unloading

the data.
# How can you limit the size of a file generated by unloading
data from snowflake.
We need to use the MAX_FILE_SIZE parameter and limit the size of
the files. Files generated are generally around 16 MB in size , you can
make the files smaller by using the MAX_FILE_SIZE parameter.

COPY INTO @[Link].UNLOAD_OUTPUT

FROM SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.CUSTOMER
MAX_FILE_SIZE=10000000

Why have smaller files instead of 1 big file ?

• Faster processing
• Able to take advantage of parallel processing
• Easier to consume by other applications
which may have size limits.
Thank you!

Learn2CloudData Solutions

Loading Data in +snowflake
No ratings yet
Loading Data in +snowflake
10 pages
Snowflake Data Loading Techniques
No ratings yet
Snowflake Data Loading Techniques
5 pages
Snowflake 4 Data Loading
No ratings yet
Snowflake 4 Data Loading
1 page
An Overview of Data Unloading in Snowflake
No ratings yet
An Overview of Data Unloading in Snowflake
8 pages
Snowflake Mini Project Overview
No ratings yet
Snowflake Mini Project Overview
7 pages
Snowflake Data File Preparation Guide
No ratings yet
Snowflake Data File Preparation Guide
11 pages
Documentation: Community Resources Blog English
No ratings yet
Documentation: Community Resources Blog English
13 pages
26.unloading Data
No ratings yet
26.unloading Data
5 pages
6.DataLoading in Snowflake
No ratings yet
6.DataLoading in Snowflake
10 pages
All Course Slides
100% (1)
All Course Slides
192 pages
Snowflake Prctice1
100% (1)
Snowflake Prctice1
51 pages
Loading Data
No ratings yet
Loading Data
16 pages
Data Unloading for Cloud Storage
No ratings yet
Data Unloading for Cloud Storage
3 pages
Snowflake
No ratings yet
Snowflake
25 pages
Snowflake Data Loading and Unloading Guide
No ratings yet
Snowflake Data Loading and Unloading Guide
19 pages
Unit - V
No ratings yet
Unit - V
27 pages
Snowflake Data Warehouse Top Commands
No ratings yet
Snowflake Data Warehouse Top Commands
61 pages
Snowflake Data Loading Guide
No ratings yet
Snowflake Data Loading Guide
2 pages
1.snowflake Data Load 1234
No ratings yet
1.snowflake Data Load 1234
1 page
Snowflake Snowpro Core
No ratings yet
Snowflake Snowpro Core
9 pages
Cabanasj486 Snowflake Snowpro Core
No ratings yet
Cabanasj486 Snowflake Snowpro Core
6 pages
Snowflake - Interview Questions
No ratings yet
Snowflake - Interview Questions
15 pages
Snowflake SQL
No ratings yet
Snowflake SQL
2 pages
Data File Prep for Snowflake Users
No ratings yet
Data File Prep for Snowflake Users
13 pages
Data Pipeline Pharmarack
No ratings yet
Data Pipeline Pharmarack
3 pages
Snow Flake Notes
No ratings yet
Snow Flake Notes
36 pages
Snowflake Cost Optimization Strategies 7 Tips To Reduce Your Snowflake Costs (English (Auto-Generated) ) (DownloadYoutubeSubtitles - Com)
No ratings yet
Snowflake Cost Optimization Strategies 7 Tips To Reduce Your Snowflake Costs (English (Auto-Generated) ) (DownloadYoutubeSubtitles - Com)
22 pages
Snowflake
No ratings yet
Snowflake
7 pages
Snowflake Data Integration and Loading Guide
No ratings yet
Snowflake Data Integration and Loading Guide
2 pages
Data Loading Techniques in Snowflake
No ratings yet
Data Loading Techniques in Snowflake
3 pages
Snowflake Data Loading Best Practices
No ratings yet
Snowflake Data Loading Best Practices
8 pages
Snowpro Cof-Co2 Core Dumps Full Material2024
No ratings yet
Snowpro Cof-Co2 Core Dumps Full Material2024
141 pages
Teradata To Snowflake Migration Guide
100% (2)
Teradata To Snowflake Migration Guide
15 pages
Snowflake
No ratings yet
Snowflake
16 pages
Getting Started With Snowflake Guide
100% (1)
Getting Started With Snowflake Guide
23 pages
Snowflake
No ratings yet
Snowflake
73 pages
Snowflake Training
No ratings yet
Snowflake Training
136 pages
Programming+in+Snowflake+ +All+Slides
100% (1)
Programming+in+Snowflake+ +All+Slides
342 pages
Caching in The Snowflake Cloud Data Platform
No ratings yet
Caching in The Snowflake Cloud Data Platform
11 pages
Stages in Snowflake
No ratings yet
Stages in Snowflake
18 pages
Snowflake To Oracle
No ratings yet
Snowflake To Oracle
16 pages
Snowflake Snowpro Exam Cheatsheet
83% (12)
Snowflake Snowpro Exam Cheatsheet
7 pages
Snowflake Snowpro Certification Exam Cheat Sheet by Jeno Yamma
100% (1)
Snowflake Snowpro Certification Exam Cheat Sheet by Jeno Yamma
7 pages
Snowflake Scenario Based Interview Questions
100% (2)
Snowflake Scenario Based Interview Questions
20 pages
Snowflake+Interview+Questions+ +Part+I
No ratings yet
Snowflake+Interview+Questions+ +Part+I
27 pages
E - Snowflake-Snowpro-Core-1
No ratings yet
E - Snowflake-Snowpro-Core-1
79 pages
Snowflake Admin & Data Loading Guide
No ratings yet
Snowflake Admin & Data Loading Guide
51 pages
Snowflake SnowPro Core Certification Exam Questions - Page 26 of 27 - SkillCertPro
No ratings yet
Snowflake SnowPro Core Certification Exam Questions - Page 26 of 27 - SkillCertPro
1 page
The Missing Manual - SELECT - Data Council
No ratings yet
The Missing Manual - SELECT - Data Council
54 pages
Full Practice Set-1
No ratings yet
Full Practice Set-1
92 pages
Teradata To Snowflake Migration Guide
No ratings yet
Teradata To Snowflake Migration Guide
14 pages
Snowflake Notes
No ratings yet
Snowflake Notes
21 pages
Snowflake Syllabus
100% (2)
Snowflake Syllabus
2 pages
Snowflake Best Practices
No ratings yet
Snowflake Best Practices
7 pages
Ravi Snowflake Interview Questions-1
No ratings yet
Ravi Snowflake Interview Questions-1
20 pages
SF Notes Anuja
No ratings yet
SF Notes Anuja
12 pages
Job Scheduling
No ratings yet
Job Scheduling
5 pages
Understanding Streams in Snowflake
No ratings yet
Understanding Streams in Snowflake
9 pages
HR Professional Resume Overview
No ratings yet
HR Professional Resume Overview
3 pages
Resume Rizwan Ansari
No ratings yet
Resume Rizwan Ansari
1 page
Snowflake - CostSaving
No ratings yet
Snowflake - CostSaving
4 pages
Gowtham Vs Ritesh Agarwal
No ratings yet
Gowtham Vs Ritesh Agarwal
16 pages
Snowflake - Time Travel
No ratings yet
Snowflake - Time Travel
8 pages
Snowflake - FailSafe
No ratings yet
Snowflake - FailSafe
5 pages
Pradeep Sai Kumar Resume 1
No ratings yet
Pradeep Sai Kumar Resume 1
1 page
Srija Bose (2)
No ratings yet
Srija Bose (2)
2 pages
Physics MM Syllabus
No ratings yet
Physics MM Syllabus
22 pages
Aspiring Software Engineer Profile
No ratings yet
Aspiring Software Engineer Profile
2 pages
Ayush Gupta: Skills
No ratings yet
Ayush Gupta: Skills
2 pages
Application Support Analyst
No ratings yet
Application Support Analyst
3 pages
Snowflake & SQL Expert Resume
No ratings yet
Snowflake & SQL Expert Resume
3 pages
Top 10 Unix Interview Questions
No ratings yet
Top 10 Unix Interview Questions
4 pages
SQL & PL/SQL Interview Questions
100% (3)
SQL & PL/SQL Interview Questions
46 pages
The Hardware Technologies For AI
No ratings yet
The Hardware Technologies For AI
2 pages
YubiKey Manual v3.3
No ratings yet
YubiKey Manual v3.3
41 pages
DABDeng
No ratings yet
DABDeng
3 pages
Proteus Introduction
No ratings yet
Proteus Introduction
18 pages
Pca App Launch Dic
No ratings yet
Pca App Launch Dic
8 pages
Algorithmic Thinking With Python
No ratings yet
Algorithmic Thinking With Python
19 pages
Text To Speech Conversion in MATLAB. Access Speech Properties of Windows From MATLAB. - Programmerworld
No ratings yet
Text To Speech Conversion in MATLAB. Access Speech Properties of Windows From MATLAB. - Programmerworld
1 page
CDU Instructions v1.02
No ratings yet
CDU Instructions v1.02
7 pages
Database Individual Assignment
No ratings yet
Database Individual Assignment
2 pages
Master Boot Record - Wikipedia, The Free Encyclopedia
No ratings yet
Master Boot Record - Wikipedia, The Free Encyclopedia
21 pages
INF790 Group Assignment 2 Instructions
No ratings yet
INF790 Group Assignment 2 Instructions
2 pages
Brother GTX
No ratings yet
Brother GTX
4 pages
Class 3 Computer Notes-1
No ratings yet
Class 3 Computer Notes-1
4 pages
PPSC Unit-1
No ratings yet
PPSC Unit-1
44 pages
Foxbarcode
No ratings yet
Foxbarcode
12 pages
Introduction to Robotics Concepts
No ratings yet
Introduction to Robotics Concepts
37 pages
LecturesPlanningKinematics FB+SLS
No ratings yet
LecturesPlanningKinematics FB+SLS
209 pages
04 - LinkedLists
No ratings yet
04 - LinkedLists
53 pages
Emerging Trends
No ratings yet
Emerging Trends
12 pages
Pankaj - Dhull - Resume 3
No ratings yet
Pankaj - Dhull - Resume 3
2 pages
CV en - Grozdangrozev Feb.2024
No ratings yet
CV en - Grozdangrozev Feb.2024
5 pages
Chapter 3
No ratings yet
Chapter 3
4 pages
Bartender 2016 Manual
No ratings yet
Bartender 2016 Manual
21 pages
Unix Lab: Shell Script Exercises
No ratings yet
Unix Lab: Shell Script Exercises
10 pages
Lec - 4 C Assembly
No ratings yet
Lec - 4 C Assembly
50 pages
IT Support & Network Engineer Resume
No ratings yet
IT Support & Network Engineer Resume
1 page
WSSOA Objective and MCQ Guide
No ratings yet
WSSOA Objective and MCQ Guide
6 pages
BeautifulSoup HTML Parsing Guide
No ratings yet
BeautifulSoup HTML Parsing Guide
9 pages
Programming Structures & Classes
No ratings yet
Programming Structures & Classes
15 pages

Unloading - Data - From - Snowflake

Uploaded by

Unloading - Data - From - Snowflake

Uploaded by

Unloading Data

Data unloading is the process of moving data out of snowflake to cloud

Why is unloading of data needed ?

COPY INTO @[Link].UNLOAD_OUTPUT/CUST

Request for data is frequently received from business users from

To take advantage of parallelism provided by snowflake and get the files

To get a single file we need to use the COPY option SINGLE=TRUE

COPY INTO @[Link].UNLOAD_OUTPUT FROM

The default compression when unloading data to snowflake in parquet

COPY INTO @UNLOAD_OUTPUT_PARQUET

Load failures due to files already being present in the folder is a

When unloading data in CSV format we observed that snowflake

We overcame this problem by casting the data to string and unloading

COPY INTO @[Link].UNLOAD_OUTPUT

Why have smaller files instead of 1 big file ?

You might also like