0% found this document useful (0 votes)

54 views5 pages

Google Big Query Quick 5min Understanding

Cloud BigQuery is a fully managed analytics data warehouse designed for enterprise-level analytics and dashboard purposes, allowing for seamless data synchronization and no operational overhead. It offers features like flexible data ingestion, global availability, and cost control, while providing best practices for optimizing query performance and managing costs. Users can interact with BigQuery through various tools, including command-line utilities and integrations with other Google products, and pricing is based on the amount of data processed and storage used.

Uploaded by

a logical human

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views5 pages

Google Big Query Quick 5min Understanding

Uploaded by

a logical human

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

##############################################################

############## cloud big query ##############

cloud big query : fully manger analytics datawarehouse ,

Enterprise level datawarehouse ,

analytics and dashbord purpose ,

syn for all data reports ,

No ops platform

GCP --> nevigation ---> big data --> big query

Database is dataset in big query

Resources ---> create dataset

data location (data locality principal)

dafault table expiration

encriptions ( google manged or customer mangeed )

dataset ---> create table ( empty table , google cloud storage , upload ,drive , google cloud bigtable )

---> project name

---> dataset name , table name,table tyupe

---> partitioning and clustering

Gcp --> nevigation manu ---> big data --> Big query ---> query editor

---> query
history

saved queries

job
history

transfers

schedule queries
Bi
engine

Big query partions : for low cardinality ,

column based partitons ,

data stored within partion ,

no need share manully shareding tables.

big query advantages : io : 100k disks in parallel ,sepearation of storage and compute , cpu : 100 cpus
, jupiter networks

best partices Big query :

query performance : std query anti patterns, avoid select * , sample data using preview options ,
check price before running ,

limit the cost before running by restricting the number of bytes billed , partion data by date to retrive
the data , materilize query result in in stages , consider the cost of large resuklt set, use streaming
insert with cautions

big query limits are given in google cloud /bigqueries/quotas .

storage optimizations

big query :

any size, from anywhere

big query calculate amount of data its processing add computation accordingly and charged per
query basis .

features : flexible data ingetion , global avablity , security and permissions , cost control , highly
available ,fully integrated , connect with google products , automatic data transfer service ,

when to use big query :

when strucured data , dont need low latency system

big query ---> chrage for size of storege used for storing , only charged when query being executed

Big query can be used with cloud datalab , tablue ,qlik , data studio , google sheet ,cowerkers

structured data + analytics workload + low latency === cloud big table ( no sql database )

structured data + mobile sdk === cloud storage firebase

structure data == cloud storage

strucured data + no anlytics workload + relational data + no hirizontal scaling -=== cloud sql

structured data + no analytics workload + relational data + horizontal scalblity = === cloud spanner

structured data + no analytics workload + relational data + mobile sdk == firebase realtime db

pricing in bigquery : calculate according to Slots :

slots are units of

computaion capacity

slots are automatically

calculate for query

flat pricing can be used to

purchase no of slots

Estimation : can guessetimate query before submitting ,

online console owr with dryrun in bg

big query command line:

bq : command line itiluty

bq up

$ bq --location=US load --source_format=CSV mydataset.mytable ./myfile.csv

qtr:STRING,sales:FLOAT,year:STRING
$ bq --location=asia-northeast1 load --source_format=CSV mydataset.mytable ./myfile.csv
qtr:STRING,sales:FLOAT,year:STRING

$ bq mk --table [PROJECT_ID]:[DATASET].[TABLE] [SCHEMA]

$ bq mk --table mydataset.mytable qtr:STRING,sales:FLOAT,year:STRING

$ bq mk --table mydataset.mytable ./myschema.js

mk --> make

bq head --max_rows=10 mydataset.mytable ====> see table rows

bq head myotherproject:mydataset.mytable =====> head command to see few 1000 records

bq head --start_row 100 --selected_fields "field1,field2" mydataset.mytable ====> seeing records

Copy Table

bq --location=US cp mydataset.mytable mydataset2.mytable2

bq --location=asia-northeast1 cp mydataset.mytable mydataset2.mytable2

Exporting Data

bq --location=US extract --compression GZIP 'mydataset.mytable' gs://example-bucket/myfile.csv

bq --location=US extract --destination_format NEWLINE_DELIMITED_JSON 'mydataset.mytable'

gs://example-

bucket/myfile.json

bq --location=US extract --destination_format AVRO --compression SNAPPY 'mydataset.mytable'

gs://example-

bucket/myfile.avro

Copy Table

bq --location=US cp mydataset.mytable mydataset2.mytable2

bq --location=asia-northeast1 cp mydataset.mytable mydataset2.mytable2

Delete

bq rm -t myotherproject:mydataset.mytable

bq rm -t mydataset.mytable

bq ls --format=prettyjson --project_id
bq ls --format=prettyjson

bq show --format=prettyjson [PROJECT_ID]:[DATASET]

bq query --nouse_legacy_sql \

'SELECT * EXCEPT(schema_owner) FROM INFORMATION_SCHEMA.SCHEMATA’

Update Dataset

bq update --description "Description of mydataset" mydataset

bq update --default_table_expiration 7200 mydatase

Remove

bq rm -r -f -d [PROJECT_ID]:[DATASET]

Access Control Access.

bq show --format=prettyjson [PROJECT_ID]:[DATASET] > [PATH_TO_FILE]

bq update --source [PATH_TO_FILE] [PROJECT_ID]:[DATASET]

############IAM roles for big query ##########

permissions : bigquery.jobs. <create/listAll/list/get>

datasets.<delete/get/insert>

set predefined roles : roles/bigquery.<metadataViewer/dataViewer/dataEditor>

premitive roles : dataset reader/writer/owner

project edit/owner ...

GCS & BigQuery: Data Management Guide
No ratings yet
GCS & BigQuery: Data Management Guide
3 pages
BigQuery Cost Optimization + Best Practices
100% (1)
BigQuery Cost Optimization + Best Practices
30 pages
BigQuery For Data Warehouse Practitioners - Solutions - Google Cloud
No ratings yet
BigQuery For Data Warehouse Practitioners - Solutions - Google Cloud
25 pages
BQ Solutions-1
No ratings yet
BQ Solutions-1
19 pages
BigQuery Slot Pricing Explained
No ratings yet
BigQuery Slot Pricing Explained
6 pages
BigQuery Optimization Guide
100% (3)
BigQuery Optimization Guide
62 pages
BigQuery Introduction
No ratings yet
BigQuery Introduction
11 pages
Bigquery, Google'S Enterprise Data Warehouse: Slid02
No ratings yet
Bigquery, Google'S Enterprise Data Warehouse: Slid02
3 pages
GCP Data Storage & BigQuery Guide
No ratings yet
GCP Data Storage & BigQuery Guide
15 pages
Google BigQuery: Scalable Data Analysis
No ratings yet
Google BigQuery: Scalable Data Analysis
2 pages
Formatted BigQuery CheatSheet
No ratings yet
Formatted BigQuery CheatSheet
1 page
Framework For Migrate Your Data Warehouse Google BigQuery WhitePaper
100% (1)
Framework For Migrate Your Data Warehouse Google BigQuery WhitePaper
21 pages
Introduction To Google Cloud Big Data Platform: Lecturer: Phd. Tran Minh Quang Data Engineering - Group 12
No ratings yet
Introduction To Google Cloud Big Data Platform: Lecturer: Phd. Tran Minh Quang Data Engineering - Group 12
21 pages
BigQuery & ML on Google Cloud
No ratings yet
BigQuery & ML on Google Cloud
75 pages
BigQuery: Big Data Analytics & ML Guide
No ratings yet
BigQuery: Big Data Analytics & ML Guide
73 pages
CDA C2 R 200 en File 22.en
No ratings yet
CDA C2 R 200 en File 22.en
7 pages
Big Query
No ratings yet
Big Query
8 pages
BigQuery CheatSheet
100% (1)
BigQuery CheatSheet
100 pages
DBT Bigquery Whitepaper
100% (1)
DBT Bigquery Whitepaper
39 pages
M2 Ingesting New Datasets Into BigQuery
No ratings yet
M2 Ingesting New Datasets Into BigQuery
12 pages
05 Data Warehouse Using Google Big Query
No ratings yet
05 Data Warehouse Using Google Big Query
6 pages
Data Engineering 101 - BigQuery
No ratings yet
Data Engineering 101 - BigQuery
49 pages
From Data To Insights Course Summary
No ratings yet
From Data To Insights Course Summary
67 pages
10.20240803 0800 ClassNotes
No ratings yet
10.20240803 0800 ClassNotes
2 pages
BigQuery's New Enterprise Data Features
No ratings yet
BigQuery's New Enterprise Data Features
6 pages
BQ BQ: BQ Command-Line Tool Bigquery Documentation
100% (1)
BQ BQ: BQ Command-Line Tool Bigquery Documentation
7 pages
Curso Google Data Engineer
100% (1)
Curso Google Data Engineer
36 pages
Professional Cloud Architect Exam - Free Actual Q&as, Mar 31 Page
No ratings yet
Professional Cloud Architect Exam - Free Actual Q&as, Mar 31 Page
337 pages
Rajimartin Google Bigquery
No ratings yet
Rajimartin Google Bigquery
1 page
BIG Query Guide and Syllabus
No ratings yet
BIG Query Guide and Syllabus
8 pages
Day1 - Introduction To Database
100% (1)
Day1 - Introduction To Database
29 pages
11.20240805 0700 ClassNotes
No ratings yet
11.20240805 0700 ClassNotes
3 pages
BDA04 GoogleCloud
No ratings yet
BDA04 GoogleCloud
33 pages
Question 2
No ratings yet
Question 2
4 pages
04 BigQuery
100% (1)
04 BigQuery
243 pages
Big Query Optimization Document
No ratings yet
Big Query Optimization Document
10 pages
Google Bigquery
No ratings yet
Google Bigquery
10 pages
7 BigData BigQuery Intelli
No ratings yet
7 BigData BigQuery Intelli
3 pages
BigQuery SQL Cheat Sheet Visual
No ratings yet
BigQuery SQL Cheat Sheet Visual
1 page
Bigquery
No ratings yet
Bigquery
25 pages
Big Query
100% (1)
Big Query
64 pages
Big Query
No ratings yet
Big Query
11 pages
BigQuery: AI-Ready Data Platform Overview
No ratings yet
BigQuery: AI-Ready Data Platform Overview
11 pages
FDS CO2 Session 16
No ratings yet
FDS CO2 Session 16
18 pages
BigQuery Billing Data Lab
No ratings yet
BigQuery Billing Data Lab
2 pages
Data Analysts' Guide to BigQuery & Tableau
100% (1)
Data Analysts' Guide to BigQuery & Tableau
14 pages
GCP Technologies
No ratings yet
GCP Technologies
12 pages
BigQuery Data Engineer Interview CheatSheet
No ratings yet
BigQuery Data Engineer Interview CheatSheet
4 pages
BigQuery The Future of Data Warehousing
No ratings yet
BigQuery The Future of Data Warehousing
10 pages
Using The BigQuery Datasource - Preview User Guide-V2
No ratings yet
Using The BigQuery Datasource - Preview User Guide-V2
29 pages
Unstructured Data: User Price Shipped
No ratings yet
Unstructured Data: User Price Shipped
14 pages
(English (Auto-Generated) ) (Cloud Forum) Understanding BigQuery - Use Cases and Best Practices (DownSub - Com)
No ratings yet
(English (Auto-Generated) ) (Cloud Forum) Understanding BigQuery - Use Cases and Best Practices (DownSub - Com)
42 pages
Bigquery Interview Questions
No ratings yet
Bigquery Interview Questions
5 pages
Understanding SecLookup in GCP
100% (5)
Understanding SecLookup in GCP
12 pages
Exam Overview: GCP Data Engineer
100% (1)
Exam Overview: GCP Data Engineer
12 pages
Big Query Interview Q&A
100% (1)
Big Query Interview Q&A
8 pages
How Is Bigdata Handled in Kaggle?: 17Cp006-Leenanci Parmar 17CP012-DHRUVI LAD
No ratings yet
How Is Bigdata Handled in Kaggle?: 17Cp006-Leenanci Parmar 17CP012-DHRUVI LAD
18 pages
Google GCP BigLake
No ratings yet
Google GCP BigLake
13 pages
Template IJOSH 2024
No ratings yet
Template IJOSH 2024
4 pages
Nursing Care Plan Template
No ratings yet
Nursing Care Plan Template
5 pages
Full Stack Development Seminar Presentation
No ratings yet
Full Stack Development Seminar Presentation
50 pages
SEHH2031 Revision Session Ch5-6 Solution
No ratings yet
SEHH2031 Revision Session Ch5-6 Solution
2 pages
Erpnext Tutorial
100% (3)
Erpnext Tutorial
131 pages
كورس البنية التحتية الخاص بشكبة الجهد المتوسط وشبكة انارة الشوارع للمهندس محمد رمضان شلبي
50% (2)
كورس البنية التحتية الخاص بشكبة الجهد المتوسط وشبكة انارة الشوارع للمهندس محمد رمضان شلبي
91 pages
La Salette of Roxas College, Inc.: Magsaysay ST., Vira, Roxas, Isabela
No ratings yet
La Salette of Roxas College, Inc.: Magsaysay ST., Vira, Roxas, Isabela
7 pages
Week 5
No ratings yet
Week 5
8 pages
IGCSE Physics Practical Exam
No ratings yet
IGCSE Physics Practical Exam
19 pages
Bernina 330/350PE/380 Sewing Machine Instruction Manual
No ratings yet
Bernina 330/350PE/380 Sewing Machine Instruction Manual
52 pages
1 Paca - A-Contreras - v. - Rovila - Water - Supply - Inc.
No ratings yet
1 Paca - A-Contreras - v. - Rovila - Water - Supply - Inc.
14 pages
Mpi Assignment
No ratings yet
Mpi Assignment
14 pages
Triceratops Knitting Pattern
No ratings yet
Triceratops Knitting Pattern
3 pages
Microbrush Stamp Technique Guide
No ratings yet
Microbrush Stamp Technique Guide
2 pages
Japanese Girl Names
0% (1)
Japanese Girl Names
8 pages
Midterm Correction Post Test Psych Nursing
No ratings yet
Midterm Correction Post Test Psych Nursing
25 pages
The True History of The Conquest of New Spain Bernal Diaz Del Castillo
100% (1)
The True History of The Conquest of New Spain Bernal Diaz Del Castillo
1,190 pages
Aids Csbs M.E Time Table
No ratings yet
Aids Csbs M.E Time Table
6 pages
Miguel Angel Rodriguez Urueña Resume
No ratings yet
Miguel Angel Rodriguez Urueña Resume
3 pages
Burger King India LTD.: IPO Note
No ratings yet
Burger King India LTD.: IPO Note
5 pages
IFU SAVER PP Series 3359879 - 09 - 2018 - en
No ratings yet
IFU SAVER PP Series 3359879 - 09 - 2018 - en
2 pages
Globalization of Legal Proessiona and Education - Pragyaan Journal of Law
No ratings yet
Globalization of Legal Proessiona and Education - Pragyaan Journal of Law
9 pages
SCADA System Architecture Guide
No ratings yet
SCADA System Architecture Guide
50 pages
Another Cup of Coffee Please
No ratings yet
Another Cup of Coffee Please
3 pages
Amendment Application With Affidavit A4
No ratings yet
Amendment Application With Affidavit A4
5 pages
PP II (A) - Teacher Comments For 1st Term Report Card
No ratings yet
PP II (A) - Teacher Comments For 1st Term Report Card
7 pages
Islamic Months Colour Code Activity Mominamateen
No ratings yet
Islamic Months Colour Code Activity Mominamateen
1 page
Screenshot 2024-11-26 at 12.48.39 PM
No ratings yet
Screenshot 2024-11-26 at 12.48.39 PM
51 pages
CR 955 Afaehqqe
No ratings yet
CR 955 Afaehqqe
17 pages
Chem 2152 2024 Exam 3
No ratings yet
Chem 2152 2024 Exam 3
6 pages

Google Big Query Quick 5min Understanding

Uploaded by

Google Big Query Quick 5min Understanding

Uploaded by

##############################################################

############## cloud big query ##############

cloud big query : fully manger analytics datawarehouse ,

Enterprise level datawarehouse ,

analytics and dashbord purpose ,

syn for all data reports ,

GCP --> nevigation ---> big data --> big query

Database is dataset in big query

Resources ---> create dataset

data location (data locality principal)

dafault table expiration

encriptions ( google manged or customer mangeed )

---> project name

---> dataset name , table name,table tyupe

---> partitioning and clustering

Big query partions : for low cardinality ,

column based partitons ,

data stored within partion ,

no need share manully shareding tables.

best partices Big query :

big query limits are given in google cloud /bigqueries/quotas .

any size, from anywhere

when to use big query :

structured data + mobile sdk === cloud storage firebase

structure data == cloud storage

pricing in bigquery : calculate according to Slots :

slots are units of

slots are automatically

flat pricing can be used to

Estimation : can guessetimate query before submitting ,

big query command line:

bq : command line itiluty

$ bq --location=US load --source_format=CSV mydataset.mytable ./myfile.csv

$ bq mk --table [PROJECT_ID]:[DATASET].[TABLE] [SCHEMA]

$ bq mk --table mydataset.mytable qtr:STRING,sales:FLOAT,year:STRING

$ bq mk --table mydataset.mytable ./myschema.js

bq head --max_rows=10 mydataset.mytable ====> see table rows

bq head myotherproject:mydataset.mytable =====> head command to see few 1000 records

bq head --start_row 100 --selected_fields "field1,field2" mydataset.mytable ====> seeing records

bq --location=US cp mydataset.mytable mydataset2.mytable2

bq --location=asia-northeast1 cp mydataset.mytable mydataset2.mytable2

bq --location=US extract --compression GZIP 'mydataset.mytable' gs://example-bucket/myfile.csv

bq --location=US extract --destination_format NEWLINE_DELIMITED_JSON 'mydataset.mytable'

bq --location=US extract --destination_format AVRO --compression SNAPPY 'mydataset.mytable'

bq --location=US cp mydataset.mytable mydataset2.mytable2

bq --location=asia-northeast1 cp mydataset.mytable mydataset2.mytable2

bq show --format=prettyjson [PROJECT_ID]:[DATASET]

'SELECT * EXCEPT(schema_owner) FROM INFORMATION_SCHEMA.SCHEMATA’

bq update --description "Description of mydataset" mydataset

bq update --default_table_expiration 7200 mydatase

Access Control Access.

bq show --format=prettyjson [PROJECT_ID]:[DATASET] > [PATH_TO_FILE]

bq update --source [PATH_TO_FILE] [PROJECT_ID]:[DATASET]

############IAM roles for big query ##########

permissions : bigquery.jobs. <create/listAll/list/get>

set predefined roles : roles/bigquery.<metadataViewer/dataViewer/dataEditor>

premitive roles : dataset reader/writer/owner

project edit/owner ...

You might also like