0% found this document useful (0 votes)
410 views3 pages

Data Engineering Essentials and Skills

Uploaded by

Jk nayak
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
410 views3 pages

Data Engineering Essentials and Skills

Uploaded by

Jk nayak
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd

Data engineering

Data engineering build a system which collects


 Data from various source like RDBMS, EXCEL,
CSV,PDF ETC.. AND STORE IN A WAREHOUSE
FOR PREDICTIVE ANALYSIS , DATA
SCIENCE ,MACHINE LEARNING & AI
 DATA CAN BE IN THE FORM OF STRUCTURE ,
SEMISTRUCTURE, UNSTURCTURE.
 EARLIER TIME DATA WAS GENERATED VERY
LESS BUT NOW HUGE VOLUMES OF DATA IS
GENERATING , HOW TO USE THIS DATA AND
MAKE KEY DESCISSION FOR THEIR BUSINESS
DEVELOPMENT.
 DATA ENGINEERS ROLE IS TO MAKE DATA
PIPELINE FROM VARIOUS SOURCE TO STORE
DATA IN A DATA WAREHOUSE TO MAKE A
PROPER DECISSION MAKING BY DATA
SCIENTIST AND BUSINESS ANALYST.
SKILLS FOR DATA ENGINEERS
1. SQL (FULL KNOWLEDGE)
[Link] (BASIC KNOWLEDGE DATA TYPE,
LOOPS STATEMENT, LIST, TUPLE,
DCITIONARY,CLASSES FUNCTION ETC.. )
[Link] COMPUTING FRAMEWORK
LIKE HADOOP BASIC (SLOW PERFORMANCE )AND
SPARK ARCHTECTURE(FAST PERFORMANCE) AND
ITS USES.
[Link](PYTHON + SPARK)
[Link] TECHNOLOGY (AWS, AZURE, GCP, OCI)
AZURE SERVICES:-
i) AZURE DATA FACTORY :-IT IS DATA
ORCHESTRATION TOOL WHICH IS USED FOR
ETL OR ELT PROCESS. IT IS TOTALLY NO
CODING TOOL ONLY DRAG AN DROP
OPTIONS
ii) AZURE DATA BRICKS:- DATABRICKS IS A
CLOUD VERSION OF SPARK(DATA BRICKS
AND MICROSOFT JOINTLY STARTED)
HERE WE CAN WRITE A PYTHON ,
R,SCALA ,JAVA CODE

iii)AZURE SYNAPSE ANALYTICS FOR DATA


WAREHOUSE

DATABASE

(OLTP SYSTEM)

EXCEL
ADF(AZURE DATA ADB (AZURE DATAWARE
FACTORY COLLECT DATABRICKS HOUSE(SQL DB) BI
DATA FROM DIFFERENT PROCESS
SOURCE
BIGDATA )
CSV
DATA
INTEGRATION(ETL / ELT
)

IOT/ INTERNET ADLS


GEN2(STORAGE)

DATA ENGINEER DATA ANALYST

ANY QUESTON?

You might also like