0% found this document useful (0 votes)

102 views13 pages

Sqoop: Data Transfer in Hadoop

Sqoop is a tool used to transfer data between Hadoop and relational databases. It allows users to import data from databases like MySQL into HDFS or Hive, and export data from HDFS to databases. The architecture of Sqoop consists of connectors, metadata, and a map-reduce job controller. Importing and exporting is handled through map-reduce jobs where the statements are converted to jobs and run on the HDFS cluster. Sqoop also provides commands to list databases and tables for easier management of data.

Uploaded by

Akram Sharieff

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

102 views13 pages

Sqoop: Data Transfer in Hadoop

Uploaded by

Akram Sharieff

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 13

SRIKALAHASTEESWARA INSTITUTE OF TECHNOLOGY

Dept. of Computer Science and Engineering

Srikalahasti.

SQOOP – A HADDOP
TECHNOLOGY

By
A. Venkatamuni
15381A0526
1 C.S.E IV Year
CONTENTS

INTRODUCTION

ARCHITECTURE

IMPORTING DATA

EXPORTING DATA

LIST DATA

CONCLUSION
INTRODUCTION
 Sqoop is a tool designed to transfer data between Hadoop and relational
database servers.
 It is used to import data from relational databases such as MySQL,
Oracle to Hadoop HDFS, and export from Hadoop file system to
relational databases.

 It is provided by the Apache Software Foundation.

ARCHITECTURE
 The architecture of Sqoop consists of connectors, metadata and the map-
reduce job controller as shown in the Figure.
 Connectors are one of the main components in sqoop that is responsible for
ensuring the database drivers given by the user is connected with sqoop
 The metadata stores internals of table like indexes and partitions.
ARCHITECTURE (Contd.,)

 The importing and exporting of the data is handled through map-reduce job
where the import statement given by the user is converted to a map-reduce
job and given to the HDFS cluster .

 Map job launch multiple mappers depends on the number defined by user
in the command line.

 For Sqoop import, each mapper task will be assigned with part of data to
be imported based on key defined in the command line.

 Then each mapper creates connection with the database using JDBC and
fetches the part of data assigned by Sqoop and writes it into HDFS or Hive
or HBase based on the option provided in the command line.
IMPORTING DATA
Sqoop facilitates the users of RDBMS to import the data in tables to either of
the Hadoop platforms using import command (Apache sqoop import, 2013)
which are discussed Import a Table from MySQL to HDFS.
Import a Table from MySQL to Hbase:
A table in MySQL can be imported to HBase using the command as follows:
Case 1: If table have primary key and import all the column of MySQL:
$ bin/sqoop import --connect jdbc:mysql://localhost/database name --username
user --password password --table tableName --hbase-table hbase_tableName
--column-family hbase_table_col1 --hbase-create-table
Case 2: If table doesn’t have primary key then choose one column as a hbase-
row-key. Import only few columns.
$ bin/sqoop import --connect jdbc:mysql://localhost/database name --username
user --password password --table tableName --hbasetable hbase_tableName
--columns column1,column2 --column-family hbase_table_col --hbase-row-key
column1 --hbase-create-table
IMPORTING DATA (Contd.,)
Import a Table from MySQL to Hive:

A table in MySQL can be imported to Hive using the command as follows:

Case 1: Import MySQL table into Hive if table have primary key.
$bin/sqoop-import --connect jdbc:mysql://localhost:3306/database name
-username user -password password --table tableName --hive-table
tableName --create-hive-table --hiveimport --hive-home
path/to/hive_home
Case 2: Import MySQL table into Hive if table doesn’t have primary key.
$bin/sqoop-import --connect jdbc:mysql://localhost:3306/database name
-username user -password password --table tableName --hive-table
tableName --create-hive-table --hiveimport --hive-home
path/to/hive_home -m 1.
EXPORTING DATA
 Export describes how to export data back from the HDFS to the RDBMS
database.
 The files which are given as input to the Sqoop contain records, which are
called rows in table.
 Those are read and parsed into a set of records and delimited with user-
specified delimiter.
Syntax:
$ sqoop export (generic-args) (export-args)
$ sqoop-export (generic-args) (export-args)
Example:
It is mandatory that the table to be exported is created manually and is
present in the database from where it has to be exported.
$ mysqlmysql> USE db;mysql> CREATE TABLE employee ( id INT NOT
NULL PRIMARY KEY, name VARCHAR(20), deg VARCHAR(20),
salary INT, dept VARCHAR(10));
EXPORTING DATA (Contd.,)

The following command is used to export the table data (which is in

emp_data file on HDFS) to the employee table in db database of Mysql
database server.

$ sqoop export \
--connect jdbc:mysql://localhost/db \
--username root \--table employee \
--export-dir /emp/emp_data

 The following command is used to verify the table in mysql command

line.
mysql>select * from employee;
LIST DATA

We can list both Databases and tables inside those databases. Sqoop list-
databases tool parses and executes the ‘SHOW DATABASES’ query against
the database server. Thereafter, it lists out the present databases on the
server.
Syntax:
The following syntax is used for Sqoop list-databases command.
$ sqoop list-databases (generic-args) (list-databases-args)
$ sqoop-list-databases (generic-args) (list-databases-args)
Sample Query:
The following command is used to list all the databases in the MySQL
database server.
$ sqoop list-databases \--connect jdbc:mysql://localhost/ \--username root
LIST DATA (Contd.,)

Sqoop list-tables describes how to list out the tables of a particular database
in MySQL database server using Sqoop. Sqoop list-tables tool parses and
executes the ‘SHOW TABLES’ query against a particular database.
Thereafter, it lists out the present tables in a database.
Syntax:
The following syntax is used for Sqoop list-tables command.
$ sqoop list-tables (generic-args) (list-tables-args)
$ sqoop-list-tables (generic-args) (list-tables-args)
Sample Query:
The following command is used to list all the tables in the userdb database
of MySQL database server.
$ sqoop list-tables \--connect jdbc:mysql://localhost/userdb \--username
root
CONCLUSION

Sqoop consists of “eval” command to perform user defined constraints

on the tables.To conclude, Sqoop helps in transferring bulk data between
RDBMS systems and Distributed Systems very optimally. It reduces the
unnecessary efforts of the developers in coding and maintaining the code.
As Sqoop transfers the data in parallel, the data transfer is also very fast.
Because of its contributors and support, Sqoop helps very much in the
Hadoop world. It acts as a middleware between RDBMS and Non -
Structural Databases.
Thank you!
Any queries?

Apache Sqoop Fundamentals and Usage
100% (1)
Apache Sqoop Fundamentals and Usage
66 pages
Bda U3
No ratings yet
Bda U3
59 pages
Unit 3 Apache Sqoop and Drill
No ratings yet
Unit 3 Apache Sqoop and Drill
10 pages
Sqoop in Hadoop: Features & Benefits
No ratings yet
Sqoop in Hadoop: Features & Benefits
8 pages
B22 BDA Experiment 03
No ratings yet
B22 BDA Experiment 03
11 pages
Sqoop Tool for AI & DS Students
No ratings yet
Sqoop Tool for AI & DS Students
10 pages
Understanding Apache Sqoop for Data Transfer
No ratings yet
Understanding Apache Sqoop for Data Transfer
24 pages
Sqoop: Importing Data to Hadoop HDFS
No ratings yet
Sqoop: Importing Data to Hadoop HDFS
7 pages
Module 5 - Sqoop
No ratings yet
Module 5 - Sqoop
25 pages
Sqoop: Interface for RDBMS & Hadoop
No ratings yet
Sqoop: Interface for RDBMS & Hadoop
39 pages
Apache Sqoop Data Transfer Between Hadoop and RDBMS
No ratings yet
Apache Sqoop Data Transfer Between Hadoop and RDBMS
9 pages
Unit 4 3 Lumify, Data Rapper and Sqooop
No ratings yet
Unit 4 3 Lumify, Data Rapper and Sqooop
27 pages
U Iv Sqoop 1
No ratings yet
U Iv Sqoop 1
20 pages
160 P16cse5a-P16ite3a 2020052411232116
No ratings yet
160 P16cse5a-P16ite3a 2020052411232116
13 pages
Sqoop
No ratings yet
Sqoop
4 pages
Sqoop
No ratings yet
Sqoop
15 pages
BDA Lab2
No ratings yet
BDA Lab2
8 pages
5 - Big - Data Vivek
No ratings yet
5 - Big - Data Vivek
4 pages
Understanding Sqoop in Hadoop
No ratings yet
Understanding Sqoop in Hadoop
27 pages
Apache Sqoop: Import/Export Commands
No ratings yet
Apache Sqoop: Import/Export Commands
7 pages
04 Sqoop
No ratings yet
04 Sqoop
30 pages
Sqoop Data Transfer Guide
No ratings yet
Sqoop Data Transfer Guide
18 pages
Unit 3 Topic 8 Flume and Scoop
No ratings yet
Unit 3 Topic 8 Flume and Scoop
35 pages
Sqoop User Guide
No ratings yet
Sqoop User Guide
90 pages
Introduction to Sqoop in Hadoop
No ratings yet
Introduction to Sqoop in Hadoop
6 pages
Sqoop Data Transfer Guide
No ratings yet
Sqoop Data Transfer Guide
9 pages
Excluding Tables in Apache Sqoop Import
No ratings yet
Excluding Tables in Apache Sqoop Import
10 pages
Scoop Intro
No ratings yet
Scoop Intro
9 pages
Importing Data with Cloudera Sqoop
No ratings yet
Importing Data with Cloudera Sqoop
23 pages
Big Data Ingestion with Sqoop and Flume
No ratings yet
Big Data Ingestion with Sqoop and Flume
104 pages
Scoop PPT
No ratings yet
Scoop PPT
3 pages
BD Sqltohadoop3 PDF
No ratings yet
BD Sqltohadoop3 PDF
13 pages
M - M - Num-Mappers
No ratings yet
M - M - Num-Mappers
4 pages
SqoopTutorial Ver 2.0
No ratings yet
SqoopTutorial Ver 2.0
51 pages
Sqoop Import/Export Commands Guide
No ratings yet
Sqoop Import/Export Commands Guide
5 pages
Sqoop Interview Guide for Big Data
No ratings yet
Sqoop Interview Guide for Big Data
25 pages
Apache Sqoop
No ratings yet
Apache Sqoop
21 pages
Sqoop Commands for MySQL Import
No ratings yet
Sqoop Commands for MySQL Import
12 pages
Bridging Databases Mastering Hadoop Sqoop Integration
No ratings yet
Bridging Databases Mastering Hadoop Sqoop Integration
10 pages
SIC Big Data Chapter 3 Workbook
No ratings yet
SIC Big Data Chapter 3 Workbook
86 pages
Big Data: Sqoop
No ratings yet
Big Data: Sqoop
43 pages
Sqooprequestfiles
No ratings yet
Sqooprequestfiles
7 pages
Comprehensive Apache Sqoop Tutorial
No ratings yet
Comprehensive Apache Sqoop Tutorial
2 pages
Sqoop Students Datadotz
No ratings yet
Sqoop Students Datadotz
19 pages
Experiment-5 (Case Study On Sqoop)
No ratings yet
Experiment-5 (Case Study On Sqoop)
5 pages
Sqoop Cammand
No ratings yet
Sqoop Cammand
8 pages
Practice Assignment
No ratings yet
Practice Assignment
3 pages
Lab Experiments 1,2&4
No ratings yet
Lab Experiments 1,2&4
8 pages
Sqoop Import Techniques Guide
No ratings yet
Sqoop Import Techniques Guide
18 pages
Sqoop: Data Transfer Tool for Hadoop
No ratings yet
Sqoop: Data Transfer Tool for Hadoop
28 pages
Intro
No ratings yet
Intro
2 pages
Sqoop: Bridging Hadoop and RDBMS
No ratings yet
Sqoop: Bridging Hadoop and RDBMS
4 pages
Sqoop Implementation Revised
No ratings yet
Sqoop Implementation Revised
7 pages
Sqoop Commands
No ratings yet
Sqoop Commands
4 pages
Sqoop v1.1
No ratings yet
Sqoop v1.1
18 pages
32 BDA Exp2
No ratings yet
32 BDA Exp2
24 pages
Sqoop MySQL to HDFS Data Transfer Guide
No ratings yet
Sqoop MySQL to HDFS Data Transfer Guide
7 pages
SQL Basics for R Users
No ratings yet
SQL Basics for R Users
30 pages
Virtualization
No ratings yet
Virtualization
11 pages
Ionic Framework: Srikalahasteeswara Institute of Technology
No ratings yet
Ionic Framework: Srikalahasteeswara Institute of Technology
15 pages
LaTeX Documentation Guide
No ratings yet
LaTeX Documentation Guide
21 pages
Felisa Nur-LAPORAN KEGIATAN MINI PROJEC
No ratings yet
Felisa Nur-LAPORAN KEGIATAN MINI PROJEC
63 pages
Paediatric Bronchoscopy Progress in Respiratory Research Kostas N. Priftis Download
No ratings yet
Paediatric Bronchoscopy Progress in Respiratory Research Kostas N. Priftis Download
53 pages
SQL Server Resume & CV Builder Tools
No ratings yet
SQL Server Resume & CV Builder Tools
9 pages
Getting Somewhere by Lilian A. Aujo-Group 1
100% (1)
Getting Somewhere by Lilian A. Aujo-Group 1
3 pages
Basic Load Cases Used For Piping Stress Analysis
No ratings yet
Basic Load Cases Used For Piping Stress Analysis
5 pages
Cloning and Biotechnology Overview
No ratings yet
Cloning and Biotechnology Overview
5 pages
E Auction 20.04.2023 Publication
No ratings yet
E Auction 20.04.2023 Publication
5 pages
Ultratech Cement: Particulars Test Results Requirements of
No ratings yet
Ultratech Cement: Particulars Test Results Requirements of
1 page
Fraud Detection Using Machine Learning
No ratings yet
Fraud Detection Using Machine Learning
36 pages
Home Page: BCSL 057 Web Programming Lab Phone No. 9811854308
100% (1)
Home Page: BCSL 057 Web Programming Lab Phone No. 9811854308
11 pages
Understanding People Media in MIL
100% (2)
Understanding People Media in MIL
50 pages
How To Derive A Formula (Vol. 1) (By Alexei A. Kornyshev Dominic OLee)
100% (2)
How To Derive A Formula (Vol. 1) (By Alexei A. Kornyshev Dominic OLee)
702 pages
Enterobacterales Summary Tables
No ratings yet
Enterobacterales Summary Tables
3 pages
Binomial and Multinomial Theorems
No ratings yet
Binomial and Multinomial Theorems
4 pages
Parts Reference List MODEL: MFC7420 / 7820N DCP7010 / 7010L / 7025
No ratings yet
Parts Reference List MODEL: MFC7420 / 7820N DCP7010 / 7010L / 7025
33 pages
Hilove
No ratings yet
Hilove
2 pages
Fractions Year 4
100% (1)
Fractions Year 4
3 pages
ACC262 SPECIMEN PAPER (Nov 2024) (1) - Merged
No ratings yet
ACC262 SPECIMEN PAPER (Nov 2024) (1) - Merged
22 pages
PUPCET Reviewer With Answer Keys
82% (72)
PUPCET Reviewer With Answer Keys
13 pages
Modular Kitchen Design Details
No ratings yet
Modular Kitchen Design Details
25 pages
Sanctions For Examination Misconducts
No ratings yet
Sanctions For Examination Misconducts
2 pages
Being and Time A Revised Edition of The Stambaugh Translation Martin Heidegger Full Digital Chapters
No ratings yet
Being and Time A Revised Edition of The Stambaugh Translation Martin Heidegger Full Digital Chapters
59 pages
Module11 by Amevoice M
No ratings yet
Module11 by Amevoice M
852 pages
Removal of Inspection in Fogleman
No ratings yet
Removal of Inspection in Fogleman
1 page
China Non Metal Ships Industry Profile Cic3752
No ratings yet
China Non Metal Ships Industry Profile Cic3752
8 pages
EPP - ICT - Creating A Multimedia Presentation Using The Advanced Features of MS PowerPoint Tool
No ratings yet
EPP - ICT - Creating A Multimedia Presentation Using The Advanced Features of MS PowerPoint Tool
27 pages
Quality Circle Report
100% (3)
Quality Circle Report
45 pages
Foundation Engineering Course
No ratings yet
Foundation Engineering Course
70 pages
Word To LaTeX Info
No ratings yet
Word To LaTeX Info
18 pages
01a. Questionnaire Hf. Recurrent Rev. 01, Jan. 04, 2023-Lgtc-tt-Am-f004
No ratings yet
01a. Questionnaire Hf. Recurrent Rev. 01, Jan. 04, 2023-Lgtc-tt-Am-f004
4 pages

Sqoop: Data Transfer in Hadoop

Uploaded by

Sqoop: Data Transfer in Hadoop

Uploaded by

SRIKALAHASTEESWARA INSTITUTE OF TECHNOLOGY

Dept. of Computer Science and Engineering

 It is provided by the Apache Software Foundation.

A table in MySQL can be imported to Hive using the command as follows:

The following command is used to export the table data (which is in

 The following command is used to verify the table in mysql command

Sqoop consists of “eval” command to perform user defined constraints

You might also like