0% found this document useful (0 votes)

18 views4 pages

Assignment DBMS

The document discusses Big Data in the context of Database Management Systems (DBMS), highlighting its characteristics, challenges, and differences from traditional DBMS. Key characteristics include the 5 V's: Volume, Velocity, Variety, Veracity, and Value, while challenges involve storage, integration, real-time processing, and data quality. Technologies such as Hadoop, Spark, and NoSQL are essential for managing Big Data, enabling organizations to leverage large datasets for improved decision-making and business growth.

Uploaded by

rickshithaanandakumarsmart

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views4 pages

Assignment DBMS

Uploaded by

rickshithaanandakumarsmart

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

DBMS : Assignment - submission date (27/6/25)

Explain the concept of Big Data in the context of DBMS. Discuss its characteristics, challenges,
and how traditional DBMS differs from Big Data systems. Also, briefly describe the technologies
used to manage Big Data.

Introduction to Big Data in DBMS

It processes a huge amount of structured, semi-structured, and unstructured data to extract

insight meaning, from which one pattern can be designed that will be useful to take a decision

for grabbing the new business opportunity, the betterment of product/service, and ultimately

business growth. Data science process to make sense of Big data/huge amount of data that is

used in business.

Characteristics of Big Data (The 5 V's)

1. Volume:
Refers to the massive size of data generated daily from multiple sources such as social
media, IoT devices, and transactions.

2. Velocity:
Describes the speed at which new data is generated and needs to be processed in real
time (e.g., social feeds, sensor data).

3. Variety:
Data comes in various formats – structured (tables), semi-structured (XML, JSON), and
unstructured (images, videos, logs).

4. Veracity:
Ensures that the data is trustworthy and accurate, despite inconsistencies or
incompleteness in raw data.

5. Value:
The ability to derive meaningful insights and business value from the data collected.

Types of Big Data

Type Description Examples

Data organized in rows and columns, easy Databases, spreadsheets, online

Structured
to query using SQL. transaction logs

Semi- Data with some organizational properties

XML, JSON, metadata, NoSQL
Structured but no fixed schema.

Data without a pre-defined model or Images, videos, text files, social

Unstructured
structure. media posts

Sources of Big Data

Big Data is generated from multiple sources.

1. Social Media Platforms: Data from posts, likes, comments, shares on Facebook, Twitter,
Instagram, etc.

2. Sensor-Generated Data: Environmental data (temperature, humidity) and surveillance

from traffic or security cameras.

3. Customer Feedback: Reviews and ratings on platforms like Amazon, Flipkart, Myntra, and
service-based sectors.

4. IoT Devices: Smart TVs, ACs, refrigerators, and other devices sending real-time usage and
control data.

5. E-Commerce & Online Transactions: Banking records, shopping history, and digital
payment logs.

6. GPS Devices: Location tracking data from smartphones and vehicles for route
optimization and movement monitoring.

7. Transactional Data: Data from purchases, invoices, receipts, including date, time, items,
and payment methods.

8. Machine-Generated Data: Logs and system data from servers, industrial machines,
wearable devices, and satellites.

Challenges in Big Data Management

1. Storage & Scalability: Traditional systems cannot scale to petabyte-level data storage.
2. Data Integration: Combining data from various formats and sources.

3. Real-time Processing: Need for instant processing in areas like fraud detection or live
analytics.

4. Data Quality: Managing errors, inconsistencies, and duplications in large datasets.

5. Security & Privacy: Protecting sensitive and personal information.

6. Analysis Complexity: Requires advanced tools and skills for meaningful insights.

Difference: Traditional DBMS vs Big Data Systems

Feature Traditional DBMS Big Data Systems

Data Type Structured data only Structured, Semi-structured, Unstructured

Scalability Limited, vertical scaling Massive, horizontal scaling

Storage Centralized Distributed across clusters

Processing Batch processing Real-time and batch

Technology Used RDBMS (e.g., MySQL, Oracle) Hadoop, Spark, NoSQL, etc.

Query Language SQL NoSQL, MapReduce, HiveQL, etc.

Flexibility Rigid schema Schema-on-read

Technologies Used to Manage Big Data

1. Hadoop

o Open-source framework for distributed storage and processing of large datasets

using the MapReduce programming model.

2. Apache Spark

o Fast data processing engine for real-time and batch analytics.

3. NoSQL Databases

o Examples: MongoDB, Cassandra, CouchDB

o Handle semi-structured and unstructured data with flexibility and scalability.

4. Hive & Pig

o Tools on top of Hadoop for querying (Hive uses SQL-like language).

5. Apache Kafka

o Distributed streaming platform for building real-time data pipelines and

streaming apps.

6. Elasticsearch

o Search engine used for indexing and querying large volumes of data quickly.

Conclusion

Big Data has revolutionized how modern organizations collect, store, and analyze data. Unlike
traditional DBMS, Big Data systems are designed to handle massive volumes and diverse types
of data in real time. Through technologies like Hadoop, Spark, and NoSQL, businesses can
unlock new opportunities, enhance services, and gain competitive advantages. Understanding
the characteristics and management challenges of Big Data is crucial for developing efficient and
future-ready data systems.

Big Data Analytics & Hadoop Guide
No ratings yet
Big Data Analytics & Hadoop Guide
14 pages
M1 Q&a
No ratings yet
M1 Q&a
26 pages
BD 1
No ratings yet
BD 1
15 pages
Big Data Notes
No ratings yet
Big Data Notes
89 pages
Bigdata Analytics
No ratings yet
Bigdata Analytics
19 pages
Big Data Analytics
No ratings yet
Big Data Analytics
21 pages
BDA Unit 1
No ratings yet
BDA Unit 1
50 pages
08 Big Data Introduction
No ratings yet
08 Big Data Introduction
15 pages
Bda Only Red QB
No ratings yet
Bda Only Red QB
63 pages
Chapter 1 Introduction To Big Data
No ratings yet
Chapter 1 Introduction To Big Data
19 pages
BDA Unit 1 Notes
No ratings yet
BDA Unit 1 Notes
34 pages
BDA Unit 1 Notes-1
No ratings yet
BDA Unit 1 Notes-1
34 pages
Module 1 Notes
No ratings yet
Module 1 Notes
12 pages
Unit 1.1 - Introduction To Big Data Analytics
No ratings yet
Unit 1.1 - Introduction To Big Data Analytics
19 pages
Big Data
No ratings yet
Big Data
34 pages
Notesfor BDA
No ratings yet
Notesfor BDA
59 pages
Big Data Analytics Notess
No ratings yet
Big Data Analytics Notess
69 pages
IMP Questions PDF in Big Data
No ratings yet
IMP Questions PDF in Big Data
15 pages
Big Data
100% (2)
Big Data
190 pages
Unit 1
No ratings yet
Unit 1
44 pages
BDT L1u1
No ratings yet
BDT L1u1
18 pages
Big Data Analytics
No ratings yet
Big Data Analytics
10 pages
Module 1. 16974328175990
No ratings yet
Module 1. 16974328175990
119 pages
Big Data UNIT I
No ratings yet
Big Data UNIT I
91 pages
Chap 1
No ratings yet
Chap 1
41 pages
Bdgujgdgjnctjvccnj
No ratings yet
Bdgujgdgjnctjvccnj
6 pages
Title - Concept of Big Data: Presented by - Divyanshu Upadhyay Naman Gupta Adarsh Pandey Pankaj Chaudhary Shivbrat Singh
No ratings yet
Title - Concept of Big Data: Presented by - Divyanshu Upadhyay Naman Gupta Adarsh Pandey Pankaj Chaudhary Shivbrat Singh
17 pages
Core Technology - Big Data and Analysis
No ratings yet
Core Technology - Big Data and Analysis
27 pages
Ese Bda
No ratings yet
Ese Bda
28 pages
Big Data: Definition, Examples, and Importance
No ratings yet
Big Data: Definition, Examples, and Importance
8 pages
Big Data Analytics
No ratings yet
Big Data Analytics
64 pages
Understanding Data Types & Big Data
No ratings yet
Understanding Data Types & Big Data
62 pages
Bda Answers PDF
No ratings yet
Bda Answers PDF
20 pages
Big Data Seminar Report Overview
100% (2)
Big Data Seminar Report Overview
27 pages
Unit 5 Concepts of Big Data and Data Lake
No ratings yet
Unit 5 Concepts of Big Data and Data Lake
15 pages
Big Data Applications & Database Insights
No ratings yet
Big Data Applications & Database Insights
15 pages
Understanding Big Data's 5 V's
No ratings yet
Understanding Big Data's 5 V's
18 pages
Data, Big
No ratings yet
Data, Big
90 pages
DBMS Unit1
No ratings yet
DBMS Unit1
30 pages
Big Data Analytics 18CS72 - Module 1
No ratings yet
Big Data Analytics 18CS72 - Module 1
84 pages
Bda Q&a
No ratings yet
Bda Q&a
15 pages
Big Data Analytics
No ratings yet
Big Data Analytics
58 pages
Mod10-Wk10 CSG2132 Module 10 Big Data 2020
No ratings yet
Mod10-Wk10 CSG2132 Module 10 Big Data 2020
26 pages
Big Data
No ratings yet
Big Data
16 pages
Understanding Big Data Concepts
No ratings yet
Understanding Big Data Concepts
16 pages
Kwasu-Csc204 Module 1 Big Data Computing and Security 2
No ratings yet
Kwasu-Csc204 Module 1 Big Data Computing and Security 2
22 pages
Bda Unit 1
No ratings yet
Bda Unit 1
10 pages
03 Big Data and Analytics
No ratings yet
03 Big Data and Analytics
56 pages
Big Data 1
No ratings yet
Big Data 1
28 pages
BD Unit 1
No ratings yet
BD Unit 1
5 pages
Unit 1
No ratings yet
Unit 1
51 pages
Unit 1 - BDS - DS307
No ratings yet
Unit 1 - BDS - DS307
47 pages
CS8091 LN
No ratings yet
CS8091 LN
68 pages
Fundamentals of Big Data Analytics
No ratings yet
Fundamentals of Big Data Analytics
151 pages
Viralheat Inc. Tech Stack Overview
No ratings yet
Viralheat Inc. Tech Stack Overview
68 pages
Big Data Processing
No ratings yet
Big Data Processing
38 pages
BIG Data1
No ratings yet
BIG Data1
49 pages
BDA Notes
No ratings yet
BDA Notes
96 pages
Big Data
No ratings yet
Big Data
19 pages
Exploring Dataset in MapReduce
No ratings yet
Exploring Dataset in MapReduce
14 pages
Big Data Analytics Exam Guide
100% (1)
Big Data Analytics Exam Guide
3 pages
Spark & Databricks Guide for Developers
No ratings yet
Spark & Databricks Guide for Developers
71 pages
Data Analyst Resume: Shreya Arun
No ratings yet
Data Analyst Resume: Shreya Arun
2 pages
Data Engineering Lab
No ratings yet
Data Engineering Lab
4 pages
OpenMetadata: A Data Discovery Solution
No ratings yet
OpenMetadata: A Data Discovery Solution
12 pages
Big Data Course: Hadoop & Analytics
No ratings yet
Big Data Course: Hadoop & Analytics
2 pages
Lakehouse: A New Generation of Open Platforms That Unify Data Warehousing and Advanced Analytics
No ratings yet
Lakehouse: A New Generation of Open Platforms That Unify Data Warehousing and Advanced Analytics
8 pages
Amazon Redshift - Analyze Data Across Your Lake House With Amazon Redshift
No ratings yet
Amazon Redshift - Analyze Data Across Your Lake House With Amazon Redshift
48 pages
Cloudera - DANA-262: Analyzing With Cloudera Data Warehouse
No ratings yet
Cloudera - DANA-262: Analyzing With Cloudera Data Warehouse
3 pages
Week - 5
No ratings yet
Week - 5
7 pages
Choosing Hadoop Storage Formats
No ratings yet
Choosing Hadoop Storage Formats
4 pages
YouTube Data Analysis Using Hadoop1
No ratings yet
YouTube Data Analysis Using Hadoop1
69 pages
Databricks Interview Questions With Detailed Solution
No ratings yet
Databricks Interview Questions With Detailed Solution
171 pages
Azure Databricks for Data Engineers
No ratings yet
Azure Databricks for Data Engineers
87 pages
SQL and Nosql Programming With Spark
No ratings yet
SQL and Nosql Programming With Spark
63 pages
Data Science & ML Expert Profile
No ratings yet
Data Science & ML Expert Profile
5 pages
Chapter 8 Mapreduce Service (MRS)
No ratings yet
Chapter 8 Mapreduce Service (MRS)
23 pages
Big Data & Hadoop Ecosystem Guide
No ratings yet
Big Data & Hadoop Ecosystem Guide
4 pages
Hive Query Execution and Data Management
75% (4)
Hive Query Execution and Data Management
17 pages
Apache Spark Interview Questions
No ratings yet
Apache Spark Interview Questions
12 pages
Sr. Data Engineer with AWS & Azure Expertise
No ratings yet
Sr. Data Engineer with AWS & Azure Expertise
8 pages
Bad601 Lab Maual
No ratings yet
Bad601 Lab Maual
34 pages
Installation Guide Apache Kylin
100% (1)
Installation Guide Apache Kylin
17 pages
Mca 3rd Sem
No ratings yet
Mca 3rd Sem
7 pages
22CS911-DEC Unit 4-8.07.2025
No ratings yet
22CS911-DEC Unit 4-8.07.2025
114 pages
Overview of the Hadoop Ecosystem
No ratings yet
Overview of the Hadoop Ecosystem
21 pages
Naveen's Resume - AWS DE
No ratings yet
Naveen's Resume - AWS DE
5 pages
Hadoop Learning Guide for Beginners
No ratings yet
Hadoop Learning Guide for Beginners
54 pages
Install Hive 1.2.1 on Hadoop 2.x.x Guide
No ratings yet
Install Hive 1.2.1 on Hadoop 2.x.x Guide
7 pages

Assignment DBMS

Uploaded by

Assignment DBMS

Uploaded by

DBMS : Assignment - submission date (27/6/25)

Introduction to Big Data in DBMS

It processes a huge amount of structured, semi-structured, and unstructured data to extract

Characteristics of Big Data (The 5 V's)

Types of Big Data

Data organized in rows and columns, easy Databases, spreadsheets, online

Semi- Data with some organizational properties

Data without a pre-defined model or Images, videos, text files, social

Sources of Big Data

Big Data is generated from multiple sources.

2. Sensor-Generated Data: Environmental data (temperature, humidity) and surveillance

Challenges in Big Data Management

4. Data Quality: Managing errors, inconsistencies, and duplications in large datasets.

5. Security & Privacy: Protecting sensitive and personal information.

Difference: Traditional DBMS vs Big Data Systems

Feature Traditional DBMS Big Data Systems

Data Type Structured data only Structured, Semi-structured, Unstructured

Scalability Limited, vertical scaling Massive, horizontal scaling

Storage Centralized Distributed across clusters

Processing Batch processing Real-time and batch

Query Language SQL NoSQL, MapReduce, HiveQL, etc.

Flexibility Rigid schema Schema-on-read

Technologies Used to Manage Big Data

o Open-source framework for distributed storage and processing of large datasets

o Fast data processing engine for real-time and batch analytics.

o Examples: MongoDB, Cassandra, CouchDB

o Handle semi-structured and unstructured data with flexibility and scalability.

o Tools on top of Hadoop for querying (Hive uses SQL-like language).

o Distributed streaming platform for building real-time data pipelines and

You might also like