Big Data Analytics - Answer Key
PART A - 2 Marks Each
1. Big Data refers to extremely large datasets that cannot be processed using traditional tools. Examples include
social media data analysis and predictive maintenance in manufacturing.
2. Cloud Computing provides scalable computing resources over the internet, while Big Data refers to
large-scale data storage and analysis.
3. Machine-generated data is data produced automatically by machines without human involvement, e.g., server
logs, IoT sensor data.
4. Hadoop offers distributed storage (HDFS) and parallel processing (MapReduce), making it vital for handling
large datasets efficiently.
5. Types of NoSQL Data Stores: Key-Value Store, Document-Oriented Store, Column-Family Store, Graph
Database.
6. CAP Theorem: A distributed system can only provide two of the following three guarantees at the same time:
Consistency, Availability, Partition Tolerance.
7. Asynchronous Programming allows tasks to run independently without blocking execution, useful for I/O
operations.
8. Scalar Time is represented as a monotonically increasing counter for each event in a system.
9. Clock Skew is the difference in time readings between multiple system clocks in a distributed environment.
10. Transparency hides system complexity from the user. Types include Access, Location, Replication,
Concurrency, and Failure Transparency.
PART B - 16 Marks
7b) Applications of Big Data
Big Data Analytics is applied across many industries, solving complex problems, enhancing efficiency, and
creating competitive advantages. Healthcare: Analyzing patient data for disease prediction, personalized
treatment, and real-time monitoring. Example: IBM Watson Health aids in diagnosis. Retail: Recommending
products based on customer behavior, optimizing inventory, and analyzing customer sentiment. Example:
Amazon's product recommendations. Finance: Fraud detection, risk assessment, and algorithmic trading.
Example: PayPal uses Big Data to detect fraudulent transactions. Smart Cities: Managing urban infrastructure by
monitoring traffic, energy consumption, and public safety using sensors and analytics. Example: Barcelona
optimizes traffic and energy use. Social Media: Sentiment analysis, trend prediction, and targeted advertising.
Example: Twitter sentiment during elections.
8b) Big Data in E-Commerce (Web Analytics)
Web Analytics refers to collecting, measuring, and analyzing user interactions on e-commerce platforms. Role of
Big Data in E-Commerce: 1. Personalized product recommendations based on user behavior (search history,
purchases). 2. Customer behavior analysis via heatmaps to improve UI/UX. 3. Dynamic pricing using real-time
demand and competition data. 4. Fraud detection by monitoring millions of transactions for anomalies. 5. Inventory
management using demand forecasts for efficient stock handling. Case Study Example: Amazon uses Big Data
to analyze clickstream, user reviews, purchase history, and warehouse IoT sensors. Techniques include
collaborative filtering, sentiment analysis, and predictive analytics. Benefits: Improved customer experience,
efficient operations, better security, and data-driven decision-making. Challenges: Data privacy compliance
(GDPR), handling high data volume, and integration complexity.
9b) Key Features of Document-Oriented Database with Diagram
Document-oriented databases store data as documents (JSON, BSON, XML), ideal for handling semi-structured
or unstructured data. Key Features: 1. Schema-less Data Model: Documents can have different structures. 2.
Hierarchical Structure: Supports nested objects and arrays. 3. Scalability: Horizontal scaling via sharding. 4.
Indexed Documents: Fields can be indexed for faster queries. 5. Easy Data Representation: Stores
heterogeneous data, suited for CMS, blogging platforms, etc. Example Diagram: Collection: Users | |----
Document 1: | { | "_id": "001", | "name": "John Doe", | "email": "
[email protected]", | "address": {"street": "123
Street", "city": "NY", "zip": "10001"} | } | |---- Document 2: { "_id": "002", "name": "Jane Smith", "email":
"
[email protected]", "address": {"street": "456 Avenue", "city": "LA", "zip": "90001"}, "phone": "9876543210" }
Use Case Example: MongoDB stores documents in collections, enabling fast and flexible data retrieval and
storage. Conclusion: Document-oriented databases are flexible, scalable, and ideal for applications with varied
data types.