Module 1: Introduction to Information
Storage
Upon completion of this module, you should be able to:
• Describe digital data, types of digital data, and information
• Describe data center and its key characteristics
• Describe key data center management processes
• Describe the evolution of computing platforms
Module 1: Introduction to Information Storage 1
© Copyright 2015 EMC Corporation. All rights reserved.
The Growth of the Digital Universe
• The digital universe is created and defined by software
– Digital data is continuously generated, collected, stored, and
analyzed through software
• The digital universe generates approximately 4.4 trillion GB of
data annually
– Proliferation of IT, Internet usage, social media, and smart devices
adds to data growth
• The Internet of Things (IoT) is also adding to data growth
– IoT is made up of Internet-connected equipment and sensors
Module 1: Introduction to Information Storage 2
© Copyright 2015 EMC Corporation. All rights reserved.
Why Information Storage and Management?
• Organizations are dependent on continuous and reliable access
to information
• Organizations seek to effectively store, protect, process,
manage, and leverage information
• Organizations are increasingly implementing intelligent storage
solutions
– To efficiently store and manage information
– To gain competitive advantage
– To derive new business opportunities
Module 1: Introduction to Information Storage 3
© Copyright 2015 EMC Corporation. All rights reserved.
What is Digital Data?
Digital Data
A collection of facts that is transmitted and stored in electronic form, and
processed through software.
Video
Laptop
Text 10101011010 11101011011 11101110100
00110101110 10011101001 11100100010
10101010101 01110111011 10111011101
Desktop Photos Internal or
Digital Data
External Storage
Tablet and Mobile
Module 1: Introduction to Information Storage 4
© Copyright 2015 EMC Corporation. All rights reserved.
Types of Digital Data
• Data that has no inherent structure and is
Unstructured usually stored as different types of files.
• E.g. Text documents, PDFs, images, and videos
Increasing Growth
• Textual data with erratic formats that can be
Quasi-Structured formatted with effort and software tools
• E.g. Clickstream data
• Textual data files with an apparent pattern,
Semi-Structured enabling analysis
• E.g. Spreadsheets and XML files
• Data having a defined data model, format,
Structured structure
• E.g. Database
Module 1: Introduction to Information Storage 5
© Copyright 2015 EMC Corporation. All rights reserved.
What is Information?
Information
Processed data that is presented in a specific context to enable useful
interpretation and decision-making.
• Example: Annual sales data processed into a sales report
– Enables calculation of the average sales for a product and the
comparison of actual sales to projected sales
• New architectures and technologies have emerged for
extracting information from non-structured data
Module 1: Introduction to Information Storage 6
© Copyright 2015 EMC Corporation. All rights reserved.
Information Storage
• Information is stored on storage devices on non-volatile media
• Types of storage devices:
– Magnetic storage devices: Hard disk drive and magnetic tape
– Optical storage devices: Blu-ray disc, DVD, and CD
– Flash-based storage devices: Solid state drive, memory card, and USB
thumb drive
• Storage devices are assembled within a storage system or “array”
– Provides high capacity, scalability, performance, reliability, and security
• Storage systems along with other IT infrastructure are housed in a
data center
Module 1: Introduction to Information Storage 7
© Copyright 2015 EMC Corporation. All rights reserved.
What is a Data Center?
Data Center
A facility that houses IT equipment including compute, storage, and
network components, and other supporting infrastructure for providing
centralized data-processing capabilities.
• A data center comprises:
– Facility: The building and floor space where the data center is
constructed
– IT equipment: Compute, storage, and network equipment
– Support infrastructure: Power supply, fire detection, HVAC, and
security systems
Module 1: Introduction to Information Storage 8
© Copyright 2015 EMC Corporation. All rights reserved.
Key Characteristics of a Data Center
Availability
Data Integrity Security
Manageability
Performance Capacity
Scalability
Module 1: Introduction to Information Storage 9
© Copyright 2015 EMC Corporation. All rights reserved.
Key Data Center Management Processes
Management Process Description
Monitoring Continuously gathering information on data center
resources
Reporting Presenting the details on resource performance,
capacity, and utilization
Provisioning Configuring and allocating resources to meet the
capacity, availability, performance, and security
requirements
Planning Estimating the amount of resources required to
support business operations
Maintenance Ensuring the proper functioning of resources and
resolving incidents
Module 1: Introduction to Information Storage 10
© Copyright 2015 EMC Corporation. All rights reserved.
Evolution of Computing Platforms
PLATFORM 3
Cloud Big Data Mobile Social
BILLIONS OF USERS Mobile Devices MILLIONS OF APPS
PLATFORM 2
LAN/Internet Client/Server
HUNDREDS OF MILLIONS OF USERS PC TENS OF THOUSANDS OF APPS
PLATFORM 1
Mainframe, Mini Computer
MILLIONS OF USERS Terminals THOUSANDS OF APPS
Module 1: Introduction to Information Storage 11
© Copyright 2015 EMC Corporation. All rights reserved.
First Platform
• Based on mainframes Data Center
– Applications and databases hosted
centrally
Mainframe
– Users connect to mainframes through (Applications
and data)
terminals
• Challenges with mainframes
– Substantial CAPEX and OPEX
• High acquisition costs
• Considerable floor space and energy Terminals
requirements
Module 1: Introduction to Information Storage 12
© Copyright 2015 EMC Corporation. All rights reserved.
Second Platform
• Based on client-server model
Data Center
– Distributed application architecture
– Servers receive and process requests Web Server Application Server Database Server
for resources from clients
– Users connect through a client
program or a web interface
LAN/WAN
• Challenges with client-server model Request Response
– Creation of IT silos
– Hardware and software maintenance
overhead
– Scalability to meet the growth of
users and workloads Clients
(Client software or web browser)
Module 1: Introduction to Information Storage 13
© Copyright 2015 EMC Corporation. All rights reserved.
Third Platform
CLOUD BIG DATA MOBILE SOCIAL
The four Pillars of the Third Platform
• The four pillars are transforming the way organizations are
using technology for business operations
Module 1: Introduction to Information Storage 15
© Copyright 2015 EMC Corporation. All rights reserved.
Module 1: Summary
Key points covered in this module:
• Digital data, types of digital data, and information
• Data center and its key characteristics
• Key data center management processes
• Evolution of computing platforms
Module 1: Introduction to Information Storage 17
© Copyright 2015 EMC Corporation. All rights reserved.