0% found this document useful (0 votes)
9 views12 pages

Unit 2 Cloud Computing

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views12 pages

Unit 2 Cloud Computing

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Unit-2-Cloud Computing -23UCSCE56-2

Content Delivery Services: Amazon CloudFront - Windows Azure Content


Delivery Network
Analytics Services: Amazon Elastic MapReduce - Google MapReduce
Service - Google BigQuery - Windows Azure HDInsight
Deployment and Management Services: Amazon Elastic Beanstack -
Amazon CloudFormation
Identity and Access Management Services: Amazon Identiy and Access
Management - Windows Azure Active Directory
Open Source Private Cloud Software: CloudStack – Eucalyptus -
OpenStack

Content Delivery Services

1. Introduction to Content Delivery in Cloud Computing


 Content Delivery Network (CDN):
A network of geographically distributed servers that deliver web
content, applications, media, and files to users based on their
location.
 Goal: Reduce latency, increase availability, and improve performance.
 How it works:
o Data (content, images, videos, applications) is cached at edge
locations (closest to end users).
o User requests are routed to the nearest edge server instead of
the origin server.
o Improves speed, scalability, and security.

Examples: Amazon CloudFront, Windows Azure CDN, Akamai, Cloudflare.

2. Amazon CloudFront

Definition
 Amazon CloudFront is a fast and secure CDN service by AWS that
delivers web pages, videos, APIs, and applications globally with low
latency.

How It Works
1. Content is stored in an origin server (Amazon S3 bucket, EC2
instance, or custom server).
2. CloudFront distributes content to edge locations worldwide.
3. User requests are directed to the nearest edge location using DNS-
based routing.
4. If the edge location has cached content, it is delivered directly.
5. If not, CloudFront fetches it from the origin and caches it for future
requests.
Key Features
 Global Network: 400+ edge locations worldwide.
 Low Latency: Routes requests to nearest server.
 Security: Supports SSL/TLS, AWS Shield (DDoS protection).
 Integration: Works seamlessly with AWS S3, EC2, Elastic Load
Balancer.
 Dynamic Content Support: Delivers personalized, real-time data.

Use Cases
 Video streaming (Netflix, Prime Video).
 Gaming applications (low latency).
 E-commerce websites (faster product images, catalogs).
 API acceleration.

3. Windows Azure Content Delivery Network (Azure CDN)

Definition
 Azure CDN is a global CDN service by Microsoft Azure to deliver
web content, applications, and streaming media quickly to users.

How It Works
1. Content is stored in Azure storage (Blob, Web Apps, or Media
Services).
2. Content is cached across multiple edge servers worldwide.
3. When a user requests content, Azure CDN delivers it from the nearest
edge location.
4. Updates are propagated to all cache servers automatically.

Key Features
 Global Coverage: 100+ edge servers in multiple continents.
 Scalability: Handles high traffic and sudden demand spikes.
 Integration: Works with Azure Blob Storage, Media Services, and Web
Apps.
 Optimization: Caches both static and dynamic content.
 Security: HTTPS support, token-based authentication.

Use Cases
 Delivering static files (images, CSS, JavaScript).
 Streaming media (videos, music, podcasts).
 Website acceleration for global users.
 Handling large downloads (software, updates).

4. Comparison: Amazon CloudFront vs Azure CDN


Feature Amazon CloudFront (AWS) Azure CDN (Microsoft)
Integration Works with AWS (S3, EC2) Works with Azure (Blob, Web Apps)
Global Coverage 400+ edge locations 100+ edge servers
Security AWS Shield, WAF, SSL/TLS HTTPS, token authentication
Dynamic Content Supported Supported
Use Case Fit Video streaming, APIs, gaming Web apps, static files, media services
5. Advantages of CDN in Cloud Computing
 Reduced Latency: Faster content delivery.
 Scalability: Handles heavy traffic without overloading servers.
 Reliability: Redundant servers ensure uptime.
 Cost Savings: Less bandwidth usage on origin servers.
 Security: Protects against DDoS and provides encryption.

Analytics Services

1. Introduction to Analytics in Cloud Computing


 Definition: Cloud analytics is the use of cloud-based services and
tools to analyze large-scale datasets for decision-making.
 Need:
o Explosion of Big Data (logs, IoT data, social media,
transactions).
o Traditional on-premise systems are costly and less scalable.
 Benefits of Cloud Analytics:
o Elastic scaling (handle TB to PB of data).
o Cost efficiency (pay-as-you-go).
o Integration with storage and compute resources.

2. Amazon Elastic MapReduce (Amazon EMR)

Definition
 Amazon EMR is a cloud big data processing service that uses open-
source frameworks like Apache Hadoop, Spark, HBase, and Presto
to analyze massive amounts of data.
How It Works
1. User uploads datasets to Amazon S3 (or DynamoDB, RDS).
2. EMR creates a cluster of EC2 instances.
3. Frameworks like Hadoop/Spark process the data.
4. Output is stored in S3, DynamoDB, or Redshift.
Key Features
 Scalable clusters (add/remove nodes easily).
 Integrated with AWS ecosystem.
 Pay-as-you-go (no upfront hardware).
 Support for batch, streaming, and machine learning workloads.
Use Cases
 Log processing (web servers, IoT).
 Machine learning model training.
 Data mining for e-commerce recommendations.
 Bioinformatics (genomic data).

3. Google MapReduce Service

Definition
 Google MapReduce is a programming model and processing
technique for analyzing large datasets across a distributed cluster. It
inspired Hadoop.
Working Model
 Map phase: Breaks input data into key-value pairs.
 Shuffle phase: Groups intermediate results by key.
 Reduce phase: Aggregates the grouped data and produces final
results.

Features
 Automatic parallelization across nodes.
 Fault tolerance (failed tasks rerun).
 Scalability (works on thousands of machines).
Use Cases
 Word count in large text corpora.
 Indexing web pages (Google Search originally).
 Log analysis, transaction analysis.

4. Google BigQuery

Definition
 Google BigQuery is a serverless, fully managed, petabyte-scale data
warehouse for real-time analytics.
How It Works
 Upload data into Google Cloud Storage or stream directly into
BigQuery.
 Query using SQL-like syntax.
 BigQuery automatically allocates resources and optimizes execution.
Features
 Serverless: No need to manage infrastructure.
 Real-time analytics: Query streaming data instantly.
 Massive scalability: Handles petabytes.
 Integration: Works with Google Data Studio, AI/ML APIs.
Use Cases
 Business Intelligence (dashboards, reports).
 Marketing analytics (customer segmentation).
 IoT data analysis.
 Healthcare data (patient trends, predictive analysis).

5. Windows Azure HDInsight

Definition
 HDInsight is a cloud-based big data analytics service by Microsoft
Azure that provides managed clusters for Hadoop ecosystem
frameworks.
Supported Frameworks
 Apache Hadoop (batch processing)
 Apache Spark (in-memory, real-time analytics)
 Apache Hive, Pig (SQL-like processing)
 Apache HBase (NoSQL storage)
 Apache Storm (real-time streaming analytics)
Features
 Fully managed Hadoop/Spark clusters.
 Integration with Azure Data Lake, SQL Database, Power BI.
 Cost-effective: Scale up or down clusters easily.
 Secure with Azure Active Directory integration.

Use Cases
 Processing IoT sensor data.
 Fraud detection in finance.
 Predictive analytics for retail.
 Large-scale ETL (Extract, Transform, Load) pipelines.

6. Comparison Table
Service Provider Technology Base Key Strengths
Flexible, integrates with AWS
Amazon EMR AWS Hadoop, Spark
ecosystem
Google
Google Proprietary (basis of Hadoop) Scalable, fault-tolerant, foundational
MapReduce
Google Serverless SQL Data
Google BigQuery Real-time analytics, BI integration
Cloud Warehouse
Enterprise integration, Power BI
Azure HDInsight Microsoft Hadoop, Spark, Storm
support

Deployment and Management Services

1. Introduction
 In cloud computing, deployment and management services simplify
how applications and infrastructure are launched, configured, and
maintained.
 Deployment Services: Help developers launch applications quickly
without worrying about infrastructure management.
 Management Services: Provide tools to define, provision, and manage
cloud infrastructure automatically.
Two important AWS services:
1. Amazon Elastic Beanstalk → Focused on application deployment &
management.
2. Amazon CloudFormation → Focused on infrastructure provisioning
(Infrastructure as Code).

2. Amazon Elastic Beanstalk

Definition
 Amazon Elastic Beanstalk is a Platform as a Service (PaaS) that
enables developers to deploy and manage web applications without
handling underlying infrastructure.

How It Works
1. Developer uploads application code (Java, Python, PHP, [Link], .NET,
Ruby, Go, etc.).
2. Beanstalk automatically handles:
o Provisioning EC2 instances
o Load balancing
o Auto-scaling
o Monitoring with CloudWatch
3. Developers focus only on writing code.

Features
 Supports multiple programming languages & frameworks.
 Automatically scales applications up or down.
 Integrated with AWS services (S3, RDS, CloudWatch, EC2).
 Provides a web console, CLI, and API for management.

Use Cases
 Hosting web applications and APIs.
 Building microservices-based applications.
 Deploying enterprise applications without deep AWS expertise.

3. Amazon CloudFormation

Definition
 Amazon CloudFormation is a service for Infrastructure as Code
(IaC) that allows users to define and provision cloud resources using
JSON or YAML templates.

How It Works
1. User writes a CloudFormation template describing AWS resources
(EC2, VPC, S3, RDS, Load Balancers, etc.).
2. CloudFormation reads the template and automatically creates the
resources in correct order.
3. Infrastructure can be updated by modifying the template.

Features
 Infrastructure as Code → Templates ensure consistency and
repeatability.
 Automation → Automatically provisions, configures, and manages
resources.
 Rollback → If deployment fails, CloudFormation reverts changes.
 Stack Management → Groups related resources into "stacks" for
easier control.

Use Cases
 Provisioning a multi-tier application stack (web servers, app servers,
databases).
 Managing complex infrastructure deployments.
 Automating DevOps pipelines (CI/CD).
 Creating disaster recovery setups.
4. Comparison: Elastic Beanstalk vs CloudFormation
Feature Elastic Beanstalk CloudFormation
Type PaaS (Platform as a Service) IaC (Infrastructure as Code)
Focus Application deployment & scaling Infrastructure provisioning & management
User Effort Minimal (upload code → runs automatically) Higher (requires JSON/YAML templates)
Control Limited control (abstracts infrastructure) Full control (custom infrastructure)
Best For Developers who want fast deployments Architects/DevOps who want infra control

5. Advantages of Deployment & Management Services


 Simplified Operations → Automates routine tasks.
 Scalability → Applications and infrastructure scale dynamically.
 Consistency → Avoids manual configuration errors.
 Cost Efficiency → Pay-as-you-go with optimized resource usage.
 Faster Delivery → Helps in Agile and DevOps environments.

Identity and Access Management (IAM) Services

1. Introduction to IAM in Cloud Computing


 Definition:
Identity and Access Management (IAM) is a framework of policies,
tools, and technologies that ensures the right users have the right
access to the right resources in a cloud environment.
 Goals of IAM:
o Authentication – Verifying a user’s identity (who are you?).
o Authorization – Granting access permissions (what can you
do?).
o Accountability – Tracking and monitoring user actions.
 Importance in Cloud Computing:
o Multi-tenant environments require strict control.
o Protects sensitive resources.
o Enables secure collaboration across teams and organizations.

2. Amazon Identity and Access Management (IAM)

Definition
 Amazon IAM is an AWS service that helps you securely control access
to AWS resources for users, groups, and applications.

Key Features
1. User & Group Management
o Create and manage AWS users and groups.
o Assign unique security credentials (username, password, access
keys).
2. Access Policies
o JSON-based policy documents define permissions.
o Follows the least privilege principle.
3. Roles
o Temporary access to AWS services for applications or external
users.
4. Multi-Factor Authentication (MFA)
o Adds an extra layer of security (password + OTP).
5. Integration
o Works with AWS services like EC2, S3, DynamoDB, Lambda.

How It Works
 Admin creates users/groups.
 Assigns policies (e.g., S3 full access, EC2 read-only).
 Users authenticate and access AWS services as per assigned rights.

Use Cases
 Providing developers access to S3 and EC2 with limited rights.
 Allowing third-party apps to use temporary access credentials.
 Enforcing MFA for administrators.

3. Windows Azure Active Directory (Azure AD)

Definition
 Azure Active Directory (Azure AD) is Microsoft’s cloud-based IAM
service that provides identity management and access control for
Azure resources, Microsoft 365, and external SaaS applications.

Key Features
1. Single Sign-On (SSO)
o Access multiple apps (Office 365, Salesforce, Dropbox, etc.) with
one login.
2. Role-Based Access Control (RBAC)
o Assigns roles to users, controlling their permissions across
resources.
3. Multi-Factor Authentication (MFA)
o Strengthens security with OTP, biometric, or device-based
authentication.
4. Hybrid Identity
o Integrates with on-premises Active Directory.
o Enables seamless login for enterprise users.
5. Conditional Access
o Policies based on user location, device, or risk level.
6. Integration with Microsoft Ecosystem
o Works with Azure services, Teams, Outlook, SharePoint, Power
BI, etc.

How It Works
 Users authenticate via Azure AD credentials.
 Based on roles and policies, Azure AD grants or denies access.
 Supports SSO across cloud and on-prem apps.
Use Cases
 Enterprise employee login for Microsoft 365 apps.
 Controlling access to Azure resources (VMs, Storage).
 B2C (Business-to-Customer) identity management.
 Secure collaboration across multiple organizations.

4. Comparison: AWS IAM vs Azure AD


Feature AWS IAM Azure Active Directory
Scope AWS resources (S3, EC2, DynamoDB) Microsoft ecosystem (Azure, M365, SaaS apps)
Access Control Policies (JSON-based) RBAC + Conditional Access
User Types Users, Groups, Roles Users, Groups, Roles, Enterprise directories
MFA Support Yes Yes
Single Sign-On (SSO) Limited (via external integration) Built-in (for 1000+ SaaS apps)
Integration AWS-specific Microsoft + third-party apps

5. Advantages of IAM Services


 Centralized access control.
 Stronger security through MFA and conditional access.
 Easy onboarding/offboarding of employees.
 Reduces insider threats (least privilege principle).
 Improves user experience (SSO).

Open Source Private Cloud Software

1. Introduction to Private Cloud and Open Source Software


 Private Cloud
o A private cloud is a cloud infrastructure operated solely for a
single organization.
o Provides control, security, and customization.
o Can be hosted on-premises or by a third-party provider.
 Why Open Source Private Cloud?
o Flexibility to customize.
o Cost-effective (no vendor lock-in).
o Community-driven support and development.
o Better integration with hybrid and multi-cloud solutions.

Examples of major open-source private cloud platforms:


1. Apache CloudStack
2. Eucalyptus
3. OpenStack

2. CloudStack

Definition
 CloudStack is an open-source Infrastructure-as-a-Service (IaaS)
platform developed by Apache Software Foundation.
 Provides management of large networks of virtual machines.
Key Features
 Supports compute orchestration, networking, and storage.
 Multi-hypervisor support (XenServer, KVM, VMware, Hyper-V).
 Multi-tenant environment with role-based access.
 Web-based user interface and RESTful API.
 Load balancing, high availability, and auto-scaling.

Architecture
 Management Server: Orchestrates resources.
 Cloud Infrastructure: Consists of zones, pods, clusters, and hosts.
 User Interface: Provides dashboard for users/admins.
Use Cases
 Enterprise private clouds.
 Service providers delivering IaaS to customers.
 Hybrid cloud integrations.

3. Eucalyptus

Definition
 Eucalyptus (Elastic Utility Computing Architecture for Linking
Your Programs to Useful Systems) is an open-source software for
building AWS-compatible private and hybrid clouds.

Key Features
 Fully compatible with Amazon Web Services (AWS APIs).
 Manages compute (EC2), storage (S3, EBS), and networking
resources.
 Supports Linux-based hypervisors like Xen and KVM.
 Allows hybrid deployments (on-premises + AWS integration).
Architecture
 Cloud Controller (CLC): Entry point for managing cloud resources.
 Cluster Controller (CC): Manages a group of nodes.
 Node Controller (NC): Runs on each node to control VM instances.
 Storage Controller (SC): Provides block storage services.
 Walrus: Provides object storage like Amazon S3.
Use Cases
 Enterprises requiring AWS-compatible private clouds.
 Research labs and universities for cloud experiments.
 Organizations needing hybrid cloud (on-prem + AWS).

4. OpenStack

Definition
 OpenStack is the most popular open-source cloud operating
system.
 Provides a complete IaaS platform to manage compute, storage, and
networking resources.
Key Features
 Modular architecture (different services for different resources).
 Multi-tenant support with RBAC.
 Strong ecosystem supported by IBM, Red Hat, HP, and Intel.
 Supports multiple hypervisors (KVM, Xen, VMware, Hyper-V).
 RESTful APIs for integration with third-party tools.

Core Components
 Nova: Compute service for managing VM instances.
 Swift: Object storage system.
 Cinder: Block storage service.
 Neutron: Networking service.
 Keystone: Identity service for authentication.
 Glance: Image service for VM images.
 Horizon: Dashboard (web-based UI).
Use Cases
 Large-scale enterprise private clouds.
 Telecom providers building NFV (Network Function Virtualization)
infrastructure.
 Scientific research requiring massive compute clusters.
 Public and hybrid cloud deployments.

5. Comparison of CloudStack, Eucalyptus, and OpenStack


Feature CloudStack Eucalyptus OpenStack
Developed Apache Software University of California, later OpenStack Foundation (with big
by Foundation HP vendors)
AWS-compatible private Full cloud OS with modular
Focus IaaS orchestration
cloud services
Xen, KVM, VMware,
Hypervisors KVM, Xen KVM, Xen, VMware, Hyper-V
Hyper-V
Walrus (S3-like), EBS-like
Storage Block & object storage Swift (object), Cinder (block)
storage
Basic & advanced
Networking Cluster-based networking Neutron for advanced SDN
networks
UI Web-based + API Web-based + AWS CLI tools Horizon dashboard + APIs
Service providers,
Best for AWS hybrid users Large enterprises, research, telcos
enterprises

6. Advantages of Open Source Private Cloud


 No vendor lock-in → flexibility and customization.
 Cost-effective → free to use (only infrastructure costs).
 Community support → continuous improvement.
 Security and compliance → data remains under organizational
control.
 Scalability → suitable for both small and large enterprises.
Part – A (1 Mark Questions)
1. What is Amazon CloudFront?
2. Expand CDN.
3. Define Azure Content Delivery Network.
4. What is the use of Amazon Elastic MapReduce (EMR)?
5. Expand HDInsight.
6. Which service in Google Cloud is used for Big Data analytics?
7. Define Amazon Elastic Beanstalk.
8. What is CloudFormation used for?
9. Expand IAM.
10. Which Azure service provides identity management?
11. Name two open-source private cloud platforms.
12. Which component of OpenStack handles identity management?
13. Which OpenStack service provides object storage?
14. Define Walrus in Eucalyptus.
15. State one advantage of a private cloud over a public cloud.

Part – B (7 Mark Questions)


1. Explain the working of Amazon CloudFront with a neat diagram.
2. Write short notes on Windows Azure Content Delivery Network.
3. Compare Amazon Elastic MapReduce (EMR) and Google BigQuery.
4. Explain Windows Azure HDInsight and its applications.
5. Write a note on Amazon Elastic Beanstalk.
6. Explain Amazon CloudFormation with an example template.
7. Discuss the role of AWS Identity and Access Management (IAM) in
cloud security.
8. Explain Azure Active Directory and its features.
9. Explain CloudStack architecture and its features.
10. Write short notes on Eucalyptus and its components.

Part – C (10 Mark Questions)


1. Discuss Amazon CloudFront in detail. Explain how content delivery
works.
2. Explain Google MapReduce Service with an example.
3. Compare and contrast Amazon Elastic MapReduce, Google
BigQuery, and Azure HDInsight.
4. Explain in detail Amazon CloudFormation. Write about its benefits
and use cases.
5. Discuss in detail AWS IAM and Azure Active Directory. Compare
their features.
6. Explain OpenStack components and architecture with a neat
diagram.
7. Compare CloudStack, Eucalyptus, and OpenStack with architecture
and use cases.
8. Discuss the importance of content delivery services in cloud
computing.

You might also like