Course Designers
Experts from Industry Experts from Higher Technical Institutions Internal Experts
Mr. Renganathan, Agile Coach and Cloud Engineer, Tata K. Prakash, Assistant Professor, Dept. Of CSE, Government Dr. Savaridassan.P, Assistant Professor, Department of Networking
Communications College of Engineering, Dharmapuri and Communications, SRMIST- KTR
Course Course Course L T P C
21CSE632T ESSENTIALS OF DATA SCIENCE ON CLOUD COMPUTING E PROFESSSIONAL ELECTIVE
Code Name Category 2 1 0 3
Pre- Co-
Progressive
requisite Nil requisite Nil Nil
Courses
Courses Courses
Data Book /
Course Offering Department Networking and Communications Nil
Codes/Standards
Course Learning
The purpose of learning this course is to:
Rationale (CLR):
CLR-1: Perform cloud based bigdata processing
CLR-2: Use various Cloud based tools for data processing
CLR-3: Use various Cloud based tools for data analysis
CLR-4: Train machine learning models using cloud services
CLR-5: Design interactive dashboards for data exploration and reporting
Programme Outcomes
Course Outcomes At the end of this course, learners will be able to: (PO)
(CO): 1 2 3
CO-1: Explain the basic concepts of data science and cloud computing 2 2
CO-2: Apply the data processing techniques using cloud-based tools and services 2 2
CO-3: Perform data analysis using cloud-based tools 2 2
CO-4: Deploy machine learning models in the cloud 2 2
CO-5: Perform data visualization using cloud-based tools 2 2
Module-1 - Introduction to Data Science and Cloud Computing 9
Hour
Overview of data science and its applications, Introduction to cloud computing and its benefits for data science, Overview of popular cloud platforms (AWS, Azure, Google Cloud), Introduction to cloud storage
services (e.g., Amazon S3, Azure Blob Storage, Google Cloud Storage)
Module -2 - Data Cleaning and Preprocessing Techniques 9
Hour
Understanding data formats and structures, Data ingestion strategies for cloud storage Data Preprocessing and Transformation in the Cloud, Introduction to ETL (Extract, Transform, Load) processes in the
cloud, Using cloud-based tools for data transformation (e.g., AWS Glue, Azure Data Factory)
Module -3 - Big Data Processing in the Cloud 9
Hour
Introduction to big data concepts and challenges, Overview of cloud-based big data processing frameworks (e.g., Apache Spark, Hadoop), Setting up and managing big data clusters on cloud platforms,
Introduction to cloud-based data analysis tools and services (e.g., AWS Athena, Google BigQuery, Azure Synapse Analytics), Writing SQL queries for data analysis in the cloud, Exploratory data analysis (EDA)
techniques in cloud environments.
Module -4 - Building and Deploying Machine Learning Models in the Cloud 9
Hour
Introduction to machine learning concepts and algorithms, Supervised vs. unsupervised learning, Overview of cloud-based machine learning services (e.g., AWS SageMaker, Azure Machine Learning, Google
AI Platform), Data preprocessing for machine learning tasks, Training machine learning models using cloud services, Deploying machine learning models as APIs or serverless functions on the cloud.
Module -5 - Data Visualization and Cost Optimization 9
Hour
Importance of data visualization in data science, Introduction to cloud-based data visualization tools (e.g., AWS QuickSight, Google Data Studio, Power BI), Designing interactive dashboards for data exploration
and reporting, Scalability considerations for data science projects in the cloud, Cost optimization strategies for cloud-based data science projects.
1. Foster Provost and Tom Fawcett, Data Science for Business, first edn, O'Reilly Media,
Learning 2013.
Resources 2. Jiahui Liu and Wei Tan, Cloud Computing for Data Analysis and Scientific Research.
Learning Assessment
Continuous Learning Assessment (CLA)
Summative
Formative Life Long Learning
Bloom’s Final Examination
CLA-1 Average of unit test CLA-2 –
Level of Thinking (40% weightage)
(50%) (10%)
Theory Practice Theory Practice Theory Practice
Level 1 Remember 15% - 15% - 15% -
Level 2 Understand 25% - 20% - 25% -
Level 3 Apply 30% - 25% - 30% -
Level 4 Analyze 30% - 25% - 30% -
Level 5 Evaluate - - 10% - - -
Level 6 Create - - 5% - - -
Total 100 % 100 % 100 %
Course Designers
Experts from Industry Experts from Higher Technical Institutions Internal Experts
Mr. Hari Prasad Prabhu Dr E Kavitha
Dr. Vedhavathy T R, Assistant Professor, Department of
Senior Consultant Associate Professor
Networking and Communications, SRM IST-KTR
Deloitte, Chennai Anna University (Villupuram Campus)