Distributed Computing
Unit I Introduction to Distributed Computing
● Fundamentals of distributed computing:
○ Characteristics of Distributed Systems:
◆ Issues,
◆ Goals, and
◆ Types of distributed systems,
○ Distributed System Models
● Introduction to Artificial Intelligence and Data Science in
distributed computing:
○ Distributing computational tasks,
◆ handling large volumes of data, and
◆ leveraging parallel processing capabilities,
○ Issues related to data storage and retrieval,
◆ data consistency,
◆ communication overhead,
◆ synchronization, and
◆ fault tolerance.
● Use cases and applications of integrating AI and data science in
distributed systems:
○ Predictive Maintenance,
○ Fraud Detection,
○ Intelligent Transportation Systems,
○ Supply Chain Optimization,
○ Energy Management,
○ Healthcare and Medical Diagnostics,
○ Customer Behavior Analysis and Natural Language Processing
(NLP)
Unit II Distributed Data Management and Storage
● Overview of Distributed Computing Frameworks and Technologies
○ Parallel Computing,
○ Distributed Computing Models,
○ Message Passing,
○ Distributed File Systems:
◆ Hadoop Distributed File System (HDFS) and
◆ Google File System (GFS),
◆ Cluster Computing:
◇ (AWS),
◇ Microsoft Azure, and
◇ Google Cloud Platform (GCP),
◇ Message Brokers and Stream Processing,
◇ Edge Computing
● Data Replication and Consistency Model:
○ Eager Replication,
○ Lazy Replication,
○ Quorum-Based Replication,
○ Consensus-Based Replication,
○ Selective Replication,
○ Strong Consistency,
○ Eventual Consistency,
○ Read-your-writes Consistency,
○ Consistent Prefix Consistency,
○ Causal Consistency
● Distributed data indexing and retrieval techniques:
○ Distributed Hash Tables (DHTs),
○ Distributed Inverted Indexing,
○ Range-based Partitioning,
○ Content-based Indexing,
○ Peer-to-Peer (P2P) Indexing,
○ Hybrid Approaches