Updated BCA DS III IV Syllabus
Updated BCA DS III IV Syllabus
Under
State Education Policy
1
Board of Studies in Computer Science for UG
No : BCU/BoS/Comp.Sci. & Appln.(PG & UG)/278/2024
2
Name of the Degree Program : Bachelor of Computer
Applications- BCA (DS)
Discipline Course : Computer Science
Starting Year of Implementation : 2024-25 (I & II Semesters)
2025-26 (III & IV Semesters)
2026-27 (V & VI Semesters)
Programme Outcomes (PO): By the end of the program the students will be able to:
3
ASSESSMENT
Weightage for the Assessments (in percentage)
Type of Course Formative Assessment/ I.A. Summative Assessment (S.A.)
Theory 20% 80 %
Practical 20% 80 %
4
Detailed Structure for BCA- DS Course
24BCAL12 Language L2 04 80 20 03 03
Total Credits 23
24BCA21 Data Structure 03 80 20 03 03
24BCA22 Object Oriented 03 80 20 03 03
Programming Using Java
24BCA23 Operating Systems 03 80 20 03 03
24BCA21P Data Structure Lab 04 40 10 03 02
24BCA22P Java Programming Lab 04 40 10 03 02
II
24BCA23P Unix & Shell Programming 04 40 10 03 02
Lab
24BCAL21 Language L1 04 80 20 03 03
24BCAL22 Language L2 04 80 20 03 03
24BCACC2 Constitution Values-II 02 40 10 1.5 02
24BCACC3 Environmental Studies 02 40 10 1.5 02
Total Credits 25
24BCA31 Database Management 04 80 20 03 03
III System
24BCA32 Foundations of Data 04 80 20 03 03
Science
5
24BCA33 Python Programming 04 80 20 03 03
24BCA31P DBMS Lab 04 40 10 03 02
24BCA32P Data Science Lab 04 40 10 03 02
24BCA33P Python Lab 04 40 10 03 02
24BCAE1 Data Visualization 03 40 10 1.5 02
24BCAL31 Language I 04 80 20 03 03
24BCAL32 Language II 04 80 20 03 03
Total Credits 23
24BCA41 Artificial Intelligence 04 80 20 03 03
24BCA42 Data Analytics 04 80 20 03 03
24BCA43 Scientific Programming 04 80 20 03 03
using R
24BCA41P Artificial Intelligence Lab 04 40 10 03 02
24BCA42P Data Analytics Lab 04 40 10 03 02
IV
24BCA43P R Programming Lab 04 40 10 03 02
24BCAE2 Data Mining 03 40 10 1.5 02
24BCASE1 Basics of Natural Language 03 40 10 1.5 02
Processing
24BCAL41 Language I 04 80 20 03 03
24BCAL42 Language II 04 80 20 03 03
Total Credits 25
24BCA51 Data Science Project 04 80 20 03 03
Management
24BCA52 Business Data Analytics 04 80 20 03 03
24BCA53 Time Series Analysis 04 80 20 03 03
24BCA54 Machine Learning 04 80 20 03 03
V
24BCA52P BDA Lab 04 40 10 03 02
24BCA53P Time Series Analysis Lab 04 40 10 03 02
24BCASE2 Quantitative Techniques 03 40 10 1.5 02
24BCAPJ Project 08 120 30 03 06
Total Credits 24
6
24BCA61 Cloud Computing & AWS 04 80 20 03 03
for Data Science
24BCA62 Inferential Statistics 04 80 20 03 03
7
Detailed Syllabus for BCA – Data Science
SEMESTER – III
Course Outcomes
COs Description
CO1 Understand the fundamental concepts of data, database systems, DBMS architecture,
data models, and the various types of DBMS along with their classifications.
CO2 Design high-level conceptual data models using the Entity-Relationship approach,
apply design principles, and understand physical storage structures, file organization,
and indexing mechanisms.
CO3 Apply the relational model concepts, perform normalization using functional
dependencies, and write effective SQL queries to manage and manipulate data,
constraints, and views and Basics of PL/SQL.
CO4 Utilize relational algebra to query databases, analyze query processing and
optimization techniques, and explain transaction management, concurrency control,
and recovery mechanisms in DBMS.
8
Join and Dependency Preservation. Normalization: 1NF, 2NF, 3NF, BCNF and Higher
Normal Forms. Normalization through Synthesis. Structure of Relational Databases. SQL –
Data Definition and Data Types. Specifying Constraints in SQL. Schema Change Statements.
Insert, Delete and Update Statements. Views (Virtual Tables). Assertions and Triggers.
Embedded SQL. Dynamic SQL.
Text Books:
1. Elmasri R. and Navathe S.B., Fundamentals of Database Systems, 7th Edition,
Addison-Wesley, USA,2016.
2. Silberschatz A., Korth H.F., and Sudarshan S., Database System Concepts, 7th
Edition, Tata McGraw Hill,NewDelhi, 2019.
3. Ivan Bayross, SQL, PL/SQL – The Programming Language of Oracle, 4th Edition,
BPB Publications,NewDelhi,2003.
Reference Books:
1. C.J. Date, A. Kannan, S. Swamynathan, An Introduction to Database Systems, 8th
Edition, Pearson Education,England,UK, 2009.
2. Raghu Ramakrishnan and Johannes Gehrke, Database Management Systems, 3rd
Edition, McGraw Hill,NewDelhi, 2003.
3. Kevin Loney and George Koch, Oracle 9i – The Complete Reference, McGraw-Hill
International Edition, USA,2002.
Web Links:
1. https://www.youtube.com/playlist?list=PLBlnK6fEyqRi_CUQ-
FXxgzKQ1dwr_ZJWZ
2. https://www.youtube.com/watch?v=T7AxM7Vqvaw&list=PLdo5W4Nhv31b33kF46f9a
Fj
oJPOkdlsRc
3. https://www.tutorialspoint.com/dbms/index.htm
4. https://www.geeksforgeeks.org/dbms/dbms/
9
Theory 25BCA32: Foundations of Data Science
Teaching Hours : 04 Hours/Week Credits: 03
Duration of Exam: 03 Hours Maximum Marks: 100 (Exam 80 + IA 20)
Course Outcomes
COs Description
CO1 Explain the evolution, roles, and applications of Data Science, along with data security
concerns.
CO2 Use different strategies to collect, clean, integrate, and transform data for analysis.
CO3 Compute key statistical measures like mean, standard deviation, skewness, and
correlation, and visualize data using plots.
CO4 Develop simple and multiple regression models, evaluate them using visual tools, and
apply them for predictions.
CO5 Use statistical and analytical methods to interpret data, draw insights, and support
decision-making processes.
Text Books:
1. Jojo Moolayil, “Smarter Decisions : The Intersection of IoT and Data Science”, 1st Edition,
Packt Publishing Ltd., UK, 2016.
2. Cathy O‟Neil and Rachel Schutt , “Doing Data Science”, 1st Edition,O'Reilly, USA, 2013.
3. David Dietrich, Barry Heller, Beibei Yang, “Data Science and Big data Analytics”, 1st
Edition, EMC Education Services, 2015.
10
Reference Books:
1. Joel Grus, “Data Science from Scratch”, 1st Edition, O‟REILLY, USA, 2015.
2. Rafael A. Irizarry, “Introduction to Data Science”, 1st Edition,Chapman & Hall,USA, 2022.
3. Gupta. S.C. & Kapoor,V.K. , “Fundamentals of Mathematical Statistics”, 12th Edition,
Sultan Chand & Sons Pvt. Ltd., New Delhi,2002.
Web Links:
1. https://www.geeksforgeeks.org/data-science/data-science-for-beginners/
2. https://www.geeksforgeeks.org/methods-of-data-collection/
3. https://www.scaler.com/topics/data-science/data-preprocessing/
4. https://datatab.net/tutorial/descriptive-inferential-statistics
5. https://www.geeksforgeeks.org/machine-learning/linear-regression-python-
implementation/
11
Theory 25BCA33: Python Programming
Teaching Hours : 04 Hours/Week Credits: 03
Duration of Exam: 03 Hours Maximum Marks: 100 (Exam 80 + IA 20)
Course Outcomes
COs Description
CO1 Understand the basic syntax, data types, variables, operators, control structures, and
functions in Python, and apply them to develop simple programs.
CO2 Demonstrate proficiency in handling Python‟s complex data structures such as lists,
dictionaries, tuples, and sets, and apply file handling for text, binary, and CSV
formats.
CO3 Apply object-oriented programming concepts in Python and use libraries like NumPy
and Pandas for efficient data manipulation and analysis.
CO4 Utilize Python packages like Matplotlib, Plotly, and APIs for effective data
visualization and apply them in real-time data analysis projects.
CO5 Understand the basic syntax, data types, variables, operators, control structures, and
functions in Python, and apply them to develop simple programs.
12
UNIT – IV Data Analysis and Visualization with Python 14 Hours
Data Analysis and Visualization in Python: Importing and Exporting Data (CSV, JSON),
Understanding and Formatting Data, Using Matplotlib and Plotly for Visualization,
Generating and Plotting Data (Line Graphs, Bar Charts), Random Walks, Dice Simulation,
Working with APIs, Downloading Data and Visualizing Repositories using Plotly, Mapping
Global Datasets with JSON.
Text Books:
1. Wesley J. Chun, Core Python Applications Programming, 3rd Edition, Pearson
Education, USA, 2016.
2. Yashavant Kanetkar and Aditya Kanetkar, Let Us Python, 3rd Edition, BPB
Publications, New Delhi, 2020.
3. Jeeva Jose & P. Sojan Lal, Introduction to Computing and Problem Solving with
Python, 1st Edition, Khanna Publishers, New Delhi, 2016.
4. Eric Matthes, Python Crash Course – A Hands-On, Project-Based Introduction to
Programming, 2nd Edition, No Starch Press, USA, 2019.
5. Gowrishankar S, Veena A, Introduction to Python Programming, 1st Edition, CRC
Press/Taylor & Francis,USA, 2018. ISBN: 978-0815394372.
Reference Books:
1. Allen B. Downey et al., How to Think Like a Computer Scientist: Learning with
Python, 1st Edition, John Wiley, USA, 2015.
2. John Zelle, Python Programming: An Introduction to Computer Science, 2nd Edition,
Course Technology, Cengage Learning,USA, 2013.
3. A.N. Kamthane & A.A. Kamthane, Programming and Problem Solving with Python,
1st Edition, McGraw Hill Education, NewDelhi, 2017.
4. Mark Lutz, Learning Python, 5th Edition, O‟Reilly Publications,USA,2013. ISBN:
978-1449355739.
5. Ljubomir Perkovic, Introduction to Computing Using Python – An Application
Development Focus,1st Edition, Wiley,USA, 2012.
Web Links:
1. https://www.w3schools.com/python/
2. https://www.tutorialspoint.com/python/index.htm
3. https://www.geeksforgeeks.org/python/python-programming-language-tutorial/
13
Theory : Elective - I 25BCAE1: Data Visualization
Teaching Hours : 03 Hours/Week Credits : 02
Duration of Exam : 1.5 Hours Maximum Marks : 50 (Exam40 + IA10)
Course Outcomes
COs Description
CO1 Understand the fundamentals of data visualization, including its types, importance, and
role in business and decision-making.
CO2 Identify and distinguish between different types of data (numerical, categorical,
temporal, geographical) and explain the data visualization process
CO3 Evaluate and compare various data visualization tools such as Excel, Tableau, Power BI, and
Python for specific use cases.
CO4 Apply the principles of data storytelling and dashboard design to communicate
insights effectively with narrative and interactivity.
CO5 Design visually appealing and meaningful data visualizations by applying principles of
good design, appropriate use of color, and data modeling techniques.
Text Books:
1. Cole Nussbaumer Knaflic,"Storytelling with Data: A Data Visualization Guide for
Business Professionals", 1st edition,Wiley, USA,2015.
2. Edward Tufte,“The Visual Display of Quantitative Information”,2nd Edition , Graphics
Press, USA,2001.
Reference Books:
1. Kieran Healy,"Data Visualization: A Practical Introduction", 1st Edition, Princeton
University Press,USA, 2018.
2. Alberto Ferrari and Marco Russo, "Analyzing Data with Power BI and Power Pivot for
Excel",1st edition, Microsoft Press,USA, 2017.
3. Devin Knight, Brian Knight, Mitchell Pearson, and Manuel Quintana, "Microsoft Power BI
Complete Reference",1st edition, Packt Publishing,UK,2018.
14
Lab 25BCA31P: DBMS Lab
Teaching Hours : 04 Hours/Week Credits : 02
Duration of Exam : 3 Hours Maximum Marks : 50 (Exam 40 + IA 10)
15
11. Parameterized Cursor
Use a parameterized cursor to display students based on department input.
12. Stored Function
Create a stored function to calculate grade based on marks:
Above 80: Distinction
60–79: First Class
40–59: Second Class
Below 40: Fail
13. Stored Procedure
Write a stored procedure to update the marks of a student given their RollNo and new
marks.
14. Trigger – BEFORE INSERT
Create a BEFORE INSERT trigger on STUDENT to ensure marks are not entered as
negative.
15. Trigger – AFTER UPDATE
Create a trigger to log changes into a table STUDENT_LOG whenever the marks are
updated.
16. Cursor with Loops
Use a cursor with a loop to count and display the number of students in each
department.
16
Lab 25BCA32P: Data Science Lab
Teaching Hours : 04 Hours/Week Credits : 02
Duration of Exam : 3 Hours Maximum Marks : 50 (Exam40 + IA10)
Part - A
1. Load a dataset from Kaggle and implement the basic operations – head(),
info(),describe(), detect missing values, drop or fill null values, remove duplicate
values.
2. Create 2 datasets – customers.csv(customerid, Name, City) and purchases.csv
(customerid, purchaseamount). Implement the following operations:
a. a)Merge 2 datasets b) Create new column: Discount = purchaseamount * 0.1
c) Normalize purchaseamount using Min-Max scaling
3. From iris dataset, implement dimensionality reduction technique to reduce 4d to 2d.
Visualize original vs reduced dataset using matplotlib.
4. Create a dataset containing customer id and age and implement Data Discretization.
5. Create a dataset containing students marks in 5 different subjects and display the
mean and standard deviation for each subject.
6. Create a dataset containing Student Id, Height(in CM) and compute the Skewness and
Kurtosis.
7. Create a dataset containing the 5 consecutive monthly sales of 3 different products
and draw box plot to identify the outliers and distribution spread.
8. Create a dataset containing Customer Age, Income, Spending Score and Savings.
Calculate Correlation matrix and plot heat map.
9. Create a dataset containing Car‟s Id, Weight, Horsepower and MPG. Calculate
Pearson Correlation Coefficients.
10. Create a dataset containing test scores of the student with 3 different teaching
methods. Perform one-way ANOVA to test if means differ significantly.
Part – B
1. From Boston Housing Dataset, predict the housing price based on average number of
rooms. Evaluate Plot regression line and residuals.
2. From AutoMPG Dataset, predict miles per gallon using multiple car features. Check
residual plots and model accuracy.
3. Implement Polynomial Regression on Salary Data (Employee Experience Vs Salary).
Fit Polynomial regression (degree 2 or 3 ) for better fit. Visualize polynomial curve fit
and residuals.
4. Use Diabetes dataset from sklearn to create a pipeline that scales features and apply
linear regression. Evaluate R2 and RMSE.
5. Use Advertising Dataset to build regression model and analyze residual plots. Plot
residual histogram and scatter plots.
6. Use California Housing dataset and train the regression model and plot the
distribution of prediction errors. Use Seaborn distplot/kdeplot for residuals.
17
Lab 25BCA33P: Python Lab
Teaching Hours : 04 Hours/Week Credits : 02
Duration of Exam : 3 Hours Maximum Marks : 50 (Exam 40 + IA 10)
Part A
1. Basic Python Operations: Write a Python program to declare variables, perform
arithmetic operations, and display results.
2. Conditional Statements: Create a program to check if a number is even or odd using
if-else.
3. Looping Constructs: Write a Python program to print the first n Fibonacci numbers
using a for loop.
4. String Manipulation: Implement a program that accepts a string and counts the
number of vowels and consonants.
5. List & Dictionary Basics: Create a program to store student details in a dictionary and
retrieve details based on user input.
6. Loop Control Statements: Demonstrate the use of break, continue, and pass in loops.
7. NumPy Basics: Write a program to create NumPy arrays, perform element-wise
operations, and reshape arrays.
8. Pandas Series: Create a Pandas Series and perform indexing, slicing, and querying
operations.
9. DataFrame Operations: Load a dataset into a Pandas DataFrame and perform sorting
and filtering operations.
10. Handling Missing Values: Write a program to handle missing values by filling them
with mean/median values.
Part B
1. String Operations: Write a program to count the occurrences of each word in a given
string.
2. List Operations: Implement a program to insert, delete, and update elements in a list.
3. Dictionary Manipulation: Create a dictionary with employee details and perform
CRUD operations.
4. Iterators & Generators: Implement a program to generate prime numbers up to n using
a generator function.
5. Functions in Python: Write a Python program to calculate the factorial of a number
using recursion.
6. File Handling: Write a program to read and write student marks into a text file and
display the contents.
7. Matplotlib Basics: Plot a line graph and a bar chart using Matplotlib.
8. Data Import & Export: Write a program to load a CSV file into Pandas and perform
basic data analysis.
9. Grouping & Aggregation: Implement a program to group a dataset by category and
calculate summary statistics.
10. Scikit-learn Demo: Load a dataset from Scikit-learn and display its basic properties.
18
SEMESTER – IV
Course Outcomes
COs Description
CO1 Understand the fundamental concepts of Artificial Intelligence including intelligent
agents, knowledge-based systems, and problem-solving strategies using uninformed
and informed search techniques.
CO2 Apply logical reasoning techniques such as propositional and predicate logic, perform
inferences using forward and backward chaining, and solve constraint satisfaction
problems and basic learning methods.
CO3 Design AI systems using planning techniques and reasoning under uncertainty through
probabilistic and fuzzy logic approaches, and interpret data using perception models
like vision and natural language processing.
CO4 Explore the application domains of AI including machine learning models, neural
networks, expert systems, and evaluate ethical and societal implications of AI
technologies.
19
UNIT – III Planning, Reasoning under Uncertainty, and Perception 14 Hours
Introduction to Planning: Planning Problem, State-Space Search, The Blocks World Problem,
STRIPS Representation. Handling Uncertainty: Non-Monotonic Reasoning, Probabilistic
Reasoning, Introduction to Fuzzy Logic and Fuzzy Set Theory. Introduction to Perception:
Computer Vision – Image Classification, Object Detection. Natural Language Processing
(NLP): Introduction, Syntactic Processing, Semantic Analysis, Discourse and Pragmatic
Processing.
Text Books:
1. M.C. Trivedi, A Classical Approach to Artificial Intelligence, 1st Edition, Khanna
Book Publishing Company, NewDelhi,2024 (AICTE Recommended).
2. Nilsson Nils J., Artificial Intelligence: A New Synthesis,1st Edition, Morgan
Kaufmann Publishers Inc., USA, 1998,ISBN: 978-1-55860-467-4.
3. Dan W. Patterson, Introduction to Artificial Intelligence & Expert Systems, 1st edition,
PHI Learning,NewDelhi, 2010.
4. Rajiv Chopra, Data Science with Artificial Intelligence, Machine Learning and Deep
Learning, 1st Edition,Khanna Book Publishing Company,NewDelhi, 2024.
5. Russell, S. and Norvig, P., Artificial Intelligence – A Modern Approach, 3rd Edition,
Prentice Hall, USA, 2010.
Reference Books:
1. M.C. Trivedi, Introduction to AI and Machine Learning, 1st Edition, Khanna Book
Publishing Company, New Delhi,2024.
2. Van Hirtum, A. & Kolski, C., Constraint Satisfaction Problems: Algorithms and
Applications, 1st Editon, Springer, Swizerland, 2020.
3. Elaine Rich, Kevin Knight, Shivasankar B. Nair, Artificial Intelligence, 3rd Edition,
McGraw Hill, USA, 2019.
4. Rajiv Chopra, Machine Learning and Machine Intelligence,1st Edition, Khanna Book
Publishing Company, New Delhi, 2024.
20
Theory 25BCA42: Data Analytics
Teaching Hours : 04 Hours/Week Credits : 03
Duration of Exam : 03 Hours Maximum Marks : 100 (Exam 80 + IA 20)
Course Outcomes
COs Description
CO1 Understand the evolution, scope, and significance of data analytics and its applications
in various domains
CO2 Analyze relationships between variables using correlation and regression techniques,
and interpret the results.
CO3 Apply probability concepts and statistical methods such as t-tests, ANOVA, and Chi
square tests to real-world data.
CO4 Demonstrate proficiency in data visualization and dashboard creation using Power BI
for effective communication of insights.
CO5 Transform and clean data using Power Query and prepare it for analytical modeling
and reporting.
21
Text Books:
1. Kumar, U.D. :Business Analytics – The Science of Data – Driven Decision Making, 1st
Edition, Wiley, USA, 2014.
2. Dr Anil Maheshwari, Data Analytics Made Accessible, 1st Edition, Amazon.com
Services LLC., USA, 2016.
3. Johnson, R.A., Miller, I. and Freund, . :Probability and Statistics for Engineers,
8th Edition,Pearso, USA, 2013.
Reference Books:
1. Gert, H.N., Thorlund, L. and Thorlund, J :Business Analytics for Managers – Taking
Business Intelligence Beyond Reporting, 1st Edition, Wiley, USA,2018.
2. Dr. Gaurav Aroraa , Chitra Lele , Dr. Munish Jindal ; “ Data Analytics: Principles, Tools,
and Practices: A Complete Guide for Advanced Data Analytics Using the Latest Trends,
Tools, and Technologies, 1st Edition, Amazon, USA, 2021.
22
Theory 25BCA43: Scientific Programming using R
Teaching Hours : 04 Hours/Week Credits : 03
Duration of Exam : 03 Hours Maximum Marks : 100 (Exam 80 + IA 20)
Course Outcomes
COs Description
CO1 Demonstrate the ability to install and utilize R packages for performing data analysis tasks.
CO2 Apply R programming skills for efficient data analysis and statistical computing.
CO3 Use key statistical packages in R for analytics and data interpretation.
Perfor
CO4 Data management operations, including importing/exporting data in various formats.
CO5 Integrate foundational concepts from statistics, probability, and linear algebra within
the R programming environment.
Text Books:
Reference Books:
23
Theory : Elective-II 25BCAE2: Data Mining
Teaching Hours : 03 Hours/Week Credits : 02
Duration of Exam : 1.5 Hours Maximum Marks : 50 (Exam 40 + IA 10)
Course Outcomes
COs Description
CO1 Explain the basic concepts, techniques, and issues involved in data mining and pattern
discovery.
CO2 Apply frequent pattern mining algorithms like Apriori and FP-Growth to discover
associations in datasets.
CO3 Implement classification techniques including regression, Bayesian, decision trees, and
k-nearest neighbour for predictive modeling.
CO4 Analyze clustering methods such as hierarchical and partitional algorithms using
similarity and distance measures to detect data groupings and outliers.
Text Books:
1. 1.Margaret H Dunham, “Data Mining Introductory and Advanced Topics”, 1st
Edition,Pearson Education,USA, 2012.
2. Jiawei Han and Micheline Kamber, “Data Mining - Concepts and Techniques”, Third
Edition, Elsevier, USA,2012
Reference Books:
1. GalitShmueli,et al, “Data Mining for Business Analytics: Concepts, Techniques and
Applications in R”, 1st Edition,Wiley, India, 2018.
2. Pang-Ning Tan, Michael Steinbach, Vipin Kumar, Introduction to Data Mining,
Second edition,Addison Wesley,USA, 2018.
24
Theory: Skill Enhancement Course 25BCASE1: Basics of NLP
Teaching Hours : 03 Hours/Week Credits : 02
Duration of Exam : 1.5 Hours Maximum Marks : 50 (Exam 40 + IA 10)
Course Outcomes
COs Description
CO1 Explain the scope and core components of NLP, and perform text preprocessing using
tokenization, stemming, lemmatization, and POS tagging.
CO2 Apply statistical models like N-grams and vectorization techniques (BoW, TF-IDF,
Word2Vec) for feature extraction in textual data.
CO3 Analyze syntactic and semantic structures using parsing, NER, and similarity
measures, and apply clustering techniques on text data.
CO4 Design and implement basic NLP applications such as sentiment analysis, text
classification, and understand the role of neural networks in modern NLP tasks.
Text Books:
1. Daniel Jurafsky and James H. Martin, “Speech and Language Processing: An
Introduction to Natural Language Processing, Computational Linguistics, and Speech
Recognition”, second edition,Prentice Hall, USA,2008 .
2. James A.. Natural language Understanding 2e, Pearson Education, USA,1994
3. Bharati A., Sangal R., Chaitanya V.. Natural language processing: a Paninian
perspective, 1st Edition,PHI, NewDelhi,2000.
25
Lab 25BCA41P: Artificial Intelligence Lab
Teaching Hours : 04 Hours/Week Credits : 02
Duration of Exam : 3 Hours Maximum Marks : 50 (Exam 40 + IA 10)
List of Programs
1. Demonstrate basic problem-solving using Breadth-First Search on a simple grid.
2. Implement Departmenth-First Search (DFS) on a small graph.
3. Solve the Water Jug Problem using Breadth First Search (BFS).
4. Implement a Hill Climbing search to find the peak in a numeric dataset.
5. Apply the A* Search algorithm to find the shortest path in a 4x4 grid.
6. Implement the Minimax search algorithm for 2-player games. You may use a game
tree with 3 plies.
7. Solve the 4 – Queens Problem as a CSP backtracking problem.
8. Use constraint propagation to solve a Magic Square puzzle.
9. Apply optimization techniques to find the maximum value in a list.
10. Represent and evaluate propositional logic expressions.
11. Implement a basic rule-based expert system for weather classification.
12. Implement a basic AI agent with simple decision-making rules.
13. Implement a basic Rule-Based Chatbot.
14. Using Python NLTK, perform the following Natural Language Processing (NLP)
tasks for text content.
a. Tokenizing
b. Filtering Stop Words
c. Stemming
d. Part of Speech tagging
e. Chunking
f. Named Entity Recognition (NER)
26
Lab 25BCA42P: Data Analytics Lab
Teaching Hours : 04 Hours/Week Credits : 02
Duration of Exam : 3 Hours Maximum Marks : 50 (Exam 40 + IA 10)
Part - A
27
Company P NP Total
A 15 9 24
B 16 15 31
Total 31 24 55
Part - B
1. Install Power BI Desktop, explore the interface, and load data from an Excel file.
2. Use Power Query Editor to clean and transform a dataset.
3. Use Power Query Editor to combine data from multiple tables.
4. Visualize data using basic chart types - Bar Chart, Column Chart and Pie Chart by
a. loading a dataset.
5. Use cards, maps, and matrices for advanced visuals by loading a dataset.
6. Enhance dashboard interactivity using slicers and filters.
7. Use drill-through pages and bookmarks for navigation and detail views.
28
Lab 25BCA43P: Scientific Programming using R Lab
Teaching Hours : 04 Hours/Week Credits : 02
Duration of Exam : 3 Hours Maximum Marks : 50 (Exam 40 + IA 10)
PART - A
1. Write a program to demonstrate variable assignments and data types in R.
2. Use if, else if, and else statements to categorize age groups.
3. Implement a for loop to calculate the factorial of a number.
4. Define a user-created function to compute the mean and standard deviation of a
numeric vector.
5. Write a program using tryCatch() to handle errors during division operations.
6. Demonstrate the use of debug(), browser(), and traceback() for debugging a faulty
function.
7. Use regular expressions to extract all email addresses from a given text string.
8. Write a script that logs current time, pauses execution using Sys.sleep(), and resumes
with another log.
9. Execute a system command from R to list all files in the current directory using
system() or shell().
PART - B
1. Define an S3 class for "Student" and implement a method to calculate average marks.
2. Create an S4 class for "Employee" with slots like ID, Name, and Salary, and a method
to increment salary.
3. Install and demonstrate the use of the ggplot2 package with a simple bar chart.
4. Explore dplyr functions to filter, arrange, and summarize a dataset.
5. Write and build a basic custom R package including documentation and metadata
using devtools.
6. Implement a linear regression model on a sample dataset using lm() in R.
7. Perform a time-series plot for monthly sales data using ts() and plot().
8. Create a simple dashboard using shiny that displays summary statistics of a dataset.
9. Build a case study in R to analyze COVID-19 dataset: read data, clean, visualize, and
interpret trends.
29
BLUEPRINT FOR QUESTION PAPER
FOR 03 CREDITS
Marks Unit I Unit II Unit III Unit IV Number of questions to Total
be answered
2 3 3 3 3 10 20
6 2 2 2 2 5 30
10 1 1 1 1 3 30
Total 80
FOR 02 CREDITS
Marks Unit I Unit II Unit III Number of questions to Total
be answered
2 2 2 2 5 10
5 2 2 2 4 20
10 1 1 1 1 10
Total 40
30