0% found this document useful (0 votes)

37 views3 pages

Programming Assignment 2

The document outlines the programming assignment for IIT Kharagpur's AI4ICPS I Hub Foundation, focusing on the classification of pulsar candidates using machine learning techniques. Students are instructed to implement a Support Vector Machine (SVM) function while adhering to specific coding guidelines and utilizing a provided dataset. The assignment emphasizes the importance of proper data handling, including normalization and training/testing splits, and requires the evaluation of model accuracy across various hyperparameter values.

Uploaded by

harsha.p1720

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views3 pages

Programming Assignment 2

Uploaded by

harsha.p1720

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

IIT KHARAGPUR AI4ICPS I HUB FOUNDATION

Hands-on Approach to AI, Cohort-3, February – May 2025

Programming Assignment 2
Due date: Sunday 30th March 2025, EOD – IST.

Important Instructions about Programming Assignments

1. Programming assignments will be evaluated automatically. Do not change the skeleton code
provided to you.
2. Write your code only in the designated places in the skeleton code and process the input data
provided to you in the designated variables. Do not alter the input output structure in the skeleton
code.
3. Do not import any additional libraries. Do not use any additional files for the processing other than
those mentioned in the skeleton code from a.(i) to a.(iv).

Failure to comply with these instructions may lead to you getting zero marks for the assignment, even
if the solution is largely correct.

Question:

Objective: Pulsars are a rare type of neutron star that produces radio emissions detectable here on Earth.
They are of considerable scientific interest as probes of space-time, the interstellar medium, and states of
matter. Each pulsar produces a slightly different emission pattern, which varies slightly with each rotation.
Thus, a potential signal detection known as a 'candidate', is averaged over many rotations of the pulsar, as
determined by the length of an observation. In the absence of additional information, each candidate could
potentially describe a real pulsar. However, in practice, almost all detection is caused by radio frequency
interference (RFI) and noise, making legitimate signals hard to find. Machine learning tools are now being
used to automatically label Pulsar candidates to facilitate rapid analysis. Classification systems in particular
are being widely adopted, which treat the candidate data sets as binary classification problems. Here, the
legitimate pulsar examples are a minority positive class, and spurious examples are the majority negative
class.

i. Randomly pick 80% of the data as a training set and the rest as a test set.
ii. Normalize each feature of the dataset to have a zero mean and unit variance. Note that while
normalizing the features, their mean and variance should be computed over the train split only.
Once the mean and variance are computed using only the train split, you normalize the test split
using the mean and variance computed over the train split.
iii. Note that training requires solving the dual optimization problem. To solve the dual optimization
problem, you must use the python package: cvxopt.solvers

Write a SVM function that takes a new datapoint as input and predicts the class. In SVM, the
hyperparameter C regulates the regularization strength, affecting the balance between a smooth decision
boundary and the accurate classification of training points. Now, for a given set of hyperparameter values
C = [0.1, 1, 10, 100, 1000], what will be their corresponding accuracies, provided we are
using the linear kernel?

Instructions:

1. Do not import any more libraries or modify any functions given in the skeleton code.
2. Input for evaluating the test cases; do not change the hyperparameter C value.
3. The output will be in decimal points.
4. You must use random_state=42 during the train test split.

Dataset: The dataset contains samples of pulsar candidates collected during the High Time Resolution
Universe Survey (South). It has around 17898 instances with 8 continuous attributes.

The target attribute is “Class” which can be legitimate (1) or spurious (0). Please note that the dataset may
contain missing values. To handle these missing values, you should use appropriate techniques.

Data Filename: pulsar_star_dataset.csv

Dataset description: The first four attributes are simple statistics obtained from the integrated pulse profile
(folded profile). This is an array of continuous variables that describe a longitude-resolved version of the
signal that has been averaged in both time and frequency. The remaining four variables are similarly
obtained from the DM-SNR curve. These are summarized below:

1. Mean of the integrated profile.

2. Standard deviation of the integrated profile.

3. Excess kurtosis of the integrated profile.

4. Skewness of the integrated profile.

5. Mean of the DM-SNR curve.

6. Standard deviation of the DM-SNR curve.

7. Excess kurtosis of the DM-SNR curve.

8. Skewness of the DM-SNR curve.

9. Class.

Here, DM-SNR stands for two things: Dispersion Measure (DM) and Signal-to-Noise Ratio (SNR). DM,
as the name suggests, measures the dispersion or spread of pulsar's signals during their journey from pulsar
to earth. SNR, on the other hand, measures the strength of a pulsar's signal relative to background noise.
DM is calculated from the time delay of each signal when it arrives on earth, while SNR is calculated at
the peak intensity of each signal.

Sample Test Cases:

"input": "0.9",
"output": "0.97"

"input": "9",
"output": "0.975"

"input": "90",
"output": "0.975"

"input": "900",
"output": "0.98"
"input": "9000",
"output": "0.98"

Predicting Pulsar Star Using Machine Learning - 1 - s15423
No ratings yet
Predicting Pulsar Star Using Machine Learning - 1 - s15423
6 pages
DAL Assignment 6 Endsem
No ratings yet
DAL Assignment 6 Endsem
8 pages
Machine Learning Approach To Detect Pulsar Star: Tensorflow and Random Forest Model With Python
No ratings yet
Machine Learning Approach To Detect Pulsar Star: Tensorflow and Random Forest Model With Python
14 pages
Pulsar Detection with ML Techniques
No ratings yet
Pulsar Detection with ML Techniques
7 pages
DAL Assignment 6
No ratings yet
DAL Assignment 6
7 pages
Debesai Gutierrez Koyluoglu
No ratings yet
Debesai Gutierrez Koyluoglu
11 pages
Practical Manual - Machine Learning Application
No ratings yet
Practical Manual - Machine Learning Application
4 pages
Pulsar Candidate Identification With Artificial Intelligence Techniques
No ratings yet
Pulsar Candidate Identification With Artificial Intelligence Techniques
23 pages
Pulsar Data Analysis with Machine Learning
No ratings yet
Pulsar Data Analysis with Machine Learning
10 pages
Neural Network Pulsar Classification
No ratings yet
Neural Network Pulsar Classification
1 page
Pulsar Candidate Selection Evolution
No ratings yet
Pulsar Candidate Selection Evolution
22 pages
Statistical Signal Processing Overview
No ratings yet
Statistical Signal Processing Overview
3 pages
Problem Statement: PREDICTING A PULSAR STAR
No ratings yet
Problem Statement: PREDICTING A PULSAR STAR
1 page
تقنيات استخراج الميزات الصوتية
No ratings yet
تقنيات استخراج الميزات الصوتية
6 pages
Support Vector Machines and Singular Value Decomposition
No ratings yet
Support Vector Machines and Singular Value Decomposition
7 pages
Vela Pulsar Analysis with Ooty Radio
No ratings yet
Vela Pulsar Analysis with Ooty Radio
9 pages
Semester End Examinations - July 2024: USN 1 M S
No ratings yet
Semester End Examinations - July 2024: USN 1 M S
4 pages
The GMRT High Resolution Southern
No ratings yet
The GMRT High Resolution Southern
16 pages
Learning To Detect
No ratings yet
Learning To Detect
11 pages
E9 205 - Machine Learning For Signal Processing: Practice For Midterm Exam # 1
No ratings yet
E9 205 - Machine Learning For Signal Processing: Practice For Midterm Exam # 1
8 pages
Computer Vision Quickly Identifies Radio Signals With Unlimited Accuracy
No ratings yet
Computer Vision Quickly Identifies Radio Signals With Unlimited Accuracy
9 pages
Digital Modulation Recognition Using Support Vector Machine Classifier
No ratings yet
Digital Modulation Recognition Using Support Vector Machine Classifier
5 pages
Variable Selection Benchmark Guide
No ratings yet
Variable Selection Benchmark Guide
30 pages
Chapter 3 - Report
No ratings yet
Chapter 3 - Report
16 pages
10.1515 - Astro 2000 0312
No ratings yet
10.1515 - Astro 2000 0312
12 pages
Automatic Modulation Recognition in Cognitive Radi
No ratings yet
Automatic Modulation Recognition in Cognitive Radi
10 pages
Finalexam01summer PDF
No ratings yet
Finalexam01summer PDF
2 pages
PCCCS504 Module 4
No ratings yet
PCCCS504 Module 4
4 pages
Question Bank 2023 Final All Questions
No ratings yet
Question Bank 2023 Final All Questions
78 pages
Data Science Final Exam Fall 2023 SOL
No ratings yet
Data Science Final Exam Fall 2023 SOL
6 pages
DM-I Q Paper 2024
No ratings yet
DM-I Q Paper 2024
12 pages
Ds 2
No ratings yet
Ds 2
27 pages
IIT Kharagpur Machine Learning Exam Guidelines
No ratings yet
IIT Kharagpur Machine Learning Exam Guidelines
12 pages
MLT Bcai 651 Lab Manual
No ratings yet
MLT Bcai 651 Lab Manual
42 pages
Estimation and Detection: Lecture 9: Introduction Detection Theory (Chs 1,2,3)
No ratings yet
Estimation and Detection: Lecture 9: Introduction Detection Theory (Chs 1,2,3)
38 pages
Statistical Learning
No ratings yet
Statistical Learning
92 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
17 pages
Slay The Day
No ratings yet
Slay The Day
21 pages
Final Exam, Data Mining (CEN 871) : Name Surname: Student's ID
No ratings yet
Final Exam, Data Mining (CEN 871) : Name Surname: Student's ID
2 pages
Dis1 Sol
No ratings yet
Dis1 Sol
9 pages
Determination of Required SNR Values (Radar Detection)
No ratings yet
Determination of Required SNR Values (Radar Detection)
6 pages
EE378A - Combined Notes
No ratings yet
EE378A - Combined Notes
76 pages
Wireless Networks Assignment11
No ratings yet
Wireless Networks Assignment11
6 pages
WSN20100100007 87680380
No ratings yet
WSN20100100007 87680380
5 pages
ML Ans
No ratings yet
ML Ans
18 pages
ML 20230316 1
No ratings yet
ML 20230316 1
9 pages
SSPI Lecture 3 Estimation Intro 2025
No ratings yet
SSPI Lecture 3 Estimation Intro 2025
56 pages
Lecture 04
No ratings yet
Lecture 04
33 pages
Lecture Notes - SVM
No ratings yet
Lecture Notes - SVM
13 pages
HW 1
No ratings yet
HW 1
4 pages
13 Hinteregger
No ratings yet
13 Hinteregger
4 pages
Dimensionality Reduction & Model Evaluation
No ratings yet
Dimensionality Reduction & Model Evaluation
80 pages
Application of Machine Learning To Predict The The
No ratings yet
Application of Machine Learning To Predict The The
6 pages
++FPGA Based Arrhythmia Detection
No ratings yet
++FPGA Based Arrhythmia Detection
10 pages
19ECE357 - V Sem End - Odd 2023
No ratings yet
19ECE357 - V Sem End - Odd 2023
4 pages
ML Tutorial I
No ratings yet
ML Tutorial I
3 pages
IJMLC DivyanshKhanna RohanSahu
No ratings yet
IJMLC DivyanshKhanna RohanSahu
7 pages
Human Activity Recognition by Machine Learning Methods
No ratings yet
Human Activity Recognition by Machine Learning Methods
12 pages
Narrowband Method
No ratings yet
Narrowband Method
11 pages
E3sconf Icmpc2023 01030
No ratings yet
E3sconf Icmpc2023 01030
12 pages
SMAI Workshop Prompts by SPRINGPAD
No ratings yet
SMAI Workshop Prompts by SPRINGPAD
4 pages
Esco Tendermanagement tenderManagementApplicationTests
No ratings yet
Esco Tendermanagement tenderManagementApplicationTests
1 page
Visapp 2024 266 CR
No ratings yet
Visapp 2024 266 CR
9 pages
Ijeee V11i7p105
No ratings yet
Ijeee V11i7p105
11 pages
TCS T4 Spring Boot2
100% (1)
TCS T4 Spring Boot2
75 pages
Toughsonic 3 30mm Data Sheet
No ratings yet
Toughsonic 3 30mm Data Sheet
2 pages
MRC First Announcement20feb
No ratings yet
MRC First Announcement20feb
12 pages
PET Parison Size Impact on Mould Design
No ratings yet
PET Parison Size Impact on Mould Design
6 pages
Most Important MCQ'S: Guess Papers by M.N.A. GHUMMAN
No ratings yet
Most Important MCQ'S: Guess Papers by M.N.A. GHUMMAN
7 pages
Cover
No ratings yet
Cover
2 pages
SDS Guide for Chemical Synthesis
No ratings yet
SDS Guide for Chemical Synthesis
7 pages
The Voynich Manuscript
No ratings yet
The Voynich Manuscript
2 pages
Scholz 2011 Environmental Literacy in Science and Society - From Knowledge To Decisions
No ratings yet
Scholz 2011 Environmental Literacy in Science and Society - From Knowledge To Decisions
656 pages
JST Vol. 30 (1) Jan. 2022 (View Full Journal)
No ratings yet
JST Vol. 30 (1) Jan. 2022 (View Full Journal)
904 pages
Additive Manufacturing of Linear Shaped Charges To Address Run Up
No ratings yet
Additive Manufacturing of Linear Shaped Charges To Address Run Up
89 pages
Control Device System Series 8040 Overview
No ratings yet
Control Device System Series 8040 Overview
9 pages
Understanding the Digital Self
No ratings yet
Understanding the Digital Self
11 pages
Syphilis Rapid Test MSDS Overview
No ratings yet
Syphilis Rapid Test MSDS Overview
3 pages
García-Martí Et Al. 2024
No ratings yet
García-Martí Et Al. 2024
34 pages
C How To Program 9th Edition Deitel Paul - Ebook PDF Download
100% (2)
C How To Program 9th Edition Deitel Paul - Ebook PDF Download
86 pages
10.1201 9781003390848 Previewpdf
No ratings yet
10.1201 9781003390848 Previewpdf
70 pages
Calculus Paper
No ratings yet
Calculus Paper
1 page
Parts of A Flowering Plant Lesson 1
No ratings yet
Parts of A Flowering Plant Lesson 1
3 pages
Evidence Guide NC 2013
No ratings yet
Evidence Guide NC 2013
65 pages
Ma Emf Mag1000 MT101-KFL-VN180907-2018.09.07
No ratings yet
Ma Emf Mag1000 MT101-KFL-VN180907-2018.09.07
24 pages
Gene PPARG Sequence
No ratings yet
Gene PPARG Sequence
3 pages
Authorized Hacker Techniques Tools and Incident Handling 3rd Edition Ebook and TestBank Bundle
No ratings yet
Authorized Hacker Techniques Tools and Incident Handling 3rd Edition Ebook and TestBank Bundle
326 pages
0000 REP JJ 0003 - Detailed Design Report & Appendices
100% (1)
0000 REP JJ 0003 - Detailed Design Report & Appendices
172 pages
Future of LCA Report A3
No ratings yet
Future of LCA Report A3
22 pages
Wetland Inventory and Profiling PDF
100% (1)
Wetland Inventory and Profiling PDF
26 pages
Informed Search vs. Uninformed Search
No ratings yet
Informed Search vs. Uninformed Search
1 page
Nr-Detection & Estimation Theory
No ratings yet
Nr-Detection & Estimation Theory
2 pages
The Challenge of Fate - Thorwald Dethlefsen
91% (23)
The Challenge of Fate - Thorwald Dethlefsen
124 pages
Some Metaphysical Questions
100% (1)
Some Metaphysical Questions
2 pages
A Roadside Stand Textual Questions and Answers
No ratings yet
A Roadside Stand Textual Questions and Answers
13 pages

Programming Assignment 2

Uploaded by

Programming Assignment 2

Uploaded by

IIT KHARAGPUR AI4ICPS I HUB FOUNDATION

Hands-on Approach to AI, Cohort-3, February – May 2025

Important Instructions about Programming Assignments

Data Filename: pulsar_star_dataset.csv

1. Mean of the integrated profile.

2. Standard deviation of the integrated profile.

3. Excess kurtosis of the integrated profile.

4. Skewness of the integrated profile.

5. Mean of the DM-SNR curve.

6. Standard deviation of the DM-SNR curve.

7. Excess kurtosis of the DM-SNR curve.

8. Skewness of the DM-SNR curve.

Sample Test Cases:

You might also like