CAM Assignment

Uploaded by

Abaidullah Sajid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views6 pages

CAM Assignment

Uploaded by

Abaidullah Sajid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Computer Applications in Manufacturing

Submitted To: Col Dr Imran Shafi

Submitted By:

Noman Ali

Reg. No.369524

ME-DE-43

Syndicate-C

Assignment # 02

DEPARTMENT OF MECHANICAL ENGINEERING

COLLEGE OF E&ME, NUST, RAWALPINDI

Task 1: Datasets Exploration at various sites.

Kaggle:
Kaggle is a popular online platform for data science and machine learning enthusiasts. It offers a
variety of datasets contributed by individuals, researchers, companies, and organizations. These
datasets cover a wide range of topics including finance, healthcare, sports, and social sciences.
Datasets on Kaggle are crowdsourced from multiple contributors around the world, including
businesses like Google, government bodies, universities, and individuals. The data is gathered
from various sources such as research studies, surveys, public databases, sensors, social media
platforms, and web scraping. Kaggle datasets are widely used for educational purposes,
research, competitions, and industrial applications. Companies and individuals use them to build
and test machine learning models, with some datasets contributing to real-world solutions like
fraud detection, sentiment analysis, and predictive analytics. Users can access Kaggle datasets
to practice data manipulation, apply machine learning techniques, and participate in competitions.
The platform also offers kernels (scripts) to help beginners and advanced users work with the
data easily. The few datasets found on Kaggle are shown below:

Sr. Dataset Explanation

No.
1. Student Performance The “Student Performance Factors” dataset could be found on Kaggle
Factors which includes student demographic information, academic support info
and personal factors related with a model for the prediction of performance.
The study includes demographic data of a child (e.g., gender, parental
education), school support or number of activities on subjects,
extracurricular and family background. It can be used to train models that
predict academic success, or study how different factors such as
demographics and financial aid are correlated with student outcomes. It
serves as a base for educators and scholars on which they can recognize
principal factors of academic success. It contains about 6600 samples of
data.
2. Mobile Device Usage This dataset has data on mobile phone usage patterns and how
and User Behaviour users interact with their phones. Some of these features include
Dataset demographics, app usage frequency, mobile device type, battery
drain, data usage, OS of the device and the time spent on different
activities. The data set is useful for understanding various segments
of mobile users, and informing trends in app usage, device
ownership and user behaviour on a mobile. Most common
applications are in Marketing, Behavioural studies and Mobile app
development. It contains 700 samples of user data. Each entry is
categorized into one of five user behaviour classes, ranging from
light to extreme usage, allowing for insightful analysis and modeling.
3. Electric Vehicle Sales by This dataset presents detailed data on electric vehicle sales across
State in India various Indian states. It includes information on the number of EVs
sold, the distribution of sales by state, and possibly other
demographic or geographic indicators influencing sales trends. This
dataset is useful for analysing the adoption rate of electric vehicles
across different regions in India and identifying patterns related to
state-wise EV penetration. It contains almost 98000 sample data.
4. 3D Printer Dataset for This dataset focuses on data relevant to 3D printing, including key
Mechanical Engineers parameters such as print speed, layer height, material used, and
resulting quality of prints. It offers insights into the impact of various
3D printing settings on the final output, making it useful for engineers
working on optimizing 3D printing processes. The dataset can be
applied to performance analysis, material efficiency, and quality
improvement in additive manufacturing. It contains almost 50 sample
data.
5. Materials and their This dataset has details for tensile strength, hardness, ductility and
Mechanical Properties elasticity with respect to different materials. This is perfect for
materials science research, engineering projects as well analysis of
material performance across various applications. The dataset aids
researchers and engineers to comprehend material behaviour in
different states, thus being beneficial for mechanical designing as
well selecting the best fit materials. It contains almost 1500 sample
data.
GitHub:
GitHub is a version control platform where developers can share code, collaborate on software
projects, and host open-source projects. It has repositories with datasets and machine learning
models contributed by users. Datasets on GitHub are created by developers, data scientists,
researchers, and organizations. The data hosted on GitHub varies in source. Some may be the
results of scraping, experiments, API integrations, or manually gathered data. It’s a mix of code-
generated datasets and manually curated data files. GitHub datasets are used by developers and
data scientists to test algorithms, run machine learning models, or build web and mobile
applications. GitHub hosts a wide variety of code repositories that include ready-to-use datasets.
Developers can clone these repositories, access the datasets, and integrate them into their
projects. GitHub’s collaborative features also enable contributions to ongoing datasets, improving
or expanding the available data. The few datasets found on GitHub are shown below:

Sr. Dataset Explanation

No.
1. Pipe Specification The dataset provides a user interface for industrial pipe material
Selection specifications, including material grade, material type, pressure
rating (flange), and corrosion allowance based on a specific fluid. It
is designed to help engineers and industry professionals in selecting
the appropriate piping materials and configurations for various fluid
transportation systems, ensuring safety and efficiency based on
environmental and operational conditions.
2. Car Simulations The dataset aims to simulate the multibody dynamics behavior of a
car body's motion as it moves over uneven terrain. It includes code
for simulating suspension systems, body movement, and interaction
with irregular surfaces. This dataset is useful for automotive
engineers and researchers focused on vehicle dynamics and
suspension design, providing a foundation to model and analyze
real-world driving scenarios.
3. Introduction to Mechanical The dataset offers resources related to the fundamentals of
Manufacturing mechanical manufacturing. It includes lecture notes, assignments,
and potentially data or examples on manufacturing processes such
as machining, casting, forming, and additive manufacturing. The
dataset is beneficial for students and professionals looking to
understand key concepts in manufacturing engineering and enhance
their practical skills in mechanical production.
4. Structural Analysis This dataset is a Julia package for topology optimization. It provides
tools for optimizing material layouts within a given design space,
under specified constraints and boundary conditions, to achieve the
best performance. This package is useful for mechanical engineers
and researchers working on structural optimization, offering
customizable simulations to design lightweight and efficient
structures.
5. Composite Materials The dataset is a Python package designed to solve problems related
to laminated composite materials using classical laminate theory
(CLT). It provides tools for computing mechanical properties,
stresses, strains, and failure criteria for composite laminates. The
dataset and code are particularly useful for mechanical and
aerospace engineers working on the analysis and optimization of
composite structures.
UCI: Machine Learning Repository:
The UCI Machine Learning Repository is a collection of databases, domain theories, and datasets
used by the machine learning community for empirical research in algorithms and data
exploration. Created by the University of California, Irvine (UCI), this repository has contributions
from academic researchers, engineers, and scientists. The datasets were collected from various
sources such as academic research papers, experiments, web scraping, and public records.
Some datasets are also results from competitions. These datasets are widely used in academic
research to test machine learning algorithms, explore data analysis techniques, and benchmark
performance. They're frequently cited in papers and research projects. Researchers, students,
and practitioners can access these datasets to validate algorithms, conduct data analysis, or for
teaching purposes. Many use the UCI datasets for educational projects and tutorials to learn data
preprocessing and algorithm development. The few datasets found on UCI are shown below:

Sr. Dataset Explanation

No.
1. Heart Disease The Heart Disease dataset from the UCI Machine Learning
Repository contains medical records used to diagnose heart
disease. It includes features such as age, gender, blood pressure,
cholesterol levels, and results from various medical tests. The target
variable indicates the presence or absence of heart disease. This
dataset is widely used for classification and prediction tasks in
medical diagnostics, particularly for predicting the likelihood of heart
disease based on patient data.
2. Car Evaluation The dataset contains data used to evaluate car acceptability based
on several features like buying price, maintenance cost, number of
doors, passenger capacity, trunk size, and safety features. The
dataset is useful for classification tasks, where the target variable
indicates the overall evaluation of the car (unacceptable, acceptable,
good, or very good). It's widely used for decision-making models in
the automotive industry.
3. Online Retail The dataset consists of transactional data from a UK-based online
retailer. It includes features such as invoice numbers, product
descriptions, quantities, prices, and customer identifiers. This
dataset is useful for analyzing customer behaviour, sales patterns,
and inventory management. It's commonly used for tasks like market
basket analysis, customer segmentation, and sales forecasting.
4. Individual Household The dataset contains measurements of electric power consumption
Electric Power in one household over a period. It includes features like date, time,
Consumption global active power, voltage, and energy sub-metering. This dataset
is valuable for time-series analysis and research on energy usage
patterns, efficiency, and forecasting in household settings.
5. Concrete Compressive The dataset contains data on concrete samples with varying
Strength compositions and curing times. It includes features such as the
amount of cement, water, and aggregates, as well as the
compressive strength of the concrete. This dataset is useful for
regression analysis and modelling the strength of concrete based on
its constituents, aiding in material science and construction
engineering studies.
Deep Learning Datasets with features:
The datasets of deep learning are shown below:

Sr. Dataset Explanation

No.
1. MNIST It's a dataset that has numbers, which's useful for AI to learn and
recognize patterns.
2. MS COCO A big dataset which helps AI with tasks, like detecting objects and
segmenting images. It has a range of contents, making it very
diverse.
3. ImageNet This collection contains several images that are categorized based
on WordNet. It's helpful for AI to learn concepts.
4. VisualQA It's a dataset that focuses on image related questions challenging AI
to use both vision and language skills.
5. CIFAR 10 This dataset includes ten categories of images. Its main purpose is
to train AI models in recognizing images.
6. Fashion MNIST Focusing on numbers this alternative data set concentrates on
fashion-related images. It helps AI fashion items effectively.
7. Street View House Like MNIST this one helps in recognizing street scenes.
Numbers
8. Sentiment140 This dataset is specifically designed for sentiment analysis helping
AI understand emotions expressed in text data.
9. WordNet An extensive database containing synonym words and their
associated concepts. It greatly assists AI in understanding language
comprehension.
10. Wikipedia Corpus A repository of information sourced from various articles. It serves as
a resource, for AI learning purposes.
11. Free Spoken Digit A dataset created for identifying spoken digits using samples.
12. Free Music Archive A vast music analysis dataset that consists of high-quality audio
features and metadata.
13. Ballroom A collection of audio excerpts representing various dance styles,
enabling AI to analyze musical patterns.
14. Million Song A repository of audio features and metadata for a million music
tracks, ideal for AI research.
15. LibriSpeech A dataset containing a thousand hours of English speech, to train AI
listening models.
16. VoxCeleb A speaker identification dataset derived from YouTube, featuring
famous voices.
17. Urban Sound A set of urban sound clips for AI to classify into different categories.
Classification
18. IMDB reviews A valuable dataset for AI, used in analysis, providing movies public
reviews.
19. Twenty Newsgroups A collection of a thousand Usenet articles from twenty newspapers,
assisting AI in text analysis.
20. Yelp Reviews A dataset of user reviews with images and varying file sizes, a fruitful
playground for AI study.

List of Datasets For Machine-Learning Research
100% (1)
List of Datasets For Machine-Learning Research
61 pages
List of Datasets For Machine-Learning Research
No ratings yet
List of Datasets For Machine-Learning Research
61 pages
List of Datasets For Machine-Learning Research
No ratings yet
List of Datasets For Machine-Learning Research
48 pages
Dataset Websites
No ratings yet
Dataset Websites
7 pages
Datasets
No ratings yet
Datasets
5 pages
Device Properties Overview
No ratings yet
Device Properties Overview
2 pages
Datasets for Aspiring Data Scientists
No ratings yet
Datasets for Aspiring Data Scientists
7 pages
Lecture 3
No ratings yet
Lecture 3
25 pages
BFS Use Cases in Data Science
No ratings yet
BFS Use Cases in Data Science
3 pages
Data Sources for Data Mining Explained
No ratings yet
Data Sources for Data Mining Explained
15 pages
Essential Data Science Resources List
No ratings yet
Essential Data Science Resources List
3 pages
Where To Find Data PDF
No ratings yet
Where To Find Data PDF
10 pages
Lab 4: Data Processing Techniques
No ratings yet
Lab 4: Data Processing Techniques
3 pages
Brian Matongora - Data Ecology Week One
No ratings yet
Brian Matongora - Data Ecology Week One
6 pages
DSBDA Manual
No ratings yet
DSBDA Manual
76 pages
Exp. 1 Demonstrate Various Industry-Based Data Science Tools
No ratings yet
Exp. 1 Demonstrate Various Industry-Based Data Science Tools
3 pages
Data Analysis and Machine Learning With Kaggle (2021) - Banachewicz & Massaron
No ratings yet
Data Analysis and Machine Learning With Kaggle (2021) - Banachewicz & Massaron
51 pages
3 Data
No ratings yet
3 Data
23 pages
Machine Learning Amarture Part 4
No ratings yet
Machine Learning Amarture Part 4
10 pages
Machine Learning 2
No ratings yet
Machine Learning 2
37 pages
Data Science
No ratings yet
Data Science
14 pages
AI and Python in Business Applications
No ratings yet
AI and Python in Business Applications
52 pages
Ahmed Rashad's Pioneer Assignment #1
No ratings yet
Ahmed Rashad's Pioneer Assignment #1
4 pages
AI Tools & Websites
No ratings yet
AI Tools & Websites
7 pages
Instructions For Big Data Assignment
No ratings yet
Instructions For Big Data Assignment
5 pages
Macse502 Programming-For-data-science Eth 1.0 83 Macse502
No ratings yet
Macse502 Programming-For-data-science Eth 1.0 83 Macse502
4 pages
Essential Data Science Projects Guide
No ratings yet
Essential Data Science Projects Guide
1 page
Python For Data Science
No ratings yet
Python For Data Science
22 pages
Open Datasets For Data Science
No ratings yet
Open Datasets For Data Science
1 page
SL-III Lab Manual
No ratings yet
SL-III Lab Manual
74 pages
Data Science Self-Learning Guide
100% (3)
Data Science Self-Learning Guide
16 pages
Lecture 4 - Machine Learning Pipeline
No ratings yet
Lecture 4 - Machine Learning Pipeline
38 pages
Kaggle Book
100% (1)
Kaggle Book
57 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
51 pages
19 No-Code Data Science Tools
No ratings yet
19 No-Code Data Science Tools
8 pages
Python Projects for Learners
No ratings yet
Python Projects for Learners
9 pages
Data Science
No ratings yet
Data Science
9 pages
Data Analysis with Orange Tool
No ratings yet
Data Analysis with Orange Tool
8 pages
Where To Find Large Datasets Open To The Public
No ratings yet
Where To Find Large Datasets Open To The Public
41 pages
DBDAL LAB - MANUAL - Final
No ratings yet
DBDAL LAB - MANUAL - Final
93 pages
DSBDAlab Manual
No ratings yet
DSBDAlab Manual
116 pages
Ai Procycle
No ratings yet
Ai Procycle
13 pages
Data Analytics QP May 25
No ratings yet
Data Analytics QP May 25
4 pages
Fact Sheet: Emerging Data Tools
No ratings yet
Fact Sheet: Emerging Data Tools
1 page
DT-1 Sample Worksheet
No ratings yet
DT-1 Sample Worksheet
3 pages
Practical 1
No ratings yet
Practical 1
8 pages
10 Data Science Project Ideas
No ratings yet
10 Data Science Project Ideas
12 pages
Top 5 Free Data Resources To Practice Data Analytics
No ratings yet
Top 5 Free Data Resources To Practice Data Analytics
1 page
Bigdata CW
No ratings yet
Bigdata CW
31 pages
Data Science
No ratings yet
Data Science
8 pages
Day5 FDP IoT Part1
No ratings yet
Day5 FDP IoT Part1
89 pages
Week1 Exploratory Data Analysis
No ratings yet
Week1 Exploratory Data Analysis
2 pages
Python Programming for Data Science Lab
100% (1)
Python Programming for Data Science Lab
29 pages
2023 Data Science Survey Insights
No ratings yet
2023 Data Science Survey Insights
2 pages
Ch 7b
No ratings yet
Ch 7b
15 pages
Lecture 1
No ratings yet
Lecture 1
25 pages
4 Me421 Sdof Undamped
No ratings yet
4 Me421 Sdof Undamped
13 pages
Week 4
No ratings yet
Week 4
18 pages
Quant Test 8 Scholar Den
No ratings yet
Quant Test 8 Scholar Den
110 pages
PEL Internship Project Report
No ratings yet
PEL Internship Project Report
17 pages
Week 5
No ratings yet
Week 5
15 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
FFR Report Final PDF
No ratings yet
FFR Report Final PDF
13 pages
Fatigue
No ratings yet
Fatigue
126 pages
Code
No ratings yet
Code
5 pages
John Yee PHD CV
No ratings yet
John Yee PHD CV
6 pages
SAP - EEP - PTP - Service Entry Sheet User Manual V2.
No ratings yet
SAP - EEP - PTP - Service Entry Sheet User Manual V2.
34 pages
Blablabla
No ratings yet
Blablabla
120 pages
Untitled
No ratings yet
Untitled
389 pages
Powerdrive MD2: Commissioning Manual
No ratings yet
Powerdrive MD2: Commissioning Manual
180 pages
SB Imperva SecureSphere CEF Guide
No ratings yet
SB Imperva SecureSphere CEF Guide
21 pages
FANOVI启动手册与问题解决
No ratings yet
FANOVI启动手册与问题解决
43 pages
SCRIPT - To Tune The 'SESSION - CACHED - CURSORS' and 'OPEN - CURSORS' Parameters (ID 208857.1)
No ratings yet
SCRIPT - To Tune The 'SESSION - CACHED - CURSORS' and 'OPEN - CURSORS' Parameters (ID 208857.1)
3 pages
Freebitcoin No Captcha With Lottery Auto Roll 100 Reward Points 1000 Bonus BTC PDF Free
No ratings yet
Freebitcoin No Captcha With Lottery Auto Roll 100 Reward Points 1000 Bonus BTC PDF Free
4 pages
Convert Any Website To Android App Using Android Studio - TechsBucket
100% (1)
Convert Any Website To Android App Using Android Studio - TechsBucket
16 pages
Oracle CPU Spikes: Analyzing Top SQL
No ratings yet
Oracle CPU Spikes: Analyzing Top SQL
7 pages
A1 Worksheet Part 2
No ratings yet
A1 Worksheet Part 2
5 pages
Python Programming PRACTICAL NO.9 ANSWERS
No ratings yet
Python Programming PRACTICAL NO.9 ANSWERS
6 pages
Digital India Initiatives Overview
No ratings yet
Digital India Initiatives Overview
17 pages
Geostatistical Ore Reserve Estimation
No ratings yet
Geostatistical Ore Reserve Estimation
8 pages
50 Most Important Formulas in Excel PDF
100% (1)
50 Most Important Formulas in Excel PDF
42 pages
PowerPoint Slide Layout Guide
No ratings yet
PowerPoint Slide Layout Guide
8 pages
Policies
No ratings yet
Policies
13 pages
SW Ins
No ratings yet
SW Ins
12 pages
Frontend Development Javascript React Roadmap
No ratings yet
Frontend Development Javascript React Roadmap
26 pages
DynaPath Delta User Manual
79% (14)
DynaPath Delta User Manual
477 pages
ERP Trends and Implementation
No ratings yet
ERP Trends and Implementation
18 pages
Blockchain Unconfirmed Transaction NEW Hack Script 2021
No ratings yet
Blockchain Unconfirmed Transaction NEW Hack Script 2021
4 pages
PPDM Plus Dps Plus Ordering Licensing Guide For Dsa Gii
No ratings yet
PPDM Plus Dps Plus Ordering Licensing Guide For Dsa Gii
99 pages
Is Assignment 1 10 B
No ratings yet
Is Assignment 1 10 B
3 pages
A8 Printer Network Configuration Tool (v1.0.1) Manual ENG
No ratings yet
A8 Printer Network Configuration Tool (v1.0.1) Manual ENG
59 pages
Advanced Penetration Testing
No ratings yet
Advanced Penetration Testing
7 pages
Open Broadcaster Software Overview
No ratings yet
Open Broadcaster Software Overview
35 pages
Enterprise Application Development With Java EE
No ratings yet
Enterprise Application Development With Java EE
21 pages
Export Import Made Very Easy - Learn Import Export Business Like Abcd (Exim Book + Online Support + Updates)
No ratings yet
Export Import Made Very Easy - Learn Import Export Business Like Abcd (Exim Book + Online Support + Updates)
9 pages
Data Types & Programming Techniques
No ratings yet
Data Types & Programming Techniques
11 pages