0% found this document useful (0 votes)

7 views4 pages

Domain 2 - Data Collection Processing and Engineering

Uploaded by

mamdouhbevnoty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views4 pages

Domain 2 - Data Collection Processing and Engineering

Uploaded by

mamdouhbevnoty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Domain 2 - Data Collection Processing and

Engineering
MCQ
1. What happens if data exists in multiple locations when building AI specific data sets?
The data needs to be imported and transformed into a single dataset
New data needs to be collected from a single location instead
One data set needs to be chosen as a primary data set
New data needs to be created
2. After data is collected for use with an AI model, what is the next step?
Determining the data type
Determining the characteristics of the data
Looking for missing or corrupt data elements
Assessing the quality of the data
3. Why would data collected for an AI model need to be converted into binary?
To make looking for corrupt or missing data easier
So the data is a more manageable size
to ensure there is enough data
So the AI model understands it
4. When might one choose a local AI hosting solution for their data?
When data is particularly Sensitive and requires stringent security controls
When a project requires extensive computational power
When everyone using the data is in one location
When they want lower initial cost and lower maintenance
5. What is a white paper?
A comprehensive document serving as a complete reference for an AI project, outlining its
design, implementation, and outcomes.
A troubleshooting guide for AI models
A procedure with a sequence of operations
I report identifying A relevant correlations in a data set
6. What does processing a data set involve?
Transforming and manipulating data to ensure it is ready to be in an AI model
Building the initial vector features for an AI model
Creating a true picture of a real-life situation in which data is entered into an AI model
Ensuring that assets are large enough to build an unbiased AI model
7. What is a feature vector?
An ordered list of numerical properties of observed phenomena
A characteristic of data, such as numeric, string, or date
A repository used to store code for development projects
A tool used to cleanse data to ready it for an AI learning model
8. Why are feature vectors needed?
To ensure that the integrity so an AI model can make accurate predictions
To ensure a data set is large enough to create an unbiased AI model
To ensure data is balanced so an AI model can make accurate predictions
To convert data into machine readable format so that an AI model can make informed
predictions
9. What are some important positive aspects of assessing data quality? Choose 3 answers.
Does it contain personal information
Is it complete
Is it large enough to produce the results without being too large
Is it balanced
Does it have corrupt elements
10. What is the most important question to answer when determining which features to use in
AI machine learning?
Is this feature expensive to use?
Is this feature big enough?
Does this feature impact the outcome and help the model's performance?
Does this feature make the AI model look better?
11. Consideration for data when working with an AI model?
Data quality
data corruption
data representation
Data type
12. Which statements regarding data set size are true? Choose 3 answers.
There is a set percentage of statistical data that one must adhere to when collecting data
The size of a data set is important
The size of a data set is not important
The amount of data collected must be sufficient to build an unbiased AI model
If there is not enough data for a data set, one should consider increasing the number of
sources used
13. a recreational area is starting a new hiker`s programme and wants to group people based on
experience level. No data currently exists to show these categories. What is the best way to
collect this data?
Surveys from existing hikers
Data from an IoT device
A web crawler to extract data from websites
Records of the number of existing hikers
14. which three statements regarding a data test set are true?
It should include Proper representation of each category or class of data
it should be used to train an A I model
It should be updated accordingly as data changes
It should include random sampling of data
its records should be kept with the training set records throughout the AI building process
15. The data set an AI model accesses usually comes in which 2 forms?
Training and assessment
Numerical and alphabetical
Beta and production
Training and testing
16. Match the area of importance in documenting data decisions to its definition.
Assumptions  Believes or conditions taken to be true for a system to work as intended
Predicates  logical statements or conditions defining properties or relationships between
different entities in an AI system
Constraints  restrictions on an AI system
17. Which three statements regarding data for the data collection process are true?
It must be a good starting point for an AI model or it cannot be used
It should be as free from bias as possible
It should be relevant to the problem being solved
It needs to be large enough to produce solid outcomes
It should contain personal information
18. match the type of data collection bias to its description
selection bias  when the data collected does not represent the entire population intended
to be analyzed
digital divide bias  with the needs and opinions of those with limited access to or ability
with technology or overlooked
observer bias  when subjective interpretation influences the data being collected
historical bias  when data is only collected at a special time rather than at all times of the
year
19. what is the purpose of randomizing data when building training and testing data sets?
To get a true picture of a real-life situation for fata entry into an AI model
To ensure the AI model does not become confused when making predictions
To ensure data Is error free
To ensure the data sets are large enough to build an unbiased AI model
20. What is involved in deep transparency?
Retraining a model to use different settings, such as parameters
Keeping track of important details such as algorithms, data, and interpretability, auditing,
and accountability
Managing customer expectations by not overpromising
Building 1 or more connections between AI and data, specifically the applications with the
data being used to develop an AI model
21. Arrange the steps for feature engineering in the correct order.
Double check that the features used are relevant to the solution  position 1
Categorize features into different types  position 2
Transform any data not in the best format  position 3
Validate transformations to ensure they will help build as accurate and AI model as possible
 position 4
22. Match the appropriate technique or tool with the data quality issue it handles.
Imputation methods  missing data
Consistency checks  misaligned data
Anomaly detection techniques  data corruption
Cyber security measures  external threats such as viruses
23. Why is it important to consult a relevant subject matter expert in the field and AI model`s
solution is designed for? Choose 2 answers
To identify any Risky features involving demographic information
to verify that the features selected for the model are valid
to build initial vector features for an AI model
to ask them to write programming code for the AI model
to have them test the AI model for us
24. What are tokens in relation to AI building?
Smaller units of words and sentences
Smaller units of time
Numerical representations of data
Part pf the python programing language
25. Which statements regarding feature vectors are true? Choose 3 answers.
They help ensure a data set is large enough to create an unbiased AI model
they can be numerical or categorical
inconsistencies in vectors will cause inconsistencies within AI predictions
multiple feature vectors across features need to be scaled properly
they can only be numerical
26. Which statements regarding cloud based AI hosting solutions are true? Choose 3 answers
there is no need to maintain physical servers
they offer scalable resources
they have the advantage of built-in tools and services for AI and machine learning
they offer greater control and data security
they come with a higher initial investment and maintenance cost
27. What are three budget considerations when planning an AI project?
Legal requirements specific to the industry for which the AI model is being used
Cost-benefit analysis to ensure a solid return on investment
Equipment for testing, processing, and running AI model
Technological and human resources needed
Guidelines used for algorithm selection
28. Which three statements are true regarding converting data to a format AI can process?
Data needs to be converted to all share the same data type
Many systems require images to be converted to binary to recognize them
conversion may be done for you by some software
AI systems can be programmed to take images and render them into binary numbers for
proper rendering
Data needs to be turned into tokens

AI Part A CH 2
No ratings yet
AI Part A CH 2
9 pages
Domain 4 - Application Integration and Deployment
No ratings yet
Domain 4 - Application Integration and Deployment
4 pages
AI Project Cycle Class 9 MCQ
No ratings yet
AI Project Cycle Class 9 MCQ
2 pages
AI Project Steps & Design Thinking
No ratings yet
AI Project Steps & Design Thinking
4 pages
AI Project Cycle Question Bank
No ratings yet
AI Project Cycle Question Bank
14 pages
AI Project Cycle Overview
No ratings yet
AI Project Cycle Overview
10 pages
Image To PDF 20230915 14.27.03
No ratings yet
Image To PDF 20230915 14.27.03
6 pages
Domain 3 - AI Algorithms and Models
No ratings yet
Domain 3 - AI Algorithms and Models
5 pages
Unit1 Partb 9th AI NOTES-78-84
No ratings yet
Unit1 Partb 9th AI NOTES-78-84
7 pages
Worksheet-Ai Class 10-Mcq CHP 1
No ratings yet
Worksheet-Ai Class 10-Mcq CHP 1
25 pages
Class 9 AI MCQs for Data Literacy
100% (1)
Class 9 AI MCQs for Data Literacy
15 pages
Model Life Cycle
No ratings yet
Model Life Cycle
10 pages
X AI SS CH2 Notes
No ratings yet
X AI SS CH2 Notes
8 pages
Class 10 Project Cycle
No ratings yet
Class 10 Project Cycle
34 pages
Gwalior Glory High School: Question Bank Unit 2 - Ai Project Cycle Class-Ix Subject: - Artificial Intelligence
No ratings yet
Gwalior Glory High School: Question Bank Unit 2 - Ai Project Cycle Class-Ix Subject: - Artificial Intelligence
6 pages
Class 9 Part B Unit 1
No ratings yet
Class 9 Part B Unit 1
13 pages
MCQ - IX - Part B
No ratings yet
MCQ - IX - Part B
10 pages
Ai Model Life Cycle
No ratings yet
Ai Model Life Cycle
6 pages
Daywise Work AI
No ratings yet
Daywise Work AI
2 pages
GR 10 - PT1 Revision Paper Ans Key
No ratings yet
GR 10 - PT1 Revision Paper Ans Key
9 pages
Pa 1 Ques
No ratings yet
Pa 1 Ques
4 pages
Data Science MCQs Sample Mid2xlsx 2024 11-29-23!19!54
No ratings yet
Data Science MCQs Sample Mid2xlsx 2024 11-29-23!19!54
8 pages
29questions Answers
No ratings yet
29questions Answers
6 pages
Class 12 Ai PB 1 Pankti
No ratings yet
Class 12 Ai PB 1 Pankti
5 pages
GR 10 Ai Unit12 rv2
No ratings yet
GR 10 Ai Unit12 rv2
4 pages
Lec 3
No ratings yet
Lec 3
39 pages
Resource 20250728142254 Questions Answers of Class 9 Chapter 1
No ratings yet
Resource 20250728142254 Questions Answers of Class 9 Chapter 1
9 pages
AI Project Lifecycle Overview
No ratings yet
AI Project Lifecycle Overview
30 pages
Understanding R-Squared Values in AI
No ratings yet
Understanding R-Squared Values in AI
22 pages
AI Project LIfe Cycle
No ratings yet
AI Project LIfe Cycle
7 pages
C-X CH-2 Ai Project Cycle
No ratings yet
C-X CH-2 Ai Project Cycle
7 pages
Understanding the AI Project Cycle
No ratings yet
Understanding the AI Project Cycle
10 pages
Of AI Project Framework
No ratings yet
Of AI Project Framework
4 pages
Domain 1 - AI Problem Definition
No ratings yet
Domain 1 - AI Problem Definition
5 pages
AI & Data Science Quiz: Key Concepts
No ratings yet
AI & Data Science Quiz: Key Concepts
15 pages
Class 9th T1 (SET-1)
No ratings yet
Class 9th T1 (SET-1)
2 pages
AI Part B (XII) 2023-24
No ratings yet
AI Part B (XII) 2023-24
20 pages
AI & ML Exam Model Answers Sep 2023
No ratings yet
AI & ML Exam Model Answers Sep 2023
21 pages
Ai and ML qp1 Solved
No ratings yet
Ai and ML qp1 Solved
20 pages
AI Model Lifecycle Guide
No ratings yet
AI Model Lifecycle Guide
6 pages
AI Important Questions Board
No ratings yet
AI Important Questions Board
27 pages
02 Ai Project Cycle Important Questions
No ratings yet
02 Ai Project Cycle Important Questions
24 pages
Unit 1 Topic Ai Project Cycle
No ratings yet
Unit 1 Topic Ai Project Cycle
8 pages
Assesment 9thAI SP 1
No ratings yet
Assesment 9thAI SP 1
2 pages
9AIQPCh 1
No ratings yet
9AIQPCh 1
7 pages
AI Project Cycle Questions Answers
No ratings yet
AI Project Cycle Questions Answers
4 pages
AI Capstone Project Overview
No ratings yet
AI Capstone Project Overview
3 pages
AI Project Cycle - Question Bank
No ratings yet
AI Project Cycle - Question Bank
34 pages
Ai 1
No ratings yet
Ai 1
7 pages
AI Questions and Answers - Viva
No ratings yet
AI Questions and Answers - Viva
3 pages
AI Learning Worksheet Key
No ratings yet
AI Learning Worksheet Key
20 pages
AI MCQs Test
No ratings yet
AI MCQs Test
12 pages
Ai Project Cycle
No ratings yet
Ai Project Cycle
9 pages
AI Ethics and Project Cycle Questions Cleaned
No ratings yet
AI Ethics and Project Cycle Questions Cleaned
3 pages
AI Domains and Project Cycle Insights
No ratings yet
AI Domains and Project Cycle Insights
5 pages
Asg202404141926262753 1 216
No ratings yet
Asg202404141926262753 1 216
11 pages
Math 2 - Section
No ratings yet
Math 2 - Section
16 pages
Chapter 7
No ratings yet
Chapter 7
25 pages
ITN Module 5
No ratings yet
ITN Module 5
23 pages
Fall 2024 Linear Algebra Examples
No ratings yet
Fall 2024 Linear Algebra Examples
19 pages
Section
No ratings yet
Section
23 pages
Math (3) Section 5
No ratings yet
Math (3) Section 5
17 pages
Math 3 Revision
No ratings yet
Math 3 Revision
24 pages
Section
No ratings yet
Section
23 pages
Med Term Review
No ratings yet
Med Term Review
6 pages
MATH3 - Mid-Quizzes 2025
No ratings yet
MATH3 - Mid-Quizzes 2025
8 pages
Sequence-Collbortion Diagram
No ratings yet
Sequence-Collbortion Diagram
26 pages
Database Mid Term
No ratings yet
Database Mid Term
13 pages
Use of 3D Scanning Technology To Determine Bus Access For People
No ratings yet
Use of 3D Scanning Technology To Determine Bus Access For People
11 pages
f10107144 1 1 1 CAD Support
No ratings yet
f10107144 1 1 1 CAD Support
20 pages
Beckhoff TwinCAT 3 Development Guide
No ratings yet
Beckhoff TwinCAT 3 Development Guide
21 pages
BSBWRT311 Project Portfolio - Santiago Giraldo Arbelaez
No ratings yet
BSBWRT311 Project Portfolio - Santiago Giraldo Arbelaez
16 pages
Real-Time and Embedded Software Overview
No ratings yet
Real-Time and Embedded Software Overview
20 pages
Han OneLLM One Framework To Align All Modalities With Language CVPR 2024 Paper
No ratings yet
Han OneLLM One Framework To Align All Modalities With Language CVPR 2024 Paper
12 pages
DataNet Brochure2020
No ratings yet
DataNet Brochure2020
2 pages
Roblox UI Library Script
No ratings yet
Roblox UI Library Script
80 pages
HMC Upgrade Guide for IT Professionals
No ratings yet
HMC Upgrade Guide for IT Professionals
7 pages
Writing Classification and Comparison Techniques
No ratings yet
Writing Classification and Comparison Techniques
7 pages
Paypalllll
No ratings yet
Paypalllll
3 pages
RebarCAD v9.02 Update Highlights
No ratings yet
RebarCAD v9.02 Update Highlights
6 pages
Pupil Labs Product Catalog
No ratings yet
Pupil Labs Product Catalog
26 pages
IT and Programming Insights
No ratings yet
IT and Programming Insights
14 pages
MS Word Table & Design Exercise
No ratings yet
MS Word Table & Design Exercise
2 pages
GPU-accelerated Evolutionary Multiobjective Optimization Using Tensorized Rvea
No ratings yet
GPU-accelerated Evolutionary Multiobjective Optimization Using Tensorized Rvea
16 pages
Media Literacy Test for Students
No ratings yet
Media Literacy Test for Students
5 pages
Numerical Computation - Lec - 1 PDF
No ratings yet
Numerical Computation - Lec - 1 PDF
32 pages
CIM Lab Manual for Mastercam 2026
No ratings yet
CIM Lab Manual for Mastercam 2026
78 pages
NetBackup Flex App 5340 Admin Lessons
No ratings yet
NetBackup Flex App 5340 Admin Lessons
419 pages
Unit 3-Ict Skills
No ratings yet
Unit 3-Ict Skills
4 pages
MemTest86 Trrrrruser Guide UEFI
No ratings yet
MemTest86 Trrrrruser Guide UEFI
84 pages
TANCET Previous Year Papers MCA 2016
No ratings yet
TANCET Previous Year Papers MCA 2016
18 pages
Real Dating Format PDF 3 PDF Romance (Love)
No ratings yet
Real Dating Format PDF 3 PDF Romance (Love)
1 page
Seminar Abstract Sushmitha
No ratings yet
Seminar Abstract Sushmitha
17 pages
Python IDE Case Study
No ratings yet
Python IDE Case Study
4 pages
Arena Admin Guide
No ratings yet
Arena Admin Guide
346 pages
Blackmagic URSA Mini 4.6K Guide
No ratings yet
Blackmagic URSA Mini 4.6K Guide
3 pages
COT Datjdjdjsjsjsjsjsjsjjsjsjsjsjsjsjsjsjsjjss
No ratings yet
COT Datjdjdjsjsjsjsjsjsjjsjsjsjsjsjsjsjsjsjjss
29 pages
Itil Process Map For Visio
No ratings yet
Itil Process Map For Visio
8 pages

Domain 2 - Data Collection Processing and Engineering

Uploaded by

Domain 2 - Data Collection Processing and Engineering

Uploaded by

Domain 2 - Data Collection Processing and

You might also like