0% found this document useful (0 votes)

34 views49 pages

Smetrics 2

Uploaded by

razmo285

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views49 pages

Smetrics 2

Uploaded by

razmo285

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

1

Chapter 4
Empirical Investigation
2 SE Investigation

 What is software engineering investigation?

 Applying “scientific” principles and techniques to investigate
properties of software and software related tools and
techniques.
 Scientific investigation: is a rigorous, systematic approach,
designed to eliminate bias and other subjective influences in the
search, identification, and measurement or validation of facts
and cause-effect relationships, and from which scientific laws
may be deduced.
 Why talking about software engineering investigation?
 Because the standard of empirical software engineering
research is quite poor.
3 SE Investigation: Examples
 Experiment to confirm rules-of-thumb
 Should the LOC in a module be less than 200?
 Should the number of branches in any functional
decomposition be less than 7?
 Experiment to explore relationships
 How does the project team experience with the application
affect the quality of the code?
 How does the requirements quality affect the productivity of
the designer?
 How does the design structure affect maintainability of the
code?
 Experiment to initiate novel practices
 Would it be better to start OO design with UML?
 Would the use of SRE improve software quality?
4 SE Investigation: Why?

 To improve (process and/or product)

 To evaluate (process and/or product)

 To prove a theory or hypothesis

 To disprove a theory or hypothesis

 To understand (a scenario, a situation)

 To compare (entities, properties, etc.)

5 SE Investigation: What?

Person’s performance
Tool’s performance

Person’s perceptions

Tool’s usability

Document’s understandability

Program’s complexity etc.

6 SE Investigation: Where & When?

 In the field
 In the lab

 In the classroom

 Anytime depending on what questions you

are asking
7 SE Investigation: How?

 Hypothesis/question generation
 Data collection

 Data evaluation

 Data interpretation

 Feed back into iterative process

8 SE Investigation: Characteristics

 Data sources come from industrial settings

 This may include people, program code, etc.

 Usually
 Surveys
 Case-studies ( hypothesis generation)
 Experiments ( hypothesis testing)

10
12 Where Data Come From?

 First Degree Contact

 Direct access to participants
 Example:
 Brainstorming
 Interviews
 Questionnaires
 System illustration
 Work diaries
 Think-aloud protocols
 Participant observation
13 Where Data Come From?

 Second Degree Contact

 Access to work environment during work time,
but not necessarily participants
 Example:
 Instrumenting systems
 Real time monitoring
14 Where Data Come From?

 Third Degree Contact

 Access to work artifacts, such as source code,
documentation
 Example:
 Problem report analysis
 Documentation analysis
 Analysis of tool logs
 Off-line monitoring
15 Practical Considerations

 Hidden Aspects of Performing Studies

 Negotiations with industrial partners
 Obtaining ethics approval and informed consent
from participants
 Adapting “ideal” research designs to fit with
reality
 Dealing with the unexpected
 Staffing of project
16 Investigation Principles

There are 4 main principles of investigation:

1. Selecting investigation technique: conducting surveys,
case studies, formal experiments
2. Stating the hypothesis: What should be investigated?
3. Maintaining control over variables: dependent and
independent variables
4. Making meaningful investigation: verification of theories,
evaluating accuracy of models, validating measurement
results.
17 SE Investigation Techniques
 Three ways to investigate:
 Formal experiment: A controlled investigation of an
activity, by identifying, manipulating and documenting
key factors of that activity.
 Case study: Document an activity by identifying key
factors (inputs, constraints and resources) that may affect
the outcomes of that activity.
 Survey: A retrospective study of a situation to try to
document relationships and outcomes.
15 Guidelines to choose A particular
investigation method
 If the activity has already occurred, we can perform survey
or case study. If it is yet to occur, then case study or formal
experiment may be chosen.
 If we have a high level of control over the variables that
can affect the outcome, then we can use an experiment. If
we have no control over the variable, then case study will
be a preferred technique.
 If replication is not possible at higher levels, then
experiment is not possible.
 If the cost of replication is low, then we can consider
experiment.
19 Hypothesis

 The first step is deciding what to investigate.

 The goal for the research can be expressed as a

hypothesis in quantifiable terms that is to be tested.

 The test result (the collected data) will confirm or

refute the hypothesis.

 Example:

Can Software Reliability Engineering (SRE) help us

to achieve an overall improvements in software
development practice in our company?
20
Examples /1
 Experiment: research in the small
 You have heard about software reliability
engineering (SRE) and its advantages and may
want to investigate whether to use SRE in your
company. You may design a controlled (dummy)
project and apply the SRE technique to it. You
may want to experiment with the various phases
of application (defining operational profile,
developing test-cases and decision upon
adequacy of test run) and document the results
for further investigation.
21 Examples /2
 Case study: research in the typical
 You may have used software reliability
engineering (SRE) for the first time in a project
in your company. After the project is completed,
you may perform a case-study to capture the
effort involved (budget, personnel), the number
of failures investigated, and the project duration.
22
Examples /3
 Survey: investigate in the large
 After you have used SRE in many projects in
your company, you may conduct a survey to
capture the effort involved (budget, personnel),
the number of failures investigated, and the
project duration for all the projects. Then, you
may compare these figures with those from
projects using conventional software test
technique to see if SRE could lead to an overall
improvements in practice.
23 Hypothesis (cont’d)

 Other Examples:
 Can integrated development and testing tools
improve our productivity?
 Does Cleanroom software development produce
better-quality software than using the
conventional development methods?
 Does code produced using Agile software
development have a lower number of defects per
KLOC than code produced using the
conventional methods?
24 Control /1
 What variables may affect truth of a hypothesis? How do they affect
it?
 Variable:
 Independent (values are set by the experiment or
initial conditions)
 Dependent (values are affected by change of
other variables)
 Example: Effect of “programming language” on the “quality” of
resulting code.
 Programming language is an independent and
quality is a dependent variable.
22
25
Control /2
 A common mistake: ignoring other variables that may
affect the values of a dependent variable.
 Example:
 Suppose you want to determine whether a change in
programming language (independent variable) can affect
the productivity (dependent variable) of your project. For
instance, you currently use FORTRAN and you want to
investigate the effects of changing to Ada. The values of all
other variables should stay the same (e.g., application
experience, programming environment, type of problem,
etc.)
 Without this you cannot be sure that the difference in
productivity is attributable to the change in language.
 But list of other variables may grow beyond control!
26
Control /3
 How to identify the dependent and
independent variables?
 Example:
AD
F&BZD&CF
Given :A, B, C
Using causal ordering:
A, B, C  D  F  Z
27 Planning Experiments

A Process Model for Performing Experiments: six-phase

process
1. Conception
 Defining the goal of investigation
2. Design
 Generating quantifiable (and manageable)
hypotheses to be tested
 Defining experimental objects or units
 Identifying experimental subject
 Identifying the response variable(s)
28 Planning Experiments

3. Preparation
 Getting ready to start, e.g., purchasing tools,
hardware, training personnel, etc.
4. Execution
5. Review and analysis
 Review the results for soundness and validity
6. Dissemination & decision making
 Documenting conclusions
29 Formal Experiments: Principles

1. Replication
 Experiment under identical conditions should
be repeatable.
 Confounded results (unable to separate the
results of two or more variables) should be
avoided.
2. Randomization
 The experimental trials must be organized in a
way that the effects of uncontrolled variables
are minimized
 Types of experimental design
crossing and nesting
30 Formal Experiments: Principles

3. Local control
 Blocking: allocating experimental units to blocks or
groups so the units within a block are relatively
homogeneous. The blocks are designed so that the
experimental design captures the anticipated variation
in the blocks by grouping like varieties, so that the
variation does not contribute to the experimental error.
 Balancing: is the blocking and assigning of treatments
so that an equal number of subjects is assigned to each
treatment. Balancing is desirable because it simplifies
the statistical analysis.
31 Example: Blocking & Balancing
 You are investigating the comparative effects of three design techniques
on the quality of the resulting code.
 The experiment involves teaching the techniques to 12 developers and
measuring the number of defects found per 1000 LOC to assess the code
quality.
 It may be the case that the twelve developers graduated from three
universities. It is possible that the universities trained the developers in
very different ways, so that being from a particular university can affect
the way in which the design technique is understood or used.
 To eliminate this possibility, three blocks can be defined so that the first
block contains all developers from university X, the second block from
university Y, and the third block from university Z. Then, the treatments
are assigned at random to the developers from each block. If the first
block has six developers, two are assigned to design method A, two to B,
and two to C.
32 Formal Experiments: Principles

3. Local control
 Correlation: the most
popular technique to
assess relationships
among observational
data
 Linear and nonlinear
correlation.
 Nonlinear correlation
is hard to be measured
and may stay hidden.
33 Formal Experiments: Types

Factorial design:
Crossing (each level of each
factor appears with each level of
the other factor)
Nesting (each level of one occurs
entirely in conjunction with one
level of another)
Proper nested or crossed design

may reduce the number of cases to

be tested.
34 Formal Experiments: Types

 Advantages of factorial design

 Resources can be used more efficiently
 Coverage (completeness) of the target
variables’ range of variation
 Implicit replication
 Disadvantages of factorial design
 Higher costs of preparation, administration
and analysis
 Number of combinations will grow rapidly
 Some of the combinations may be worthless
35 Formal Experiments: Selection

 Selecting the number of

variables:
 Single variable
 Multiple variables
 Example: Measuring time to
code a program module with or
without using a reusable
repository
 Without considering the effects of
experience of programmers
 With considering the effects of
experience of programmers
36 Formal Experiments: Baselines

 Baseline is an “average” treatment of a variable in a

number of experiments (or case studies).
 It provides a measure to identify whether the value is
within an acceptable range.
 It may help checking the validity of measurement.
35 Empirical Research Guidelines
38 Contents

1. Experimental context
2. Experimental design
3. Data collection
4. Analysis
5. Presentation of results
6. Interpretation of results
39 1. Experimental Context

Goals:
Ensure that the objectives of the experiment

have been properly defined

Ensure that the description of the experiment

provides enough details for the practitioners

40 1. Experimental Context
 C1: Be sure to specify as much of the context as possible. In
particular, clearly define the entities, attributes and measures
that are capturing the contextual information.
 C2: If a specific hypothesis is being tested, state it clearly
prior to performing the study, and discuss the theory from
which it is derived, so that its implications are apparent.
 C3: If the target is exploratory, state clearly and, prior to
data analysis, what questions the investigation is intended to
address, and how it will address them.
41 2. Experimental Design

Goal:
Ensure that the design is appropriate for the

objectives of the experiment

Ensure that the objective of the experiment

can be reached using the techniques specified

in the design
42
2. Experimental Design /1

 D1: Identify the population from which the subjects and

objects are drawn.

 D2: Define the process by which the subjects and objects

were selected (inclusion/exclusion criteria).

 D3: Define the process by which subjects and objects are

assigned to treatments.

 D4: Restrict yourself to simple study designs or, at least, to

designs that are fully analyzed in the literature.
 D5: Define the experimental unit.
43
2. Experimental Design /2
 D6: For formal experiments, perform a pre-
experiment or pre-calculation to identify or estimate
the minimum required sample size.
 D7: Use appropriate levels of blinding.
 D8: Avoid the use of controls unless you are sure
the control situation can be unambiguously defined.
 D9: Fully define all treatments (interventions).
 D10: Justify the choice of outcome measures in
terms of their relevance to objectives of the
empirical study.
44 3. Data Collection

Goal
Ensure that the data collection process is well

defined
Monitor the data collection and watch for

deviations from the experiment design

45 3. Data Collection
 DC1: Define all software measures fully, including the
entity, attribute, unit and counting rules.
 DC2: Describe any quality control method used to ensure
completeness and accuracy of data collection.
 DC3: For observational studies and experiments, record data
about subjects who drop out from the studies.
 DC4: For observational studies and experiments, record data
about other performance measures that may be adversely
affected by the treatment, even if they are not the main focus
of the study.
46
4. Analysis
Goal
Ensure that the collected data from the

experiment is analyzed correctly

Monitor the data analysis and watch for

deviations from the experiment design

47 4. Analysis

 A1: Specify any procedures used to control for

multiple testing.
 A2: Consider using blind analysis (avoid “fishing
for results”).
 A3: Perform sensitivity analysis.
 A4: Ensure that the data do not violate the
assumptions of the tests used on them.
 A5: Apply appropriate quality control procedures to
verify the results.
48 5. Presentation of Results

Goal
Ensure that the reader of the results can

understand the objective, the process and the

results of experiment
49 5. Presentation of Results
 P1: Describe or cite a reference for all procedures used.
Report or cite the statistical package used.
 P2: Present quantitative results as well as significance levels.
Quantitative results should show the magnitude of effects
and the confidence limits.
 P3: Present the raw data whenever possible. Otherwise,
confirm that they are available for review by the reviewers
and independent auditors.
 P4: Provide appropriate descriptive statistics.
 P5: Make appropriate use of graphics.
50 6. Interpretation of Results

Goal
Ensure that the conclusions are derived

merely from the results of the experiment

51 6. Interpretation of Results

 I1: Define the population to which inferential

statistics and predictive models apply.
 I2: Differentiate between statistical significance and
practical importance.
 I3: Specify any limitations of the study.

Software Metrics
No ratings yet
Software Metrics
51 pages
Chapter-04-Empirical Investigation
No ratings yet
Chapter-04-Empirical Investigation
43 pages
FALLSEM2021-22 SWE2020 ETH VL2021220101086 Reference Material I 17-08-2021 Empirical Investigation
No ratings yet
FALLSEM2021-22 SWE2020 ETH VL2021220101086 Reference Material I 17-08-2021 Empirical Investigation
33 pages
Empirical Investigation
No ratings yet
Empirical Investigation
31 pages
Indian Institute of Technology (BHU), Varanasi: Software Metrics (CS-7003) "Empirical Investigation "
No ratings yet
Indian Institute of Technology (BHU), Varanasi: Software Metrics (CS-7003) "Empirical Investigation "
52 pages
Week 10 SWE3033 Software Processes - Empirical Studies
No ratings yet
Week 10 SWE3033 Software Processes - Empirical Studies
17 pages
Advanced Software Engineering Research
No ratings yet
Advanced Software Engineering Research
6 pages
Empirical Software Engineering
No ratings yet
Empirical Software Engineering
7 pages
Empirical Research Methods For Software Engineering
No ratings yet
Empirical Research Methods For Software Engineering
24 pages
Basili-Role of E in SE Irvine
No ratings yet
Basili-Role of E in SE Irvine
42 pages
ESE Notes 1.0
No ratings yet
ESE Notes 1.0
15 pages
Introduction to Software Testing Basics
No ratings yet
Introduction to Software Testing Basics
54 pages
RM - Chapter 3
No ratings yet
RM - Chapter 3
25 pages
The Role of Experimentation in Software
No ratings yet
The Role of Experimentation in Software
17 pages
Testing Is A Process Used To Help Identify The Correctness, Completeness and Quality of
No ratings yet
Testing Is A Process Used To Help Identify The Correctness, Completeness and Quality of
37 pages
SE UNIT 4 Notes - Updated
No ratings yet
SE UNIT 4 Notes - Updated
28 pages
Software Testing Essentials
No ratings yet
Software Testing Essentials
19 pages
Research Methodology: AIUB. FALL 2019-2020
No ratings yet
Research Methodology: AIUB. FALL 2019-2020
40 pages
2K20 CO 015 Abhay Sharma
No ratings yet
2K20 CO 015 Abhay Sharma
10 pages
Testing
No ratings yet
Testing
104 pages
Sindu Pabba HW11
No ratings yet
Sindu Pabba HW11
3 pages
Understanding 2QS in Qualitative Research
No ratings yet
Understanding 2QS in Qualitative Research
26 pages
Exercise 2
No ratings yet
Exercise 2
4 pages
Thesis Help on Software Testing
100% (3)
Thesis Help on Software Testing
7 pages
Testing Principles and Test Case Design
No ratings yet
Testing Principles and Test Case Design
30 pages
Lecture 7 & 8 - Data Collection & Analysis
No ratings yet
Lecture 7 & 8 - Data Collection & Analysis
27 pages
PR 2 - Prelim Lessons 1 and 2
No ratings yet
PR 2 - Prelim Lessons 1 and 2
48 pages
Trab. Ingles
No ratings yet
Trab. Ingles
3 pages
ESE Book-28-58
No ratings yet
ESE Book-28-58
31 pages
The Many Faces of Software Testing
No ratings yet
The Many Faces of Software Testing
6 pages
PMSE Mod 3
No ratings yet
PMSE Mod 3
10 pages
Unit 5
No ratings yet
Unit 5
66 pages
2.01 03 - UnitTesting
No ratings yet
2.01 03 - UnitTesting
36 pages
Testin
No ratings yet
Testin
24 pages
Unit-1-Basics-Of-Software-Testing-And-Testing-Methods Final With Cover
No ratings yet
Unit-1-Basics-Of-Software-Testing-And-Testing-Methods Final With Cover
21 pages
SEPM Unit4
No ratings yet
SEPM Unit4
12 pages
Empirical Software Engineering Guide
No ratings yet
Empirical Software Engineering Guide
142 pages
Empirical Research in Software Engineering
No ratings yet
Empirical Research in Software Engineering
32 pages
Software Testing Life Cycle Overview
No ratings yet
Software Testing Life Cycle Overview
30 pages
7 Preparing For Tests
No ratings yet
7 Preparing For Tests
21 pages
SE1 Revision 1
No ratings yet
SE1 Revision 1
33 pages
Cause-Effect Graphing
0% (1)
Cause-Effect Graphing
110 pages
L2 Research Approaches
No ratings yet
L2 Research Approaches
32 pages
Pressman7Ch17 Modfied by Arvind
No ratings yet
Pressman7Ch17 Modfied by Arvind
57 pages
Software Testing Lifecycle Guide
No ratings yet
Software Testing Lifecycle Guide
6 pages
Metaheuristics Applied To Automatic Software Testing: A Brief Overview
No ratings yet
Metaheuristics Applied To Automatic Software Testing: A Brief Overview
17 pages
Software - Testing
No ratings yet
Software - Testing
52 pages
CSC2130: Empirical Research Methods For Software Engineering
No ratings yet
CSC2130: Empirical Research Methods For Software Engineering
17 pages
Software Testing Strategies Explained
No ratings yet
Software Testing Strategies Explained
51 pages
Lonetti 2018
No ratings yet
Lonetti 2018
53 pages
Topics Covered: Lesson 23
No ratings yet
Topics Covered: Lesson 23
4 pages
Why Testing?: Test Life Cycle
No ratings yet
Why Testing?: Test Life Cycle
16 pages
Software Testing Essentials
No ratings yet
Software Testing Essentials
31 pages
Class 8
No ratings yet
Class 8
36 pages
Software Testing Essentials
No ratings yet
Software Testing Essentials
48 pages
Software Testing Comprehensive Guide
No ratings yet
Software Testing Comprehensive Guide
106 pages
SOCS 185 The Influence of Agents of Socialization - Nurturing Motherly Traits Through Play
No ratings yet
SOCS 185 The Influence of Agents of Socialization - Nurturing Motherly Traits Through Play
3 pages
Kinasang-an: Understanding Globalization
100% (1)
Kinasang-an: Understanding Globalization
131 pages
Jose Rizal's Retraction Controversy
No ratings yet
Jose Rizal's Retraction Controversy
1 page
The Two Ibn Sina's PDF
No ratings yet
The Two Ibn Sina's PDF
26 pages
Understanding Attitudes, Beliefs, and Values
No ratings yet
Understanding Attitudes, Beliefs, and Values
1 page
AI & IoT in Soft Archery
No ratings yet
AI & IoT in Soft Archery
7 pages
RESET - GP - CETI - Bicomponent Spunbond Nonwovens
No ratings yet
RESET - GP - CETI - Bicomponent Spunbond Nonwovens
40 pages
Ending Mental Health Stigma Campaign
No ratings yet
Ending Mental Health Stigma Campaign
6 pages
Corporate Culture Case Study
0% (1)
Corporate Culture Case Study
5 pages
Marpol Annex II P&a Manual
100% (2)
Marpol Annex II P&a Manual
27 pages
Cewe Transducer Overview and Manual
0% (1)
Cewe Transducer Overview and Manual
56 pages
Vocabulary Words Class-6 P-4
No ratings yet
Vocabulary Words Class-6 P-4
9 pages
DSI Sonic Acquisition Help Guide
No ratings yet
DSI Sonic Acquisition Help Guide
12 pages
Quantum Ransomware Analysis
100% (1)
Quantum Ransomware Analysis
27 pages
Sensation & Perception Study Guide
No ratings yet
Sensation & Perception Study Guide
6 pages
Ch2 Retailstrategic Planning Operations Management 24
No ratings yet
Ch2 Retailstrategic Planning Operations Management 24
24 pages
(Ebook PDF) Mastering Competencies in Family Therapy: A Practical Approach To Theory and Clinical Case Documentation 3rd Edition Instant Download
100% (7)
(Ebook PDF) Mastering Competencies in Family Therapy: A Practical Approach To Theory and Clinical Case Documentation 3rd Edition Instant Download
39 pages
Cable - XLPE-LSHF-SWB-LSHF-AT-AR 3GX 2.5MM
No ratings yet
Cable - XLPE-LSHF-SWB-LSHF-AT-AR 3GX 2.5MM
2 pages
Mitsubishi NT ND Specs en
No ratings yet
Mitsubishi NT ND Specs en
8 pages
3, 5, 7 Term B.Tech - BE Master Time Table ESE Winter - 2021
No ratings yet
3, 5, 7 Term B.Tech - BE Master Time Table ESE Winter - 2021
8 pages
Fiitjee: Physics, Chemistry & Mathematics
No ratings yet
Fiitjee: Physics, Chemistry & Mathematics
23 pages
Understanding Barcode Technology Basics
No ratings yet
Understanding Barcode Technology Basics
17 pages
Engineering Dynamics Essentials
No ratings yet
Engineering Dynamics Essentials
21 pages
AV Equipment Rental Price List
No ratings yet
AV Equipment Rental Price List
5 pages
Implementing Performance Assessment in The Classroom
No ratings yet
Implementing Performance Assessment in The Classroom
4 pages
TED TALK LISTENING Worksheet
No ratings yet
TED TALK LISTENING Worksheet
1 page
Forbes Estates Lipa
No ratings yet
Forbes Estates Lipa
19 pages
Python String Methods
No ratings yet
Python String Methods
3 pages
Grade 10 Quadratic Relations Guide
No ratings yet
Grade 10 Quadratic Relations Guide
8 pages
2025 - Pinheiro Et Al.
No ratings yet
2025 - Pinheiro Et Al.
30 pages

Smetrics 2

Uploaded by

Smetrics 2

Uploaded by

1

 What is software engineering investigation?

 To improve (process and/or product)

 To prove a theory or hypothesis

 To disprove a theory or hypothesis

 To understand (a scenario, a situation)

 To compare (entities, properties, etc.)

Program’s complexity etc.

 Anytime depending on what questions you

 Feed back into iterative process

 Data sources come from industrial settings

 First Degree Contact

 Second Degree Contact

 Third Degree Contact

 Hidden Aspects of Performing Studies

There are 4 main principles of investigation:

 The first step is deciding what to investigate.

hypothesis in quantifiable terms that is to be tested.

refute the hypothesis.

Can Software Reliability Engineering (SRE) help us

A Process Model for Performing Experiments: six-phase

may reduce the number of cases to

 Advantages of factorial design

 Selecting the number of

 Baseline is an “average” treatment of a variable in a

have been properly defined

provides enough details for the practitioners

objectives of the experiment

can be reached using the techniques specified

 D1: Identify the population from which the subjects and

 D2: Define the process by which the subjects and objects

 D3: Define the process by which subjects and objects are

 D4: Restrict yourself to simple study designs or, at least, to

deviations from the experiment design

experiment is analyzed correctly

deviations from the experiment design

 A1: Specify any procedures used to control for

understand the objective, the process and the

merely from the results of the experiment

 I1: Define the population to which inferential

You might also like