0% found this document useful (0 votes)

87 views9 pages

Modeling Discrete Choice

The document discusses modeling discrete choice behavior using categorical dependent variables, specifically through logistic regression and maximum likelihood estimation. It introduces the concept of utility in decision-making and contrasts deterministic utility with random utility models, ultimately focusing on the logit model as a widely accepted approach in business analytics. The document also outlines the process of estimating a logit model using maximum likelihood estimation and provides examples related to choices between different alternatives.

Uploaded by

Sourav Roy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

87 views9 pages

Modeling Discrete Choice

Uploaded by

Sourav Roy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

UVA-QA-0779

Nov. 7, 2011

MODELING DISCRETE CHOICE: CATEGORICAL DEPENDENT VARIABLES,

LOGISTIC REGRESSION, AND MAXIMUM LIKELIHOOD ESTIMATION

Consider an individual choosing between two or more discrete alternatives: a shopper in

a grocery store deciding between apple or orange juice, or a prospective student determining
which of several university offers he ought to accept. For the juice manufacturer and the
university, the ability to predict the outcome of such choices is of vital importance.

In this note, we will discuss how this might be done. The process we will follow bears
some similarity to a regular linear regression but also has substantial differences, primarily due
to the fact that the choices are discrete; that is, they correspond to a categorical dependent
variable in regression.

The Concept of Utility

A fundamental construct in estimating choice behavior is the concept of utility—a

measure of one’s relative satisfaction or pleasure resulting from a particular action (in the above
examples, consumption of apple or orange juice and studies at various universities, respectively).

Suppose that, for a given individual, the utility from consumption of apple juice equals
uA and utility from consumption of orange juice equals uO. To get a sense of utility, I can rate,
on a 100% scale, how much I like apple juice and how much I like orange juice. Let us say I like
apple at 60% and orange at 50%. (Note that these values are relative to one another.) My utility
would be 0.6 and 0.5 for apple and orange juices, respectively. (Later in the note, we will
estimate these utilities using a linear model: Utility = a + b1 × x1 + b2 × x2 +… as a function of
certain attributes—fruit, size, packaging, price, etc.—but for now, assume that uA and uO are
known.)

A very simple choice model could say that if uA > uO, the individual chooses apple juice;
otherwise, he or she chooses orange. If uA = uO, one would be indifferent between the two. The
difference uA uO is called surplus. Such a model is called deterministic utility.

This technical note was prepared by Assistant Professor Anton Ovchinnikov. Copyright ¤ 2011 by the University of
Virginia Darden School Foundation, Charlottesville, VA. All rights reserved. To order copies, send an e-mail to
[email protected]. No part of this publication may be reproduced, stored in a retrieval system,
used in a spreadsheet, or transmitted in any form or by any means—electronic, mechanical, photocopying,
recording, or otherwise—without the permission of the Darden School Foundation.

Downloaded by XanEdu UserID 669874 on 5/15/2021. Smith School of Business - Queen's University, Professor Anton Ovchinnikov, MMA Winter 2022
-2- UVA-QA-0779

Random Utility

The problem with the above model is that, as long as uA > uO, the individual always
chooses apple juice, regardless of the magnitude of the surplus: The case of uO = 0.5 and
uA = 0.6 is no different from the case of uO = 0.1 and uA = 0.9. This contradicts the way people
typically behave: When surplus is small, people tend to be indifferent regarding the two juices,
but when it is large, the preference for apple is stronger. It is reasonable to suppose that, in the
former case, one might still occasionally purchase orange juice—for the sake of variety, lack of
attention to the choice, or other reasons—even though one generally prefers apple. In the latter
case, however, such instances should be much rarer.

The goal of random utility models is to capture the above behavior. Underlying such
models is the assumption that, rather than being a set number, the utility is a draw from a
particular distribution. In this case, uA and uO could be means of such distributions. If the means
of the distributions are close, then one would see a situation that resembles indifference: One
would choose apple somewhat more often but still occasionally would choose orange. If the
means are far apart, then apple juice would be chosen much more often than orange.

A Logit Model

A particular form of random utility model that has gained wide acceptance in business
analytics is a logit model (e.g., conjoint analysis a popular marketing research methodology is
based on a logit model). Underlying the logit model is an assumption that utilities follow a
Gumbel distribution. This distribution fits the actual choice data from numerous empirical
studies well and results in an analytically appealing form for the choice probabilities.

In particular, given the expected utilities uA and uO, the Gumbel distribution suggests
that the choice probabilities equal

Prob (Apple juice is chosen) = exp (uA) / [exp (uO) + exp (uA)]

and, correspondingly,

Prob (Orange juice is chosen) = exp (uO) / [exp (uO) + exp (uA)].

That is, if uA = 0.6 and uO = 0.5 as in the previous example, then

Prob (Apple juice is chosen) = exp (0.6) / [exp (0.5) + exp (0.6)] = 1.822 / 3.471 = 52.5%

and

Prob (Orange juice is chosen) = 47.5%.

Downloaded by XanEdu UserID 669874 on 5/15/2021. Smith School of Business - Queen's University, Professor Anton Ovchinnikov, MMA Winter 2022
-3- UVA-QA-0779

Further, since utility is a measure of relative satisfaction/pleasure, without loss of

generality, one can assume that either of the two utilities equals zero and rescale the other. For
example, if uO = 0, then uA = 0.6 0.5 = 0.1. Then, recalling that exp (0) = 1, we obtain

Prob (Apple juice is chosen) = exp (uA) / [1 + exp (uA)]

and

Prob (Orange juice is chosen) = 1 / [1 + exp (uA)].

It is easy to verify that such substitution had no impact on the resulting choice
probabilities (i.e., with uO = 0 and uA = 0.1, the choice probabilities are still 47.5% and 52.5%).

Estimating A Logit Model: Dummy Dependent Variables, Logistic Regression, and

Maximum Likelihood Estimation

A process of statistical estimation of a logit model is conceptually similar to a “standard”

linear regression (hence the name, logistic regression), yet it involves substantial technical
differences. To illustrate the idea and the process, we abandon the juice example and instead
consider the following situation:

An administrator at a business school D (name disguised for confidentiality

purposes) collected data about each applicant’s GMAT score and choice of
business school D versus business school H—one of the D’s closest competitors.
(The data is presented in Exhibit 1 and depicted in Figure 1.) Given a student’s
GMAT score, what can be said about that particular student’s choice?

For someone who is familiar with linear regression, a natural tendency would be to
regress the dependent variable choice (D versus H) onto the independent variable GMAT score.
But the dependent variable is categorical, not continuous, so one needs to introduce a dummy
variable, say, 1 for H and 0 for D. The result of such linear regression is presented in Figure 1.

Downloaded by XanEdu UserID 669874 on 5/15/2021. Smith School of Business - Queen's University, Professor Anton Ovchinnikov, MMA Winter 2022
-4- UVA-QA--0779

g the D versus H choice data to a linnear model.

gure 1.Fitting
Fig

Using
U the equ uation for th
he line in Fig
gure 1, it is not difficultt to obtain a point predicction.
For exammple, for indeependent varriable GMA AT = 700, Fiigure 1 wouuld suggest thhat the depenndent
variable choice equaals 0.0116 × 700 – 7.7291 = 0.40444. But becauuse the depeendent variabble is
categoriccal (i.e., corrresponds to a dummy variable
v that can be eithher 0 or 1), the predictioon of
0.4044 iss meaningleess—the dep pendent variaable can be only 0 or 11. In this sittuation, a naatural
desire is to interpret the 0.4044 number
n as a probability——in this casse, given ourr definition oof the
dummy variable,
v thaat an applicaant chooses H. That, hoowever, imm mediately leaads to a probblem:
For GMA AT = 650, fo or example, the predicted “probabiliity” is 0.17754 (i.e., it iss negative, w
which
makes no o sense).

This
T inconsisstency happeens becausee the desire to fit a linee to a categgorical depenndent
variable violates lin
nearity and homoscedassticity assum mptions of linear regreession. The logit
model dooes not rely on
o these assu
umptions, soo it ultimatelly results in bbetter predicctions.

The
T process of o estimatingg a logit model is calledd maximum llikelihood esstimation (M
MLE).
MLE is a “counterpart” to the least
l squaress minimizatiion used in estimating llinear regresssion,
but it acccounts for the fact thaat the estimaated quantitties are probbabilities. M
MLE proceeds as
follows:

1. Express
E the utility
u of the choice giveen a GMAT score as a liinear model uH|GMAT = a +
b × GMAT an nd anchor on n the alternaative choice aat 0; that is, set uD = 0.
2. Compute
C the correspondiing choice prrobabilities:
i. Prob (H
H is chosen given
g GMAT H|GMAT) / [11 + exp (uH|GMAT)]
T) = exp (uH
ii. Prob (D
D is chosen given GMA H|GMAT)] = 1 – Prob (H is
AT) = 1 / [11 + exp (uH
chosen given GMAT)

Downloaded by XanEdu UserID 669874 on 5/15/2021. Smith School of Business - Queen's University, Professor Anton Ovchinnikov, MMA Winter 2022
-5- UVA-QA-0779

3. Given these probabilities, estimate the likelihood of observing the data you have.
i. Applicant ID1 chose D, and the likelihood of that is Prob (D is chosen given
GMAT = 655) = 1 / [1+ exp (a + b × 655)].
ii. Applicant ID2 also chose D and the likelihood of that is again Prob (D is chosen
given GMAT = 660). The likelihood of ID1 and ID2 choosing what they chose is
Prob (D is chosen given GMAT = 655) × Prob (D is chosen given GMAT =
660)
iii. …and so on.
4. Compute the total likelihood (a product of the likelihoods for each data point).
5. Solve an optimization problem that will maximize the total likelihood by changing
coefficients a and b (e.g., using Excel Solver).

Hint: Because the objective function (the total likelihood) involves a product of non-
linear quantities, this is obviously a highly non-linear optimization model. It can be simplified if
one considers logarithms of the likelihoods, instead of the likelihoods themselves. A helpful
property of the logarithms is that log (L1 × L2 × L3×…) = log (L1) + log (L2) + log (L3) +…
Further, log [exp (uH)] = uH = a + b × GMAT, which is a linear function of coefficients a and b.
These features make a model with log-likelihoods easier to solve.

Exhibit 2 presents a snapshot of the Solver model that performs the above procedure.
Figure 2 presents the resulting probability, obtained using a logistic regression. From Exhibit 2,
the resulting equation is

Prob (H is chosen given GMAT) = exp (48.47 + 0.0683 × GMAT) / [1+ exp (48.47 + 0.0683
× GMAT)].

Returning to the previous example, for GMAT = 700 the resulting probability is

Prob (H is chosen given GMAT = 700) = exp (48.47 + 0.0683 × GMAT) / [1+ exp (48.47 +
0.0683 × GMAT)]
= 0.5258 / 1.5258 = 34.46%.

Likewise, for GMAT = 650, it equals 1.7% (Figure 2).

Downloaded by XanEdu UserID 669874 on 5/15/2021. Smith School of Business - Queen's University, Professor Anton Ovchinnikov, MMA Winter 2022
-6- UVA-QA--0779

Figure 2. Fitting th
he D vs H ch
hoice data to linear and loogistic modeels.

The
T procedurres for estim mating logisstic regressiion coefficieents are em mbedded in m many
statistical packages. For example, the StatT Tools add-oon to Excel allows one to run a bbinary
logistic regression
r (aa case when there are tw
wo alternativves to choosse from, as cconsidered inn this
note). Th he resulting output is prresented in Exhibit
E 3; nnot surprisinngly, the coefficients arre the
same as ini the modell we obtained d “manuallyy” (i.e., with Solver).

Several statistical method ds and softwware exist thaat can estimaate much moore complexx logit
models, such as mulltinomial log git (MNL, a case whenn there are tthree or morre alternativves to
choose from),
fr latentt class logitss (when, in addition to estimating an MNL, oone also wannts to
determin
ne whether th he respondennts come fro om different subgroups thhat have diffferent underrlying
preferencces/utilities),, nested logiit (a case wh
hen several cchoices are embedded inn one another, as
when a customer
c firsst chooses a store from which
w to buyy and then chhooses the bbrand to buy)), and
many othhers.

Downloaded by XanEdu UserID 669874 on 5/15/2021. Smith School of Business - Queen's University, Professor Anton Ovchinnikov, MMA Winter 2022
-7- UVA-QA-0779

Exhibit 1
MODELING DISCRETE CHOICE: CATEGORICAL DEPENDENT VARIABLES,
LOGISTIC REGRESSION, AND MAXIMUM LIKELIHOOD ESTIMATION
Sample GMAT Score Data

ID GMAT Choice Dummy

1 655 D 0
2 660 D 0
3 660 D 0
4 662 D 0
5 662 D 0
6 674 D 0
7 676 D 0
8 680 D 0
9 680 D 0
10 682 D 0
11 683 D 0
12 687 H 1
13 687 D 0
14 689 D 0
15 692 D 0
16 696 H 1
17 700 H 1
18 701 D 0
19 703 D 0
20 708 H 1
21 708 D 0
22 710 H 1
23 719 D 0
24 719 H 1
25 725 H 1
26 727 D 0
27 728 H 1
28 728 H 1
29 731 H 1
30 731 H 1
31 737 H 1
32 738 H 1
33 741 D 0
34 747 H 1
35 747 H 1

Data source: Sample data created by author.

Downloaded by XanEdu UserID 669874 on 5/15/2021. Smith School of Business - Queen's University, Professor Anton Ovchinnikov, MMA Winter 2022
-8- UVA-QA-0779

Exhibit 2
MODELING DISCRETE CHOICE: CATEGORICAL DEPENDENT VARIABLES,
LOGISTIC REGRESSION, AND MAXIMUM LIKELIHOOD ESTIMATION

uH|GMAT = a + b × GMAT a= 48.471082 b= 0.068326198

ID GMAT Choice Dummy uH expuH) Prob (H is chosen) Likelihood Log (Likelihood)

1 655 D 0 3.7174 0.0243 0.0237 0.9763 0.0240
2 660 D 0 3.3758 0.0342 0.0331 0.9669 0.0336
3 660 D 0 3.3758 0.0342 0.0331 0.9669 0.0336
4 662 D 0 3.2391 0.0392 0.0377 0.9623 0.0384
5 662 D 0 3.2391 0.0392 0.0377 0.9623 0.0384
6 674 D 0 2.4192 0.0890 0.0817 0.9183 0.0853
7 676 D 0 2.2826 0.1020 0.0926 0.9074 0.0971
8 680 D 0 2.0093 0.1341 0.1182 0.8818 0.1258
9 680 D 0 2.0093 0.1341 0.1182 0.8818 0.1258
10 682 D 0 1.8726 0.1537 0.1332 0.8668 0.1430
11 683 D 0 1.8043 0.1646 0.1413 0.8587 0.1524
12 687 H 1 1.5310 0.2163 0.1778 0.1778 1.7268
13 687 D 0 1.5310 0.2163 0.1778 0.8222 0.1958
14 689 D 0 1.3943 0.2480 0.1987 0.8013 0.2215
15 692 D 0 1.1894 0.3044 0.2334 0.7666 0.2658
16 696 H 1 0.9160 0.4001 0.2858 0.2858 1.2526
17 700 H 1 0.6427 0.5258 0.3446 0.3446 1.0653
18 701 D 0 0.5744 0.5630 0.3602 0.6398 0.4466
19 703 D 0 0.4378 0.6455 0.3923 0.6077 0.4980
20 708 H 1 0.0961 0.9083 0.4760 0.4760 0.7424
21 708 D 0 0.0961 0.9083 0.4760 0.5240 0.6462
22 710 H 1 0.0405 1.0414 0.5101 0.5101 0.6731
23 719 D 0 0.6555 1.9260 0.6582 0.3418 1.0736
24 719 H 1 0.6555 1.9260 0.6582 0.6582 0.4182
25 725 H 1 1.0654 2.9020 0.7437 0.7437 0.2961
26 727 D 0 1.2021 3.3270 0.7689 0.2311 1.4649
27 728 H 1 1.2704 3.5622 0.7808 0.7808 0.2474
28 728 H 1 1.2704 3.5622 0.7808 0.7808 0.2474
29 731 H 1 1.4754 4.3726 0.8139 0.8139 0.2060
30 731 H 1 1.4754 4.3726 0.8139 0.8139 0.2060
31 737 H 1 1.8853 6.5885 0.8682 0.8682 0.1413
32 738 H 1 1.9537 7.0544 0.8758 0.8758 0.1326
33 741 D 0 2.1586 8.6593 0.8965 0.1035 2.2679
34 747 H 1 2.5686 13.0474 0.9288 0.9288 0.0738
35 747 H 1 2.5686 13.0474 0.9288 0.9288 0.0738

Total 15.4808

Please refer to UVA-QA-0779X (spreadsheet supplement) for the setup of the Solver model.

Data source: Created by author.

Downloaded by XanEdu UserID 669874 on 5/15/2021. Smith School of Business - Queen's University, Professor Anton Ovchinnikov, MMA Winter 2022
-9- UVA-QA-0779

Exhibit 3
MODELING DISCRETE CHOICE: CATEGORICAL DEPENDENT VARIABLES,
LOGISTIC REGRESSION, AND MAXIMUM LIKELIHOOD ESTIMATION

StatToolsReport
Analysis: LogisticRegression
PerformedBy: XXXXXXX
Date: XXXXXXX
Updating: Static

SummaryMeasures
NullDeviance 47.80356733
ModelDeviance 30.96154543
Improvement 16.8420219
pValue <0.0001

Standard Wald Lower Upper

Coefficient pValue Exp(Coef)
RegressionCoefficients Error Value Limit Limit
Constant 48.47037424 15.38526195 3.150441923 0.0016 78.62548765 18.31526082 8.90398E22
GMAT 0.068325198 0.021740921 3.142700311 0.0017 0.025712994 0.110937402 1.070713446

1 0 Percent
ClassificationMatrix Correct
1 11 4 73.33%
0 3 17 85.00%

Percent
SummaryClassification
Correct 80.00%
Base 57.14%
Improvement 53.33%

Note: Each cell contains a comment that explains the entry. Please refer to UVA-QA-0779X.

Data source: Created by author.

Downloaded by XanEdu UserID 669874 on 5/15/2021. Smith School of Business - Queen's University, Professor Anton Ovchinnikov, MMA Winter 2022

Logistic Regression
No ratings yet
Logistic Regression
9 pages
The Independence of Irrelevant Alternatives - 230919 - 191757
No ratings yet
The Independence of Irrelevant Alternatives - 230919 - 191757
26 pages
Lattin James M-Analyzing Multivariate Data-Pp477-490
100% (1)
Lattin James M-Analyzing Multivariate Data-Pp477-490
15 pages
Multinomial Regression Models
No ratings yet
Multinomial Regression Models
35 pages
Estimation of Random Utility Models in R: The Mlogit Package
No ratings yet
Estimation of Random Utility Models in R: The Mlogit Package
40 pages
CIVL 5640 Discrete Choice Experiments and Data Analysis: Sisi Jian Department of Civil and Environmental Engineering
No ratings yet
CIVL 5640 Discrete Choice Experiments and Data Analysis: Sisi Jian Department of Civil and Environmental Engineering
42 pages
Newsletter 23 - Logit, Probit, Tobit (2P)
No ratings yet
Newsletter 23 - Logit, Probit, Tobit (2P)
2 pages
Binary Data Advanced
No ratings yet
Binary Data Advanced
42 pages
Multinomial Logit Model Explained
No ratings yet
Multinomial Logit Model Explained
13 pages
SOC6078 SOC6078 Advanced Statistics: 4. Models For Categorical Dependent Variables II Extending The Logit and Probit Models
No ratings yet
SOC6078 SOC6078 Advanced Statistics: 4. Models For Categorical Dependent Variables II Extending The Logit and Probit Models
15 pages
CH-4-Discrete Choice Models-Short
No ratings yet
CH-4-Discrete Choice Models-Short
58 pages
Random Models Marketing
No ratings yet
Random Models Marketing
11 pages
Chapter Four
No ratings yet
Chapter Four
8 pages
Chapter 2
No ratings yet
Chapter 2
22 pages
Discrete-Choice Models of Demand
No ratings yet
Discrete-Choice Models of Demand
19 pages
Econ 217 3
No ratings yet
Econ 217 3
16 pages
3 Logit: 3.1 Choice Probabilities
No ratings yet
3 Logit: 3.1 Choice Probabilities
42 pages
Multi No Mial
No ratings yet
Multi No Mial
53 pages
Discrete Choice Models Explained
100% (1)
Discrete Choice Models Explained
19 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
3 pages
Report IIT Bombay
No ratings yet
Report IIT Bombay
6 pages
Econometric - Daily Class Notes
No ratings yet
Econometric - Daily Class Notes
33 pages
07 Banerjee and Banerjee Business Analytics Ch07
No ratings yet
07 Banerjee and Banerjee Business Analytics Ch07
17 pages
Lecture-9 With Remarks
No ratings yet
Lecture-9 With Remarks
31 pages
MNL Model Overview and Applications
No ratings yet
MNL Model Overview and Applications
33 pages
CE5205 Week 7 - DCM and Mode Choice I
No ratings yet
CE5205 Week 7 - DCM and Mode Choice I
32 pages
Chapter 15 Qualitative Response Regression Models Part 2
No ratings yet
Chapter 15 Qualitative Response Regression Models Part 2
31 pages
Nlogit An R Package Presentation
No ratings yet
Nlogit An R Package Presentation
40 pages
Tutorial 12 QM@
No ratings yet
Tutorial 12 QM@
17 pages
Regresion Logistic - Odt 1
No ratings yet
Regresion Logistic - Odt 1
8 pages
Multinomial Logistic Regression Models: Newsom Psy 525/625 Categorical Data Analysis, Spring 2021 1
No ratings yet
Multinomial Logistic Regression Models: Newsom Psy 525/625 Categorical Data Analysis, Spring 2021 1
5 pages
Lecture 1 Updated
No ratings yet
Lecture 1 Updated
35 pages
Notes On Choice Variability and Risky Choice
No ratings yet
Notes On Choice Variability and Risky Choice
16 pages
Calibration and Test A Discrete Choice Model With Endogenous Choice Sets
No ratings yet
Calibration and Test A Discrete Choice Model With Endogenous Choice Sets
24 pages
Logit Models in Choice Analysis
No ratings yet
Logit Models in Choice Analysis
7 pages
Logistic Regression Guide
No ratings yet
Logistic Regression Guide
17 pages
Discrete Choice Models in Econometrics
No ratings yet
Discrete Choice Models in Econometrics
38 pages
LIBROJ. S. Cramer - Logit Models From Economics and Other Fields-Cambridge University Press (2003)
100% (1)
LIBROJ. S. Cramer - Logit Models From Economics and Other Fields-Cambridge University Press (2003)
185 pages
Binaryresponsemf IMP
No ratings yet
Binaryresponsemf IMP
11 pages
Sawtooth Software: Analysis of Traditional Conjoint Using Microsoft Excel: An Introductory Example
No ratings yet
Sawtooth Software: Analysis of Traditional Conjoint Using Microsoft Excel: An Introductory Example
7 pages
Econometrics: Choice Models Guide
No ratings yet
Econometrics: Choice Models Guide
64 pages
Discrete Choice Models 2
No ratings yet
Discrete Choice Models 2
19 pages
CH-4-Discrete Choice Models-PG (Compatibility Mode)
No ratings yet
CH-4-Discrete Choice Models-PG (Compatibility Mode)
93 pages
Homework 2
No ratings yet
Homework 2
3 pages
5 Multi-Attribute Utility Function
No ratings yet
5 Multi-Attribute Utility Function
53 pages
Random Utility
No ratings yet
Random Utility
15 pages
Logit Probit
No ratings yet
Logit Probit
20 pages
Part III - Analysis With NonLinear Models
No ratings yet
Part III - Analysis With NonLinear Models
68 pages
Logit vs Probit Models in Regression
No ratings yet
Logit vs Probit Models in Regression
2 pages
Multinomial Logistic Regression - Spss Data Analysis Examples
No ratings yet
Multinomial Logistic Regression - Spss Data Analysis Examples
1 page
Session 4 Forecasting Regression Methods II
No ratings yet
Session 4 Forecasting Regression Methods II
65 pages
Logit and Probit Models in R Guide
100% (1)
Logit and Probit Models in R Guide
24 pages
Assignment 2
No ratings yet
Assignment 2
11 pages
Game Theory
100% (1)
Game Theory
155 pages
Lecture I-II: Motivation and Decision Theory: 1 Motivating Experiment: Guess The Average
No ratings yet
Lecture I-II: Motivation and Decision Theory: 1 Motivating Experiment: Guess The Average
8 pages
Session19. Estimation Logit Model-1
No ratings yet
Session19. Estimation Logit Model-1
25 pages
Binary Response Models: Logits, Probits and Semiparametrics: Joel L. Horowitz and N.E. Savin
No ratings yet
Binary Response Models: Logits, Probits and Semiparametrics: Joel L. Horowitz and N.E. Savin
18 pages
Advanced Discrete Choice Models
No ratings yet
Advanced Discrete Choice Models
11 pages
Female Mystics in Spain's Golden Age
No ratings yet
Female Mystics in Spain's Golden Age
14 pages
Plan Learn English
No ratings yet
Plan Learn English
2 pages
Mount Pulag
No ratings yet
Mount Pulag
9 pages
Test For Chemistry Education Examination Form One
No ratings yet
Test For Chemistry Education Examination Form One
7 pages
Tank Mage: Character Sheet
No ratings yet
Tank Mage: Character Sheet
4 pages
DIY Ram Paper Sculpture Kit
No ratings yet
DIY Ram Paper Sculpture Kit
20 pages
Overview of the Respiratory System
No ratings yet
Overview of the Respiratory System
1 page
Maths Chapter 1 and 2 Test
No ratings yet
Maths Chapter 1 and 2 Test
1 page
MTTM C 202 Unit I
No ratings yet
MTTM C 202 Unit I
15 pages
Quarter I Week 2
No ratings yet
Quarter I Week 2
62 pages
Hindu Rin
No ratings yet
Hindu Rin
4 pages
Enhancing Self-Concept and Esteem
No ratings yet
Enhancing Self-Concept and Esteem
3 pages
Adulthood Nutrition: Needs & Health Risks
No ratings yet
Adulthood Nutrition: Needs & Health Risks
6 pages
Data Analytics & Mining Course Guide
No ratings yet
Data Analytics & Mining Course Guide
2 pages
MPF Filtri en v6
No ratings yet
MPF Filtri en v6
18 pages
Simple GUI Design in Java
No ratings yet
Simple GUI Design in Java
2 pages
GenChem1 Lesson 2
No ratings yet
GenChem1 Lesson 2
48 pages
Pittsburgh Shakespeare Contest 2023
No ratings yet
Pittsburgh Shakespeare Contest 2023
5 pages
DDL and DML in MySQL Explained
No ratings yet
DDL and DML in MySQL Explained
5 pages
Vector Sum and Resolving A Vector
No ratings yet
Vector Sum and Resolving A Vector
14 pages
Overview of Renewable Energy Sources
No ratings yet
Overview of Renewable Energy Sources
11 pages
Wavelength Division Multiplexing
No ratings yet
Wavelength Division Multiplexing
25 pages
Reid TacticalStrategicDeception 2017
No ratings yet
Reid TacticalStrategicDeception 2017
25 pages
Radio Processor 6347 Commercial Presentation
100% (1)
Radio Processor 6347 Commercial Presentation
16 pages
Dzogchen: Embracing Natural Awareness
100% (1)
Dzogchen: Embracing Natural Awareness
28 pages
str-w6754 Ds en
No ratings yet
str-w6754 Ds en
8 pages
Tecumseh Engine Valve Specs Guide
No ratings yet
Tecumseh Engine Valve Specs Guide
1 page
Effects of A Personalized Game On Students Outcomes and Visual Attention During Digital Citizenship Learning-1
No ratings yet
Effects of A Personalized Game On Students Outcomes and Visual Attention During Digital Citizenship Learning-1
23 pages
Solid Waste Management System
No ratings yet
Solid Waste Management System
45 pages
"I Will Walk Among You": 00i-291 Harper 3p.indb 1 10/1/18 8:09 AM
No ratings yet
"I Will Walk Among You": 00i-291 Harper 3p.indb 1 10/1/18 8:09 AM
303 pages

Modeling Discrete Choice

Uploaded by

Modeling Discrete Choice

Uploaded by

UVA-QA-0779

MODELING DISCRETE CHOICE: CATEGORICAL DEPENDENT VARIABLES,

Consider an individual choosing between two or more discrete alternatives: a shopper in

The Concept of Utility

A fundamental construct in estimating choice behavior is the concept of utility—a

That is, if uA = 0.6 and uO = 0.5 as in the previous example, then

Prob (Orange juice is chosen) = 47.5%.

Further, since utility is a measure of relative satisfaction/pleasure, without loss of

Prob (Apple juice is chosen) = exp (uA) / [1 + exp (uA)]

Prob (Orange juice is chosen) = 1 / [1 + exp (uA)].

Estimating A Logit Model: Dummy Dependent Variables, Logistic Regression, and

A process of statistical estimation of a logit model is conceptually similar to a “standard”

An administrator at a business school D (name disguised for confidentiality

g the D versus H choice data to a linnear model.

Likewise, for GMAT = 650, it equals 1.7% (Figure 2).

ID GMAT Choice Dummy

Data source: Sample data created by author.

uH|GMAT = a + b × GMAT a= 48.471082 b= 0.068326198

ID GMAT Choice Dummy uH expuH) Prob (H is chosen) Likelihood Log (Likelihood)

Data source: Created by author.

Standard Wald Lower Upper

Data source: Created by author.

You might also like