Symbolic Logic: 5th Edition Overview
Symbolic Logic: 5th Edition Overview
EDITION
9
OXFORD & IBH PUBLISHING Ca
New Delhi Bombay Calcutta
First Published January 1960
This book has been published with a subsidy under the Indo-American
Text Book Programme operated by National Book Trust, India.
Quantification Theory
4.1 Singular Propositions and General
Propositions 63
4.2 Proving Validity: Preliminary Quantification
Rules 71
4.3 Proving Invalidity 78
4.4 Multiply-Ceneral Propositions 83
xil Contents
1 Logic, in Dictionary of Phibsophy and Psychology, James Mark Baldwin, ed., Macmillan
Publishing Co., Inc., New York, 1925.
2 Introduction: Logic and Language [Ch. 1
the completed reasoning process. Their question is always: Does the conclusion
that is reached follow from the premisses used or assumed? If the premisses
provide adequate grounds for accepting the conclusion, if asserting the prem¬
isses to be true warrants asserting the conclusion to be true also, then the
reasoning is correct. Otherwise, the reasoning is incorrect. Logicians’ methods
and techniques have been developed primarily for the purpose of making this
distinction clear. Logicians are interested in all reasoning, regardless of its
subject matter, but only from this special point of view.
tive logic. In this book, however, we shall be concerned only with deductive
arguments, and shall use the word ‘argument’ to refer to deductive arguments
exclusively.
An argument may contain false propositions exclusively and still be valid, as,
for example,
This argument is valid because if its premisses were true, its conclusion would
have to be true also, even though, in fact, they are all false. These two examples
show that although some valid arguments have true conclusions, not all of them
do. The validity of an argument does not, therefore, guarantee the truth of its
conclusion.
When we consider the argument
we can see that although both premisses and conclusion are true, the argument
is invalid. Its invalidity becomes obvious when it is compared with another
argument of the same form:
2Some logicians use the term ‘valid’ to characterize statements that are logically true. We shall
adopt that usage in Chapter 10, Section 10.6. Until then, however, we apply the terms ‘valid’ and
‘invalid’ to arguments exclusively.
Sec. 1.4] Symbolic Logic 5
If Rockefeller is President, then Rockefeller is famous.
Rockefeller is not President.
Therefore, Rockefeller is not famous.
This argument is clearly invalid because its premisses are true but its conclu¬
sion is false. The two latter examples show that although some invalid argu¬
ments have false conclusions, not all of them do. The falsehood of its conclusion
does not guarantee the invalidity of an argument. But the falsehood of its
conclusion does guarantee that either the argument is invalid or at least one of
its premisses is false.
An argument must satisfy two conditions to establish the truth of its conclu¬
sion. It must be valid, and all of its premisses must be true. Such an argument is
termed ‘sound’. To determine the truth or falsehood of premisses is the task of
scientific inquiry in general, since premisses may deal with any subject matter
at all. But determining the validity or invalidity of arguments is the special
province of deductive logic. The logician is interested in the question of
validity even for arguments that might be unsound because their premisses
might happen to be false.
A question might be raised about the legitimacy of that interest. It might be
suggested that logicians should confine their attention to arguments that have
true premisses only. It is often necessary, however, to depend upon the validity
of arguments whose premisses are not known to be true. Scientists test their
theories by deducing from them conclusions that predict the behavior of
observable phenomena in the laboratory or observatory. The conclusion is then
tested directly by observation of experimental data and if it is true, the results
confirm the theory from which the conclusion was deduced. If the conclusion is
false, the results disconfirm or refute the theory. In either case, the scientist is
vitally interested in the validity of the argument by which the testable con¬
clusion is deduced from the theory being investigated, for if that argument is
invalid, his whole procedure is without point. Although an oversimplification
of scientific method, our example shows that questions of validity are important
even for arguments whose premisses are not true.
It has been explained that logic is concerned with arguments and that
arguments contain propositions or statements as their premisses and conclu¬
sions. These premisses and conclusions are not linguistic entities, such as
declarative sentences, but are, rather, what declarative sentences are typically
uttered to assert. The communication of propositions and arguments, however,
requires the use of language, and this complicates our problem. Arguments
formulated in English or any other natural language are often difficult to
appraise because of the vague and equivocal nature of the w'ords in which they
6 Introduction: Logic and Language [Ch. 1
are expressed, the ambiguity of their construction, the misleading idioms they
may contain, and their pleasing but deceptive metaphorical style. The resolu¬
tion of these difficulties is not the central problem for the logician, however, for
even when they are resolved, the problem of deciding the validity or invalidity
of the argument still remains.
To avoid the peripheral difficulties connected with ordinary language,
workers in the various sciences have developed specialized technical vocabu¬
laries. The scientist economizes the space and time required for writing his
reports and theories by adopting special symbols to express ideas that would
otherwise require a long sequence of familiar words to formulate. This has the
further advantage of reducing the amount of attention needed, for when a
sentence or equation grows too long, its meaning is more difficult to grasp. The
introduction of the exponent symbol in mathematics permits the expression of
the equation
AxAxAxAxAxAxAxAxAxAxAxA
= BXBXBXBXBXB XB
A12 = B7
A like advantage has been obtained by the use of graphic formulas in organic
chemistry. The language of every advanced science has been enriched by
similar symbolic innovations.
A special technical notation has been developed for logic as well. Aristotle
made use of certain abbreviations to facilitate his own investigations. Modern
symbolic logic augmented this base by the introduction of many more special
symbols. The difference between the old and the new logic is one of degree
rather than of kind, but the difference in degree is tremendous. Modern
symbolic logic has become immeasurably more powerful a tool for analysis and
deduction through the development of its own technical language. The special
symbols of modern logic permit us to exhibit with greater clarity the logical
structures of arguments that may be obscured by formulation in ordinary
language. It is easier to divide arguments into the valid and the invalid when
they are expressed in a special symbolic language, for with symbols the
peripheral problems of vagueness, ambiguity, idiom, metaphor, and amphiboly
do not arise. The introduction and use of special symbols serve not only to
facilitate the appraisal of arguments, but also to clarify the nature of deductive
inference.
The logician’s special symbols are much better adapted to the actual
drawing of inferences than is ordinary language. Their superiority in this
respect is comparable to that of Arabic over the older Roman numerals for
purposes of computation. It is easy to multiply 148 by 47, but very difficult to
compute the product of CXLVIII and XLVII. Similarly, the drawing of
Sec. 1.4] Symbolic Logic 7
inferences and the evaluation of arguments is greatly facilitated by the adop¬
tion of a special logical notation. To quote Alfred North Whitehead, an
important contributor to the advance of symbolic logic:
Although this book treats symbolic logic systematically rather than histori¬
cally, a few historical remarks may be appropriate at this point. Since the
1840s, symbolic logic has developed along two different historical paths. One
of them began with the English mathematician George Boole (1815-1864).
Boole applied algebraic notations and methods first to symbolize and then to
validate arguments of the kind studied by Aristotle in the fourth century b.c.
This route may be characterized as an effort to apply mathematical notations
and methods to traditional, nonmathematical kinds of arguments. The other
path began with the independent efforts of the English mathematician Augus¬
tus De Morgan (1806-1871) and the American scientist and philosopher
Charles Peirce (1839-1914) to devise a very precise notation for relational
arguments. The earlier logic had largely ignored this type of argument, which,
nevertheless, plays a central role in mathematics. This historical path may be
characterized as an effort to create a new quasi-mathcmatical kind of logical
notation and analytical technique for use in mathematical derivations and
demonstrations.
These two historical paths coalesced in the brilliant works of the German
mathematician and philosopher Gottlob Frege (1848-1925), the Italian math¬
ematician Guiseppe Peano (1858-1932), and the English philosophers Alfred
North Whitehead (1861-1947) and Bertrand Bussell (1872-1970), whose
Principia Mathematica was an important landmark in the history of symbolic
logic. Some of Boole’s contributions are reported in the first two sections of
Chapter 7 and in Appendix B. The contributions of the others have become so
thoroughly incorporated into modern symbolic logic that only occasional
references to their more distinctive ideas are appropriate.
All statements can he divided into two kinds, simple and compound. A
simple statement is one that does not contain any other statement as a compo¬
nent part, whereas every compound statement does contain another statement
as a component part. For example, ‘Atmospheric testing of nuclear weapons
will be discontinued or this planet will become uninhabitable’ is a compound
statement that contains, as its components, the two simple statements ‘Atmos¬
pheric testing of nuclear weapons will be discontinued’ and ‘this planet will
become uninhabitable’. The component parts of a compound statement may
themselves be compound, of course.
As the term ‘component’ is used in logic, not every statement that is part of
a larger statement is a component of the larger statement. For example, the last
six words of the statement, The third wife of Bertrand Russell was a beautiful
girl’, can be regarded as a statement in its own right. But it is not a component
part or a component of the larger statement containing it. For a part of a
statement to be a component of a larger statement, two conditions must be
satisfied. First, the part must be a statement in its own right; and second, if the
part is replaced in the larger statement by any other statement, the result of
that replacement must be meaningful. Although the first condition is satisfied in
the example given, the second is not. For if the part ‘Bertrand Russell was a
beautiful girl’ is replaced by the statement ‘Where there’s smoke, there’s fire’,
the result of that replacement is arrant nonsense.1
The statement ‘Roses are red and violets are blue’ is a conjunction, a
compound statement formed by inserting the word ‘and’ between two state¬
ments. Two statements so combined are called conjuncts. The word ‘and’ has
other uses, however, as in the statement ‘Castor and Pollux were twins', which
is not compound, but a simple statement asserting a relationship. We introduce
the dot *•’ as a special symbol for combining statements conjunctively. Using
lrThis account of compound and component statements has been suggested by Proiessor
('. Mason Myers of Northern Illinois University, Professor Alex Blum of Bar-llan University, and
Professor James A. Martin of the University" yf Wyoming. See Martin’s ‘How Not to Define
Truth-Functionality,' Logique et Analyse, 1970, 14' Anncc, N° 52, pp. 476-482.
Sec. 2.1] Simple and Compound Statements 9
this notation, the preceding conjunction is written ‘Roses are red-violets are
blue’. Where p and q are any two statements whatever, their conjunction is
written p*q.
Every statement is either true or false, so we can speak of the truth value of
a statement, where the truth value of a true statement is true and the truth
value of a false statement is false. There are two broad categories into which
compound statements can be divided, according to whether or not there is
anything other than the truth values of its component statements that deter¬
mines the truth value of the compound statement. The truth value of the
conjunction of two statements is completely determined by the truth value of
its conjuncts. A conjunction is true if both its conjunct!* are true, but false
otherwise. For this reason a conjunction is a truth-functional compound
statement, and its conjuncts are truth-functional components of it.
Not every compound statement is truth-functional. For example, the
truth-value of the compound statement ‘Percival Lowell believed that Mars is
inhabited’ is not in any way determined by the truth value of its component
simple statement ‘Mars is inhabited’. This is so because it could be true that
Percival Lowell believed that Mars is inhabited, regardless of whether it is
inhabited or not. A person can believe one truth without believing another, and
a person can believe one falsehood without believing all of them. So the
component ‘Mars is inhabited’ is not a truth-functional component of the
compound statement ‘Percival Lowell believed that Mars is inhabited,’ and
the latter statement is not a truth-functional compound statement.
We define an occurrence of a component of a compound statement to be a
truth-functional component of that compound statement provided that: If
that occurrence of the component is replaced in the compound (or in any
component of the compound statement which contains the component in
question) by different statements that have the same truth value as each other,
the different compound statements produced by these replacements will also
have the same truth values as each other. And now a compound statement is
defined to be a truth-functional compound statement, if all of its components
are truth-functional components of the statement.2
The only compound statements we shall consider here will be truth-func-
tionally compound statements. Therefore, in the rest of this book we shall use
the term “simple statement” to refer to any statement that is not truth-func-
tionally compound.
Since conjunctions are truth-functionally compound statements, our dot
symbol is a truth-functional connective. Given any two statements p and q
there are just four possible sets of truth values they can have, and in every case
the truth value of their conjunction pmq is uniquely determined. The four
possible cases can be exhibited as follows:
2Compare this definition with the one proposed by Professor David II. Sanford in his ‘What Is
Truth Functional Component?’ Lo^iquc ct Analyse. 1970, I t*' Annce, N° 52, pp. 483-486.
10 Arguments Containing Compound Statements [Ch. 2
Representing the truth values true and false by the capital letters T and ‘F\
respectively, the way in which the truth value of a conjunction is determined
by the truth values of its conjuncts can be displayed more briefly by means of
a truth table as follows:
V_P'cl
T T T
T F F
F T F
F F F
Since it specifics the truth value of p*q in every possible case, this truth table
can be taken as defining the dot symbol. Other English words such as ‘more¬
over’, ‘furthermore’, ‘but’, ‘yet’, ‘still’, ‘however’, ‘also’, ‘nevertheless’, ‘al¬
though’, and so forth, and even the comma and the semicolon, are also used to
conjoin two statements into a single compound one, and all of them can be
indifferently translated into the dot symbol, so far as truth values are con¬
cerned.
The statement ‘It is not the case that lead is heavier than gold’ is also
compound, being the negation (or denial or contradictory) of its single com¬
ponent statement ‘lead is heavier than gold’. We introduce the symbol
called a curl (or a tilde) to symbolize negation. There are often alternative
formulations in English of the negation of a given statement. Thus where L
symbolizes the statement ‘lead is heavier than gold’, the different statements ‘it
is not the case that lead is heavier than gold’, ‘it is false that lead is heavier than
gold’, ‘it is not true that lead is heavier than gold’, ‘lead is not heavier than
gold’ are all indifferently symbolized as — L. More generally, where p is any
statement whatever, its negation is written ~~p. Since the negation of a true
statement is false and the negation of a false statement is true, we can take the
following truth table as defining the curl symbol:
P_
T “ F
F T
When two statements are combined disjunctively by inserting the word ‘or’
between them, the resulting compound statement is a disjunction (or alterna¬
tion), and the two statements so combined are called disjuncts (or alternatives).
The word ‘or’ has two different senses’, one of which is clearly intended in the
Sec. 2.1] Simple and Compound Statements 11
p_(j_pyji
f T T
TFT
F T T
F F F
The United Nations will become more responsible or there will be a third
world war.
The United Nations will not become more responsible.
Therefore, there will be a third world war.
where it is explicitly stated that the disjunction is exclusive, by use of the added
phrase ‘but not both’, for example, we do have the symbolic apparatus for
expressing that sense, as will be explained below.
The use of parentheses, brackets, and braces for punctuating mathematical
expressions is familiar. In the absence of a special convention, no number is
uniquely denoted by the expression ‘6 + 9 -f- 3\ although when punctuation
makes clear how its constituents are to be grouped, it denotes either 5 or 9.
Punctuation is needed to resolve ambiguity in the language of symbolic logic
too, since compound statements may themselves be combined to produce more
complicated compounds. Ambiguity is present in paq v r, which could be
either the conjunction of p with q v r, or else the disjunction of paq with r.
These two different senses are unambiguously given by different punctuations:
pa(q v r) and (paq) v r. In case p and q are both false and r is true, the first
punctuated expression is false (since its first conjunct is false), but the second
punctuated expression is true (since its second disjunct is true). Here, a differ¬
ence in punctuation makes all the difference between truth and falsehood. In
symbolic logic, as in mathematics, we use parentheses, brackets, and braces for
punctuation. To cut down on the number of punctuation marks required,
however, we establish the symbolic convention that in any expression the curl
will apply to the smallest component that the punctuation permits. Thus the
ambiguity of — p v </, which might mean either (~~p) v q or ~~(p v q), is
resolved, by our convention, to mean the first of these. The curl can, and
therefore by our convention does, apply to the first component p rather than lo
the larger expression p v (/.
The word ‘either’ has a variety of different uses in English. It has conjunctive
force in The Disjunctive Syllogism is valid on either interpretation of the word
‘or’/ It frequently serves merely to introduce the first disjunct in a.disjunction,
as in ‘Either the United Nations will become more responsible or there will be
a third world war’. Perhaps the most useful function of the word ‘either’ is to
punctuate some compound statements. Thus the sentence
More stringent antipollution measures will be enacted and the laws will
be strictly enforced or thejjuality of life will be degraded still further.
can have its ambiguity resolved in one direction by placing the word ‘cither’ at
its beginning, or in the other direction by inserting the word ‘either’ right after
the word ‘and’. Such punctuation is effected in our symbolic language by
parentheses. The ambiguous formula paqv r discussed in the preceding para¬
graph corresponds to the ambiguous sentence considered in this one. The two
different punctuations of the formula correspond to the two different punctu¬
ations of the sentence effected by the two different insertions of the word
‘either’.
Not all conjunctions are formulated by explicitly placing the word ‘and’
between complete sentences, as in ‘Charlie’s neat and Charlie’s sweet’. Indeed,
the latter would more naturally be expressed as ‘Charlie’s neat and sweet’. The
Sec. 2.1] Simple and Compound Statements 13
familiar ‘Jack and Jill went up the hill’ is the more natural way of expressing the
conjunction ‘Jack went up the hill and Jill went up the hill’. It is the same with
disjunctions: ‘Either Alice or Betty will be elected’ expresses more briefly the
proposition alternatively formulated as ‘Either Alice will be elected or Betty
will be elected’; and ‘Charlene will be either secretary or treasurer’ expresses
somewhat more briefly the same proposition as ‘Either Charlene will be
secretary or Charlene will be treasurer’.
The negation of a disjunction is often expressed by using the phrase ‘nei-
ther-nor’. Thus the disjunction ‘Either Alice or Betty will be elected’ is denied
by the statement ‘Neither Alice nor Betty will be elected’. The disjunction
would be symbolized as A v B and its negation as either —(A v B) or as
(— A)'(^B). (The logical equivalence of these two formulas will be discussed
in Section 2.4.) To deny that at least one of two statements is true is to assert
that both of the two statements are false.
The word ‘both’ serves various functions. One is simply a matter of emphasis.
To say ‘Both Jack and Jill went up the hill’ is only to emphasize that the two of
them did what they are asserted to have done by saying ‘Jack and Jill went up
the hill'. A more useful function of the word ‘both’ is punctuational, like that of
‘either’. ‘Both .. . and -• are not-’ is used to make the same statement as
‘Neither . . . nor • - is.In such sentences, the order of the words ‘both' and
‘not’ is very significant. There is a great difference between
and
EXERCISES3
I. If A and B are true statements and X and Y are false statements, which of the
following compound statements are true?
II. If A and B are known to he true, and X and Y are known to be false, but the
truth values of P and Q are not known, of which of the following statements can
you determine the truth value?
III. Using the letters A, B, C, and D to abbreviate the simple statements: ‘Atlanta
wins their conference championship’, ‘Baltimore wins their conference champi¬
onship’, ‘Chicago wins the superbowl’, and ‘Dallas wins the superbowl’, symbol¬
ize the following:
*1. Either Atlanta wins their conference championship and Baltimore wins their
conference championship or Chicago wins the superbowl.
2. Atlanta wins their conference championship and^either Baltimore wins theif
conference championship or Dallas does not win the superbowl.
3. Atlanta and Baltimore will hotboth win their conference championships but
Chicago and Dallas will both not win the superbowl.
4. Either Atlanta or Baltimore will win their conference championships but
neither Chicago nor Dallas will win the superbowl.
*5. Either Chicago or Dallas will win the superbowl but they will not both win
the superbowl.
6. Chicago will win the superbowl unless Atlanta wins their conference cham¬
pionship.
7. It is not the case that neither Atlanta nor Baltimore wins their Conference
championship.
8. Either Chicago or Dallas will fail to win the superbowl.
9. Either Chicago or Dallas will win the superbowl unless both Atlanta and
Baltimore win their conference championships.
10. Either Chicago will win the superbowl and Dallas will not win the superbowl
or both Atlanta and Baltimore will win their conference championships.
IV. Using capital letters to abbreviate simple statements, symbolize the following:
*1. The words of his mouth were smoother than butter, but war was in his heart.
(Psalm 55:21)
2. Promotion cometh neither from the east, nor from the west, nor yet from the
south. (Psalm 75:6)
3. As for man, his days are as grass: as a flower of the field, so he Hourisheth.
(Psalm 103:15)
4. Wine is a mocker, strong drink is raging. (Proverbs 20:1)
*5. God hath made man upright; but they have sought out many inventions
(Ecclesiastes 7:29)
6. The race is not to the swift, nor the battle to the strong... (Ecclesiastes 9:11)
7. Love is strong as death; jealousy is cruel as the grave. (The Song of Solomon
8:6)
8. A bruised reed shall he not break, and the smoking flax shall he not quench.
(Isaiah 42:3)
9. Saul and Jonathan were lovely and pleasant in their lives ... (2 Samuel 1:23)
10. His eye was not dim, nor his natural force abated. (Deuteronomy 34:7)
11. The voice is Jacob’s voice, but the hands are the hands of Esau. (Genesis
27:22)
12. He shall return no more to his house, neither shall his place know him any
more. (Job 7:10)
16 Arguments Containing Compound Statements [Ch. 2
The compound statement Tf the train is late, then we shall miss our con¬
nection* is a conditional (or a hypothetical, an implication, or an implicative
statement). The component between the ‘if’ and the ‘then* is called the
antecedent (or the implicans or protasis), and the component that follows the
Then* is the consequent (or the implicate or apodosis). A conditional does not
state either that its antecedent is true or that its consequent is true; it says only
that if its antecedent is true, then its consequent is true also, that is, its
antecedent implies its consequent. The key to the meaning of a conditional is
the relation of implication that is asserted to hold between its antecedent and
its consequent, in that order.
If we examine a number of different conditionals, we can see that there are
different kinds of implications which they may express. In the conditional ‘If all
cats like liver and Dinah is a cat, then Dinah likes liver’, the consequent follows
logically from its antecedent. On the other hand, in the conditional ‘if the
figure is a triangle, then the figure has three sides’, the consequent follows from
the antecedent by the very definition of the word ‘triangle’. The truth of the
conditional ‘If this piece of gold is placed in aqua regia, then this piece of gold
dissolves’ is not a matter of either logic or definition. The connection asserted
here is causal and must be discovered empirically. These examples show that
there are different kinds of implications that constitute different senses of the
‘if-then’ phrase. Having noted these differences, we now seek to find some
identifiable common meaning, some partial meaning that is common to these
different types of conditionals.
Our discussion of ‘if-then’ will parallel our previous discussion of the word
‘or’. First, we pointed out two different senses of that word. Second, we noted
that there was a common partial meaning: That at least one disjunct is true was
seen to be involved in both the inclusive and the exclusive ‘or’. Third, we
introduced the special symbol V to represent this common partial meaning
(which was the whole meaning of ‘or’ in its inclusive sease). Fourth, we
observed that because arguments like the Disjunctive Syllogism are valid on
either interpretation of the word ‘oi^, symbolizing any occurrence of the word
‘or’ by the wedge symbol, preserves the validity of such arguments. And
because we are interested in arguments from the point of view of determining
their validity, this translation of the word ‘or’ into V, which may abstract from
or ignore part of its meaning in some cases, is wholly adequate for our present
purposes.
A common partial meaning of these different kinds of conditional statements
emerges when we ask what circumstances would suffice to establish the
falsehood of a conditional. Under what circumstances would we agree that the
conditional Tf this piece of gold is placed in this solution, then this piece of
gold dissolves’ is false? Clearly, the statement is false in the case that this piece
of gold is actually placed in this solution and does not dissolve. Any conditional
Sec. 2.2] Conditional Statements 17
with a true antecedent and a false consequent must be false. Hence any
conditional if p, then q is known to be false in case the conjunction pa~~q is
known to be true, that is, in case the antecedent of the conditional is true and
its consequent false. For the conditional to be true, the indicated conjunction
must be false, which means that the negation of that conjunction must be true.
In other words, for any conditional if p, then q to be true, ~~(p'~~q), the
negation of the conjunction of its antecedent with the negation of its conse¬
quent, must be true also. We may, then, regard the latter as a part of the
meaning of the former.
We introduce the new symbol ‘D’, called a horseshoe, to represent the
partial meaning that is common to all conditional statements. We define
‘p D */’ as an abbreviation for (p* ~~q)\ The horseshoe is a truth-functional
connective, whose exact significance is indicated by the following truth table:
P_ q ~q p—q p2JI
T T F F T T
T F T T F F
F T F F T T
F F T F T T
Here, the first two columns represent all possible truth values for the compo¬
nent statements p and q, and the third, fourth, and fifth represent successive
stages in determining the truth value of the compound statement — (p* — q) in
each case. The sixth column is identically the same as the fifth, since the
formulas that head them are defined to express the same proposition. The
horseshoe symbol must not be thought of as representing the meaning of
‘if-then’, or the relation of implication, but rather symbolizes a common partial
factor of the various different kinds of implications signified by the ‘if-then’
phrase.
We can regard the horseshoe as symbolizing a special, extremely weak kind
of implication. It is expedient for us to do so, since convenient ways to read
‘p D q’ are ‘if p, then q\ ‘p implies q\ or ‘p only if q. The weak implication
symbolized by ‘ D’ is called a material implication. Its special name indicates
that it is a special concept and not to be confused with the other, more usual
kinds of implication. Some conditional statements in English do express merely
material implications, as for example ‘If Russia is a democracy, then I’m a
Dutchman’. It is clear that the implication expressed here is neither logical,
definitional, nor causal. No ‘real connection’ is alleged to hold between what
the antecedent states and what is stated by the consequent. This sort of
conditional is ordinarily intended as an emphatic or humorous method of
denying the truth of its antecedent, for it typically contains a notoriously or
ridiculously false statement as its consequent. Any such assertion about truth
values is adequately symbolized by using the truth-functional connective 4 D *.
Although most conditional statements express more than a merely material
18 Arguments Containing Compound Statements [Ch. 2
EXERCISES
I. If A and B are true statements and X and Y are false statements, which of the
following compound statements are true?
II. If A and B are known to be true, and X and Y are known to be false, but the truth
values of P and are not known, of which of the following statements can you
determine the truth values?
III. Symbolizing ‘Amherst wins its first game’ as A, ‘Colgate wins its first game* as C,
and ‘Dartmouth wins its first game’ as D, symbolize the following compound
statements:
Sfi£u24L Argument Forms and Truth Tables 19
*1. Both Amherst and Colgate win their first games only if Dartmouth does not
win its first game.
2. Amherst wins its first game if either Colgate wins its first game or Dartmouth
wins its first game.
3. If Amherst wins its first game, then both Colgate and Dartmouth win their
first games.
4. If Amherst wins its first game, then either Colgate or Dartmouth wins its first
game.
*5. If Amherst does not win its first game, then it is not the case that either
Colgate or Dartmouth wins its first game.
6. If it is not the case that both Amherst and Colgate win their first game, then
both Colgate and Dartmouth win their first games. *
7. If Amherst wins its first game, then not both Colgate and Dartmouth win
their first games.
8. If Amherst does not win its first game, then both Colgate and Dartmouth do
not win their first games.
9. Hither Amherst wins its first game and Colgate does not win its first game or,
if Colgate wins its first game, then Dartmouth does not win its first game.
*10. If Amherst wins its first game, then Colgate does not win its first game, but if
Colgate does not win its first game, then Dartmouth wins its first game.
11. If Amherst wins its first game, then, if Colgate does not win its first game,
then Dartmouth wins its first game.
12. Hither Amherst and Colgate win their first games or it is not the case that if
Colgate wins its first game, then Dartmouth wins it first game.
13. Amherst wins its first game only if either Colgate or Dartmouth does not win
its first game.
14. If Amherst wins its first game only if Colgate wins its first game, then
Dartmouth does not win it its first game.
15. If Amherst and Colgate both do not win their first games, then Amherst and
Colgate do, not both win their first games.
which is obviously invalid, since its premisses are true but its conclusion false.
Any argument is proved to be invalid if another argument of exactly the same
form can be constructed with true premisses and a false conclusion. This
reflects the fact that validity and invalidity are purely formal characteristics of
arguments: any two arguments having the same form are either both valid or
both invalid, regardless of any differences in their subject matter.4 The concept
of two arguments having exactly the same form is one that deserves further
examination.
It is convenient, in discussing forms of arguments, to use small letters from
the middle part of the alphabet, ‘p\ 7/’, V, V,... as statement variables, which
are defined simply to be letters for which, or in place of which, statements may
be substituted. Now, we define an argument form to be any array of symbols
that contains statement variables, such that when statements are substituted for
the statement variables—the same statement being substituted for every
occurrence of the same statement variable throughout—the result is an argu¬
ment. For definiteness, we establish the convention that in any argument form,
‘p’ shall be the first statement variable that occurs in it, 7/* shall be the second,
V the third, and so on.
Any argument which results from the substitution of statements for the
statement variables of an argument form is said to have that form, or to be a
substitution instance of that argument form. If we symbolize the simple
statement ‘The United Nations will become more responsible* as U, and the
simple statement ‘There will be a third world war’ as W, then the Disjunctive
Syllogism presented earlier can be symbolized as
UvW (1)
.*. w
pvq (2)
-p
..q
4 Here we assume that the simple* statements involved are neither logically true (e.g., ‘All
equilateral triangles are triangles') nor logically false (e.g., ‘Some triangles are nontriangles'). We
assume also that the only logical relations among the simple statements involved arc those
asserted or entailed by the premisses. The point of these restrictions is to limit our considerations
in Chapters 2 and 3 to truth-functional arguments alone, and to exclude other kinds of arguments
whose validity turns on more complex logical considerations to be introduced in Chapters 4 and 5.
Sec. 2.3] Argument Forms and Truth Tables 21
from which it results by substituting the statements U and W for the statement
variables p and </, respectively. But that is not the only form of which it is a
substitution instance. The same argument is obtained by substituting the
statements U vW, and W for the statement variables p, </, and r, respec¬
tively, in the argument form
V (3)
(/
r
We define the specific form of a given argument as that argument form from
which the argument results by substituting a different simple statement for
each distinct statement variable. Thus the specific form of the argument (1) is
the argument form (2). Although the argument form (3) is a form of the
argument (1), it is not the specific form of it. The technique of refutation by-
logical analogy can now be described more precisely. If the specific form of a
given argument can be shown to have any substitution instance with true
premisses and false conclusion, then the given argument is invalid.
The terms ‘valid’ and ‘invalid’ can be extended to apply to argument forms as
well as arguments. An invalid argument form is one that has at least one
substitution instance with true premisses and a false conclusion. The technique
of refutation by logical analogy presupposes that any argument of which the
specific form is an invalid argument form, is an invalid argument. Any argu¬
ment form is valid that is not invalid: So a valid argument form is one that has
no substitution instance with true premisses and false conclusion. Any given
argument can be proved valid if it can be shown that the specific form of the
given argument is a valid argument form.
To determine the validity or invalidity of an argument form, we must
examine all possible substitution instances of it to see if any of them have true
premisses and false conclusions. The arguments with which we are here
concerned contain only simple statements and truth-functional compounds of
them. We are interested only in the truth values of their premisses and
conclusions. We can obtain all possible substitution instances whose premisses
and conclusions have different truth values, by considering all possible ar¬
rangements of truth values for the statements that are substituted for the
distinct statement variables in the argument form to be tested. Possible
substitution instances can be set forth most conveniently in a truth table, with
an initial or guide column for each distinct statement variable that appears in
the argument form. Thus to prove the validity of the Disjunctive Syllogism
form
pvq
—p
:.q
22 Arguments Containing Compound Statements [Ch. 2
P q pyq .~p
T T T F
TFT F
F T T T
F F F T
Each row of this table represents a whole class of substitution instances. The
T’s and F’s in the two initial columns represent the truth values of statements
that can be substituted for the variables p and q in the argument form. These
determine the truth values in the other columns, the third of which is headed
by the first ‘premiss’ of the argument form, and the fourth by the second
‘premiss’. The second column’s heading is the ‘conclusion’ of the argument
form. An examination of this truth table reveals that whatever statements are
substituted for the variables p and </, the resulting argument cannot have true
premisses and a false conclusion. This is so because the third row represents the
only possible case in which both premisses are true, and there the conclusion is
true also.
Because truth tables provide a purely mechanical or effective method of
deciding the validity or invalidity of any argument of the general type here
considered, we can now justify our proposal to symbolize all conditional
statements by means of the truth-functional connective ‘ D ’. The justification
for treating all implications as though they were mere material implications is
that valid arguments containing conditional statements remain valid when
those conditionals are interpreted as expressing material implications only. The
three simplest and most intuitively valid forms of argument involving condi¬
tional statements are
.*. -p
and the
That they all remain valid when tliejir‘conditionals are interpreted as expressing
material implications is easily established by truth tables. The validity of
Sec. 2.3] Argument Forms and Truth Tables 23
Modus Ponens is shown by the same truth table that defines the horseshoe
symbol:
P_q P -> (I
T T T
T F F
FT T
F F T
Here the two premisses are represented by the third and first columns, and the
conclusion, by the second. Only the first row represents substitution instances
in which both premisses are true, and in that row the conclusion is true also.
The validity of Modus Pollens is shown by the truth table:
p_q_p D q _21!_ZR
T T T F F
T F F T F
FIT FT
F F T T T
Here only the fourth row represents substitution instances in which both
premisses (the third and fourth columns) are true, and there the conclusion (the
fifth column) is true also. Since the Hypothetical Syllogism form contains three
distinct statement variables, the truth table for it must have three initial
columns and will require eight rows for listing all possible substitution in¬
stances:
p_ '/ r p D q (j D r p J r
T T T T T T
T T F T F F
T F T F T T
T F F F T F
F T T T T T
F T F T F T
F F T T T T
F F F T T T
In constructing it, the three initial columns represent all possible arrangements
of truth values for the statements substituted for the statement variables p, c/,
and r. The fourth column is filled in by reference to the first and second, the
fifth by reference to the second and third, and the sixth by reference to the first
and third. The premisses are both true only in the first, fifth, seventh, and
eighth rows, and in these rows the conclusion is true also. This suffices to
demonstrate that the Hypothetical Syllogism remains valid when its con-
24 Arguments Containing Compound Statements [Ch. 2
ditionals are symbolized by means of the horseshoe symbol. Any doubts that
remain about the claim that valid arguments containing conditionals remain
valid when their conditionals are interpreted as expressing merely material
implication, can be allayed by the reader’s providing, symbolizing, and testing
his own examples by means of truth tables.
To test the validity of an argument form by a truth table requires a table
with a separate initial or guide column for each different statement variable,
and a separate row for every possible assignment of truth values to the
statement variables involved. Hence, testing an argument form containing n
distinct statement variables, requires a truth table having 2n rows. In con¬
structing truth tables, it is convenient to fix upon some uniform pattern for
inscribing the T’s and F’s in their initial or guide columns. In this book we shall
follow the practice of simply alternating T’s and F’s down the extreme
right-hand initial column and alternating pairs of T’s with pairs of F’s down the
column directly to its left. Next, we shall alternate quadruples of T’s with
quadruples of F’s,. .., and finally, we shall fill in the top half of the extreme
left-hand initial column with T’s and its bottom half with F’s.
There are two invalid argument forms that bear a superficial resemblance to
the valid argument forms Modus Ponens and Modus Pollens. These are
pDq pDq
q and —p
.'.p
and are known as the Fallacies of Affirming the Consequent and of Denying the
Antecedent, respectively. The invalidity of both can be shown by a single truth
table:
p q pD q ~~p -</
T T T F F
T F F F T
F T T T F
F F T T T
The two premisses in the Fallacy of Affirming the Consequent head the second
and third columns, and are true in both the first and third rows. But the
conclusion, which heads the first column, is false in the third row—which
shows that the argument form does have a substitution instance with true
premisses and a false conclusion, and is therefore invalid. Columns three and
four are headed by the two premisses in the Fallacy of Denying the Anteced¬
ent, which are true in both the third and fourth rows. Its conclusion heads the
fifth column and is false in the third row—which shows that the second
argument form is invalid also.
It must be emphasized that althpiigh a valid argument form has only valid
arguments as substitution instances, an invalid argument form can have both
Sec. 2.3] Argument Forms end Truth Tobies 25
EXERCISES
I. For each of the following arguments indicate which, if any, of the argument forms
in Exercise 11 below have the given argument as a substitution instance, and
indicate which, if any, is the specific form of the given argument:
*a. A f. M 3 (N 3 O) k. (A D B)-(C D D)
.'. A vB O 3 ~~\1 A vC
O 3 -N WBvD
b. (>D g. (P 3 QHR 3 S) I. (EDF)-(GDII)
:.c :.p 3 q ^Fv ^G
-iiv
c. E 3 (F-G) h. r 3 u m. 1 D J
-(F-G) 3 *E .'. (T 3 U) v ( V’T) D7) 3(1 37)
d. H i. W 3 X n. A' 3 (A 3 M)
1 . .X 3 (W 3 X) K 3 A
.'.H-I .’.A 3 M
*e. JD(K-L) *j. yv(z*—Y) o. /V 3 (N 3 0)
Jv(K-L) y A7 3 N
.’. K-L ~(2—n /V 3 O
II. Use truth tables to determine the validity or invalidity of each of the following
argument forms:
• • ~P pDq Mpvq)
:.pDr ~(p*q)
26 Arguments Containing Compound Statements [Ch. 2
III. Use truth tables to determine the validity or invalidity of each of the following
arguments:
*1. If Alice is elected class president, then either Betty is elected vice-president
or Carol is elected treasurer. Betty is elected vice-president. Therefore, if
Alice is elected class-president, then Carol is not elected treasurer.
2. If Alice is elected class president, then either Betty is elected vice-president
or Carol is elected treasurer. Carol is not elected treasurer. Therefore, if
Betty is not elected vice-president, then Alice is not elected class president.
3. If Alice is elected class president, then Betty is elected vice-president and
Carol is elected treasurer. Betty is not elected vice-president. Therefore,
Alice is not elected class president.
4. If AJice is elected class president, then if Betty is elected vice-president,
then Carol is elected treasurer. Betty is not elected vice-president. There¬
fore, either Alice is elected class president or Carol is elected treasurer.
*5. If the seed catalog is correct, then if the seeds are planted in April, then the
flowers bloom in July. The flowers do not bloom in July. Therefore, if the
seeds are planted in April, then the seed catalog is not correct.
6. If the seed catalog is correct, then if the seeds are planted in April, then the
flowers bloom in July. The flowers bloom in July. Therefore, if the seed
catalog is correct, then the seeds are planted in April.
7. If the seed catalog is correct, then if the seeds are planted in April, then the
flowers bloom in July. The seeds are planted in April. Therefore, if the
flowers do not bloom in July, then the seed catalog is not correct.
8. If the seed catalog is correct, then if the seeds are planted in April, then tin*
flowers bloom in July. The flowers do not bloom in July. Therefore, if the
seeds are not planted in April, then the seed catalog is not correct.
9. If Ed wins first prize, then Fred wins second prize, and if Fred wins second
prize, then George is disappointed. Either Ed wins first prize or George is
disappointed. Therefore, Fred docs not win second prize. .
*10. If Ed wins first prize, then either Fred wins second prize or George is
disappointed. Fred does not win second prize. Therefore, if George is
disappointed, then Ed does not win first prize.
11. If Ed wins first prize, then Fred wins second prize, and if Fred wins second
prize, then George is disappointed. Either Fred does not win second prize or
George is not disappointed. Therefore, Ed does not win first prize.
12. If Ed wins first prize, then Pred wins second prize, and if Fred wins second
prize, then George is disappointed. Either Ed does not win first prize or Fred
does not win second prize. Therefore, either Fred does not win second prize
or George is not disappointed.
13. If the weather is warm and the sky is clear, then we go swimming and we go
boating. It is not the case that if the sky is clear, then we go swimming.
Therefore, the weather is not warm.
14. If the weather is warm and the sky is clear, then cither we go swimming or
we go boating. It is not the case that if the sky is clear, then we go
swimming. Therefore, if we do riot go boating, then the weather is not warm.
*15. If the weather is warm and the sky is clear, then either we go swimming or
we go boating. It is not the case’that if we do not go swimming, then the sky
is not clear. Therefore, either the weather is warm or we go boating.
Sec. 2.4] Statement Forms 27
p_21}_E12:E
T F T
F T T
That there are only T’s in the column headed by the statement form in question
shows that all of its substitution instances are true. Any statement that is a
substitution instance of a tautologous statement form is formally true and is,
itself, said to be tautologous or a tautology.
Similarly, although the statements ‘Cortez discovered the Pacific’ (C) and
‘Cortez discovered the Pacific and Cortez did not discover the Pacific’ (C*
are both false, we discover their falsehood in quite different ways. The first
28 Arguments Containing Compound Statements [Ch. 2
simply happens to be false, and that must be learned empirically; the second is
necessarily false, and that can be known independently of empirical investiga¬
tion. The statement Cm~~C is formally false, a ‘.substitution instance of a
statement form, all of whose substitution instances are false. One statement is
said to contradict, or to be a contradiction of, another statement when it is
logically impossible for them both to be true. In this sense, contradiction is a
relation between statements. But there is another, related sense of that term.
When it is logically impossible for a particular statement to be true, that
statement itself is said to be self-contradictory, or a self-contradiction. Such
statements are also said more simply to be contradictory or contradictions. We
shall follow the latter usage here. A statement form that has only false substi¬
tution instances is said to be contradictory or a contradiction, and the same
terms are applied to its substitution instances. The statement form p • — p is
proved a contradiction by the fact that in its truth table only F's occur in the
column that it heads.
Statements and statement forms that are neither tautologous nor contradic¬
tory are said to be contingent or contingencies. For example, p, —p, p v ry, p‘(j,
and pD(/ are contingent statement forms; and B, C, —C, B*C, Bv C are
contingent statements. The term is appropriate, since their truth values are not
formally determined but are dependent or contingent upon what happens to be
the case.
It is easily proved that p D (q D p) and ~^p D (p D q) are tautologies.
When expressed in English as ‘A true statement is implied by any statement
whatever', and as ‘A false statement implies any statement whatever, they
seem rather strange. They have been called by some writers the paradoxes of
material implication. When it is kept in mind that the horseshoe symbol is a
truth-functional connective that stands for material implication rather than
either ‘implication in general’ or more usual kinds of implications such as
logical or causal, then the tautologous statement forms in question are not at all
surprising. And when the misleading English formulations are corrected by
inserting the word ‘materially’ before ‘implied’ and ‘implies’, then the air of
paradox vanishes. Material implication is a special, technical concept that the
logician introduces and uses because it simplifies his task of discriminating valid
from invalid arguments.
Two statements are said to be materially equivalent when they have the
same truth value, and we symbolize the statement that they are materially
equivalent by inserting the symbol ‘=’ between them. Being a truth-functional
connective, the three-bar symbol is defined by the following truth table:
P <1 P = <1
T T T
T . F F
F T F
F F T
Sec. 2.4] Statement Forms 29
To say that two statements are materially equivalent, is to say that they
materially imply each other, as is easily verified by a truth table. Hence the
three-bar symbol may be read ‘if and only if . A statement or statement form of
the pattern p == q is called a biconditional. Two statements are said to be
logically equivalent when the biconditional that expresses their material
equivalence is a tautology. The ‘principle of Double Negation’, expressed as
p = ~~~~p, is proved to be tautologous by a truth table.
There are two logical equivalences that express important interrelations of
conjunctions, disjunctions, and negations. Because a conjunction asserts that
both its conjuncts are true, its negation need assert only that at least one is
false. Thus negating the conjunction p*q amounts to asserting the disjunction
of the negations of p and q. This statement of equivalence is symbolized as
~~(p*<j) EH (~~p v — (f), and proved to be a tautology by the following truth
table:
Similarly, because a disjunction asserts merely that at least one disjunct is true,
to negate it is to assert that both are false. Negating the disjunction pvq
amounts to asserting the conjunction of the negations of p and q. It is symbol¬
ized as ~~(p v (/) ^ p • — q) and is easily proved tautologous by a truth table.
These two equivalences are known as De Morgan’s 'Fheorems, after the English
mathematician-logician Augustus De Morgan (1806-1871) and can be stated
compendiously in English as:
pvq
-p
.'.q
pDq
<1
• *P
(1) (2) (3) (4) (5) (6) (7) (8) (9) (10)
Here columns (2), (4), (7), (10) are the initial or guide columns. Column (3) is
filled in by reference to columns^) and (4), and column (1) by reference to
column (3). Column (6) is filled in by reference to column (7), column (9) is
filled in by reference to column (10), and then column (8) by reference to
columns (6) and (9). Finally, column (5) is filled in by reference to columns (1)
and (8). That its main connective has only T’s under it in the truth table
establishes that the statement form being tested is a tautology.
EXERCISES
*1- PD ~~P 3. p D (p D p)
4. (p Dp) Dp
*
2. (p D —p)-(—p D p)
Statement Forms 31
Sec. 2.4]
8. (-“P’tfHfl 3 P)
*5. p 3 (P'P)
9. [(p D q) 3 <f] 3 9
6. (p-ffP P
10. [(p 3 </) 3 Pi D P
7.
n use truth tables to decide which of the following are logical equivalences:
6. [pv(qT)]=[(pvq)T]
*1. (p 3 q) = (~P D ~(/)
7. [p v (</-r)] — [(p v9)*(pvr)]
2. (p D<j) = (~q 3 ~p)
8. (p = <?) = [(p,<?)v(~P‘~‘J)l
3. [(p-q)Dr]=[p3(q 3r)l
9. p = [p-(p 3<f)]
4. [pD(qDr)l=[(p3</P»']
‘5. [p-(qvr)] = [(p*<j)v(p*r)l 10. p = lp*(9°P)]
The Method of Deduction
Av (B D D)
D (D D E)
ADC
~~C
.\B D R
32
Sec. 3.1] Formal Proof of Validity 33
1. Av(BDD)
2. D(DD E)
3. A D C
4. -C /.'.BDE
5. -A 3, 4, M.T.
6. BDD 1,5, D.S.
7. DDE 2, 4, M.P.
8. BDE ,
6 7, H.S.
—CD(DD E)
.’. D D E
34 The Method of Deduction [Ch. 3
pD </
p
:.q
i. . .-A.
RULES OF INFERENCE
These nine Rules of Inference are elementary valid argument forms, whose
validity is easily established by truth tables. They can be used to construct
formal proofs of validity for a wide range of more complicated arguments. The
names listed are standard for the most part, and the use of their abbreviations
permits formal proofs to be set down with a minimum of writing.
EXERCISES
I. For each of the following arguments, state the Rule of Inference by which its
conclusion follows from its premiss or premisses:
Sec. 3.1] Formal Proof of Validity 35
11. Each of the following is a formal proof of validity for the indicated argument.
State the ‘justification’ for each line that is not a premiss:
3. 1. K 3 L
9. ^v-iSd/)
2. MD N *5. 1. (-Xv -y) 3 [A 3 (P—^)]
3. (0 3 A1)-(F 3 L) 2. (-X--R) 3 [(F—p) 3 Z]
36 The Method of Deduction [Ch. 3
111. Construct a formal proof of validity for each of the following arguments
*1. A 3 B 4. A7 3 2V
C D D iV 3 O
(-Bv-D)-(-Av^) (A7 3 O) D(N DP)
-Av-C (M DP) DQ
■■Q
v/ 2. F 3 (F- —G) *5. (fl D ~S)"(r 3 ~t/)
(FvG) 3 1/ (V D ~W)*(X D ~Y)
E (7’ D W)-(l/ D S)
Vvrt
—7'v ~(/
3. 7 3 X 6. A 3 (B-C)
7v(Xv-Z.) ~A D [(D 3 E)-(F 3 G)]
-X (B*C) v [(~A 3 D)-(~A 3 F)]
.\-/,-~~X
..EvC,
Sec. 3.1] Formal Proof of Validity 37
7. (~/f v/) 3 (/ 3 K) 9. V D W
(~~L—M) 3 (K 3 iV) xdy
(H 3 L) • (L 3 //) Z D W
—O X D /I
:.] d n W D X
[(V D Y)-(ZDA)} D (VvZ)
.'. Y v A
8. (P 3 Q)-(R 3 S) 10. (BvC) D (Dv/{)
(() D T)-(S D U) [(/)v/i)vf’J D (G v //)
(C vi/) D - D
—T K D ~~c;
IV. Construct a formal proof of validity for each of the following arguments, using the
abbreviations suggested:
*1. If cither algebra is required or geometry is required, then all students will
study mathematics. Algebra is required and trigonometry is required.
Therefore, all students will study mathematics. (A: Algebra is required.
C: Geometry is required. S: All students will study mathematics. T: Trigo¬
nometry is required.)
2. Either Smith attended the meeting or Smith was not invited to the meeting.
If the directors wanted Smith at the meeting, then Smith was invited to the
meeting. Smith did not attend the meeting. If the directors did not want
Smith at the meeting and Smith was not invited to the meeting, then Smith
is on his way out of the company. Therefore, Smith is on his way out of the
company. (A: Smith attended the meeting. /: Smith was invited to the
meeting. D< The directors wanted Smith at the meeting. W: Smith is on his
way out of the company.)
3. If a scarcity of commodities develops, then prices rise. If there is a change of
administration, then fiscal controls will not be continued. If die threat of
inflation persists, then fiscal controls will be continued. If there is overpro¬
duction, then prices do not rise. Either there is overproduction or there is a
change of administration. Therefore, either a scarcity of commodities does
not develop or the threat of inflation does not persist. fS: A scarcity of
commodities develops. P: Prices rise. C: There is a change of administration.
F: Fiscal controls will be continued. 1: The threat of inflation persists.
O: There is overproduction.)
4. If the investigation continues, then new evidence is brought to light. If new
evidence is brought to light, then several leading citizens are implicated. If
several leading citizens are implicated, then the newspapers stop publiciz¬
ing the case. If continuation of the investigation implies that the news¬
papers slop publicizing the case, then the bringing to ligfit of new evidence
implies that the investigation continues. The investigation does not con¬
tinue. Therefore, new evidence is not brought to light. (C: The investigation
continues. N: New evidence is brought to light. /: Several leading citizens
are implicated. S: The newspapers stop publicizing the case.)
[Ch. 3
*5. If the king does not castle and the pawn advances, then either the bishop is
blocked or the rook is pinned. If the king does not castle, then if the bishop
is blocked, then the game is a draw. Either the king castles or if the rook is
pinned, then the exchange is lost. The king'does not castle and the pawn
advances. Therefore, either the game is a draw or the exchange is lost.
(K: The king castles. P: The pawn advances. B: The bishop is blocked.
R: The rook is pinned. D: The game is a draw. E: The exchange is lost.)
6. If Andrews is present, then Brown is present, and if Brown is present, then
Cohen is not present. If Cohen is present, then Davis is not present. If Brown
is present, then Emerson is present. If Davis is not present, then Farley is
present. Either Emerson is not present or Farley is not present. Therefore,
either Andrews is not present or Cohen is not present. (A: Andrews is
present. B: Brown is present. C: Cohen is present. D: Davis is present.
E: Emerson is present. F: Farley is present.)
7. If either George enrolls or Harry enrolls, then Ira does not enroll. Either Ir$
enrolls or Harry enrolls. If either Harry enrolls or George does not enroll,
then Jim enrolls. George enrolls. Therefore, either Jim enrolls or Harry does
not enroll. (GT: George enrolls. H: Harry enrolls. I: Ira enrolls. /: Jim enrolls.)
8. If Tom received the message, then Tom took the plane, but if Tom did not
take the plane, then Tom missed the meeting. If Tom missed the meeting,
then Dave was elected to the board, but if Dave was elected to the board,
then Tom received the message. If either Tom did not miss the meeting or
Tom did not receive the message, then either Tom did not take the plane or
Dave was not elected to the board. Tom did not miss the meeting. Therefore,
either Tom did not receive the message or Tom did not miss the meeting.
(R: Tom received the message. P: Tom took the plane. M: Tom missed the
meeting. D: Dave was elected to the board.)
9. If Dick was recently vaccinated, then he has a fever. Either Dick was
recently vaccinated, or if pocks begin to appear then Dick must be quaran¬
tined. Either Dick has measles or if a rash develops, then there are compli¬
cations. If Dick has measles, then he has a fever. If Dick was not recently
vaccinated and Dick does not have measles, then either a rash develops or
pocks begin to appear. Dick does not have a fever. Therefore, either there
are complications or Dick must be quarantined. (V: Dick was recently
vaccinated. F: Dick has a fever. P: Pocks begin to appear. £>: Dick must be
quarantined. M: Dick has measles. R: A rash develops. C: There are
complications.) **
*10. Either taxes are increased or if expenditures rise, then the debt ceiling is
raised. If taxes are increased, then the cost of collecting taxes increases. If a
rise in expenditures implies that the government borrows more money, then
if the debt ceiling is raised, then interest rates increase. If taxes are not
increased and the cost of collecting taxes does not increase, then if the debt
ceiling is raised, then the government borrows more money. The cost of
collecting taxes does not increase. Either interest rates do not increase or the
government does not borrow more money. Therefore, either the debt ceiling
is not raised or expenditures do not rise. (T: Taxes are increased. E: Expen¬
ditures rise. D: The debt, ceiling is raised. C: The cost of collecting taxes
increases. G: The government borrows more money. I: Interest rates
increase.)
Sec. 3.2] The Rule of Replacement 39
A-R
B
*It will be stated more formally in an appropriate context and demonstrated in Chapter 8.
40 The Method of Deduction [Ch. 3
A formal proof of validity for tlie argument given on page 39 can now Ire
written:
1. A-B /.\fl
2. B#A 1, Com.
3. B 2, Simp.
Some argument forms, although very elementary and perfectly valid, are not
included in our list of nineteen Rules of Inference. Although the argument
A-B
B
p-q
is not included in our list. Hence B does not follow from A-B by any single
elementary valid argument form, as defined by our list. It can be deduced,
however, by using two elementary valid arguments, as shown above. We could
add the intuitively valid argument form
- p-q
to our list, of course, but if we expanded our list in this way we might end up
with a list that was too long and, therefore, unmanageable.
The list of Rules of Inference contains several redundancies. For example.
Modus Tollens could be dropped from our list without any real weakening of
the machinery. This is so because any step deduced by its use could be deduced
by using other Rules of the list instead. Thus in our first proof, on page 33, line
5, ^A, which was deduced from lines 3 and 4, A D C and by Modus
Tollens, could have been deduced without it, since D follows from
A D Cby Transposition, and ^A from D —A and —Cby Modus Ponens.
Sec. 3.2] The Rule of Replacement 41
the last ten Rules of Inference can be applied either to whole lines or to parts
of lines. Not only can the statement A D (B D C) he inferred from the whole
line(A*B) D Cby Exportation, but from the line [(4* B) D C] v D we can infer
[A D (B D C)]v D by Exportation. The Rule of Replacement authorizes
specified logically equivalent expressions to replace each other wherever they
occur, even where they do not constitute whole lines of a proof. But the first
nine Rules of Inference can be used only with whole lines of a proof serving as
premisses.
There are two very important differences between substitution and re¬
placement. First, in substituting statements for statement variables in an
argument form, any statement variable can have any statement substituted for
it. In replacement, on the other hand, a statement can be replaced only by a
statement logically equivalent to it, where the logical equivalence of the two
statements is given in Rules 10 through 19. Second, in substitution, every
occurrence of a statement variable in an argument form must have the same
statement substituted for it. In replacement, however, one (or several) occur¬
rences of a statement may be replaced by a logically equivalent statement,
without other occurrences of that statement having to be replaced also.
In the absence of mechanical rules for the construction of formal proofs of
validity, some rules of thumb or hints on procedure may be suggested. The first
is simply to begin deducing conclusions from the premisses by the given Rules
of Inference. As more and more of these subconclusions become available as
premisses for further deductions, one is more likely to see how to deduce the
conclusion of the argument that is to be proved valid.
Another hint is to try to eliminate statements that occur in the premisses, but
not in the conclusion. Such elimination can proceed only in accordance with
the Rules of Inference. The Rules contain many techniques for eliminating
statements, however. Simplification is one such rule: By its application, the
right-hand conjunct of a conjunction can be dropped—provided that the
conjunction is a whole line in the proof. By Commutation, the left-hand
conjunct can be switched over to the right-hand side, from which location it
can be dropped by Simplification. The ‘middle’ term q can be eliminated by a
Hypothetical Syllogism, given two premisses or subconclusions of the patterns
p D q and q D r. Distribution is a useful rule for transforming a disjunction of
th§ form p v (</-r) into the conjunction (p v q) «(p v r), the conjuncts of which
can be eliminated, either by Simplification alone, or by Commutation followed
by Simplification. Another rule of thumb is to introduce, by Addition, a
statement that occurs in the conclusion but not in the premisses. Another
method is to work backward from the conclusion by looking for some state¬
ment or pair of statements from which the conclusion could be deduced by one
of the Rules of Inference. Then those intermediate statements are deduced
either from the premisses or from other intermediate statements, and so on,
until some that are derivable .from the premisses are found. A judicious
combination of these methods is often the best way to proceed. Practice, of
course, is the best way to acquire facility in using the method of deduction.
Sec. 3.2] The Rule of Replecement 43
EXERCISES
I. For each of the following arguments, state the Rule of Inference by which its
conclusion follows from its premiss:
*1. (~A 3 B)-(C v ~D) *5. [P 3 (() v R)] v [P 3 (() v H)|
• • (~A 3 7i)*(~D v C) :.P D(()vR)
2. (~Fv7-’)-(Gv -H) 6. lS-(7’-in| 3 (V = -W)
(E 3 F)’(C v ~~ll) f(S•■/’)•(/] D(V=~~W)
.3. (7 3 ~/) v (~K 3 -/,) 7. [.V-(Y-Z)] 3 (A = ~«)
.'.(ID ~/) v (L 3 K) .’. X 3 [(Y-Z) 3 (A ~ -~B)]
4. M 3 ~(/Vv ~0) 8. (C—/3) 3 (/•' = ~F)
M 3 (-—AT- -O) ~~D) 3 [(/-> —-/•") v -F)]
9. (C;'v//)•(/v/)
[(Gvi/)-i]v[(Cvii)-;|
*10. (K-F) 3 {Af-I(:V-0)-/>|}
3 {m-[(cmv)-pj}
11. ~{gv ~[(H-~S)-(7’v '-(')]}
—{<;> V [—(K-~S) V ~(7’v -</)]}
II. Each of the following is a formal proof of validity for the indicated argument.
State the ‘justification’ for each line that is not a premiss:
IV. Construct a formal proof of validity for each of the following arguments, using the
suggested notation in each case.
46 The Method of Deduction [Ch. 3
*1. If I study, I make good grades. If I do not study, I enjoy myself. Therefore,
either I make good grades or I enjoy myself. (S, G, E)
2. If the supply of silver remains constant and the use of silver increases, then
the price of silver rises. If an increase in the use of silver implies that the
price of silver rises, then there will be a windfall for speculators. The supply
of silver remains constant. Therefore, there will be a windfall for specula¬
tors. (S, (7, P, W)
3. Either Adams is elected chairman or both Brown and Clark are elected to
the board. If either Adams is elected chairman or Brown is elected to the
board, then Davis will lodge a protest. Therefore, either Clark is elected to
the board or Davis will lodge a protest. (A, 8, C, D)
4. If he uses good bait, then if the fish are biting, then he catches the legal
limit. He uses good bait, but he does not catch the legal limit. Therefore, the
fish are not biting. (G, ft, C)
*5. Either the governor and the lieutenant-governor will botkrun for reelection
or the primary race will be wide open and the part^,will be torn by
dissension. The governor will not run for reelection. Therefore, the party
will be torn by dissension. (G, L, W, T)
6. If the Dodgers win the pennant, then they will win the Series. Therefore, if
the Dodgers win the pennant, then if they continue to hit, then they will win
the Series. (P, S, H) „
7. If he attracts the farm vote, then he will carry the rural areas, and if he
attracts the labor vote, then he will carry the urban centers. If he carries
both the urban centers and the rural areas, then he is certain to be elected.
He is not certain to be elected. Therefore, either he does not attract the farmf
vote or he does not attract the labor vote. (F, ft, L, (7, C)
8. Either Argentina does not join the alliance or Brazil boycotts it, but if
Argentina joins the alliance, then Chile boycotts it. If Brazil boycotts the
alliance, then if Chile boycotts it, then Ecuador will boycott it. Therefore, if
Argentina joins the alliance then Ecuador will boycott it. (A, ft, C, E)
9. If Argentina joins the alliance, then both Brazil and Chile will boycott it. If
either Brazil or Chile boycotts the alliance, then the alliance will be
ineffective..Therefore, if Argentina joins the alliance, then the alliance will
be ineffective. (A, ft, C, 1)
*10. Steve took either the bus or the train. If he took the bus or drove his own car,
then he arrived late and missed the first session. He did not arrive late.
Therefore, he took the ttain. (ft, T, C, L,M)
11. If you enroll in the course and study hard, then you will pass, but if you
enroll in the course and do not study hard, then you will not pass. Therefore,
if you enroll in the course, then either you study hard and pass or you do not
study hard and do not pass. (E, S, P)
12. If Argentina joins the alliance, then either Brazil or Chile will boycott it. If
Brazil boycotts the alliance, then Chile will boycott it, also. Therefore, if
Argentina joins the alliance, then Chile will boycott it. (A, ft, C)
13. If either Argentina or Brazil joins the alliance, then both Chile and Ecuador
will boycott it. Therefore, if Argentina joins the alliance, then Chile will
boycott it. (A, ft, C, E\
14. If prices fall or wages rise, Ithen both retail sales and advertising activities
increase. If retail sales increase, then jobbers make more money, but jobbers
do not make more money. Therefore, prices do not fall. (P, W, ft, A, ])
*15. If I work, then I earn money, but if I am idle, then I enjoy myself. Either I
work or I am idle. However, if I work, then I do not enjoy myself, while if
I am idle, then I do nbt earn money. Therefore, 1 enjoy myself if and only if
I do not earn money. (W, M, /, E)
16. If you enter the primary, then if you campaign vigorously, then you win the
nomination. If you win the nomination and receive the support of the party
regulars, then you will be elected. If you take the party platform seriously,
then you will receive the support of the party regulars but will not be
elected. Therefore, if you enter the primary, then if you campaign vigor¬
ously, then you do not take the party platform serioilsly. (P, C, JV, ft, E„T)
17. Either the tariff is lowered or imports continue to decrease and our own
industries prosper. If the tariff is lowered, then our own industries prosper.
Therefore, our own industries prosper. (Ty /, O)
18. Either he has his old car repaired or he buys a new car. If he has his ojd car
repaired, then he will owe a lot of money to the garage. If he owes a lot of
money to the garage, then he will not soon be out of debt. If he buys a new
car, then he must borrow money from the bank, and if he must borrow
money from the bank, then he will not soon be out of debt. Either he will
soon be out of debt or his creditors will force him into bankruptcy. There¬
fore, his creditors will force him into bankruptcy, (ft, zV, G, S, ft, C)
19. If she goes on a picnic, then she wears sport clothes. If she wears sport
clothes, then she does not attend both the banquet and the dance. If she does
not attend the banquet, then she still has her ticket, but she does not still
have her ticket. She does attend the dance. Therefore, she does not go on a
picnic. (P, S, ft, D, T)
20. If she studies the sciences, then she prepares to earn a good living, and if she
studies the humanities, then she prepares to live a good life. If she prepares
to earn a good living or she prepares to live a good life, then her college
years are well spent. But her college years are not well spent. Therefore, she
does not study either the sciences or the humanities. (S, E, //, L, C)
*21. If you plant tulips, then your garden will bloom early, and if you plant
asters, then your garden will bloom late. So if you plant either tulips or
asters, then your garden will bloom either early or late. (!\ E, A, L)
22. If you plant tulips, then ymir garden will bloom early, and if you plant
asters, then your garden will bloom late. So if you plant both tulips and
asters, then your garden will bloom both early and late. (P, E, A, L)
23. If we go to Europe, then we tour Scandinavia. If we go to Europe, then if
we tour Scandinavia, then we visit Norway. If we tour Scandinavia, then if
we visit Norway, then we shall take a trip on a fiord. Therefore, if we go to
Europe, then we sh^ll take a trip on a fiord. (E, S, N, F)
24. If Argentina joins the alliance, then either Brazil or Chile boycotts it. If
Ecuador joins the alliance, then either Chile or Peru boycotts it. Chile does
not boycott it. Therefore, if neither Brazil nor Peru boycotts it, then neither
Argentina nor Ecuador joins the alliance. (A, ft, C, E, P)
25. If either Argentina or Brazil joins the alliance, then if either Chile or
Ecuador boycotts it, then although Peru does not boycott it, Venezuela
[Ch. 3
boycotts it. If either Peru or Nicaragua does not boycott it, the Uruguay
will join the alliance, 'flierefore, if Argentina joins the alliance, then if Chile
boycotts it, then Uruguay will join the alliance. {A, B, C, E, P, V, N, U)
26. If either Saudi Arabia raises the price of oil or Saudi Arabia does not raise
the price of oil, then America will be in grave difficulties. Therefore,
America will be in grave difficulties. (S, A)
If the Senator votes against this bill, then he is opposed to more severe
penalties against tax evaders.
If the Senator is a tax evader himself, then he is opposed to more severe
penalties against tax evaders.
Therefore, if the Senator votes against this bill, he is a tax evader himself,
V D O
H D O
.'. V D II
Instead of constructing a truth table for the specific form of this argument, we
can prove its invalidity by making an assignment of truth values to the
component simple statements V, O, and II, which will make the premisses true
and the conclusion false. The conclusion is made false by assigning T to V and
F to H; and both premisses are made true by assigning T to O. This method of
proving invalidity is closely related to the truth table method. In effect, making
the indicated truth value assignment amounts to describing one row of the
relevant truth table—a row that suffices to establish the invalidity of the
argument being tested. The relationship appears more clearly, perhaps, when
the truth value assignments are written out horizontally:
V O H VO O HDO VDH
. — . ... . ... ... .i . . . —. —
T T F T T F
Sec. 3.4] The Rule of Conditions Proof 49
This new method of proving invalidity is shorter than writing out a complete
truth table. The amount of time and work saved is proportionally greater for
more complicated arguments. In proving the invalidity of more extended
arguments, a certain amount of trial and error may be needed to discover a
truth value assignment that works, i But even so, this method is quicker and
easier than writing out the entire truth table. It is obvious that the present
method will suffice to prove the invalidity of any argument that can be shown
to be invalid by a truth table.
EXERCISES
Prove the invalidity of each of the following arguments by the method of assigning
truth values: ..
► *1. ADD / 3 (K 3 L) *5. u
C3D K ~J (-/. 3 M) U== (V-W)
BvC (L v M) 3 N Vf (7’ v A‘)
/.AvD 3 N Tv X
T-X
2. E 3 (Fv G) 4. (O v P) 3 {) 6. X ^ (V 3 Z)
G 3 (Ihl) Q 3 (Tv R) y ■=(-*—/)
O 3 (~~S 3 P) Ze(Xv-V)
E 3 / (S 3 O) 3 -Zt y
..F=Q . ’. X v Z
7. (ADH)-(C 3 D) 9. P=K)^-/0
A vC 3 (-/iv -S)
(«vD) 3 (E-F) [« D(()v-r.)]-(lJD9)
E 3 (F 3 G) [G 3 (S-T)l-(i 3 -V)
G 3 (A 3 //) [«>•«) -f/|-[i/D(yvii)]
// (()vVJ—V
8. / v (V'K)
(ivy) 3 (/.= -*#) X = (Z 3 Y)
(L 3 —Af) 3 y = (z=-A)
(/V 3 0)-(0 3 -V/) Z nE (A 3 fl)
(/ 3 K) 3 O A ~ (Z* = Z)
.'.O Z* v - W
U ~ B
Next, we introduce a new rule to use in the method of deduction: the rule of
Conditional Proof. In this section, the new rule will be applied only to
arguments whose conclusions are conditional statements. The new rule can best
be explained and justified by reference to the principle of Exportation and the
correspondence, noted in Chapter 2, between valid argument forms and
tautologies.
50 The Method of Deduction [Ch. 3
To every argument there corresponds a conditional statement whose ante¬
cedent is the conjunction of the argument’s premisses and whose consequent is
the argument’s conclusion. As has been remarked, ,an argument is valid if and
only if its corresponding conditional is a tautology. If an argument has a
conditional statement for its conclusion, which we may symbolize as A D C,
then if we symbolize the conjunction of its premisses as P2 the argument is
valid if and only if the conditional
p => (A D C) (1)
(PmA) D C (2)
But (2) is the conditional associated with a somewhat different argument. This
.second argument has, as its premisses, all of the premisses of the first argument
plus an additional premiss that is the antecedent of the conclusion of the first
argument. The conclusion of the second argument is the consequent of the
conclusion of the first argument. Now, if we deduce the conclusion of the
second argument, C, from the premisses conjoined in P*A by a sequence of
elementary valid arguments, we thereby prove that its associated conditional
statement (2) is a tautology. But since (1) and (2) are logically equivalent, this
fact proves that (1) is a tautology also, from which it follows that the original
argument—with one less premiss and the conditional conclusion A D C—is
valid also. Now, the rule of Conditional Proof permits us to infer the validity of
any argument
P
.'.A D C
P
A
.*.C
2For definiteness, we specify that the conjunction has its conjuncts associated to the left, that
is, if the premisses are Plt F2, and P3, the conjunction is (fVP2)*P3. If there is a fourth premiss, P4,
the conjunction is [(Pj'PJ’Isl’P* an^ so on-
Sac. 3.4] The Rule of CondNtonal Proof Sf
additional premiss, and then deducing the consequent of its conclusion by a
sequence of elementary valid arguments. Thus a conditional proof of validity
for the argument
(AvB)D (C-D)
(DvE) D F
.’.A D F
may be written as
1. (AvB)D (C-D)
2. (DvE) 3 F /.’.ADF
3. A /.' .F (C.P.)
4. AvB 3, Add.
5. C-D 1, 4, M.P.
6. D-C 5, Com.
7. D 6, Simp.
8. DvE 7, Add.
9. F 2, 8, M.P.
Here, the second slant line and three-dot ‘therefore’ symbol, as well as the
parenthesized ‘C.P.’, indicate that the rule of Conditional Proof is being used.
Since the rule of Conditional Proof can be used in dealing with any valid
argument having a conditional statement as conclusion, it can be applied more
than once in the course of the same deduction. Thus a conditional proof of
validity for
AD(BD C)
B D (C 3 D)
.’.A D(BDD)
A D (B D C)
B D (C D D)
A
B D D
and since the latter, itself, has a conditional conclusion, it can be given a
conditional proof by proving the validity of
AD(BDC)
BD (CD D)
A
B
:.d
52 The Method of Deduction [Ch. 3
Each use of the conditional method should be signaled by an additional slant
line and ‘therefore’ sign, in addition to the notation ‘(C.P.)’. The suggested
proof would be written:
1. AD (BDC)
2. B D (C D D) //.A D(BDD)
3. A /:.BDD (C.P.)
4. B /.-.D (C.P.)
5. BDC 1, 3, M.P.
6. C 5, 4, M.P.
7. CD D 2, 4, M.P.
8. D 7, 6, M.P.
AD B
.’.AD( A-B)
cannot be proved valid using only the original list of nineteen Rules of
Inference. Rut it is easily proved valid using the new rule of Conditional Proof.
Its conditional proof of validity is
EXERCISES
Give conditional proofs of validity for Exercises *21, 22, 23, 24, and 25 on pages
47-48.
The method of indirect proof, often called the method of proof by reductio
ad ahsurdutn, is familiar to all w'ho have studied elementary geometry. In
deriving his theorems, Euclid.often begins by assuming the opposite of what
A D (B-C)
(B v D) D E
D\A
.*. e
1. AD (B-C)
2. (B v D) D E
3. DvA /.'.£
4. ~~E I.P. (Indirect Proof)
5. ~~(B v D) 2, 4, M.T.
6. 5, De M.
7. 6, Com.
8. 7, Simp.
9. A 3, 8, D.S.
10. B-C 1, 9, M.P.
11. B 10, Simp.
12. —B 6, Simp.
13. B-~~B ’ll, 12, Conj.
A
.*. B v (B D C)
4 In this context, one statement is irrelevant to another if there is no simple statement that is
a component of both.
cannot be proved valid by the methods set forth in the preceding sections, its
validity is easily established by using the rule of Indirect Proof. One proof of
its validity is this:
1. A A’ . B v (B D
2. ~[B v (BD C)] I.P.
3. ~[Bv (~B'/C)] 2, Impl.
4. ' ~B) vC] 3, Assoc.
5. ~(B v -*)• ~c 4, De M.
6. ~(B v 5, Simp.
7. ~B-~B 6, De M.
Our nineteen Rules of Inference, plus the rules of Conditional and Indirect
Proof, provide us with a method of deduction that is complete. Any argument
whose validity can be established by the use of truth tables can be proved valid
by the method of deduction as set forth in Sections 3.1, 3.2, 3.4 and 3.5. This
will not be proved, however, until the end of Chapter 8.
EXERCISES
For each of the following arguments, construct both a formal proof of validity and
an indirect proof and compare their lengths:
*3. (H D !)•(/ D K)
(I v K) DL
.*.-(//v ])
The conditional and indp^ct methods of proof can be used not only^ to
establish the validity of^rfguments, but also to prove that certain statements
and statement forarrfare tautologies’. Any conditional statement corresponds, in
a sense, to an argument whose single premiss is the antecedent of the condi¬
tional, and whose conclusion is the conditional’s consequent. The conditional is
Sec. 3.6] Proofs of Tautologies 55
a tautology if and only if that argument is valid. Hence a conditional is proved
tautologous by deducing its consequent from its antecedent by a sequence of
elementary valid arguments. Thus the statement (A*B) DA is proved tau¬
tologous by the same sequence of lines that proves the validity of the argument
A-B
.'. A
It has already been noted that the conditional method can be used repeatedly
in a single proof. Thus the conditional statement
“T&t. Use the method of indirect proof to verify that the following are tautologies:
*1. (A D B) v (A D —B) 6. A v (A D B)
2. (A D B) v (—A D B) 7. P=-B
3. (A D B)v(B D A) * 8. A = [A*(A v £)]
4. (ADB)v(BDC) j* 9. A = [A v (A-B)]
*5. (A D B) v (~A D C) 10. -[(A D -A)-(-A D A)]
AD B
..A D A-B)
Here, the fifth line is inferred not from any one or two of the preceding lines,
but from the sequence of lines 2, 3, 4, which constitutes a valid deduction of
line 4 from lines 1 and 2. In lino 5, we infer the validity of the argument
/\ D B
/.A D (A-B)
A D B
A
.’.A-B
1. ADB A D (A*B)
P> 2. A assumption
3. B 1, 2, M.P.
4. A*B 2, 3, Conj.
5. A D (A'B) 2-4, C.P.
1. (AvB)D[(CvD)DE] /. A D [(C-D) D E]
i->2. A
3. AvB 2, Add.
4. (CvD)DE 1,3, M.P.
r-5. C-D
6. Q 5, Simp.
7. CvD Q, Add.
8. E_ 4, 7, M.P.
9. (C-D) D E 5-8, C.P.
10. A D [(C-D) D E] 2-9, C.P.
Sec. 3.7] The Strengthened Rule of Conditional Proof 59
In this proof, lines 2 through 9 lie within the scope of the first assumption,
while lines 5, 6, 7, and 8 lie within the scope of the second assumption. From
these examples, it is clear that the scope of an assumption a in a proof consists
of all lines a through <p, where the line following <p is of the form a D <p and is
inferred by C.P. from that sequence of lines. In the preceding proof, the second
assumption lies within the scope of the first because it lies between the first
assumption and line 10, which is inferred by C.P. from the sequence of lines 2
through 9.
When we use this new method of writing out a Conditional Proof of validity,
the scope of every original premiss extends all the way to the end of the proof.
The original premisses may be supplemented by additional assumptions,
provided that the latter’s scopes are limited and do not extend to the end of the
proof. Each line of a formal proof of validity must be either a premiss, or an
assumption of limited scope, or must follow validly from one or two preceding
lines by a Rule of Inference, or must follow from a sequence of preceding lines
by the principle of Conditional Proof.
Some arguments can be proved valid by the Strengthened Rule of Condi¬
tional Proof, although the original Rule of Conditional Proof, as stated in
Section 3.4, cannot be applied to them. Consider the following argument and
its proof:
L AD B
2. [AD(A-B)]DC /.*.C
1, 3, M.P.
3, 4, Conj.
6. AD(A-B) 3-5, C.P.
7. C ' 2, 6, M.P.
.\q
is valid, no matter what statements are substituted for the variables p and q.
Thus from lines 11 and 12 in the proof on page 53, the conclusion E can be
derived in just two more lines. Such a continuation would proceed:
We do not yet have the desired conclusion, but we can get it from line 15 in
just three easy steps.
From now on, the Strengthened Rule of Conditional Proof will be referred to
simply as the Rule of Conditional Proof.5 * * * * 10
5 For further discussion of the relations between the Rule of Indirect Proof and the (Strength¬
ened) Rule of Conditional Proof, see Donald Scherer, ‘The Form of Reductio ad Absurdurn, Mind,
vol. 80 (1971), pp. 247-252; Paiil Foulkes, ‘The Form of Reductio ad Absurdurn, Mind, vol. 82
(1973), pp. 579-580; Charles II. Lambros; ‘Scherer on Reductio ad Absurdurn, Mind, vol. 82
(1973), pp. 581-585; J. M. Lee, ‘The Form of Reductio ad Absurdurn, Notre Dame Journal of
Formal Logic, vol. 14 (1973), pp. 381-386; and Lyman C. D. K. Kulathungam, ‘Reductio-ad-
Absurdurn: A Family Feud between Copi and Scherer’, Notre Dame Journal of Formal Logic, vol.
10 (1975), pp. 245-254.
Sec. 3.8] Shorter Truth Table Technfque-ft<toc//o Ad Absurdum Method 61
EXERCISES
^ Use the strengthened method of conditional proof to prove the validity of the
following arguments:
*1. A D B . 4. Q\ (R 3 S)
B D f(C 3 -C) 3 D] [K D(/US)1 Df/’vU)
/.ADD (r D 3 V)
<;>v v
2. (E v F) 3 G 5. [W D (-V*-y)]-[Z D -(XvV)l
II 3 (/•/) (-A D \V)-{^B D /)
.-.(E 3 (;)•(// D /) (A D X)m(B D V)
..A — y
3. (X D L)m(M 3 N) 6. (C v D) D (X D F)
(LvN) 3 {[O D (Ov I*)] D (K-M)} [E D (/<>F)| D C;
c; D [(*-//V --//) D (OH)]
(AvB)D(C-D)
(D v E) D F
:.A 3 F
F, however, to make the conclusion false. Hence the argument is invalid only if
the statement F is both true and false, which is impossible. This method of
proving the validity of an argument is a version of the reductio ad absurdum
technique that uses truth value assignments rather than Rules of Inference.
It is easy to extend the use of this method to the classification of statements
(and statement forms). Thus to certify that Peirce’s Law [(p D q) D p] D p is
a tautology, we assign it the truth value F, which requires us to assign T to its
antecedent [(p D q) D p] and F to its consequent p. For the conditional
[(p D q) D p] to be true while its consequent p is false, its antecedent (p D q)
must be assigned the truth value F also. But for vhe conditional p D q to be
false, its antecedent p must be assigned T and its consequent q assigned F.
However, we were previously forced to assign F to p, so assuming Peirce’s
Law false leads to a contradiction, which proves it a tautology.
If it is possible to assign truth values consistently to its components on the
assumption that it is false, then the expression in question is not a tautology, but
must be either contradictory or contingent. In such a case, we attempt to assign
truth values to make it true. If this attempt leads to a contradiction, the
expression cannot possibly be true and must be a contradiction. But if truth
values can be assigned to make it true and other truth values can be assigned to
make it false, then it is neither a tautology nor a contradiction, but is contin¬
gent.
The reductio ad absurdum m#iipd of assigning truth values is by far the
quickest and easiest method of testing arguments and classifying statements. It
is, however, more readily applied in some cases than in others. If F is assigned
to a disjunction, F must be assigned to both disjuncts, and where T is assigned
to a conjunction, T must be assigned to both conjuncts. Here the sequence of
assignments is forced. But where T is assigned to a disjunction or F to a
conjunction, that assignment, by itself, does not determine which disjunct is
true or which conjunct is false. At this point we shall have to experiment and
make various ‘trial assignments’, which will tend to diminish the advantage of
the method for such cases. Despite these complications, however, in the vast
majority of cases, the reductio ad absurdum method is superior to any other
method known.
EXERCISES
1. Use the reductio ad absurdum method of assigning truth values to decide the
validity or invalidity of the arguments and argument forms in the exercises on
pages 25-26.
2. Use the reductio ad absurdum method of assigning truth values to establish that the
statements in Exercises I and II on page 56 are tautologies.
3. Use the reductio ad absurdum method of assigning truth values to classify the
statement forms in Exercise I on pages 30-31 as tautologous, contradictory, or con¬
tingent.
Quantification Theory
The validity of such an argument depends upon the inner logical structure of
the noncompound statements it contains. To appraise such arguments, we must
develop methods for analyzing noncompound statements and symbolizing their
inner structures.
The second premiss of the preceding argument is a singular proposition; it
says that the individual Socrates has the attribute of being human. We call
‘Socrates’ the subject term and ‘human’ the predicate term. Any (affirmative)
singular proposition states that the individual referred to by its subject term has
the attribute designated by its predicate term. We regard as individuals not
only persons, but any things, such as animals, cities, nations, planets, or stars, of
which attributes can significantly be predicated. Attributes can be designated
not only by adjectives, but by nouns or even verbs. Thus ‘Leslie is a gossip’ and
‘Leslie gossips’ have the same meaning, which can also be expressed as ‘Leslie
is gossipy’.
In symbolizing singular propositions, we use the small letters V through
to denote individuals, ordinarily using the first letter of an individual’s name to
denote that individual. Because these symbols denote individuals, we call them
individual constants’. To designate attributes, we use capital letters, being
guided by the same principle in their selection. Thus in the context of the
preceding argument, we denote Socrates by the small letter V and symbolize
the attributes human and mortal by the capital letters 7T and ‘M’. To express
a singular proposition in our symbolism, we write the symbol for its predicate
63
64 Quantification Theory [Ch. 4
term to the left of the symbol for its subject term. Thus we symbolize ‘Socrates
is human' as 7/s’ and ‘Socrates is mortal' as ‘Ms’.1
Examining the symbolic formulations of singular propositions having the
same predicate term, we observe them to have a common pattern. The
symbolic formulations of the singular propositions ‘Aristotle is human', ‘Boston
is human', ‘California is human’, ‘Descartes is human’,..., which are 7/a’,
7/b’, 7fc\ 7/d', ..., each consists of the attribute symbol 7/' followed by an
individual constant. We use the expression 7/x\ to symbolize the pattern
common to all such singular propositions. The small letter Y—called an
‘individual variable'—is a mere place marker that serves to indicate where an
individual constant can be written to produce a singular proposition. The
singular propositions *Ha\ ‘Hh‘Hc\ 7/d’,... are either true or false; but ‘Hx
is neither true nor false, since it is not a proposition. Such expressions as 7/x’ are
called ‘propositional functions'. These are defined to be expressions that
contain individual variables and become propositions when their individual
variables are replaced by individual constants.2 Any singular proposition can
be regarded as a substitution instance of the propositional function from which
it results by the substitution of an individual constant for the individual
variable in the propositional function. The process of obtaining a proposition
from a propositional function by substituting a constant for a variable is called
‘instantiation’. The negative singular propositions ‘Aristotle is not human’ and
‘Boston is not human', symbolized as ‘—//a’ and Hb\ result by instantiation
from the propositional function of which they are substitution in¬
stances. Thus we see that symbols other than attribute symbols and individual
variables can occur in propositional functions.
. General propositions such as ‘Everything is mortal’ and ‘Something is mortal'
differ from singular propositions in not containing the names of any individuals.
However, they also can be regarded as resulting from propositional functions,
not by instantiation, but by the process called ‘generalization’ or ‘quantifica¬
tion’. The first example, ‘Everything is mortal', can alternatively be ex¬
pressed as
Here the relative pronoun ‘it’ refers back to the word ‘thing’, which precedes it
in the statement. We can use the individual variable Y in place of the pronoun
‘it’ and its antecedent to paraphrase the first general proposition as
1 Some logicians enclose the individual constant in parentheses, symbolizing 'Socrates is human’
as *H{s)\ but we shall not follow that practice here.
2Some writers have defined 'propositiohal functions’ to be the meanings of such expressions; but
here we define them as the expressions themselves.
Scc- 4,11 Singular Propositions and General Propositions 65
(x)Mx
and as
The phrase There is at least one x such that’ is called an ‘existential quantifier'
and is symbolized as ‘(3:*)’. Using the new symbol, we can completely symbol¬
ize our second general proposition as
(3x)Mx
Assuming the existence of at least one individual, we can say that the two top
propositions are contraries, that is, they might both be false but cannot both be
true; the two bottom propositions are subcontraries, that is, they can both be
true but cannot both be false. Propositions that are at opposite ends of the
diagonals are contradictories, of which one must be true and the other false.
Finally, on each side, the truth of the lower proposition is implied by the truth
of the proposition directly above it.
Traditional logic emphasized four types of subject-predicate propositions
illustrated by the following:
(x)(Hx D Mx)
(x)(llx ID ~~Mx)
(3 .v)(//x*Afx)
4 We have the same symbolic convention for quantifiers (both universal and existential) that we
established for negation on page 12: a quantifier applies to or has for its scope the smallest
component that the punctuation permits.
68 Quantification Theory [Ch. 4
(3x)(ffx-^Afx) '
Where the Greek letters phi and psi are used to represent any attribute
symbols whatever, the four general subject-predicate propositions of tradi¬
tional logic may be represented in a square array as
()
(3.0 (4h- • 0.v) (30 (4>v • *4,x)
Of these, the A and the O are contradictories, and the E and the I are
contradictories also. But none of the other relationships discussed in connection
with the square array on page 66 hold for the traditional A, E, Z, and O
propositions, even if we assume that there is at least one individual in the
universe. Where 4 <l>x* is a propositional function that has no true substitution
instances, then regardless of what attribute is symbolized by the proposi¬
tional functions *<bx D tyx’ and ‘4>x D —'I'x’ have only true substitution
instances. This is so because all their substitution instances are conditional
statements with false antecedents. In such cases both the A and E propositions
that are the universal quantifications of these complex propositional functions
are true, so A and E propositions are not contraries. Again, where ‘<I>x is a
propositional function that has no true substitution instances, then, regardless
of what €yVx might be, the propositional functions ‘<J>x • ♦x’ and ‘<t>x • ~~'Vx have
only false substitution instances,*Ior their substitution instances are conjunc¬
tions whose first conjuncts are false. In such cases the 1 and O propositions that
are the existential quantifications of these complex propositional functions are
false. Thus I and O propositions are not subcontraries. In all such cases, then,
since the A and E propositions are true and the / and O propositions are false,
the truth of a universal does not imply the truth of the corresponding particu¬
lar; no implication relation holds between them.
If we make the assumption that there is at least one individual, then
‘(x)(<t>x D 'I'*)’ does imply \3x)(<bx O ^x)\ The latter, however, is not an f
proposition. An / proposition qf the form 'Some <J>’s are 'k’s is symbolized as
‘(3x)(<I>x-'I,x)\ which asserts that .there is at least one thing having both the
attribute $ and the attribute 'V. But the proposition D asserts
Sec' Singular Propositions and General Propositions 69
only that there is at least one object that either has the attribute or does not
have the attribute <1>, which is a very different and much weaker assertion.
The four traditional subject-predicate forms A, E, 7, and O are not the only
forms of general propositions. There are others that involve the quantification
of more complicated propositional functions. Thus the general proposition ‘All
members are either parents or teachers’, which does not mean the same as ‘All
members are parents or all members are teachers’, is symbolized as
(x)[Mx D (Pxv lx)]. And the general proposition ‘Some Senators are either
disloyal or misguided’, is symbolized as '(3x)[Sx-(Dxv Mx)]\ It should be
observed that such a proposition as ‘Apples and bananas are nourish¬
ing’ can be symbolized either as the conjunction of two A propositions,
*[(*)(-A* Nx)] • [(x)(Bx D Nx)]’, or as a single noncompound general proposi¬
tion, ‘(x)[(Ax v Bx) D Nx\\ It should not, however, be symbolized as
<(x)[(Ax*Bx) D Nx]\ since to say that apples and bananas are nourishing is to
say that anything is nourishing which is cither an apple or a banana, not that
anything is nourishing which is both an apple and a banana (whatever that
might be). It must be emphasized that there are no mechanical rules for
translating statements from English into our logical notation. In every case one
must understand the meaning of the English sentence and then re-express that
meaning in terms of propositional functions and quantifiers.
EXERCISES
I. Translate each of the following into the logical notation of propositional functions
and quantifiers. In each case, use the abbreviations suggested and begin each
formula with a quantifier, not with a negation symbol:
16. Some medicines are dangerous only if taken in excessive amounts. (Mx: x is
a medicine. Dx: x is dangerous. Ex: x is taken in excessive amounts.)
17. All fruits and vegetables are wholesome and delicious. (Fx: x is a fruit. Vx: x
is a vegetable. Wx: x is wholesome. Dx: x is delicious.)
18. Everything enjoyable is either immoral, illegal, or fattening. (Ex: x is enjoya¬
ble. Mx; x is moral. Lx: x is legal. Ex: x is fattening.)
19. A professor is a good lecturer if and only if he is both well informed and
entertaining. (Px: x is a professor. Gx: x is a good lecturer. Wx: x is well
informed. Ex: x is entertaining.)
*20. Only policemen and firemen are both indispensable and underpaid. (Px: x is
a policeman. Fx: x is a fireman. Ix: x is indispensable. Ux: x is underpaid.)
21. Not every actor is talented who is famous. (Ax: x is an actor. Tx: x is talented.
Fx: x is famous.)
22. Any girl is healthy if1 she is well nourished and exercises regularly. (Gx: x is
a girl. IIx: x is healthy. Wx: x is well nourished. Ex: x exercises regularly.)
23. It is not true that every watch will keep good time if and only if it is wound
regularly and not abused. (Wx: x is a watch. Kx: x keeps good time. Rx: x is
wound regularly. Ax: x is abused.)
24. Not every person who talks a great deal has a great deal to say. (Px: x is a
person. Tx: x talks a great deal. Hx: x has a great deal to say.)
*25. No automobile that is over ten years old will be repaired if it is severely
damaged. (Ax: x is an automobile. Ox: x is over ten years old. Rx: x will be
repaired. Dx: x is severely damaged.)
26. Some horses are gentle and have been well trained.
27. Some horses are gentle only if they have been well trained.
28. Some horses are gentle if they have been well trained.
29. Any horse is gentle that has been well trained.
*30. Any horse that is gentle has been well trained.
31. No horse is gentle unless it has been well trained.
32. Any horse is gentle if it has been well trained.
33. Any horse has been well trained if it is gentle.
34. Any horse is gentle if ancfonly if it has been well trained.
*35. Gentle horses have all been well trained.
36. Only well-trained horses are gentle.
37. Only gentle horses have been well trained.
38. Only horses are gentle if they have been well trained.
39. Some horses are gentle even though they have not been well trained.
*40. If something is a well-trained horse, then it must be gentle.
41. Some horses that are well trained are not gentle.
42. Some horses are neither gentle nor well trained.
43. No horse that is well trained fails to be gentle.
44. A horse is gentle only if it has been well trained.
45. If anything is a gentle horse, then it has been well trained.
46. If any horse is well trained, then it is gentle.
Sec. 4.2] Proving Validity: Preliminary Quantification Rules 71
(a*)(<p.v)
(where v is any individual symbol)
1. (x)(Hx D Mx)
2. Hs //.Ms
3. Hs D Ms 1, UI
4. Ms 3, 2, M.P.
5This rule and the three that follow are variants of rules for natural deduction’, which were
devised independently by Gerhard Gentzen and Stanislaw Jankowski in 1934.
72 Quantification Theory [Ch. 4
<I>iy
(where ‘xj denotes any arbitrarily selected individual,
.■.(*)(<!>*) and Or/ is not within the scope of any assumption
containing the special symbol ‘r/’)
We can use the new notation and the additional rule UG to construct a
formal proof of validity for the argument: ‘No mortals are perfect; all humans
are mortal; therefore, no human? are perfect’.
1. (x)(Mx D ~~Px)
2. (x)(Hx DMx) /. *. (x)(Hx D -Px)
3. lly D My 2, UI
4. My D ~~Py 1,UI
5. Ily D ~~Py 3, 4, H.S.
6. (x)(IIx D ~Px) 5, UG
We can explain the need for the two restrictions in our statement of UG by
considering two obviously invalid arguments. First, the silly argument ‘Chicago
is large, therefore, everything is large’. If we ignored the restriction on UG that
Sec. 4.2] Proving Validity: Preliminary Quantification Rules 73
the premiss must contain the special symbol y\ we might be led to construct
the following ‘proof’:
The mistake occurs at line 2. The individual symbol in line 1 is <\ but UG can
be used only on a line containing
The second invalid argument to consider is the obviously invalid: ‘Not
everything is edible; therefore, nothing is edible’. If we ignored the restriction
on UG that the Of/ must not be within the scope of an assumption containing
the special symbol ‘i/\ we might be led to construct the following ‘proof’:
1. ~~(x)Ex (x)*Ex
2. Etj
3. (x)Ex 2, UG (wrong)
47 Ey D (x)Ex 2-3, C.P.
5. ^Eij 4, 1, M.T.
6. (x)^Ex 5, UG
But we can take any individual constant other than ‘y\ say, *w\ which has had
no prior occurrence in that context and use it to denote the individual or one of
the individuals whose existence has been asserted by the existential quantifica¬
tion. Knowing that there is such an individual, and having agreed to denote it
by *w\ we can infer from the existential quantification of a propositional
function the substitution instance of that propositional function with respect to
the individual symbol We add, as our final quantification rule, the principle
that from the existential quantification of a propositional function we may
validly infer the truth of its substitution instance with respect to an individual
constant that has no prior occurrence in that context. The new rule may be
written as
1. (x)(Dx D Cx)
2. (3 x)(Ax*Dx) /.’ . (3x)(Ax'Cx)
3. Aw-Dw 2, El
4. Dw D Cw 1,UI
5. DwAw 3, Com.
6. Dw 5, Simp.
7. Cw 4, 6, M.P.
8. Aw 3, Simp.
9. AwCw 8, 7, Conj.
10. (3 x)(Ax-Cx) 9, EG
We can show the need for tffe indicated restriction on the use of El by
considering the obviously invalid argument: ‘Some cats are animals; some dogs
are animals; therefore, some cats are dogs'. If we ignored the restriction on El
that the substitution instance inferred by it can contain only an individual
constant that had no prior occurrence in the context, we might be led to
construct the following ‘proof’:
1. (3 x)(Cx-Ax)
2. (3 x)(Dx*Ax) /.‘.(3 x)(Cx9Dx)
3. CwAw 1, El.
4. Dw •Aw 2, El (wrong)
Sec. 4*2] Proving Validity: Preliminary Quantification Rules 75
5. Cw 3, Simp.
6. Dw 4, Simp.
7. CwDw 5, 6, Conj.
8. (3x)(Cx-Dx) 7, EG
The mistake here occurs at line 4. The second premiss assures us that there is
at least one thing that is both a dog and an animal. We are not free to use the
symbol w to denote that thing, however, because *w' has already been used to
denote one of the things that was asserted by the first premiss to be both a cat
and an animal. To avoid errors of this sort, we must ojbey the indicated
restriction in using El. It should be clear that whenever we use both El and UI
in a proof to instantiate with respect to the same individual constant, we must
use El first. (It has been suggested that we could avoid the necessity of using El
before UI by changing the restriction on El to read ‘where v is an individual
constant, other than ‘t/’, that was not introduced into the context by any
previous use of El/ Even apart from its apparent circularity, that formulation
would not prevent the construction of an erroneous ‘formal proof of validity'
for such an invalid argument as ‘Some men are handsome. Socrates is a man.
Therefore, Socrates is handsome.')
Like the first nine Rules of Inference presented in Section 3.1, the four
quantification rules UI, UG, EG, and El can be applied only to whole lines in
a proof.
Any assumption of limited scope may be made in a Conditional Proof of
validity and, in particular, we are free to make an assumption of the form ‘4>t/\
Thus the argument ‘All freshmen and sophomores are invited and will be
welcome; therefore, all freshmen are invited’ is proved valid by the following
Conditional Proof:.
More than one assumption of limited scope can be made in proving the validity
of arguments involving quantifiers, as in the following Conditional Proof:
1. (x)[(AxvBx) D (Cx*Dx)]
2. (x){(CxvEx) D [(FxvGx) D IIx) //.(x)[Ax D (Fx D Hx)]
3. (Ay v By) D (Cy *Dy) 1, UI
4. (Cy v Ey) D [(Fy v Gy) D Hy] 2, UI
[Ch. 4
—> 5. Ay
6. Ay v By 5, Add
7. CyDy 3, 6, M.P.
8. Cy 7, Simp.
9. Cy v Ey 8, Add.
10. (Fy v Gy) 3 Hy 4,9, M.P.
pll. Fy
12. FyvGy 11, Add.
13. Hy 10,12, M.P.
14. Fy 5 Ily 11-13, C.P.
15. Ay 3 (Fy 3 i/y) 5-14, C.P.
16. (x)[Ax 3 (Fx 3 J/x)] 15, UG
XERCISES
I. Construct formal proofs of validity for the following arguments, using the rule of
Conditional Proof wherever you wish:
I. Construct formal proofs of validity for the following arguments, using the rule of
Conditional Proof wherever you wish:
*1. All athletes are brawny. Charles is not brawny. Therefore, Charles is not an
athlete. (Ax, Bx, c)
2. No contractors are dependable. Some contractors are engineers. Therefore,
some engineers are not dependable. (Cx, Dx, Ex)
3. All fiddlers are gay. Soni^ hunters are not gay. Therefore, some hunters are
not fiddlers. (Fx, Gx, I lx)
Sec. 4.2] Proving Validity: Preliminary Quantification Rules 77
4. No judges are idiots. Kanter is an idiot. Therefore, Ranter is not a judge.
(/*, /*, k)
5. All liars are mendacious. Some liars are newspapermen. Therefore, some
newspapermen are mendacious. (Lx, M.v, \Jx)
6. No osteopaths are pediatric ians. Some quacks are pediatricians. Therefore,
some quacks are not osteopaths. (Ox, Px, (Jx)
7. Only salesmen are retailers. Not all retailers are travelers. Therefore, some
salesmen are not travelers. (S.v, Rx, Tx)
8. There are no uniforms that are not washable. There are no washable velvets.
Therefore, there are no velvet uniforms. ((■ v, W'\\ V.v)
9. Only authoritarians are bureaucrats. Authoritarians are all churlish. There¬
fore, any bureaucrat is churlish. (Ax, Rx, Cx)
*10. Dates are edible. Only items of food are edible. All items of food are good.
Therefore, all dates arc good. (Dx, Ex, Fx, Cx)
11* All dancers are graceful. Mary is a student. Marv is a dancer. Therefore, some
students are graceful. (Dx, Cx, S.v, in)
12. Tigers are fierce and dangerous. Some tigers are beautiful. Therefore, some
dangerous things are beautiful. (Tx, Fx, Dx, Rx)
St 13. Bananas and grapes are fruits. Fruits and vegetables are nourishing. There¬
fore, bananas are nourishing. (Rx, Cx, Fx, Vx, \Tx)
14. A communist is either a fool or a knave. Fools are naive. Not all communists
are naive. Therefore, some communists are knaves. (Cx, Fx, Kx, Nx)
*15. All butlers and valets are both obsequious and dignified. Therefore, all butlers
are dignified. (Rx, Vx, Ox, Dx)
16. All houses built of brick are warm and cozv. All houses in Engle¬
wood are built of brick. Therefore, all houses in Englewood are warm.
(lix, Rx, Wx, Cx, Fx)
17. All professors are learned. All learned professors are savants. Therefore, all
professors are learned savants, i Tx, 1.x, Sx)
18. All diplomats are public servants. Some diplomats are eloquent. All
eloquent public servants are orators. Therefore, some diplomats are orators.
(Dx, Fx, Ex, Ox)
Y 19. Doctors and lawyers are college graduates. Any altruist is an idealist. Some
lawyers are not idealists. Some doctors are altruists. Therefore, some college
graduates are idealists. (Dx, Lx, Cx, Av, lx)
*20. Bees and wasps sting if they are either angry or frightened. Therefore, any
bee stings if it is angry. (Rx, W'r, Sv, Ax, Fx)
21. Any authors are successful if and only if they are well read. All authors are
intellectuals. Some authors are successful but not well read. Therefore, all
intellectuals are authors. [Ax, Sx, Wx, lx)
22. Every passenger is either in first class or in tourist class. Each passenger is in
tourist Mass if and only if he is not wealthy. Some passengers are wealthy. Not
all passengers are wealthy. Therefore, some passengers are in tourist class.
(Px, Fx, Tx, Wx)
23. All members are both officers and gentlemen. All officers are fighters. Only a
pacifist is either a gentleman or not a fighter. No pacifists are gentlemen if
they are fighters. Some members are fighters if and only if they are officers.
Therefore, not all members are fighters. (Mx, Ox, Cx, lx, Px)
78 Quantification Theory [Ch. 4
24. Wolfhounds and terriers are hunting dogs. Hunting dogs and lap dogs are
domesticated animals. Domesticated animals, are gentle and useful. Some
wolfhounds are neither gentle nor small. Therefore, some terriers are small
but not gentle. ( Wx, Tx, Hx, Lx, Dx, Gx, Ux, Sx)
25. No man who is a candidate will be defeated if he is a good campaigner. Any
man who runs for office is a candidate. Any candidate who is not defeated
will be elected. Every man who is elected is a good campaigner. Therefore,
any man who runs for office will be elected if and only if he is a good
campaigner. (Mx, Cx, Dx, Gx, Rx, Ex)
And for any number k, if there are exactly k individuals, say a, b, c,..., k, then
(x)(<l>x) =
and
(3jc)(4>x) = (<t>a v <I>h v v ... v 4>fc)
we first symbolize it as
(x)(Wx D Hx)
(;x)(Ex D Hx)
(x)(Wx D Ex)
In the case of a model containing exactly one individual, say, a, the given
argument is logically equivalent to
V Va D 11a
Ei D 11a
.’. Wa D Ea
which is proved invalid by assigning the truth value T to *Wa and 7/r/’ and F
to ‘Ea’. (Tliis assignment of truth values is a shorthand way of describing the
model in question as one that contains only the one individual a, which is \V (a
whale), and H (heavy), but not E (an elephant).)6 Hence the original argument
is not valid for a model containing exactly one individual and is, therefore,
invalid.
6Here wc assume that the simple propositional functions Ax', 'Bx\ 'Cx\ ... arc neither
necessary, that is, logically true of all individuals (for example, x is identical with itself), nor
impossible, that is, logically false of all individual* (for example, x is different from itself). We also
assume that the only logical relations among the simple propositional functions are those asserted
or logically implied by the premisses of the argument being proved invalid. The point of these
restrictions is to permit the arbitrary assignment of truth values to substitution instances of these
simple propositional functions without inconsistency—for our model-descriptions must, of course,
he consistent.
80 Quantification Theory [Ch. 4
which is symbolized as
(x)(Wx D Hx)
(3.t)( Ex • Hx)
/. (x)(Wx D Ex)
For a model containing just the one individual a, this argument is logically
equivalent to
Wa D Ha
Eam Ha
. ’. Wa D Ea
which is a valid argument. But for a model consisting of the two individuals a
and fo, the given argument is logically equivalent to
which is proved invalid by assigning the truth value T to €Wa\ ‘Wb\ 1Ha', 7/b’,
‘Eb’, and the truth value F to ‘Ea. Hence the original argument is invalid,
because there is a model for which it is logically equivalent to an invalid
truth-functional argument.
Another illustration is
(3 x)(Dx-Px)
(3x)(Dx'Sx)
(3x)(PxmSx)
Da-Pa
Da • Sa
. ’. Pa • Sa
which is valid. But tor a model consisting of the two individuals a and b, it is
equivalent to
(Da-Pa)v(Db-Ph)
(Da • Sa) v (Db • Sb)
:.(Pa-Sa)v(Pb-Sb)
which is proved invalid by assigning the truth value T to 'Da\ 1Db\ ‘Pa\ 'Sh\
and the truth value F to T// and 'Sa. Here, too, the original argument is
invalid, because there is a model for which it is logically equivalent to an
invalid truth-functional argument.
An invalid argument involving quantifiers may be valid for any model
containing fewer than k individuals, even though it must be invalid for every
model containing k or more individuals. Hence in using this method to prove
the invalidity of an argument involving quantifiers, it may be necessary to
consider larger and larger models. The question naturally arises: How large a
model must we consider in trying to prove the invalidity of an argument of this
type? A theoretically satisfactory answer to this question has been found. If an
argument contains n different predicate symbols, then if it is valid for a model
containing 2n individuals, then it is valid for every model or universally valid.7
This result holds only for propositional functions of one variable and is not true
of the relational predicates discussed in Chapter 5. Although theoretically
satisfactory, this solution is not of much practical help. If we were to go
straight to the theoretically crucial case for deciding the validity or invalidity
of any of the arguments already considered in this section, we should have to
consider models containing eight individuals. And for some of the following
exercises, the theoretically crucial case would be a model containing 28 = 256
individuals. In fact, however, none of the following exercises requires consid¬
eration of models containing more than three individuals to prove their
invalidity.
7 See Paul Bernays and Moses Schonfinkel. ‘Zuin Entscheidungsproblcm der inathematischen
Logik’. Mathematische Annaltm.. vol. 99 (1928), and Wilhelm Ackerinann. Solvable Cases of the
Decision Problem, North-Holland Publishing Company, Amsterdam, 1954, chap. 4.
J/Z Quantification Theory [Ch. 4
EXERCISES
III. Prove the validity or prove the 'invalidity of each of the following arguments:
*1. All aviators are brave: Jones is brave. Therefore, Jones is an aviator.
2. All collegians are debonair. Smith is a collegian. Therefore, Smith is debo-
Sec. 4.4] Multlply-General Propositions 83
3. No educators are fools. All gamblers are fools. Therefore, no educators are
gamblers.
4. No historians are illiterates. All illiterates are underprivileged. Therefore, no
historians are underprivileged.
*5. Only citizens are voters. Not all residents are citizens. Therefore, some
residents are not voters.
6. Only citizens are voters. Not all citizens are residents. Therefore, some
voters are not residents.
7. Automobiles and wagons are vehicles. Some automobiles are Fords. Some
automobiles arc trucks. All trucks are vehicles. Therefore, some Fords are
trucks.
8. Automobiles and wagons are vehicles. Some automobiles are Fords. Some
automobiles are trucks. All vehicles are trucks. Therefore, some Fords are
trucks.
9. Automobiles and wagons are vehicles. Some automobiles are Fords. Some
automobiles are trucks. Some wagons arc not vehicles. Therefore, some
Fords are trucks.
*10. All tenors arc either overweight or effeminate. No overweight tenor is
effeminate. Some tenors are effeminate. Therefore, some tenors are over¬
weight.
11. All tenors are either overweight or effeminate. No overweight tenor is
effeminate. Some tenors are effeminate. Therefore;, some tenors are not
overweight.
12. No applicant will be either hired or considered who is either untrained or
inexperienced. Some applicants are inexperienced beginners. All applicants
who are women will be disappointed if they are not hired. Every applicant
is a woman. Some women will be hired. Therefore, some applicants will be
disappointed.
13. No candidate is cither elected or appointed who is cither a liberal or a
radical. Some candidates arc wealthy liberals. All candidates who are
politicians arc disappointed if they are not elected. Every candidate is a
politician. Some politicians arc elected. Therefore, some candidates are not
disappointed.
14. Abbots and bishops are churchmen. No churchmen are either dowdy or
elegant. Some bishops are elegant and fastidious. Some abbots arc not
fastidious. Therefore, some abbots are dowdy.
15. Abbots and bishops are churchmen. No churchmen are both dowdy and
elegant. Some bishops arc elegant and fastidious. Some abbots are not
fastidious. Therefore, some abbots are not dowdy.
but not such propositions as KFa v (Cb •!!()'. On the other hand, the same
constant can be substituted for free occurrences of different variables, pro¬
vided, of course, that if it is substituted for any free occurrence of a variable, it
must be substituted for all free occurrences of that variable. Thus additional
substitution instances of the propositional function 'Fxv (Cyllx)' are
'Fa v (Ca-Ha)’, 'Fb v (Cb-Hb)\ 'Fc v (Ccllcf,...
Having admitted the letters *u\ *v\ *w , Uj , and ‘z as individual variables in
addition to Y, we now adjust our notation for universal and existential
quantification to conform to our expanded stock of variables. The proposition
"All F’s are GY may be alternatively symbolized as \u)(Fu D Cu)\
\v)(Fv 7D Cv)\ \w)(Fw D Gw)\ \x)(Fx D Cx)\ \i/)(Fy D Gy)', or
'(z)(Fz D Cz)\ Similarly the proposition, ‘There are some //V, may be alterna¬
tively symbolized as ‘(3u)Hu\ '(3v)hv\ '(3w)Ilw, (3x)Hx, (3y)Hy^ or
‘(3z)llz. The difference between \x)Fx and \\f)Fij [as between \3x)Cx and
‘(3f/)Gi/ ] is purely notational and either may be written in place of the other.
Of course, where a propositional function contains free occurrences of two or
more different variables, such as Fx-Cy\ the two propositional functions that
result from quantifying it differently as
are very different indeed, and their difference is moie than merely notational.
The substitution instances of the first are
(3x)(.r is wrong with the house) D (y) [(y is a person in the house)
D (y complains)]
Here the scope of the initial quantifier does not extend past the main implica¬
tion sign. But if we turn now to another proposition which bears a superficial
resemblance to the first:
although a proposition, does not have the same meaning as the original
proposition in English. Instead, it says merely that there is at least one thing
that should be rectified if it is wrong. The sense of the English sentence,
however, is clearly that anything which is wrong should be rectified. Hence a
correct symbolization is neither of the preceding, but rather
is properly symbolized as
is not to be symbolized as
for the last occurrence of the variable V is outside the scope of the initial
quantifier and is left dangling. It cannot be corrected simply by rebracketing,
as
for this expression fails equally to preserve the sense of the English sentence, in
the same way as in the previous example. -That sense is expressed by the
formula
EXERCISES
Symbolize each of the following propositions. In each case, use the suggested
notation and make the symbolic formula parallel the English as closely as possible:
*1. If anything is damaged, someone will be blamed. (Dx: x is damaged. Px: x is^
person. Bx: x will be blamed.)
2. If anything is damaged, the tenant will be charged for it. (Dx: x is damaged. Cx:
x will be charged to the tenant.)
3. If nothing is damaged, nobody will be blamed. (Dx: x is damaged. Px: x is a
person. Bx: x will be blamed.)
4. If something is damaged, but nobody is blamed, the tenant will not be charged
for it. (Dx: x is damaged. Px: x is a person. Bx: x is blamed. Cx: x will be charged
to the tenant.)
*5. If any bananas are yellow, they are ripe. (Bx: x is a banana. Yx: x is yellow. Rx: x
is ripe.)
6. If any bananas are yellow, then some bananas are ripe. (Bx: x is a banana. Yx: x
is yellow. Rx: x is ripe.)
7. If any bananas are yellow, then if all yellow bananas are ripe, they are ripe. (Bx:
x is a banana. Yx: x is yellow. Rx: x is ripe.)
8. If all ripe bananas are yellow, some yellow things are ripe. (Rx: x is ripe. Bx: x
is a banana. Yx: x is yellow.)
9. If all officers present are either captains or majors, then either some captains are
present or some majors are present. (Ox: x is an officer. Px: x is present. Cx: x is
a captain. Mx: x is a major.)
*10. If any officer is present, then either no majors are present or he is a major. (Ox: x
is an officer. Px: x is present. Mx: x is a major.)
11. If some officers are present, then if all officers present are captains, then some
captains are present. (Ox: x is an officer. Px: x is present. Cx: x is a captain.)
12. If some officers are present, then if all officers present are captains, then they are
captains. (Ox: x is an officer. Px: x is present. Cx: x is a captain.)
13. If all survivors are fortunate and only women were survivors, then if there are any
survivors, then some women are fortunate. (Sx: x is a survivor. Px: x is fortunate.
Wx: x is a woman.)
14. If any survivors are women, then if all women are fortunate, they are fortunate.
(Sx: x is a survivor. Wx: x is a woman. Fx: x is fortunate.)
*15. If there are any survivors and only women are survivors, then they are women.
(Sx: x is a survivor. Wx: xis a woman.)
16. If every position has a future and no employees are lazy, then some employees
will be successful. (Px: x is a position. Fx: x has a future. Ex: x is an employee. Lx:
x is lazy. Sx: x will be successful.)
Sec. 4.5] Quantification Rules 89
17. If any employees are lazy, then if some positions have no future, then they will
not be successful. (Ex: x is an employee. Lx: x is lazy. Px: x is a position. Fx: x
has a future. Sx: x will be successful.)
18. If any employees are lazy and some positions have no future, then some em¬
ployees will not be successful. (Ex: x is an employee. Lx: x is lazy. Px: x is a
position. Fx: x has a future. Sx: x will be successful.)
19. If any husband is unsuccessful, then if all wives are ambitious, then some wives
will be disappointed. (Ilx: x is a husband. Sx: x is successful. Wx: x is a wife. Ax:
x is ambitious. Dx: x will be disappointed.)
20. If any husband is unsuccessful, then if some wives are ambitious he will be
unhappy. (Hx: x is a husband. Sx: x is successful. Wx: x is a wife. Ax: x is
ambitious. Ux: x will be unhappy.)
ing, we may note that this is so, even when the conclusion contains more free
variables than are present in the premisses. For example, by the principle of
Addition, we validly infer the propositional furtction of two variables ‘Fx v Gy'
from the propositional function of one variable ‘Fx'.
The original list of nineteen Rules of Inference also permits the inference of
propositional functions from propositions. By the principle of Addition, for
example, we can infer the propositional function ‘Fa v Gx' from the proposi¬
tion ‘Fa'. That such inferences as these are valid, in the new sense explained, is
obvious. Moreover, propositions can validly be inferred from propositional
functions by our Rules of Inference. By the principle of Simplification, for
example, we can infer the proposition ‘Fa' from the propositional function
‘Fa*Gx'. Hence the letters ‘p‘if, V, V in our nineteen Rules of Inference
now range over, or represent, either propositions or propositional functions.
We can now adopt a more general definition of formal proof of validity,
which parallels our earlier definition exactly except that lines of a proof can be
either propositions or propositional functions. If each line after the initial
premisses follows validly from preceding lines, in the generalized sense of
‘valid’ already explained, then the last line validly follows from the initial
premisses. If our initial premisses and conclusion are propositions rather than
propositional functions, then the conclusion validly follows from the initial
premisses, in the original sense of ‘valid’ which applies to arguments whose
premisses and conclusions are all statements or propositions. The following
considerations demonstrate this conclusion. As we go from our original prem¬
isses to propositional functions—if we go validly—then, if the premisses are
true, all substitution instances of the inferred propositional functions must be
true also. And as we proceed from previously inferred propositional functions
to other propositional functions, if we proceed validly, then all substitution
instances of these other propositional functions must be true also. Finally, when
we go from validly inferred propositional functions to the final conclusion,
which is a proposition, then if we proceed validly, since all substitution
instances of the former are true, the final conclusion must be true also.
The preceding remarks require some modification to take account of as¬
sumptions of limited scope thjt contain free variables, but the modifications are
best introduced as alterations in the quantification rules themselves. The
preliminary versions of our quantification rules must be replaced in any case,
for as stated, they apply only to propositions and not to propositional functions.
The two rules for generalizing, UG and EG, must now permit the quantifica¬
tion (or binding) of free variables. The instantiation rules UI and El, on the
other hand, must now permit the freeing of bound variables to permit the
introduction of propositional functions themselves, rather than (pretended)
substitution instances of them.
In our earlier discussion of propositional functions of the single variable V,
we introduced the Greek letters phi and psi, and let ‘<t>x' and N'x’ denote any
propositional function of V, such as ‘Fx', ‘Gx', ‘Fx*llx', ‘(Fx v Gx) D Hx',...
no matter how complicated such functions might be. It will be useful to
Sec. 4.5] Quantification Rules 91
continue to use these Greek letters: typically will denote any proposi¬
tional function that contains at least one free occurrence of the variable \v\
even including those propositional functions that contain free occurrences of
other variables. Thus 4>x may denote any of the following:
This general convention helps to prevent unwanted (that is, invalid) infer¬
ences from being allowed by our four quantification rules. How it contributes
to this end will be explained after formulating each of the four rules.
EXERCISES
"1. Fx v Gx a. Fa v Gy
2. Fx v Gy b. (y)(Fy Gy)
3. Fx D (3y)Gy c. Fa • Gx
4. (y)(Fx D Gy) d. Fb v Gh
*5. Fx=(3y)(GxvHy) e. Fx D (3y)Gy
6. (x)Fx D (3y)(Cx-Hy) f. Fy v Gy
g- (y)(Fa D Gy)
h. Fy => (3y)c*!/
i. (y)(Fz D Gy)
j- Fz=(3y)(Gx-IIy)
k. (x)Fx D (ly)(Gz-IIy)
LU
*—
ill
1.
a
a
<
m. (x)(Fx D Gy)
II. For each formula (where v is ‘i/’) in the left-hand column, indicate which
formulas in the right-hand column are legitimate instances of a Op for the
given <t>v:
1. Fy a. Fa
2. Fx-Gy b. Fx
3. FyGtj c. Fy
4. Fy v (3x)(Gx D Hy) d. Fx<mX
*5. Fy v (3y)(Gc v Hy) e. Fx • Gz
6. Fy D (3y)(GyHy) f. Fx v (3x)(Gx D Hx)
g. Fxv(3y)(GcvIIy)
h. Fx D (3y)(GyIix)
i. Fz v (3x)(Gx D Hz)
j. Fx D (3y)(GyHy)
k. Fzv(3x)(Gx D Hy)
l. Fx D (3y)(GyIlz)
m. Fy v (3x)(Gx D Hy)
Sec. 4.5] Quantification Rules 93
.'. <bi>
except that in these instances both //. and v are individual variables. Here, the
premiss (fA)(Ou) may be a proposition, but the conclusion <l>r must be a
propositional function.
Just as we count valid the inference*
(z)(Fz v Gb)
/. Fhv Gb
where the instantiating constant ‘fo’ occurs in the premiss as well as in the
conclusion, so we wish to count as valid such inferences as
94 Quantification Theory [Ch. 4
in which the instantiating variable occurs free in the premiss as well as in the
conclusion. In general, when Or is inferred from (/x)(Ojx) validly, r must occur
free in Or at every place where /x occurs free in O/x. There may be more free
occurrences of r in Or than there are free occurrences of fi in O/x, however.
There will be more occurrences whenever r occurs free in O/x. All the preced¬
ing inferences must be legitimized by our principle of Universal Instantiation.
Our first Quantification Rule, Universal Instantiation, is stated as
ui MM
The general convention governing O/x and <1 >r serves to prevent such an
erroneous inference as
(x)[(3y)(Fx= —Fy)]
.*.(3 y)(Fy=-Fy)
from being allowed by UI, because r (lif) does not occur free in Or
(\3y)(Fy ~ —Fy)’) at all places that /lx (V) occurs free in <1>/x
(‘(3y)(Fx= —Fy)'). Hence ‘(3y)(Fy = —Fy)' is not a legitimate Or for use in
applying UI where (/x)(0/x) is \x)\(3y)(Fx = ~~Fy)]\ It should be obvious that
the inference is invalid because it fails for a model or possible universe
containing some things that are F and some things that are not l \ which would
make the premiss true, whereas the conclusion, being self-contradictory, is false
for any model or possible universe.
Fa Fa Fa v Gb Fa D Gb
.’.(3x)Fx9 .‘.(3y)Fy' .\(3x)(Fx v Gb) ’ .*. (3y)(Fa D Gy) ’
Or
V. (3m)(<M
Here both premisses and conclusions are propositions. Also valid are such
inferences involving propositional functions as
Fx Fay Gy Fx D Gy Fx-Gx
(3y)Fy ’ (3.r)(Fa v Gx) ’ (3y)(Fx D Gy) ’ (3y)(Fy-Gx)
which are of the same pattern as the preceding, except that here r is a variable
instead of a constant.
Sec. 4.5] Quantification Rules 95
The general convention governing <l>/i and <l>r serves to prevent such an
erroneous inference as
Fx= —Ftj
.‘.(3 x)(Fx= -Fx)
from being allowed by EG, because v (‘tj) does not occur free in (I)/'
(‘Fx == ~~Fif) at all places that fi (V) occurs free in <l>/i (*Fx EE —Fx). Hence
‘Fx EE -Ft/ is not a legitimate <l>r from which ‘(3r)( Fx EE —Fx)' can be
obtained as (3/x)(<I>/li) by EG. It should be obvious that the inference is invalid.
Even though the premiss has substitution instances that are true, the conclusion
is self-contradictory, hence false.
With both UI and EG available, we may illustrate their use in proving the
validity of the argument
VFX*/1)
1. (M)(‘V)
2. 1, UI
3. (3/i2. EG
96 Quantification Theory [Ch. 4
(E) Dp) = D p]
where v occurs free in <!>*> at all and only those places that jut occurs.free in <l>/i,
and where p contains no free occurrence of the variable v. In case p is a true
proposition, or for any true substitution instance of p, if p is a propositional
function, both sides of the equivalence must be true. This is so because if p is
true, then D p is true for every value of v, whence (r)(4>^ D p) is true. The
truth of (3/x)(4>/i) D p also follows immediately from the truth of p. In case p
is a false proposition, or for any false substitution instance of p if p is a
propositional function, and (p)(<\>v D p) is true, every substitution instance of
$iO p is true and so every substitution instance of <1>p must be false, whence
(3/i)(0/i) must be false and so (3/x)(4>/i) D p is true. In case p is a false proposi¬
tion, or for any false substitution instance of p if p is a propositional function,
and (3/i)(<I>/i) D p is true, (3/i)(4>/i) must be false. Thus every substitution
instance of Or must be false, whence every substitution instance of Or D p
must be true. Therefore, (r)(Or D p) is true also. This argument establishes the
truth of every equivalence of the form (E) because if p is true, both sides are
true. If p is false, each side implies the other.
Turning now to Existential Instantiation, we want to permit going from
(3 x)Fx to Fx or Fy only under very stringent restrictions. Not only must the
instantiating variable not have any prior free occurrence (as discussed on pages
74-75), but we must not permit (x)Fx to be inferred from (3x)Fx by way of
Existential Instantiation and Universal Generalization. There are many ways of
imposing such restrictions. One method is to formulate the rule of Existential
Instantiation in such a way that the formula or line finally inferred by its means
contains no free variables that are introduced by it. The feasibility of this
procedure can be seen by the following considerations.
Here, as in earlier sections, we are concerned with constructing proofs of
validity only for arguments whose premisses and conclusions are propositions.
We are not concerned with propositional functions that contain free variables.
Hence we never end a proof with a propositional function that contains a free
variable. Therefore, in any proof of validity in which the rule of Existential
Instantiation involves going from (3/i)(0/i) to <J>y, the propositional function Or
serves only to permit the subsequent inference of a formula that does not
contain any free occurrence of the variable v.
Suppose we already have (3jti)(0/z) as a line in a proof. Suppose further that
in the presence of other lines already obtained, if we also had Or, we would be
able to derive a desired formula p that contains no free occurrence of the
variable r. We can proceed by writing down Or as an assumption of limited
scope. Then, after p has been derived, we can close the scope of the assumption
and infer the formula Or Dp by the strengthened Rule of Conditional Proof.
Sec. 4.5] Quantification Rules 97
From this line (subject to reasonable restrictions), the formula (r)(4>i' D p) can
be inferred by Universal Generalization. And from the latter formula, by
equivalence (E), we can infer (3p)(<f>p) D p. Now, from this formula and the
earlier line (3/x)(4>p), we can obtain p by Modus Ponens. This whole process can
be represented schematically as
i. (3/i)(<t>m)
—> j- $*1’ (where v is a variable and has no free
occurrence in any preceding line)
El: (3]*)(<V)
i—* <!>/'
provided that v is a variable and does not occur free either in p or in any line
preceding
1. (x)(Fx D Gx)
2. (3y)Fy A‘.(3 z)Gz
3. Fu
4. Fu D Gu 1, UI
5. Gu 4, 3, M.P.
6. (3z)Gz 5, EG
7. (3z)Gz 2, 3-6, El
There can be, though there need not be, other lines intervening between the
premiss (3ja)(4>p) (line 2) and the propositional function <\>v (line 3) which is
98 Quantification Theory [Ch. 4
from being allowed by El. This is so because r ('y') does not occur free in Or
(‘Fx'Gy ) at all places that ja (V) occurs free in Oju, (‘Fx*Gx'). Hence *FxmGy' is
not a legitimate Or for use in applying El, where (3ja)(Oja) is *(3x)(Fx mGx)\ It
should be obvious that the argument is invalid. That something is both an F and
a G clearly does not entail that everything is an F
The restriction that r does not occur free in any line preceding Or, ensures
that r does not occur free in (3ja)(Oja). Therefore, if r is different from ja, v does
not occur free in Oja either. The general convention already assures us that
there are no free occurrences of ja in Or. Thus the present restriction on El,
together with the general convention, entails the restriction associated with the
logical equivalence (E), namely, that r occurs free in Or at all and only those
places that p occurs free in Oja.
The restriction that r does not occur free in p serves to prevent any
subsequent and erroneous use of Universal Generalization to derive (ja)(Oja) as
conclusion, from (3ja)(Oja), as premiss. Without the restriction, such an error
would be possible because if p were allowed to contain a free occurrence of
r, it could be Or itself, from which UG could be used to derive the conclusion
The restriction that r does notaccur free in any line preceding Or serves to
prevent such an erroneous “proof” as
from being allowed by El. This js so because v (V) occurs free in line 2, which
precedes Or (*Fx = ~~Fx’) in line 3. Hence ‘Fx = ~~Fx* is not a legitimate Or
for the given Oja for use in applying El, where (3ja)(Oja) is ‘(3y)(Fx = ~*~Fy)\ It
Sec. 4.5] Quantification Rules 99
should be obvious that the argument is invalid, since it fails for a model
containing some things that are F and some things that are not F, which would
make the premiss true, whereas the conclusion, being self-contradictory, is false
for every model.
<l>r
UG:
provided that v is a variable and does not occur free cither in (ja)(0/i) or in any
assumption within whose scope lies.
The general convention governing <l>/t and <!>/> serves to prevent such an
erroneous “proof” as
from being allowed by UG, because i» (‘i/’) does not occur free in <1>y
(‘Fa D —Fy') at all places that g (‘a') occurs free in <I>/i (‘Fa D —Fa’). Hence
line 3, ‘Fa D — Fy\ is not a legitimate <l>r, for the given *!>/*, from which line 4,
‘(a)(Fa D —Fa)\ can be obtained as by UG. It should be obvious that
the argument is invalid, for its premiss is true if there is at least one :hing that
is not an F, whereas its conclusion states that there are no F’s at all.
The restriction that v does not occur free in (g)(<I>/i) serves to prevent such an
erroneous “proof” as
from being allowed by UG, because v (‘a’) does occur free in (/a)(<1>/x) at line 3,
(\y)(Fx= Fy)'). Ilence line 2, *Fx=Fx\ is not a legitimate for the given
(hja, from which ‘(y)(Fx EE Fy) can be derived as ([i)(<P/i) by UG. It should be
obvious that the argument is invalid, for its premiss states only that any thing is
an F if and only if it is an F, whereas its conclusion slates that of any things x
and «/, a is an F if and only if y is an F also.
100 Quantification Theory [Ch. 4
The restriction that v does not occur free in any assumption within whose
scope Or lies, serves to prevent such an erroneous “proof” as
(3x)Fx /.\(x)Fx
Fy
(x )Fx 2, UG (wrong)
(x)Fx 1, 2-3, El
from being allowed by UG, because r (*y') occurs free in the assumption lF\j
within whose scope the premiss Or (*Fy) lies. Hence, in this case, ‘Fi/’ is not a
legitimate Or for the given O/x, from which \x)Fx* can be derived as (ju,)(0]u,) by
UG. The argument is, of course, obviously invalid.
Like their preliminary versions, these final versions of our four quantification
rules are similar to the first nine Rules of Inference in that they can be applied
only to whole lines in a proof.
EXERCISES
Identify and explain all of the mistakes in the following erroneous “proofs”:
1. r—* 1. Fx
2- (tj)Fij 1, UG
3. Fx D (ij)Fij 1-2, C.P.
4. (x)[Fx D (y)Fy] 3, UG
2. 1. (3x)(Fx-Gx) /.\(3x)Fx
|—* 2. Fx • Cx
| 3. Fx 2, Simp.
4. Fx 1, 2-3, El
5. (3x)Fx 4, EG
*3. 1. (x)(3y)(Fx=Gy) /.*. (3y)(x)(Fx= Gy)
2. (3y)(Fx = Gy) 1, UI
3. Fx ~ Gy
IN -
III
3, UG
X
5. (3y)(x)(Fx=Gy) 4, EG
jar*
'x
2? 3-5, El
vD
III
a Fx v Cx 1, 2-5, El
7. ('/)(/•>/ V Cx) 6, UG
8. (*)(?/)(% v Gx) 7, UG
*9. 1. (3 x)(Fx*Gx)
2. (3 x)(—Fx*Cx) /.\(3.t )(Fx-~ Fx)
Fx'Gy
r 4- Fx
5. Fx
3, Simp.
1, 3-4, El
f * —Fx*Gx
Ll -Fx
8. -Fx
6, Simp.
2, 6-7, El
9. Fx * - Fx 5, 8, Conj.
10. (3x)(Fx*—Fx) 9, EG
10. 1. (x)[(Fx D Cx)* ~~Ga]
r» 2. (x)[(Fx D Cx)* ~~Cy]
3. (Fx D Gz)—Gy 2, UI
4- (y)l(Fy 3 Gy) -.~G«/] 3, UG
5. (Fu D Cm)*—Cm 4, UI
6. Fu D Gu 5, Simp.
7. —Gm*(Fm D Cm) 5, Com.
8. -Cm 7, Simp.
9. -Fm 6, 8, M.T.
10. (x)—Fx 9, UG
If all drugs are contaminated, then all negligent technicians are scoun¬
drels. If there are any drugs that are contaminated, then all drugs are
contaminated and unsafe. All germicides are drugs. Only the negligent
are absent-minded. Therefore, if any technician is absent-minded, then
if some germicides are contaminated, then he is a scoundrel.
EXERCISES
II. (Construct a formal proof of validity for each of the following arguments. In each
case use the suggested notation and make the symbolic formulas parallel the
English as closely as possible:
conservatives. If some conservative does not like poetry, then no poets are
conservatives. Therefore, if there is a wealthy man who does not like poetry,
then no businessmen are poets. (Rx: x is a businessman. Px: x is a poet. Wx: x
is a wealthy man. Cx: x is a conservative. Lx: x likes poetry.)
*8. All radioactive substances either have a very short life or have medical value.
No uranium isotope that is radioactive has a very short life. Therefore, if all
uranium isotopes are radioactive, then all uranium isotopes have medical
value. (Rx: xis radioactive. Sx: xhas very short life. Mx: x has medical value.
\ Ux: x is a uranium isotope.)
sane witness would lie if his lying would implicate him in a crime.
Therefore, if any witness implicated himself in a crime, thpn if all witnesses
were sane, that witness did not lie. (Sx: x is sane. Wx: x is a witness. Lx: x
lies, lx: x implicates himself in a crime.)
.
10 If any jewelry is missing, then if all the servants are honest, it will be
returned. If any servant is honest, they all are. So if any jewelry is missing,
then if at least one servant is honest, it will be returned. (Jx: x is jewelry. Mx:
x is missing. Sx: x is a servant. Hx: x is honest. Rx: x will be returned.)
.
11 If there are any liberals, then all philosophers are liberals. If there are any
humanitarians, then all liberals are humanitarians. So if there are any
humanitarians who are liberals, then all philosophers are humanitarians. (Lx:
x is a liberal. Px: x is a philosopher. Hx: x is a humanitarian.)
*12. If something is lost, then if everyone values his possessions, it will be missed.
If anyone values his possessions, so does everyone. Therefore, if something is
lost, then if someone values his possessions, then something will be missed.
(Lx: x is lost. Px: x is a person. Vx: x values his possessions. Mx: x is missed.)
In Section 4.3 a method was presented for proving the invalidity of invalid
arguments that involve quantifiers. There, we were dealing only with singly
general propositions. That method, however, is easily adapted to deal with
arguments that contain multiply general propositions.
Here, as before, we assume that there is at least one individual. That
assumption is satisfied by models containing one, two, or any larger number of
objects. For each such model, any given proposition containing quantifiers is
logically equivalent to a truth-functional compound of singular propositions.
Hence for each such model, any given argument whose propositions contain
quantifiers is logically equivalent to some argument that contains only singular
propositions and truth-functional compounds of singular propositions. An
argument involving quantifiers is valid if and only if, for every model, it is
logically equivalent to a valid truth-functional argument. Hence a given
argument that involves quantifiers is proved invalid by describing a model for
which the given argument is logically equivalent to an invalid truth-functional
argument. We do this by translating the given argument involving quantifiers
into a logically equivalent argument that contaias only singular propositions
and their truth-functional compounds. Then, we use the method of assigning
106 Quantification Theory [Ch. 4
(x)($x) = (0a'0f7*0c-...*0fc)
and
(3x)(0x) = (Oa v 0fo v 0c v ... v 0k)
(x)Fx D (3jf)Gy
Fa D Ga
(x)(3y)(Fx D Gy)
Fa D Ga,
Pa 3 Qa
Ra 3 Qa
Pa 3 Ra
which is proved invalid by assigning the truth value T to 7V and \)a’ and the
truth value F to 'Ra9.
As in the c:ase of arguments containing singly-general propositions, a given
argument may be valid for a one-element model but invalid for a two-element
model. For example,
(3x)(y'(Fx 3 Gy)
(ij){3z)(Gy D Hz)
W(3z)(FxDllz)
is logically ecjuivalent for the one-element model that contains just a to the
valid truth-functional argument
Fa D Ga
Ga D Ha
Fa Z) Ha
but, for the two-element model that contains just a and b, it is logically
equivalent to the truth-functional argument
which is proved invalid by assigning the truth value T to 'Fa9 and the truth
value F to all the others, that is, to 'Fb\ 'Ga\ 'Gb\ ‘Ha\ and lib9.
108 Quantification Theory [Ch. 4
EXERCISES
In Chapter 2 truth tables were used, not only to establish the validity of
arguments, but also to certify the logical truth of propositions (tautologies such
as ‘A v ~A’). The notion of a logically true proposition is, therefore, familiar.
As we have seen, not every valid argument can be established by the method of
truth tables. Some of them are proved valid by using quantification rules.
Similarly, not every logically true proposition can be certified by the method of
truth tables. Some of them are demonstrated by using quantification rules.
The method used in demonstrating the logical truth of tautologies was set
forth in Chapter 3. A demonstration of the logical truth of the tautology
‘A D (A v B)’ can be set down as
r-> 1. A
1 2. AvB 1, AddT
3. A D (A v B) 1-2, C.P.
-*1. (x)Fx
2. Fy 1..UI
3. (3x)Fx 2, EG'
4. (x)Fx D (3x)Fx 1-3, C.P.
Sec. 4.7] Logical Truths Involving Quantifiers 109
1. (3 a:)— Fx
— 2. —Ft/
3. (x)Fx
I_4. Fy_ 3, III
5. (x)Fx D Fy 3 -4, C.P.
6. —(a*)Fv 5, 2, M.T.
7. — (x)Fx 1, 2- 6, El
8. (3a)-Fa* D -(a)Fv 1-7, C.P.
9. (a:) Ft D —(3jc)-Ft 8, Trans., D.N.
— * 1. —(3 a)—Ft
-*2. — Fy
3. (3a)-Fa 2, EG
4. -Ft/ D (3 a*)—Ft 2-3, C.P.
5. Fy 4, 1, M.T., D.N.
6. (a) Ft 5, UG
7. —(3a:)—Fa* D (a)Fa* 1-6, C.P.
are all logically equivalent to each other and to the form QN, and that they
are, therefore, logically true.
Some fairly obvious logical truths are simply stated and are easily demon¬
strated with our present symbolic apparatus. A logically true biconditional for
any propositional functions TV and ‘Gx’ is
[(jr)F;f(*)Gx] = (x)(Fx9Gx)
which asserts that everything has the attribute F and everything has the
attribute G if, and only if, everything has both the attributes F and G. The
demonstrations of the two implications involved may be written side by side:
* 1.
2.
{x)Fx'(x)Gx
\x)Fx I, Simp.
rL 2.
(x)(Fx-Gx)
Fy-Gy 1, UI
3. (x)Gx 1, Simp. 3. Fy 2, Simp.
4. Fy 2, UI 4. Gy 2, Simp.
5. Gy 3, UI 5. (x)Fx 3, UG
6. FyC.y 4, 5, Conj. 6. (x)Gx 4, UG
7. (x)(Fx-Gx) 6, UG 7. (x )Fx*(x)Gx 5, 6, Conj.
8. [(jc)Fr'(jc)Gar] D (x)(FxmGx) 8. (x)(Fx*Gx) I) l(x)Fx'(x)Gx]
1-7, C.P. 1-7, C.P.
—*1. (x)Fx\(x)Gx
r 2. (x)Fx
3. Fy 2, UI
4. Fy v Gy 3, Add.
5. (x)(Fx v Gx) 4, UG
6. (x)Fx D (x)(Fx v Gx) 2-5, C.P.
7. (x)Gx
8. Gy 7, UI
9. Fy v Gy 8, Add.
Sec. 4.7] Logical Truths Involving Quantifiers 111
| 10. (x)(FxvGx)_ 9, UG
11. (x)Gx D (xXFxv Gx) 7-10, C.P.
12. [(x)Fx D (x)(Fx v Gx)] • [(x)Gx D (x)(Fx v Gx)] 6, 11, Conj,
13. (x)(Fx v Gx) v (xXFx v Gx) 12, 1. C.D.
14. (x)(FxvGx) 13, Taut.
15. [(x)Fx v (x)Gx] D (x)(Fx v G.v) 1-14, C.P.
(3x)(Fx-Gx) D [(3a*)Et•(3xK»jc]
variable V, and let ‘Fx' be any propositional function containing at least one
free occurrence of the variable V. Our first logical equivalence here is between
the universal quantification of ‘Fx •£)’ and the conjunction of the universal
quantification of ‘Fx’ with *Q\ which is more briefly expressed as
(x)(Fx-Q)=[(x)Fx-Q]
1. (x)(Fx-Q) -* 1. (x)Fx-Q
2. Fx-Q 1, UI 2. (x)Fx 1, Simp.
3. Fx 2, Simp. 3. Fx 2, UI
4. (x)Fx 3, UG 4. p 1, Simp.
5. (J 2, Simp. 5. Fx'Q 3, 4, Conj.
6. (x)Fx-Q 4, 5, Conj 6. (x)(Fx-Q) 5, UG
T"(x)(Fx-Q) 5 [(x)Fx-Q] 7. [(x)Fx-(J] D (x)(Fx-Q)
1-6, C.P. 1-6, C.P.
(x)(FxDQ)=[(lx)FtDQ]
1- («/)[(*)Fx D Fy]
2- (y)[Fy D (3x)Fx]
3. (3y)[Fy D (x)Fx]
4. (3y)[(3x)Fx D Fy]
The first of these corresponds to UI, and says that given any individual i/, if
everything is an F, then y is an F. Its demonstration is almost trivially obvious,
proceeding:
r* 1. (x)Fx
2. Fz 1, UI
3. (x)Fx D Fz 1-2, C.P.
4. (y)[(x)Fx D Fy] 3, UG
114 Quantification Theory [Ch. 4
The second corresponds to EG, and says that given any thing t/, if y is an F,
then something is an F. The third and fourth correspond to UG and El,
respectively. They are not so immediately obvious, »biit are logically true and
quite easily demonstrated. An intuitive explanation can be given by reference
to the ancient Athenian general and statesman Aristides, who was often called
‘the just*. So outstanding was Aristides for his rectitude that the Athenians had
a saying:
With respect to any attribute, there is always some individual i/, such that if
anything has that attribute, y has it. That is what is asserted by the fourth
proposition listed above, which corresponds to El. The matter can be put
another way. If we turn our attention not to the attribute of being just, but to
its reverse, the attribute of being corruptible, then the sense of the first
Athenian saying is also expressible as
Again generalizing, we may observe that, with respect to any attribute, there
is always some individual y, such that if y has that attribute, everything has it.
That is what is asserted by the third proposition listed above, which corre¬
sponds to UG. Its demonstration proceeds:
p. 1. ~(3y)[Fi, D (x)Fx]
2. (y)~[% ^ (x)Fx] 1, QN
3. ~lFtJ => (x)Fx] 2, UI
4. F\1—(x)Fx 3, Impl.
5. Fy 4, Simp.
6. (x)Fx 5, UG
7. (x)Fx v (3y)[Fy D (x)Fx] 6, Add.
8. -(x)Fx 4, Simp.
9. (3y)[Fy d (x)Fx] 7, 8, D.S.
10. ~(3{/)[/'V => (x)Fx] D (3;t)[Fy D (x)Fx] 1-9, C.P.
11. (ly)[Ftj D (x)Fx] v (3y)[Fy 3 (x)Fx\ 10, Impl., D.N.
12. (3y)[Fy 3 (x)Fxj 11, Taut.
Although we shall not prove it until Chapter 10, the methods of proof so far
assembled (techniques for ‘Natural Deduction’, as they are sometimes called)
permit the demonstration of all logically true propositions that are constructed
out of truth-functional connectives and the quantification of individual varia¬
bles. It will also be proved that only propositions that are logically true can be
demonstrated by these techniques.
Sec. 4.7] Logical Truths Involving Quantifiers IIS
EXERCISES
Construct demonstrations for each of the following. (Each occurrence of *{)’ repre¬
sents either a proposition or a propositional function containing no free occurrence of
the variable V.)
Some propositions containing two or more proper names (of individuals) are
rightly understood to be truth-functional compounds of singular propositions
that have different subject terms. For example, the proposition
But for some other propositions that have the same verbal pattern that analysis
is wholly unsatisfactory. Thus the proposition
On the contrary, dividing the proposition in this way destroys its significance
because its meaning is not thatT)oth Lincoln and Grant were (or had) ac¬
quaintances, but rather that they were acquainted with each other. The given
proposition does not assert that Lincoln and Grant both had a certain attribute,
but that they stood in a certain relation. Lincoln is not said simply to be
acquainted (whatever that might mean), but is said to be acquainted with
Grant. Other propositions that express relations between two individuals are
116
Sec. 5.1] Symbolizing Relations 117
Relations such as these, that can hold between two individuals, are called
‘binary* or ‘dyadic relations. Other relations may relate three or more indi¬
viduals. For example, the propositions
The latter is a valid inference that, as De Morgan observed, all the logic of
Aristotle will not permit one to draw. Us validation by our apparatus of
quantifiers and propositional functions, will be set forth in the next section.
Before discussing proofs of validity for relational arguments (and they will
require no methods of proof beyond those developed in the preceding chapter),
the problem of symbolizing relational propositions must be dealt with. Just as
a single predicate symbol can occur in different propositions, so a single
relation symbol can occur in different propositions. Just as we have the
predicate ‘human* common to the propositions:
Aristotle is human.
Plato is human.
Socrates is human.
118 The Logic of Relations [Ch. 5
Gab
Obc
. ’. Oac
Our second argument is not much more difficult, since none of its propositions
contains more than a single quantifier. Using the individual constants *d\
and *t ’ to denote Helen, David, and Tom, respectively, ‘Gx’ to abbreviate ‘x is
a good-looking man’, and the symbol ‘Lxy’ to abbreviate ‘x likes y\ the
argument can be symbolized as
1. Lhd
2. (x)(Lxd D Lxt)
3. (x)(Lhx D Gx)
Gt
A proof of its validity is so easily constructed that it may well be set down now.
Referring back to the numbered premisses, the proof proceeds:
Sec. 5.1] Symbolizing Relations 119
4. Lhd D Lht 2, UI
5. Lht 4, 1, M.P.
6. Lht D G* 3, UI
7. Gt 6, 5, M.P.
a attracts everything’
as '(x)Aax\
everything is attracted by a'
a attracts something’
as \3x)Aax*.
something is attracted by a
everything attracts a
as \x)Axa\
‘a is attracted by everything’
something attracts a’
as '{3x)Axa\
a is attracted by something’
1. (x)(y)Axy
2. (y)(x)Axy
3. (3x)(3y)Axy
120 The Logic of Relations [Ch. 5
4. (3y)(3 x)Axy
5- (x)(y)~Axy
6- (y)(*)~Axy
and then symbolizing the expression ‘x attracts something' the same way in
which we symbolized ‘a attracts something’. This gives us the formula
7. (x)(3ij)Axy
and then symbolizing the expression ‘i/ is attracted by everything’ the same
way that we symbolized 'a is attracted by everything. This gives us the
formula
8. (3 y)(x)Axy
1. (3 y)(x)Axy
—> 2. (jc)A.rt>
3. Atw 2, UI
4. (3y)Auy 3, EG
5. (3y)Auy 1, 2-4, El
6. (.r)(3y)Axy 5, UG
The inference is valid only one way, however. Any attempt to derive formula
8 from 7 must inevitably run afoul of one of the restrictions on UG. The
argument
(*)(3il)Axy
. (ly)(x)Axy
is easily proved invalid by the method of Section 4.6. For a model containing
just the two individuals a and b, the given argument is logically equivalent to
the truth-functional argument
which is proved invalid by assigning the truth value T to ‘Aad and ‘Abb\ and
the truth value F to 'Aab' and ‘Aba .
A similar pair of inequivalent propositions may be written as
These are clearly inequivalent when the ‘something’ in 9, coming at the end, is
understood as ‘something or other’, and the ‘something’ in 10, coming at the
beginning, is understood as ‘there is at least one thing’. They are symbolized as
9. (y)(3x)Axy
10. (3x)(y)Axy
Whoever visited the building was observed. Anyone who had observed
Andrews would have remembered him. Nobody remembered An¬
drews. Therefore, Andrews didn’t visit the building.
The first proposition of this argument contains two relations, one explicit, the
other implicit. Explicitly, we have the relation of someone visiting the building.
It is explicit because mention is made of both the visitor and what was visited
by him. Implicitly, we have the relation of someone observing someone, which
is implicit because no mention is made of the someone who does the observ¬
ing—the omission being marked by the use of the passive voice. However,
because the only other occurrence of ‘x visited the building’ is also as a unit, in
the conclusion, it need not be treated as a relation at all, but may be symbolized
as a simple predicate. On the other hand, ‘x observed y\ despite its merely
Sec. 5.1] Symbolizing Relations 123
1. (x)[Vx D (ly)Oyx\
2. (x)[Oxa D Rxa]
3. (x)—Rxa /.' . ~v«
4. Oza D Rza 2, UI
5. —Rza 3, UI
6. Oza 4, 5, M.T.
7. (y)~~Oya 6, UG
8. 7, QN
9. Va D (3i/)Oya 1, UI
10. — Va 9, 8, M.T.
Our proof of validity for this argument would not have been helped at all by
symbolizing ‘Andrews visited the building’ as a substitution instance of the
relational ‘x visited if rather than of the simpler ‘ Vx\ But our proof of validity
depended upon our symbolizing ‘was observed’ explicitly as a relation.
Most of our previous examples were illustrations of unlimited generality, in
which it was asserted that everything stood in such-and-such a relation, or
something did, or nothing did. A great many relational propositions are not so
sweeping. Most assertions are more modest, claiming not that everything stands
in such-and-such a relation, but that everything does if it satisfies certain
conditions or restrictions. Thus we may say either that
or that
The second, of course, is the more modest assertion, being less general than the
first. While the first is adequately symbolized, where \ix abbreviates ‘x is a
magnet’, as
(x)(y)[My i Ai/.xj
Perhaps the best way to translate relational propositions into our logical
symbolism is by the kind of stepwise process that has already been illustrated.
Let us use it again, this time for propositions of limited generality. First let us
consider the proposition
Now, using the obvious abbreviations, ‘Gx\ ‘Px\ and 'Bxy for ‘x is a good
amateur’, ‘x is a professional’, and ‘x can beat y\ the given proposition is
symbolized by the formula
(x)[CJx D (ly)(Py-Bxy)]
first as
then as
{lx)[Px-(y)(Ay D Bxy)]
The same method is applicable in more complex cases, where more than one
relation is involved. We symbolize the proposition
first by paraphrasing it as
and then as
x disappoints somebody
With practice, of course, not all such intermediate steps need be written out
explicitly.
126 The Logic of Relations [Ch. 5
(x)[(Gx*Mx) D (3y)Fxy]
(x)l(Bx'Mx) D — (3y)L'xy]
(3x){ | Mx- —(3jf)(Wy-Uxy)]-(3 z)Fxz)
However, some uses of these words are definitely temporal, and when they are,
they can be symbolized using the logical machinery already available, as can
other temporal words like ‘while’, ‘when’, ‘whenever’, and the like. An example
or two should serve to make this clear. Thus the proposition
asserts that all times when Dick and Joan are separated are times when Dick
writes Joan. This can be symbolized using ‘Tv’ for ‘jc is a time’, ‘ Wxijz for ‘x
writes y at (time) z\ and ‘Sjxyz for ‘x and y are separated at (time) z\ as
Perhaps the most vivid illustration of the adaptability of the present notation is
in symbolizing the following remark, usually attributed to Lincoln:
You can fool sorpe of the people all of the time, and all of the people
some of the time, but you cannot fool all of the people all of the time.
The first conjunct: ‘You can fool some of the people all of the time’ is ambigu¬
ous. It may be taken to mean either that there is at least one person who can
always befooled or that, for any time, there is at least one person (or other) who
can befooled at that time. Adopting, the first interpretation and using Tx’ for ‘x
Sec. 5.1] Symbolizing Relations 127
is a person, Tx for x is a time , and 'Fxy* for ‘you can fool .v at (or during) y\
the above may be symbolized as
EXERCISES
I. Using the following ‘vocabulary’, translate the given formulas into idiomatic
English sentences:
Formulav
*1. (x)[Dx 3 (3t/)(Yy-Byx)]
2. (x)[(3ij)(Py-Fxy) 3 (z)(Pz 3 1-xz)]
3. (x)[(Rx-Sx) 3 (y)(My 3 -Cxy)]
4. (*)[(?*-Axx) 3 (Anx)]
128 The Logic of Relations [Ch. 5
II. Symbolize the following sentences, in each case using the indicated symbols:
*1. Dead men tell no tales. (Dx: x is dead. Mx: x is a man. Tx: x is a tale. Txy: x
tells y.)
2. A lawyer who pleads his own case has a fool for a client. (Lx: x is a lawyer.
Fx: x is a fool. Fxy: x pleads y's case. Cxy: x is a client of y.)
3. A dead lion is more dangerous than a live dog. (Lx: x is a lion. A.r: x is alive.
Dx: x is a dog. Dxy: x is more dangerous than y.)
4. Uneasy lies the head that wears a crown. (Ux: x lies uneasy. IIx: x is a head.
Cx: x is a crown. Wxy: x wears y.)
*5. If a boy tells only lies, none of them will be believed. (Bx: x is a boy. Lx: x
is a lie. Txy: x tells y. Bxy: x believes y.)
6. Anyone who consults a psychiatrist ought to have his head examined. (Px: x
is a personvSx: x is a psychiatrist. Ox: x ought to have his head examined.
Cxy: x consults y.)
7. No one ever learns anything unless he teaches it to himself. (Px: x is a person.
Lxy : x learns y. Txyz: x teaches y to z.)
8. Delilah wore a ring on every finger and had a finger in every pie. (cl: Delilah.
Rx: x is a ring. Fxy : x is a finger of y. Oxy : x is on y. Px: x is a pie. Ixy: x
is in y.)
9. Any man who hates children and dogs cannot be all bad. (Mx: x is a man. Cx:
x is a child. Djc: x is a dog. Bx: x is all bad. Hxy: x hates y.)
*10v Anyone who accomplishes anything will be envied by everyone. (Px: x is a
person. Axy: x accomplishes y. Exy: x envies y.)
fl. To catch a fish, one must ha'vp some bait. (Px: x is a person. Fx: x is a fish. Bx:
x is bait. Cxy: x catches y. Hxy: x has y.)
Sec. 5.1] Symbolizing Relations 129
12. Every student does some problems, but no student does all of them. (Sx: x
is a student. Px: x is a problem. Dxy: x does y.)
13. Any contestant who answers all the questions put to him will win any prize
he chooses. (Cx: x is a contestant. ()x: x is a question. Px: x is a prize. Ax{/: x
answers y. Pxy: x is put to y. Wxy: x wins y. Cxy: x chooses y.)
14. Every son has a father, but not every father has a son. (Px: x is a person. Six:
x is male. Pxy: x is a parent of y.)
*15. A person is maintaining a nuisance if he has a dog that barks at everyone
who visits its owner. (Px: x is a person. Nx: x is a nuisance. Mxy : x maintains
tj. Dx: a is a dog. Bxy: x barks at tj. Vxy: x visits y. Hxy: x has y.)
16. A doctor has no scruples who treats a patient who has no ailment. (Dx: x is
a doctor. Sx: x is a scruple. Ilxy: x has y. Px: x is a patient. Ax: x is an
ailment. Txy: x treats y.)
17. A doctor who treats a person who has every ailment has a job for which no
one would envy him. (Dx: x is a doctor. Px: x is a person. Txy: x treats y. Ax:
x is an ailment. Hxy: x has y. Jx: x is a job. Fxyz: x envies y his z.)
18. If a farmer keeps only hens, none of them will lay eggs that are worth
setting. (Fx: x is a farmer. Kxy: x keeps //. IIx: x is a hen. Fx: x is an egg. Lxy :
x lays y. Wx: x is worth setting.)
In symbolizing the following, use only the abbreviations: Px: x is a person. Sx:
x is a store, Bxyz: x buys y from
In symbolizing the following, use only the abbreviations: Cx: x is a charity; Mx:
x is money; Px: x is a person; Bxy: v belongs to i/; Dxyz: x donates »/ to z.
*1. Whoso sheddeth man’s blood, by man shall his blood be shed. (Genesis 9:6)
2. His hand will be against every man, and every man’s hand against him.
(Genesis 16:12)
3. The sun shall not smite thee by day, nor the moon by night. (Psalm 121:6)
4. A wise son maketh a glad father. (Proverbs 10:1)
*5. He that spareth his rod hateth his son. (Proverbs 13:24)
6. The borrower is servant to the lender. (Proverbs 22:7)
7. Whoso diggeth a pit shall fall therein: and he that rolleth a stone, it will
return upon him. (Proverbs 26:27)
8. The fathers have eaten sour grapes, and the children’s teeth are set on edge.
(Ezekiel 18:2)
9. The foxes have holes, and the birds of the air have nests; but the Son of man
hath not where to lay his head. (Matthew 8:20)
10. ... the good that I would, I do not; but the evil which 1 would not, that l
do. (Romans 7:19)
(3 x)Fx
(x)Fx [— Fx Fy
7~F^' :() ’ :.(y)Fy' (3w)Fw
■■p
But our statement of the quantification rules does not require that /x and v be
different variables; they may well be the same. On the whole, it is simpler
Sec. 5.ZJ Arguments Involving Relations 131
(3jc )Fx
(x)Fx - Fx Fx Fij
■'■Fx' I p ’ .■.(x)Fx’ .\&y)Fy
■ P
Its symbolic translation and proof, using 'Mx' and 'Dxif to abbreviate \v is a
man’ and ‘jc despises // may be written as follows:
are animals* will be symbolized as \x)(Ex D Ax)’, where ‘Ex’ and ‘Ax’ abbre¬
viate *x is a horse’ and ‘x is an animal*, respectively. In its conclusion
the word ‘the’ has the same sense that it does in such propositions as The whale
is a mammal* or The burnt child dreads the fire*. We may paraphrase it,
therefore, first as
then as
and finally, writing ‘Hxy’ for *x is a head of y\ we may express the conclusion
by the formula
(x)[(3y)(Ey-Hxy) D (3y)(AyHxy)]
The first specimen argument presented in this chapter, which dealt with the
relation of being older than, raises a new problem, which will be discussed in
the following section.
EXERCISES
II. Construct a formal proof of validity for each of the following arguments:
1. Whoever supports Iekes will vote for Jones. Anderson will vote for no one hut
a friend of Harris. No friend of Kelly has Jones for a friend. Therefore, if Harris
is a friend of Kelly, Anderson will not support Iekes. (Sxy: x supports y. Vxy: x
votes for y. Fxy: x is a friend of y. a: Anderson i. Iekes. /: Jones, h: Harris, k:
Kelly.)
2. All circles are figures. Therefore, all who draw circles draw figures. (Ox: x is a
circle. Fx: x is a figure. Dxy: x draws y.)
3. Any friend of A1 is a friend of Bill. Therefore, anyone who knows a friend of
Al knows a friend of Bill. (Px: x is a person. Fxy: x is a friend of y. Kxy: x knows
y. a: Al. b: Bill.)
*4. Only a fool would lie about one of Bill’s fraternity brothers to him. A classmate
of Bill’s lied about Al to him. Therefore, if none of Bill’s classmates are fools,
then Al is not a fraternity brother of Bill. (Fx: x is a fool. Lxyz: x lies about y
to z. Cxy: x is a classmate of y. Bxy: x is a fraternity brother of y. a: Al. b: Bill.)
5. Whoever belongs to the Country Club is wealthier than any member of the
Elks Lodge. Not everyone who belongs to the Country Club is wealthier than
anyone who does not belong. Therefore, not everyone belongs either to the
Country Club or the Elks Lodge. (Cx: x belongs to the Country Club. Ex: x
belongs to the Elks Lodge. Px: x is a person. Wxy: x is wealthier than y.)
*6. It is a crime to sell an unregistered gun. All the weapons that Red owns were
purchased by him from either Lefty or Moe. So if one of Red’s weapons is an
unregistered gun, then if Red never bought anything from Moe, Lefty is a
criminal. (Rx: x is registered. Gx: x is a gun. Cx: x is a criminal. Wx: x is a
weapon. Oxy: x owns y. Sxyz: x sells y to 2. r: Red. I: Lefty, m: Moe.)
7. Everything on my desk is a masterpiece. Anyone who writes a masterpiece is
a genius. Someone very obscure wrote some of the novels on my desk.
Therefore, some very obscure person is a genius. (Dx: x is on my desk. Mx: x
is a masterpiece. Px: or is a person. Gx: x is a genius. Ox: x is very obscure. Nx:
x is a novel. Wxy: x wrote tj.) •
8. There is a professor who is liked by every student who likes at least one
professor. Every student likes some professor or other. Therefore, there is a
professor who is liked by all students. (Px: x is a professor. Sx: x is a student.
Lxy: x likes y.)
*9. No one respects a person who does not respect himself. No one will hire a
person he docs not respect. Therefore, a person who respects no one will never
be hired by anybody. (Px: x is a person. Rxy: x respects y. Hxy: x hires y.)
10. Whoever donates to the United Fund gives all of his donations to the United
Fund. Therefore, if Jones is a person who does not give all of his donations to
the United Fund, then he does not donate anything at all to the United Fund.
(Px: or is a person. Dxyz: x donates y to 5. f: The United Fund. /: Jones.)
11. Any book that is approved by all critics is read by every literary person.
Anyone who reads anything will talk about it. A critic will approve any book
written by any person who flatters him. Therefore, if someone flatters every
critic, then any book he writes will be talked about by all literary persons. (Bx:
x is a book. Cx: x is a critic. Lx: x is literary. Px: x is a person. Axy: x approves y.
Rxy: x reads y. Txy: x talks about if. Fxy: x flatters y. Wxy: x writes y.)
12. A work of art that tells a story can be understood by everyone. Some religious
works of art have been created by great artists. Every religious work of art tells
an inspirational story. Therefore, if some people admire only what they cannot
understand, then some great artists’ creations will not be admired by every¬
one. (Ax: x is a great artist. Px: x is a person. Sx: x is a story, lx: x is inspira¬
tional. Rx: x is religious. Wx: x is a work of art. Cxy: x creates y. Axy : x admires
y. Txy: x tells y. Uxy: x can understand y.)
There are many interesting attributes that relations themselves may possess.
We shall consider only a few of the more familiar ones, and our discussion will
be confined to attributes of binary or dyadic relations.
Binary relations may be characterized as symmetrical, asymmetrical, or
nonsymmetrical. Various symmetrical relations are designated by the phrases:
‘is next to’, ‘is married to’, and lias the same weight as. A symmetrical relation
is a relation such that if one thing has that relation to a second, then the second
must have that relation to the first. A propositional function 'Rxy’ designates a
symmetrical relation if and only if
(x)(y)(Rxy D Ryx)
On the other hand, an asymmetrical relation is a relation such that if one thing
has that relation to a second, then the second cannot have that relation to the
first. Various asymmetrical relations afe designated by the phrases ‘is north of’,
‘is older than’, and ‘weighs more than’. A propositional function 'Rxy9 desig-
Sec. 5.3] Some Attributes of deletions 135
(x)(y)(Rxy D —Ryx)
(x)(y)(z)[(RxyRyz) D Rxz]
An intransitive relation, on the other hand, is a relation such that if one thing
has it to a second, and the second to a third, then the first cannot have it to the
third. Some intransitive relations are designated by the phrases ‘is mother of',
‘is father of’, and ‘weighs exactly two pounds more than'. A propositional
function ‘Rxif designates an intransitive relation if and only if
(x)(y){z)[(Rxtj-Rtjz) D —Rxz]
(x)Rxx
On the other hand, a relation is said to be reflexive if any thing a has that
relation to itself if there is something b such that either Rab or Rba. Obvious
examples of reflexive relations are designated by the phrases ‘has the same color
136 The Logic of Relations [Ch. 5
hair as’, ‘is the same age as\ and ‘is a contemporary of\ A propositional
function ‘Rxy’ designates a reflexive relation if and only if
(x)~~Rxx
Examples of irreflexive relations are common indeed; the phrases: ‘is north of’,
‘is married to’, and ‘is parent of’ all designate irreflexive relations. Relations
that are neither reflexive nor irreflexive are said to be nonrejlexive. The phrases
‘loves’, ‘hates’, and ‘criticizes’ designate nonreflexive relations.
Relations may have various combinations of the attributes described. The
relation of weighing more than is asymmetrical, transitive, and irreflexive,
while the relation of having the same weight as is symmetrical, transitive, and
reflexive. However, some attributes entail the presence of others. For example,
all asymmetrical relations must be irreflexive, as can easily be demonstrated.
Let ‘Rjci/’ designate any asymmetrical relation. Then, by definition:
l- {x)(y)(Rxy D —Ryx)
From this premiss we can deduce that R is irreflexive, that is, that (x)^Rxx:
2. (y)(Rxy D ^Ryx) 1, UI
3. Rxx D ~^Rxx 2, UI
4. '■'-Rxxv ~~Rxx 3, Imp).
5. ~~Rxx 4, Taut.
6. (x)~~Rxx 5, UG
Other logical connections among these attributes of relations are easily stated
and proved, but our present interest lies in another direction.
At this point, it should be remarked that the method of proving invalidity
that was presented in Chapter 4 (pages 78-81 and pages 105-107), which is
always applicable to nonrelational arguments, is, not always applicable to
relational arguments. Some sets of relational premisses, although perfectly
consistent, cannot all be true for any finite model. Thus premisses asserting that
the relation R is transitive, asymmetrical, and irreflexive, and further that
(x)(3y)Rxy, cannot all be true for any finite model. Consider any model that is
a nonempty set. Since it is nonempty, it contains at least one object, which we
shall call ‘a’. From (x)(3y)Rxy we know that (3y)Ray. By irreflexiveness of R,
~~Raa, so y cannot be a. Hence the model must contain another object, fc, such
that Rab. Now again, from (jc)(3tj)Rxy we know that (3y)Rby. By irreflexive-
Sec. 5.3] Some Attributes of Relations 137
Wtd
Wdh
(x)(y)(z)\(\VxyWyz) D Wxz\
' Wth
the method of proving its validity is immediately obvious. We said that the
argument ‘might’ be stated in the way indicated. But such a way of stating the
argument would be the exception rather than the rule. The ordinary way of
propounding such an argument would be to state only the first two premisses
and the conclusion, on the grounds that everyone knows that having the same
weight as is a transitive relation. Relational arguments are often used, and
many of them depend essentially on the transitivity, or symmetry, or one of the
other attributes of the relations involved. But that the relation in question has
the relevant attribute is seldcftn— if ever—stated explicitly as a premiss. The
reason is easy to see. In most discussions, a large body of propositions can be
presumed to be common knowlege. The majority of speakers and writers save
themselves trouble by not repeating well-known and perhaps trivially true
propositions that their hearers or readers can perfectly well be expected to
supply for themselves. An argument that is incompletely expressed, part of it
being ‘understood’, is an enthymeme.
Because it is incomplete, an enthymeme must have its suppressed premiss or
premisses taken into account when the question of its validity arises. Where a
138 The Logic of Relations [Ch. 5
necessary premiss is missing, the inference is technically invalid. But where the
unexpressed premiss is easily supplied and obviously true, in all fairness it
ought to be included as part of the argument in apy appraisal of it. In such a
case, one assumes that the maker of the argument did have more ‘in mind* than
he stated explicitly. In most cases there is no difficulty in supplying the tacit
premiss that the speaker intended but did not express. Thus the first specimen
argument stated at the beginning of this chapter:
Any horse can outrun any dog. Some greyhounds can outrun any rabbit.
Therefore, any horse can outrun any rabbit.
not only is the needed premiss about the transitivity of being able to outrun left
unexpressed, but also left unexpressed is the nonrelational premiss that all
greyhounds are dogs. When these are added—and they are certainly not
debatable issues—the validity of the argument can be demonstrated as follows:
Missing premisses are not always so easily noticed and supplied as in the
present example. When it is not so obvious which needed premisses are missing
from an enthymematically expressed argument, it is a good policy in beginning
a proof of the argument’s validity to leave a little space just below the given
premisses. Additional premisses can be written in that space when need arises.
The only point to be stressed is that no statement that is as doubtful or
debatable as the argument’s own conclusion is to be admitted as a supplemen¬
tary premiss, for in a valid argument that is enthymematically stated, only the
sheerest platitudes should be left unexpressed for the hearer or reader to fill in
for himself.
EXERCISES
*7. Vacant lots provide no income to their owners. Any owner of real estate must pay
taxes on it. Therefore, any owner of a vacant lot must pay taxes on something
that provides no income to him. (Vx: x is a vacant dot. Rx: x is real estate. Ixy: x
provides income to y. Txy: x pays taxes on y. Oxy: x owns y.)
8. All admirals wear uniforms having gold buttons. Therefore, some naval officers
wear clothes that have metal buttons. (Ax: x is an admiral. Ux: x is a uniform. Gx:
x is gold. Bx: x is a button. Nx: x is a naval officer. Cx: x is clothing. Mx: x is metal.
Wxy: x wears y. Hxy: x has y.)
9. Whenever Charlie moved to Boston, it was after he had met Al. Whenever
Charlie got married, it was before he ever saw Dave. Therefore, if Charlie moved
to Boston and subsequently got married, then he met Al before he ever saw Dave.
(Tx: x is a time. Ax: Charlie met Al at (time) x. Bx: Charlie moved to Boston at
(time) x. Mx: Charlie got married at (time) x. Dx: Charlie saw Dave at (time) x.
Pxy: x precedes y.)
10. A fish that chases every shiner will be hooked by an angler who uses a shiner for
bait. A greedy fish will chase every shiner. So if all anglers are sportsmen, then no
pike that is not hooked by a sportsman who uses minnows for bait is greedy. (Fx:
x is a fish. Sx: x is a shiner. Cxy : x chases y. Hxy : x hooks y. Ax: x is an angler. Bxy :
x uses y for bait. Gx: x is greedy. Px: x is a pike. Rx: x is a sportsman. Mx: x is
a minnow.)
The notion of identity is a familiar one. Perhaps the most natural occasion
for its use is in the process of identification, as in a police line-up when a
witness identifies a suspect and asserts that
Each of these propositions asserts that a relation obtains between the things
referred to by its two terms. The relation asserted to hold is that of identity. In
each of the preceding propositions, at least one term was a definite description,
which is a phrase of the form ‘the so-and-so’. In identifications, however, both
terms may be proper names. Just as the two propositions
assert the relation of killing to hold between the individuals referred to bv the
proper names that appear in the propositions, so the propositions
and
assert the relation of identity to hold between the individuals referred to by the
proper names that appear in them.
The usual notation for the relation of identity is the ordinary equals-sign ‘ = \
It is intuitively obvious that the relation of identity is transitive, symmetrical,
and totally reflexive. In our symbolic notation, we write
This principle permits us to infer, from the premisses v = /i and any formula
containing an occurrence of the symbol v, as conclusion any formula that
results from replacing any occurrences of r in the second premiss by the symbol
ju.1 Any inference of this pattern is valid, and in a proof, it should have the
letters ‘Id/ written beside it. A specimen deduction or two will make this clear.
The argument
may be symbolized and proved valid by the following, in which we use the
letters 7i’ and *p to abbreviate the proper names O. Henry’ and ‘William
Mlere we are using the letters mu and nu to represent any individual symbols, constants or
variables.
142 The Logic of Relations [Ch. 5
1. h = p
2. Wh /.*. Wp
3. Wp 1, 2, Id.
Using the symbols ‘g\ ‘/’, ‘m’, ‘Wxy\ ‘Wx’ to abbreviate ‘George Eliot’, ‘The
Mill on the Floss\ ‘Mary Ann Evans’, ‘x wrote y\ and ‘x is a woman’, we may
formulate the given argument and demonstrate its validity as follows:
1. Wgf
2. g = m
3. Wm /.'.(3x)(Wx-Wxf)
4. Wg 2, 3, Id.
5. Wg-Wgf 4,1, Conj.
6. (lx)(Wx-Wxf) 5, EG
Only a bald inan wears a wig. Kaplan is a man who wears a wig. This man
is not bald. Therefore, thisT man is not Kaplan.
Using the symbols 't\ ‘k’, ‘Mx\ ‘Bx\ ‘Wx’, to abbreviate ‘this man’, ‘Kaplan’, ‘x
is a man’, ‘x is bald’, and ‘.r wears a wig", we may symbolize the present
argument and prove its validity as follows:
This last proof serves to show that we use the principle of Identity not only to
infer from <t>/x and v = p, but also to infer ^~(r = /u) from Or and For
completeness and convenience, we include in our statement of the principle of
Identity its symmetry and its total reflexiveness, even though its symmetry is
readily derived from the others. Our formulation is
Now that the symbol for identity is available, we can define still another
attribute of binary relations that is of importance in characterizing ‘partial
order’ in Set Theory, as we shall see in Chapter 7. A propositional function
‘Rxi/ designates an antm/mmetrical relation if and only if
(x)(y)[(RxyRyx) D x = i/j
using V for ‘Al\ Tx’ for \v is on the team', and ‘Oxtf for \v can outrun if, we
cannot simply write down
Toa(x)(Tx D Oax)
Oaa
In this second proposition, the important word ‘else’ is missing. The first
proposition does not assert that Al can outrun everyone on the team, but
everyone else on it, that is, everyone on it who is other than or not identical
with Al. The proper translation of the first proposition is
Making similar use of the identity sign, we may symbolize the propositions
and
as
is interpreted not as asserting that there is an opening, but that there is no more
than one. This statement may be symbolized as
(*)('/)[(£>*■ Oy) Dx = y]
is interpreted as leaving open the question of whether there are any at all. This
may be symbolized as
The identity sign is also useful in symbolizing the concept of at least. It is not
needed for ‘at least one’, because the existential quantifier, by itself, suffices.
The statement
is symbolized as
(3x)Ax
Sec. 5.4] Identity and the Definite Description 145
But to symbolize
(3x)(3i/)[Ax-Ai/-.y ^ i/l
Putting together the notations for ‘at least one’ and ‘at most one’, we have a
method for symbolizing definite numerical propositions. Thus the statement
meaning exactly one, is symbolized, using 'Bx' for \v is a book’ and 'Dx for \v
is on my desk’, as
meaning exactly two, is symbolized, using \Sx' for \v is a state’, Wx’ for ‘.v is a
Senator’, and 'Ext/ for ‘x elects y\ as
(x){Sx D (3y)(3z)[NymNz*ExymIixzmy /
(ic)\[\-wExw) D (to = y v w — £)]]}
which refer to Scott, Booth, and Chicago, respectively. One fairly standard
notation for this sense of the word ‘the’ involves an upside-down iota. The
three phrases above are (partially) symbolized as
In general, a formula like *(ix)(x wrote Waverleyf is read as ‘the x that wrote
Waverley, and is treated as a proper name. Thus we can replace the individual
variable p in the propositional function typ by (w)(<$>v) to obtain 'k(ij')(<l)J') as a
substitution instance.
Normally, a definite description functions in argument the same wav that a
proper name does. The principle of Identity permits us to infer the conclusion
‘Scott wrote Mannion from the premisses ‘Scott is the author of Waverley and
‘the author of Waverley wrote Marmion . The proposition ‘Something is larger
than Detroit’ follows validly by EG from the premiss ‘The largest city in
Illinois is larger than Detroit’. Of course, El instantiates only with a variable
and UG generalizes only from a variable. Therefore, there is no difference,
with respect to these two principles, between a proper name and a definite
description. But with respect to the principle of Universal Instantiation, certain
differences and difficulties arise.
Where the letter *F' designates an attribute, the symbol complex ‘(?x)(Fx)’
refers to the individual having the^ittribute F if there is one and only one such
individual. But what if no individual, or more than one individual, has this
attribute? In such a case, since there is no unique individual to be referred to
by the expression \ix)(Fx)\ that expression does not refer. The problem of
interpreting phrases that purport to refer but do not really do so can be
handled in the following fashion, attributable to Russell.2 An explicit definition
of a symbol is given by presenting another symbol that is equivalent to it in
meaning. Thus the symbol ‘bachelor’ is defined explicitly by equating it to the
“See ‘On Denoting/ Mind, n.s., vol. 14 (1905). Russell’s paper, together with alternative
treatments attributed to G. Frege, Keith* Donpellau. and Hilary Putnam is reprinted in Part 3 of
Contemporary Philosophical Jjogic, 1. M. Copi and J. A. Gould eds., St. Martin’s Press, New York,
1978.
Sec. 5.4] Identity and the Definite Description 147
second
and finally
The three parts of its meaning can be symbolized—without using the iota—as
follows:
and
x was a genius.
or as
(3x){<J>x-(i/)(<l>t/ D (/ =
can be symbolized by using ‘Ox’ for *x is an ocean’, ‘ Wx' for ‘x is to the west of
America’, and ‘Lxt/’ for ‘x is larger than {/’, as
for this sentence affirms the existence of exactly one immortal man, when there
is none. And unless it occurs in a context that makes clear either that some
particular mountain is being referred to or that mountains in general are under
discussion, the statement
is false. This is so because it affirms that there is only one mountain, whereas in
fact there are many. These remarks should serve to make clear that a phrase of
the form ‘the so-and-so’ or a symbol such as ‘(7x)(Fx)’ cannot be instantiated by
the principle of Universal Instantiation alone. In order to deduce the conclu-
Sec. 5.4] Identity and the Definite Description 149
sion *G(rx)(I<x) from \x)Gx\ we require the additional premiss that there is
exactly one thing which is an F. Where that premiss is missing, the inference is
invalid. But where it is present, the argument is easily proved valid as follows:
1. (x)Gx
2. (3*)[Fx-(y)(Fy D if = x)] /.'. G{ix)(Fx)
3. Fx-(y)(Fy D tj = x)
4. Gx 1,UI
5. Fx*(y)(Ftj Di/ = x)-Gx 3,4, Conj.
6. (3x){Fx*(y)(Fy D y = x)-Gx) 5, EG
7. (3 x){Ff(y)(Fy D y = a) - C } 2, 3-6, El
8. G('ix)(Fx) 7—definition
or that
If either there i5 no .v that is or there is more than one of them, (1) is false and
(2) is true. To make otherwise ambiguous propositions definite and unambigu¬
ous, Russell proposed using formulas of the form (ix)(<l>x) as a kind of ‘scope
indicator’. But the rules governing this use are quite complicated, and we shall
have no use for them here. Indeed, we shall make no further use of formulas of
the form (ix)(<bx) and shall use instead the explicit symbolic translation that
involves quantifiers and the identity symbol.
EXEHCISKS
Prove the validity of the following arguments—using only the identity symbol in
addition to the indicated abbreviations:
1. The architect who designed Tappan Hall designs only oHiee buildings. Therefore,
Tappan Hall is an office building. (Ax: x is an architect, t: Tappan Ilall. Dxy: x
designed y. Ox: x is an office building.)
*2. The Professor of Greek at Siwash is very learned. Therefore, all Professors of
Greek at Siwash are very learned. (Px: x is a Professor of Greek. Sx: x is at Siwash.
Lx: x is very learned.)
150 The Logic of Relations [Ch. 5
3. The smallest state is in New England. All states in New England are primarily
industrial. Therefore, the smallest state is primarily industrial. (Sx: x is a state. Nx:
x is in New England, tx: x is primarily industrial. $xy: x is smaller than y.)
*4. The fastest running person is a Scandinavian. Therefore, anyone who is not a
Scandinavian can be outrun by someone (or other). (Sx: x is a Scandinavian. Px: x
is a person. Fxy: x can run faster than y.)
5. All entrants will win. There will be, at most, one winner. There is at least one
entrant. Therefore, there is exactly one entrant. (Ex: x is an entrant. Wx: x will
win.)
*6. Any fish can swim faster than any smaller one. Therefore, if there is a largest fish,
then there is a fastest fish. (Ex: x is a fish. Lxy: x is larger than tj. Sxy: x can swim
faster than #/.)
7. Adams and Brown were the only men at the banquet who drank. All the men at
the banquet who brought liquor drank. Adams did not bring any liquor. If any
man at the banquet drank, then some man at the banquet who drank must have
brought liquor. All who drank became ill. Therefore, the man at the banquet who
brought liquor became ill. (a: Adams, b: Brown. Mr: x was a man at the banquet.
Dx: x drank. Lx: x brought liquor, lx: x became ill.)
8. Anyone who has climbed Mt. Blanc is braver than anyone who has not. Of all
those on our team, only the youngest member has climbed Mt. Blanc. Everyone on
our team is a veteran. Therefore, the bravest member of our team is a veteran.
(Cx: x has climbed Mt. Blanc. Tx: x is on our team. Vx: x is a veteran. Bxy: x is
braver than y. Oxy: x is older than y.)
9. There is exactly one penny in my right hand. There is exactly one penny in my left
hand. Nothing is in both my hands. Therefore, there are exactly two pennies in mv
hands. (Px: x is a penny. Rx: x is in my right hand. Lx: x is in my left hand.)
10. All accompanists were bagpipers. All bagpipers were in the cabin. At most two
individuals were in the cabin. There were at least two accompanists. Therefore,
there were exactly two bagpipers. (Ax: x was an accompanist. Bx: x was a
bagpiper. C.v: x was in the cabin.)
expression
Fx
YVe have seen all these before. A new kind of proposition, however, although
still singly general, is obtained by instantiating with respect to the individual
variable and generalizing with respect to the predicate variable. I lere we have
Of these, (J) and (5) are clearly equivalent, (1) stating that
Propositions (4) and (8) are also equivalent to each other, with (4) stating that
The remaining propositions, however, are all distinct. They may be expressed
in English as
There are no equivalences here, but proposition (3) logically entails proposition
(6), and proposition (7) logically entails proposition (2). These implications can
be formally established by means of our familiar quantification rules, allowing
the symbols 7*' and V of the rules to denote predicate variables as well as
individual variables—with the same restrictions on their application, of course.
When predicate variables are introduced and their quantification permitted,
a formal, that is, completely symbolic definition, can be given for the identity
symbol. The definition is
(x = y)=df(F)(Fx = Fy)
following as a logical consequence. And from the latter all the attributes of the
identity relation can be deduced.
For the most part, our preceding discussion has concerned only attributes of
things. But things are not the only entities that have attributes. In Section 5.3
we discussed various attributes that can be attributed to, or predicated of,
relations. Attributes, themselves, can have attributes. Thus the attribute of
being honest has the attribute of being desirable; a virtue (which is an attribute)
may, itself, have the attribute of being rare; and the attribute of being incon¬
siderate is common.
Once it has been noted that some attributes can be predicated of other
attributes, the temptation arises to"predicate certain attributes of themselves.
For example, the attribute of being abstract seems, itself, to be abstract,
whereas the attribute of being concrete seems not to be itself concrete. Any
attribute that can be truly predicated of itself will be said to be a predicable
attribute. Thus being predicable is an attribute that belongs to all those and
only those attributes that can be truly predicated of themselves. On the other
hand, any attribute that cannot be truly predicated of itself will be said to be
an impredicable attribute. Thus being impredieable is an attribute that belongs
to all those and only those attributes that cannot be truly predicated of
themselves.
If we now ask whether the attribute of being impredicable can be truly
predicated of itself or not, we are led to the following unhappy conclusion. If
Sec. 5.5] Predicate Variables and Attributes of Attributes 153
the attribute of being impredicable can be truly predicated of itself, then it has
the attribute of being impredicable. It follows then by definition that the
attribute cannot be truly predicated of itself. On the other hand, if the attribute
of being impredicable cannot be truly predicated of itself, then like all other
attributes that cannot be truly predicated of themselves, it has the attribute of
being impredicable. This means that it can be truly predicated of itself. We
have thus been led to a contradiction.
The contradiction can be derived more clearly by symbolizing the attribute
of being impredicable as 7\ and defining it formally as
IF = df-FF
(F)(IF= ^FF)
11= -11
division of all entities into different logical types, but the restriction that any
attribute that may significantly be predicated of an entity of one logical type
cannot significantly be predicated of any entity of any other logical type. For
some attributes, the theory of types seems perfectly obvious. Thus an individual
thing may be orange in color, but it clearly does not make sense to say of any
attribute that it is orange in color. An attribute may have many instances, but it
does not make sense either to affirm or to deny of an individual thing that it
has many instances.
The primary motivation for accepting the theory of logical types, however,
is not its naturalness or obviousness, but the fact that it permits the avoidance
of such contradictions as that of the alleged attribute ‘impredicable*. According
to this theory, the type of an attribute must be higher than the type of any
entity of which it can significantly be predicated. Consequently, it does not
make sense either to affirm or to deny of any attribute that it belongs to itself.
Such expressions as ‘FF* and ‘^FF* must be dismissed as meaningless. Conse¬
quently, no such attribute as ‘impredicable’ can be defined, and the contradic¬
tion vanishes. The version of the theory of types sketched above—sometimes
called the ‘Simple Theory of Types’—suffices to rule out all contradictions of
this kind. It also has a certain consonance with common sense. However, there
are alternative solutions or ways to avoid the contradictions, so the theory of
logical types cannot be regarded as the solution. It is widely accepted, how¬
ever, and we shall follow it in introducing a new type of symbol to represent
attributes of attributes of individuals.
Some words have more than a single meaning, of course, and may in one
sense designate an attribute of individuals, and in another sense designate an
attribute of attributes of individuals. Thus the word ‘rare’ in one sense desig¬
nates an attribute of attributes of individuals. An attribute of individuals is rare
if it is exemplified by only a few individuals and, in this sense, it cannot be
affirmed or denied of individuals, themselves. On the other hand, there is a
different sense of the word ‘rare’, in which it designates an attribute of an
individual piece of meat that has been cooked for only a short while, and in this
sense it cannot significantly be affirmed or denied of any attribute. To avoid
ambiguity, we symbolize attributes of attributes of individuals by boldface
italic capital letters ‘A\ ‘B\ ‘C\7.., to prevent their being confused with
attributes of individuals. With this additional symbolic apparatus, we can
translate into our notation such propositions as ‘Unpunctuality is a fault*, and
‘Honesty is a virtue*. Here we use ‘Ux, *Hx\ ‘FF*, and ‘VF\ to abbreviate ‘x is
unpunctual’, ‘x is honest', ‘F is a fault’, and ‘F is a virtue*, and symbolize the two
stated propositions as ‘FU\ and ‘VI/’. More complex propositions can also be
symbolized. The propositions
and
can be symbolized, using the symbols UF' for ‘F is useful’ and DF’ for ‘F is
desirable’, as
(F)(UF D DF)
and
(3 F)(DF-—UF)
can be symbolized, using the additional symbols 7' for Tom' and 'Mxy' for \v
is mother of y\ either as
or as
KXRKCISES
1. l(x)(F)Fx] = [(F)(x)Fx]
*2. l(3x)(3F)Fx] == l(3F)(3x)Fx]
3. [(3x)(F)Fx] D [(F)(lx)Fx\
4. [(3F)(x)Fx] D [(x)(3F)Fx]
5. l(3H)(x)(3y)iixi/] 3 [(x)(3!/)(3fl)«x!/]
196 The Logic ol Relations (Ch. S
157
158 Deductive Systems [Ch. 6
a science when some of the words or symbols they contain are defined in terms
of their other symbols.
The recognition that definition and deduction are important to science may
suggest a specious ideal for scientific systems. It may be imagined that in an
ideal science, all propositions should be proved by deducing them from others
and all terms should be defined. But this would be ‘ideal’ only in the pejorative
sense that this goal is impossible to realize. Terms can be defined only by means
of other terms, whose meanings are presupposed and must be understood
beforehand if the definitions are to explain the meanings of the terms being
defined. Deductions can establish their conclusions only on the basis of prem¬
isses that must already have been verified if the conclusions are really to be
established by the proofs. Hence, if all terms or symbols of a system are to be
defined within the system, there must be either infinite sequences of definitions,
or circular definitions, as in a pocket dictionary that defines the word ‘big’ as
meaning large, and the word ‘large* as meaning big. Circular definitions are
obviously worthless as explanations, and infinite sequences of definitions are
worthless also. No term can really be explained until the end of the sequence is
reached and an infinite sequence has no end. Similarly, to prove all proposi¬
tions, there must be either infinite regressions of proofs or circular proofs. And
these are equally objectionable.
It must be admitted that within a system of propositions that constitutes a
science, not all propositions can be proved and not all terms can be defined. It
is not that there is some particular proposition that cannot be proved or some
particular term that cannot be defined, but rather that they cannot all be
proved or defined without a vicious regression or circularity. The ideal of
science, then, cannot be a system in which every proposition is proved and
every term defined. The ideal is rather one in which a minimum number of
propositions suffices for the deduction of all the rest and a minimum number of
terms suffices for the definition of all the others. This ideal of knowledge is
described as a deductive system.
existed before their time, in what sense did the Greeks originate the science of
geometry? The answer has already been indicated. Before Pythagoras, man's
geometrical knowledge consisted of a collection or catalog of almost wholly
isolated facts. Geometrical truths constituted a mere list of useful empirical
rules-of-thumb for surveying land or constructing biidges or buildings, and
there was no system to their knowledge of geometrical truths. Bv introducing
order into the subject, the Greeks transformed it from a mere body of isolated
bits of knowledge into a science.
System was introduced into geometry by the deduction of some of its
propositions from others. The propositions of geometry were ordered by listing
first those that could be used as premisses in the demonstrations of propositions
that were put later. This systematization of geometry was begun by Pythagoras
and continued by his successors. It culminated in the Elements of Euclid (c.3(K)
b.c.), in which all geometrical propositions were arranged in order, beginning
with Axioms, Definitions, and Postulates, and continuing with Theorems
deduced from the initial propositions. Geometry was cast by the (-reeks into
the form of a deductive system. Theirs was the first deductive system ever
devised. So great was the achievement that it has served as a model for
scientific thought down to the present time. Even today the most advanced
sciences are those which most nearly approximate the form of a deductive
system. These are the sciences that have achieved a relatively small number of
very general principles from which a relatively large number of other laws and
special cases have been derived. Parts of physics have actually been formulated
as deductive systems and similar attempts have been made, with somewhat less
impressive results, in parts of biology and psychology. Perhaps the boldest
attempt in this direction was that of Spinoza, whose most important work, the
Ethics, was written in ‘geometrical’ form. Starting with axioms and definitions,
Spinoza attempted to'deduce the rest of his metaphysical and ethical doctrines
as theorems that were provable on the basis of those initial assumptions.
Euclid begins his geometry with definitions of some of the terms used in its
development. Thus Definition I reads: ‘A point is that which has no parts’, and
Definition 2 reads: ‘A line is length without breadth’.1 Euclid does not attempt
to define all his terms, of course. The first two definitions define the terms
point’ and ‘line’, respectively. The words used in these definitions, such as
parts’, ‘length’, and ‘breadth’, are not, themselves, defined but are among the
undefined terms of the system for Euclid. As more new terms are introduced,
their definitions make use of previously defined terms as well as the original
undefined ones. Thus Definition 4: ‘A straight line is ... [a line] ... which lies
evenly between its extreme points’, makes use not only of such undefined terms
as ‘evenly’ and ‘between’, but also the previously defined terms ‘point’ and
‘line’.
The use of defined terms is, from the point of view of logic, a matter of
convenience only. Theoretically, every proposition that contains defined terms
1 These and the following are quoted from the Todhunter edition of The Elements of Euclid, No.
891, Everyman’s Library, London and New York, 1933.
160 Deductive Systems [Ch. 6
can be translated into one that contains only undefined ones. This can be done
by replacing each occurrence of a defined term by the sequence of undefined
terms that were used to define it. For example, Postulate 1: ‘Let it be granted
that a straight line may be drawn from any one point to any other point’. This
postulate contains the defined terms ‘straight*, ‘line*, and ‘point*, and can be
expressed, without using those defined terms, as: ‘Let it be granted that a length
without breadth which lies evenly between its extreme parts, which (them¬
selves) have no parts, may be drawn from any one thing which has no parts to
any other thing which has no parts*. This version of the Postulate is extremely
awkward, however. Although they are theoretically eliminable, in actual
practice a considerable saving of space, time, and effort is effected by using
relatively brief, defined terms to replace lengthy sequences or combinations of
undefined ones.
In setting up his deductive system of geometry, Euclid divided his unproved
propositions into two groups, one called ‘Axioms’, the other called ‘Postulates*.
He gave, however, no reason for making this division, and there seems to be no
very clear basis for distinguishing between them. Possibly, he felt that some
were more general than others or psychologically more obvious. The contem¬
porary practice is to draw no such distinction but to regard all the unproved,
initial propositions of a deductive system as having the same standing. Now, it
is customary to refer to them all, indifferently, as ‘axioms’ or as ‘postulates’,
without attaching any difference in meaning to those two terms.
Every deductive system, to avoid falling into circularity or into a vicious
regression, must contain some axioms (or postulates) that are assumed but are
not proved within the system. They need not be precarious assumptions or mere
assumptions. They may be very carefully and convincingly established—but
they are not proved within the system itself. Any argument intended to
establish the truth of the axioms is definitely outside the system, or extra-
systematic.
The older conception of Euclidean geometry held that all of its theorems
followed logically from its axioms and were, therefore, just as true as the
axioms. Euclidean geometry also held that the axioms were self-evident. It is
in this tradition that any statement was regarded as ‘axiomatic’ when its truth is
beyond all doubt, since it was sel£evident and did not require any proof. It
should be clear from what has already been said, however, that we are not
using the word ‘axiom’ in that sense. No claim is made that the axioms of any
system are self-evidently true. Any proposition of a deductive system is an
axiom of that system, if it is assumed, rather than proved, in that system. This
modern point of view has arisen largely as a consequence of the historical
development of geometry and physics.
The self-evident truth of the Euclidean axioms (and postulates) was long
believed. It was not believed quite whole-heartedly, however. Most of the
axioms, such as Axiom 9, ‘The whole is greater than its part’, were not
questioned. Although there was* no doubt about the truth of Axiom 12 (the
famous ‘parallel Postulate*), there was considerable scepticism about its ‘self-
Sec. 6.2] Euclidean Geometry 161
2Listed as Postulate 5 by Sir Thomas L. Heath in The 'Thirteen Books of Euclid’s Elements,
Cambridge University Press, Cambridge, England, 1926. For an interesting discussion of the
history of the parallel postulate, the reader is referred to pp. 202 il. of Volume 1 of that work.
3Ibid., p. 202.
162 Deductive Systems [Ch. 6
space is more probably non-Euclidean than Euclidean. In any event, the truth
or falsehood of its axioms is a purely external property of any deductive system.
The truth of its propositions is an extrasystematic consideration. It is no doubt
important, to the extent that a deductive system is ordered knowledge. When we
concentrate our attention on the system as such, however, its order is its more
important characteristic.
From the purely mathematical or logical point of view, a deductive system
can be regarded as a vast and complex argument. Its premisses are the axioms,
and its conclusion is the conjunction of all the theorems that are deduced. As
with any other argument, the logical question does not concern the truth or
falsehood of the premisses, but deals with the validity of the inference. Granted
the truth of the axioms, does the truth of the theorems necessarily follow? That
is the question with which the logician and the mathematician are concerned.
The answer is, of course, yes—if the demonstrations of the theorems are all
valid arguments. Hence the most important aspect of any deductive system is
the cogency with which its theorems are proved. During the rigorous devel¬
opment of deductive systems in abstraction from the extrasystematic explana¬
tion of their undefined terms, the question of truth or falsehood is obviously
irrelevant.
There are serious errors in the system of geometry set forth by Euclid in his
Elements. Indeed, a mistake occurs in his very first proof. The flaw in his proof,
paradoxically enough, was the result of his knowing too much about his subject.
He did not appeal to his explicitly stated axioms, alone, as premisses, but
depended, also, upon what might be called his geometrical intuition.4 Where a
chain of argument involves familiar notions, there is always the danger of
assuming more than the explicitly stated premisses warrant. This is particularly
serious in the development of a deductive system, for any attempted system¬
atization that appeals to unacknowledged assumptions in the derivations of its
theorems, thereby fails to achieve its aim. In a deductive system, the theorems
must be deduced rigorously from the stated postulates. If they are not, however
true they may be, the result falls short of the goal of systematization.
Since lapses from rigor are most often occasioned by too great familiarity
with the subject matter, mathematicians have found it helpful to minimize or
eliminate such familiarity in the interest of achieving greater rigor. In the case
of geometry, that end is accomplished by abstracting from the meanings of
such geometrical words as ‘point*, ‘line', and plane*, and-developing the
4 Euclid’s proof and a brief discussion of his,mistake can be found on pp. 241-243 of vol. 1 of
The Thirteen Books of Euclid’s Elements, by Heath, op. cit. An example of how the same type of
error can lead to conclusions that are false or even self-contradictory can be found on pp. 77-78,
W. W. Rouse Ball, Mathematical Recreations and Essays, Macmillan Publishing Co., Inc., New
York, 1940.
Sec. 6.3] Formal Deductive Systems 163
Usually, though not always, a formal deductive system is set up with some
particular interpretation ‘in mind’. That is, the investigator has some knowl¬
edge about a certain subject, and wishes to set up a system adequate for its
expression. When the formal system has been constructed, the question natu¬
rally arises whether or not it is adequate for the formulation of all the propo¬
sitions it is intended to express. If it is, it may be saicj^jfo be ‘expressively
complete’ with respect to that subject matter. We are here discussing what can
be said in the system, not what can be proved. With respect to a given subject
matter, a formal deductive system is ‘expressively complete’ when it is possible
to assign meanings to its undefined terms in such a way that every proposition
about that subject matter can be expressed as a formula of the system. Whether
the true propositions can be proved as theorems or not is another question that
will be discussed later in this section.
A system is said to be inconsistent if two formulas—one of which is the
denial or contradictory of the other-»can both be proved as theorems within it.
A system is consistent if it contains no formula such that both the formula and
its negation are provable as theorems within it. As was remarked in Section 3.7,
a contradiction logically entails any proposition whatever. Hence a derivative
definition or criterion for consistency can be formulated as follows*. Any system
is consistent if it contains (that is, can express) a formula that is not provable as
a theorem within the system. This is known as the ‘Post criterion for consist¬
ency’, having been enunciated by the American mathematician and logician,
E. L. Post. Consistency is of fundamental importance. An inconsistent deduc¬
tive system is worthless, for all of its formulas are provable as theorems,
including those which are explicit negations of others. When the undefined
terms are assigned meanings, these contradictory formulas become contradic¬
tory propositions and cannot possibly all be true. Since they cannot possibly be
Sec. 6.4] Attributes of Formal Deductive Systems 165
The most important attribute that a deductive system should possess is that
of rigor. A system has rigor when no- formula is asserted to be a theorem unless
it is logically entailed by the axioms. For the sake of rigor, arbitrary rather than
Sec. 8.5] Logistic Systems 167
familiar symbols are taken as undefined or primitive terms and the system is
developed formally. Listing clearly all the undefined terms and explicitly
stating all the axioms used as premisses for the theorems will help specify
precisely which formulas are to be esteemed as theorems and which are not.
With the increased emphasis on rigor that characterizes the modern period,
critical mathematicians have seen that this procedure is not enough. To achieve
rigor, more is required.
A system is rigorous only when its theorems are proved logically, or derived
logically from its axioms. However clearly its axioms are stated, a formal
system will lack rigor unless the concept of logical proof or logical derivation is
also specified precisely. All deductive systems of the sort that have been
mentioned, even formal deductive systems that contain logical terms in
addition to their own special uninterpreted symbols, depend upon ‘ordinary
logic’ for their development. They assume logic, in the sense that their
theorems are supposed to follow logically from their axioms. But they do not
specify what this ‘logic’ is. Hence all earlier deductive systems, for geometry,
or physics, or psychology, or the like, contain concealed assumptions that are
not explicitly stated. These hidden assumptions are the rules or principles of
logic to which one appeals in constructing proofs or derivations of theorems.
Hence all those deductive systems fall short of complete rigor, since not all of
their presuppositions are acknowledged. Therefore, their developments are not
entirely rigorous, but are more or less loose.
The question naturally arises: llow can this looseness be eli ninated and
greater rigor be achieved? The answer is obvious enough. A deductive system
will be developed more rigorously when it is specified not only what axioms are
assumed as premisses in deriving the theorems, but also what principles of
inference are to be used in the derivations. The axioms must be supplemented
by a list of valid argument forms or principles of valid inference.
The demand for rigor and for system does not stop even here, however. For
the .sake of rigor, in addition to its own special axioms, a deductive system must
specify explicitly what forms of inference are to be accepted as valid. It would
be unsystematic—and probably impossible—simply to list or catalog all re¬
quired rules of logic or valid modes of inference. A deductive system of logic,
itself, must be set up. Such a deductive system will have deduction, itself, as its
subject matter. A system of this type, often referred to as a logistic system, must
differ from the ordinary, less formal varieties in several important respects.
Since the subject matter is deduction itself, the logical terms ‘if ... then ...’,
and’, ‘or’, ‘not’, and so on, cannot occur in it with their ordinary meanings
simply assumed. Uninterpreted symbols must be used in their stead. The logical
principles or rules of inference that it assumes for the sake of deducing logical
theorems from logical axioms must be few in number and explicitly stated.
A second fundamental difference between logistic systems and other formal
deductive systems is that in the latter, the concept of a significant or ‘well
formed’ formula need not be specified. It is absolutely required in a logistic
system, however. In an ordinary (nonformal) deductive system, it will be
168 Deductive Systems [Ch. 6
obvious which sequences of its words are significant propositions of English (or
of whatever the natural language is in which the system is expressed). In a
formal but nonlogistieal deductive system, the sequences of *ts symbols are
easily divided into those which ‘make sense’ and those which do not, for they
will contain such ordinary logical words as ‘if ... then ..‘and’, ‘or’, or ‘not’,
by whose disposition in it the sequence can be recognized as significant or
otherwise. An example will make this clear. In a formal deductive system that
contains ‘A’, ‘JB\ and ‘C’ as uninterpreted primitive symbols, the sequence of
symbols ‘If any A is a B, then* it is a C’ is clearly a complete and ‘significant’
formula that may or may not be provable as a theorem. The sequence of
symbols ‘If any A is a B’ is obviously incomplete, while the sequence ‘And or or
A B not not if’ is clearly nonsense. These are recognized as ‘complete’ or ‘well
forjned’, as ‘incomplete’ or ‘ill formed’ by the presence in them of some
symbols whose meanings are understood. In a logistic system, however, all
symbols are uninterpreted. There are no familiar words within its formulas (or
sequences of symbols) to indicate which are ‘well formed’ and which are not.
Where the symbols ‘A*, ‘B’, and ‘ D’ are uninterpreted, there must be some
method of distinguishing between a well formed formula like ‘A D —B’ and
one like ‘AB% D —’, which is not well formed. By our knowledge of the normal
interpretations of these symbols, we can recognize the difference and classify
them correctly, but for the rigorous development of our system, we must be
able to do this in abstraction from the (intended) meanings of the symbols
involved.
The matter may be expressed in the following terms. As ordinarily con¬
ceived, a nonformal deductive system (interpreted, like Euclidean geometry) is
an arrangement or organization of propositions about some specified subject
matter. Consisting of propositions, it is a language in which the subject matter
may be discussed. Understanding the language, we can divide all sequences of
its words into those which are meaningful statements and those which are
meaningless or nonsensical. This division is effected in terms of meanings and is
thus done nonfonnally. In a logistic system, the situation is different because
prior to the extrasystematic assignment of meanings or interpretation, all
sequences of symbols are without meaning. Yet we want, prior to and inde¬
pendent of its interpretation, a comparable division of all of its formulas into
two groups. When meanings are assigned to the primitive symbols of a logistic
system, some of its formulas will express propositions, while others will not. We
may informally characterize a formula which on the intended interpretation
becomes a significant statement as a ‘well formed formula’ (customarily ab¬
breviated Any formulas which on the intended interpretation do not
become significant statements, are not well formed formulas. In a logistic
system, there must be a purely formal criterion for distinguishing well formed
formulas from all others. To characterize the criterion as ‘purely formal’ is to
say that it is syntactical rather than semantical, pertaining to the formal
characteristics and arrangements of • the symbols in abstraction from their
meanings. Thus a logistic system must contain only uninterpreted symbols and
Sec. 6.5] Logistic Systems 169
must provide a criterion for dividing sequences of these symbols into two
groups. The first group will contain all well formed formulas, the second group
will contain all others. Of the well formed formulas, some will be designated as
Axioms (or Postulates) of the logistic system.
All well formed formulas that are not axioms are divided into two groups,
those which are theorems and those which are not. The former are those which
are derivable from the axioms or postulates, within the system. Although
uninterpreted, the well formed formulas of a logistic system constitute a
‘language in which derivations or proofs can be set down. Some well formed
formulas are assumed as axioms, and other well formed formulas are derived
from them as theorems. It might be proposed to define ‘theorem' as any wff
that is the conclusion of a valid argument whose premisses include only axioms
of the system. This proposed definition of ‘theorem* will be acceptable only if
the concept of a valid argument within the logistic system can be formally
defined. Because all wffs of the system are uninterpreted, the ordinary con¬
ception of validity cannot be used to characterize arguments within the system.
This is the case because the usual concept of validity is semantical. An
argument is regarded as valid if and only if the truth of its premisses entails the
truth of its conclusion. Consequently, a purely formal or syntactical criterion of
validity must be provided for arguments that are expressed within the system.
‘Valid* arguments within the system may have not only postulates or already
established theorems as premisses, with new theorems as conclusions. They
may also have as premisses any wffs, even those which are neither postulates
nor theorems, and they may have as conclusions wffs that are -not theorems. Of
course, it is desired that any argument within the system which is syntactically
‘valid* will become, when given its intended or normal’ interpretation, a
semantically valid argument.
Any logistic system; then, will contain the following elements: (1) a list of
primitive symbols which, together with any symbols defined in terms of them,
are the only symbols that occur within the system; (2) a purely formal or
syntactical criterion for dividing sequences of these symbols into formulas that
are well formed (wffs) and those that are not; (3) a list of wffs assumed as
postulates or axioms; (4) a purely formal or syntactical criterion for dividing
sequences of well formed formulas into ‘valid* and ‘invalid arguments; and (5),
derivatively from (3) and (4), a purely formal criterion for distinguishing
between theorems and nontheorems of the system.
Different logistic systems may be constructed as systematic theories of
different parts of logic. The simplest logistic systems are those which formalize
the logic of truth-functional compound statements. These systems are called
propositional calculi or, less frequently, sentential calculi. One particular
propositional calculus will be presented and discussed in Chapter 8.
Set Theory
'See Chapters 5 and 6, Irving M. Copi, Introduction to Logic, Macmillan Publishing Co., Inc.,
New York and London, 1978.
170
Sec. 7.1] The Algebra of Classes 171
*afi . And we define the class of all objects that do not belong to a; this class is
called the complement of a and is symbolized either by —« or by a.
Many statements about classes can be formulated using the ordinary equals
sign. The equation a = p asserts that all members of a, if any, are also
members of fi9 and that all members of (1, if any, are also members of a. Many
of the properties of the sum, product, and complement of classes can be
expressed by means of equations. It is clear, for example, that the operations of
forming the sum and forming the product of two classes are commutative. For
example, symbolically, we have
Two principles of distribution also hold for class sums and products. Any
object that belongs either to a or to both (1 and y must belong either to a or
to (1 and must also belong either to a or to y, and conversely. In symbols,
a U (fly) = (a U (1 )(« U y)
Moreover, any object that belongs both to a and to either (1 or y must belong
either to both a and (1 or to both a and y. In symbols,
a = a U a and a = aa
a = a U a(l
Turning now to the concept of class complement, we observe that since any
object belongs to a given class if and only if it does not belong to the class of
all objects that do not belong to the given class, the complement of the
complement of a class is the class itself. We thus have a sort of double negative
rule for complementation, which can be expressed in symbols as
a = 3
172 Set Theory [Ch. 7
An object that does not belong to the sum of two classes belongs to neither of
them and must, therefore, belong to both of their complements. And an object
that does not belong to the product of two classfes must belong to the class
complement of at least one of them. These two propositions, and their con¬
verses, which are also true, can be expressed symbolically as
a U p = ap and ap = a U p
1 = 0
a U a = 1 and aa = 0
a U 0 = a, «1 = a, aO = 0, and a U 1 = 1
2Compare these with the Boolean Expansions for statements discussed in Appendix B.
Sec. 7.1] The Algebra of Classes 173
aft = 0
The A proposition, All a is ft, asserts that there is nothing which belongs to a
but not to ft, which means that the product of a and the complement of ft is
empty. The A proposition is, therefore, symbolized as
aft = 0
aft ^ 0
The O proposition: Some a is not ft, asserts that there is at least one member of
a that is not a member of ft, that is. the product of a and ft is not empty. In
symbols, the O proposition is expressed as
aft ^ 0
When formulated in our class notation, it is completely obvious that the A and
O propositions are contradictories, as are the E and I propositions.
Some of the traditional ‘immediate inferences’ involving categorical propo¬
sitions are already contained in the notation of the class algebra. Thus every
proposition has exactly the same symbolization as its obverse, for example,
'aft = O’ symbolizes both All a is ft and No a is non-ft. And conversion, where
174 Set Theory [Ch. 7
aft = 0, fty = 0 ay = 0
The validity of this form can be derived within the algebra of classes by
appealing to the results already set forth in this section. Since yO = 0, and
aft = 0 is a premiss, we have y(aft) = 0, which by association and commuta¬
tion yields (ay)ft = 0. Now a0 = 0, and fty = 0 is a premiss, so we have
a(fty) = which by association and commutation yields (ay)ft = 0. Hence
(ay)ft_U (ay)ft = 0, which by distribution yields (ay)(ft U ft) = 0. Since
ft U ft = 1 and (ay )1 = ay, we have ay = 0, the syllogism’s conclusion.
It can also be shown that all valid syllogisms of the second kind have the
form
c, pi, u, —, /}, y,
Axioms:
a U —a = 1 and <x Pi — nr = 0
The present system3 is a formal deductive system rather than a logistic system.
On its intended interpretation, of course, C is the collection of all classes, 0 and
1 are the empty and universal classes, respectively, and the symbols U, PI, and
— represent class addition, multiplication, and complementation, respectively.
The reader who is interested in deducing some theorems from these axioms
will find the following fairly easy to derive:
3From E. V. Huntington’s ‘Sets of Independent Postulates for the Algebra of Logic’, Transac¬
tions of the American Mathenuitical Society, vol. 5 (1904), p. 288.
176 Set Theory
Th. 9. a =-a.
Th. 10. If a H p ^ 0, then a ^ 0.
Th. 11. a = (a fl ft) U (a H — /?).
*T H . 12. a U (|8 U y) = (a U /i) U y,
Th. 13. a fl (]8 fl y) = (a fl /}) PI y.
Th. 14. 0 = -1.
Th. 15. a U (a fl p) = a.
Th. 16. a ^ —a.
Th. 17. -(a n /?)=-« u -p.
Th. 18. -(a U (i) = -a PI —/?.
Th. 19. If a fl ~P — 0 and P fl —y = 0, then a fl —y = 0.
*T H . 20. If (x H p ^ 0 and p fl —y = 0, then a fl y ^ 0.
4 For more details see Robert R. Stoll, Set Theory and Logic, W. II. Freeman and Company,
Publishers, San Francisco and London, 1963, pp. 273-274; or Robert R. Stoll, Sets, Logic, and
Axiomatic Theory, 2nd ed., W. H. Freeman and Company, Publishers, San Francisco and London,
1974, pp. 201-202.
5The mathematical background is described at length by P. G. B. Jourdain in his Introduction
to Georg Cantor, Contributions to the Founding of the Theory of Transfinite Numbers, P. E. B.
Jourdain, tr., Dover Publications, Inc., New York, 1915.
Sec. 7.3] Zermelo-Fraenkel Set Theory (ZF)-The First Six Axioms 177
Pierre £ France
France £ U.N.
But because the U.N. has only nations as members, it is clear that
Pierre ^ U.N.
F = R
And to deny that the set A is the same as the set B, we write
A 5* B
8Op. cit., p. 85.
178 Set Theory [Ch. 7
x = y = <M(F)(Fx=Fy)
From this definition, it was remarked, there follow all of the characteristics of
the identity relation that are embodied in the rules of inference that were
labeled ‘Principle of Identity’. This principle is included in the underlying logic
that is used in set theory in deriving conclusions from premisses, and in
deriving theorems from axioms in axiomatic set theory.
The difference between sets and attributes was remarked above. Although
two attributes may belong to exactly the same objects and yet be distinct, this
is not true of sets. If the set A contains exactly the same members as the set B,
then A = B. This is a fact about sets. It can be regarded as the most basic fact
about sets. And so it is stated as the first axiom of the ZF system: not as a logical
truth about the identity relation but as an assumption about sets.
Z F -1 A = B = df (x)(x £ A = r £ B)
C C D = di(x)(x £ C D x £ D)
This definition implies that every set is a subset of itself; in other words, set
inclusion is a reflexive relation. It is obvious that if C C D and D C E, then
C C E. Thus set inclusion is a transitive relation. If C C D and D C C, then by
the Principle of Extensionality, C = D. Where CCD but C ^ D, C is said to
be a proper subset of D.
In the case of sets containing only a small number of members, a set may be
symbolized by putting braces around an enumeration of the names of those
members. Thus the set containing just the two objects a and b can be symbol¬
ized as ‘{a, by or as ‘{fc, a}\ The order in which the names of the members is
written is indifferent, since sets containing the same members are identical. A
set containing just two members is called a ‘pair set’ or ‘doubleton’.
Sec- 7 31 Zermelo-Fraenkel Set Theory (ZFJ-The First Six Axioms 179
For sets containing many members, another notation is available. The new
notation involves, in addition to braces, a variable (such as V), a colon (repre¬
senting the concept of such that), and a propositional function that expresses
some condition that is satisfied or fulfilled by all those objects and only those
objects that belong to the set. Thus the set of all citizens of France would be
symbolized by ‘{x: x is a citizen of France}’. More generally, where Fx is
satisfied by just those objects belonging to the set S, S = {.v: Fx}. So for any
object y, y £ {x: Fx} = Fy, This more powerful way of symbolizing sets can
be seen to subsume the first method, for the set {a, b, c} can also be symbolized
as 4{x: x = a v x = b v x = c}\ This notation is in the spirit of Bertrand
Russell's remark that ‘... a class [set] may be defined as all th6 terms [objects]
satisfying some propositional function.’7
Cantor apparently believed—for a time at least8—that any condition on
objects could determine or define a set, the set containing or comprehending
just those objects satisfying that condition. Although Cantor did not formulate
this belief as a principle, it could be stated as
(3S)(x)(xe S = q>x)
where, again, it is understood that the variable ‘S’ has no free occurrence in the
propositional function <px. Thus, given any set A and any condition <p, there is
a set S which is that subset of A which contains just those members of A that
satisfy the condition <p. (Since the expression ‘qpjt* in the Axiom ranges over all
kinds of propositional functions, the Axiom is more than a single axiom and is
usually referred to as an ‘Axiom Schema*.) This Aussonderung principle has
been referred to, also, as Abstraction,13 as Comprehension,14 as Separation,15 as
Specification,16 and as the Axiom of Subsets.17 How the Axiom of Separation
avoids the Russell paradox can be seen as follows. Given any set A, if in the
Axiom of Separation we take, as cpx, the propositional function x ^ x, we have
the set R = {x: x £ A mx $ x}, whence y £R=y £ Amy y. Now putting R
for y yields R £ R = R £ A *R ^ R, which is not a contradiction but can be
simplified to R £ RmR A.18 From this last formula it immediately follows
that R A. But since A was any set whatever, and R’s existence follows from
the Axiom of Separation, it has been shown that, given any set, there is
something that is not a member of it. Halmos has described this result
somewhat whimsically as the conclusion “that nothing contains evert/thing, or,
more spectacularly, there is no universe. ‘Universe* here is used in the sense of
‘universe of discourse,* meaning in any particular discussion, a set that contains
all the objects that enter into that discussion.”19 Here we have a fundamental
difference between the Algebra of Classes, in which there is posited a universal
class (1 or V), and ZF set theory, in which—as has just been proved—there is
no universal set.
For the Axiom of Separation to produce a set S, there must already be given
a set A of which the new set S is a subset. As Halmos remarked, “All the basic
principles of set theory, except only the axiom of extension, are designed to
make new sets out of old ones.’*20 If we were to make a minimal assumption
that there is at least one set A (as we assumed in Chapter 4 that there is at least
one individual), then we can prq^e by the Axiom of Separation that there is an
empty set, that is, a set that has no members. We already have at hand the
propositional function x ^ x that nothing satisfies, so by Separation, given
13Patrick Suppes, Axiomatic Set Theory, D. Van Nostrand Company, New York, 1960, p. 5.
14 Abraham A. Fraenkcl and Yehoshua Bar-Hillel, Foundations of Set Theory, North-Holland
Publishing Company, Amsterdam, 1958, pp. 137 ff.
15Emst Zermelo, op. cit., p. 202; Patrick Suppes, op. cit., p. 6; and Paul J. Cohen, Set Theory
and the Continuum Hypothesis, W. A. Benjamin, Inc., Menlo Park, Calif., 1966, p. 55.
16Paul R. Halmos, Naive Set Theory, D. Van Nostrand Company, New York, 1960, p. 4.
17 Abraham A. Fraenkel, Abstract Set Theory, North-Holland Publishing Company, Amsterdam,
1966, p. 16.
18 See exercise 5 on p. 343.
19 Halmos, op. cit., pp. 6 ff.
20Halmos, op. cit., p. 4.
Sec. 7.3] Zermelo-Fraenkel Set Theory (ZF)-The First Six Axioms 181
the set A, we have
(3B)(x)(x £ B = x £ A -x ^ x)
Z F - 4 Given any sets a and b, there is a set S having as members just a and b.
In symbols: (3S)(x)(x ^ S ~ x — a \ x — b).
So for any sets x and y, there is the pair set (x, i/}, which is sometimes called
the unordered pair—in contrast with another concept to be introduced in the
next section. No distinctness assumption was made in the Pairing Axiom. Thus,
given any set a, we can form the unordered pair {a, a) which can also be
written as ‘{a}/ and is called a unit set or singleton.
Russell puzzled over the question whether {a} was the same as a, itself,
initially being inclined to say that it appeared to be ‘self-evident’ that ‘A whole
composed of one term only is that one term.’27 But he subsequently decided
that ‘it is necessary to distinguish a single term from the class [set] whose only
member it is’.28 He was persuaded by an argument he attributed to Frege: Let
u be a set having more than ope member; let {u} be the set whose only
member is u; then {u} has one member, u has many members, hence u ^ {w}.
Had Russell not also had misgivings about the empty set, he might have been
equally well persuaded by the following argument suggested by Stoll. Because
0 has no members and {0 } has a member, it follows that 0 ^ {0 }, which
proves that u ^ {m}.
Apart from the nonexistence of a universal set and the absence of absolute
complements, we want our sets to conform to the rules governing classes set
25 Even points can be construed as sets, as explained in Alfred North Whitehead, Process and
Reality, Cambridge University Press. .Cambridge, England, 1929, Part 4.
26 A set theory containing individuals .can be found in Suppes, op. cit.
27B. Russell, Principles of Mathematics, op. cit., p. 77. See also pp. 68, 73.
28Ibid., p. 106. See also pp. 514 and 517.
Sec. 7.3]
Zermelo-Fraenkel Set Theory (ZF}-The First six Axioms 1SS
forth in the preceding sections. It is easy to see that any two sets a and b have
an intersection or product a n b. This is assured by the Separation Axiom. If
we let x £ b be the yx in that axiom, we have
a D b = df [ x: x £ a-x £ b)
Other notations for HA are ‘Qa-’, ‘fl {a £ A}\ PI {x: x £ A}\ and It
is obvious that PI {a, h) = aPfc.
Our definition of HA was conditional on the set A’s being nonempty. If we
did not impose that condition, there would be no set al whose membership in
A is needed for using the Separation Axiom. So ‘PI 0' is not yet defined. We
cannot define it as {x: (a)(a £ 0 x £ a)} because every x satisfies the
condition that a £ 0 D x £ a, since a $ 0 for every a. Thus the suggested
definition would make fl 0 contain everything, whereas we have already
shown that there is no universal set in ZF. We could leave it undefined (as
division by zero is left undefined in arithmetic), or we could define H 0 to be
0 itself, in order to make fl S defined for any set S whatever.
We also want any two sets a and b to have a union or sum a U b which
contains all (and only) those things that are either members of a or members of
b. The Separation Axiom will not help us here. We could introduce a new
axiom
(3B)(x)(x 6 B = x £ (i v x £ h)
184 Set Theory [Ch. 7
From this general Union Axiom and the Axiom of Pairing, we can obtain
the more modest a U b from sets a and b as follows. By the Axiom of Pairing,
we have (a, b). The set U {a, b) given by the Union Axiom is
{.t: (3z)(x £ z*z £ {«, b})}. But the only z s belonging to (a, b} are a and b,
themselves, so U {«, b) = {x: x £ «vxf b}, which is the desired a U b. It is
obvious that {.t: x £ 0 } = 0, so U 0 = 0, and that U {x: x £ {</}} = a, so
U {a} = a. It is also clear that U {x: x £ {//, b}} = a U fc, so U {«, b} =
a U b, and that {a} U {b} = (a, b).
We have already remarked that there is no absolute complement of any set
in ZF. But the concept of relattve complement is available. The relative
complement A — B of B in A, whose existence and uniqueness is given by
Separation and Extensionality:
A - B = df {x: x£A-x$ B)
which is the set of all members of a that are not members of R It is obvious
that A — 0 = A, that A — A = 0 — A = 0, and that if A H B = 0 then
A — B = A. The relative complement is more useful in mathematics than an
absolute complement would be. For example, one would want the complement
of the set of even numbers to be the set of odd numbers only rather than a set
containing, in addition to odd numbers, points and circles and functions and
Sec. 7.4] Relations and Functions 185
Where E is a finite set, it is easy to see that (Pi. contains more members
than E does. For example: (P0 = {0 }, (P{r/} = {0,{«}}, (?{«, h) =
{ 0, {a}, {b}, {a, /?}}. It is easy to prove hy mathematical induction29 that if
E contains n members, then (PE contains 2" members, which explains the origin
of the term ‘power set’. For any set E, we know that E C E and 0 C E, so for
any set E, we have both E £ (PE and 0 £ (PE. From this last, it is obvious that
D { x: x £ (PE ) = 0. Other immediate consequences of our definitions and
axioms are that E C E if and only if (PE C (PE that (PE U (PE C <P(E U E),
that(P(£ n F) = (PE n (PE, and that, although E = U (PE, in general we have
only that E C (P UE.
set F consists of all objects that satisfy the predicate or condition <px. By
analogy, we focus our attention on the collection of ordered pairs of objects,
written <x, t/>, between the first and second of whidh the relation Rxy holds. To
develop this notion within set theory, the idea of ‘order' must somehow be
defined in terms of sets. Although its members determine a unique set inde¬
pendently of any order in which they are arranged or in which they may be
named, it is, nevertheless, possible to define the notion of order in purely set
theoretic terms.30 The essence of the notion of order is that the ordered pair
(x, y) is the same as the ordered pair (a, b) if, and only if, both x = a and
y = b. This essence is captured by the Wiener-Kuratowski definition of
ordered pair:
<*,y> =df{{*},{*,y}}
Given any x and y, we use the Axiom of Pairing to produce the set in question
and it is unique by Extensionality. It is immediately obvious that if x = a and
y = b, then (x, y) = (a,b). To establish the converse, we assume that
(x,y) = <a, b>, that is, {{x}, {x,y}} = {{a}, {a, b}}. Since {x} £ {{*},
{x, y}}, we have {x} £ {{«}, [a, b}}. Hence {x} = {a} or {x} = {a, b). In
eithercasex = a.Since{a, b) £ {{«}, [a, b} },wehave {a, b} £ {{x}, {x,y}}.
Hence either (1) {a, b) = {x} or (2) {a, b} = (x, y). By a similar argument,
we also have either (3) {x, y} = {a} or (2) {x, y) = (a, b). If (1) and (3) are
both true, then x = a = y = b, which includes the desired result. If either (1)
or (3) is not true, then {x, y) = {a, b}, whence by our previous result that
x = a, we have that {«, y] = {«, b}. Here, if a ^ b then y = b, but if a = fo,
then y = b. So in every case y = b.
Given two sets A and B, we can form ordered pairs (x, y) with x £ A and
y £ B. The collection of all such pairs is called the Cartesian Product A X Bo f
A and B, and for each ordered pair (a, b) in it, a is called the first coordinate
and b is called the second coordinate. (An example is the following: given the
Euclidean plane with x and y axes established, if A is the set of all real numbers
identified with points on the x-axis and B is the set of all real numbers identified
with points on the y-axis, then thejjet of all ordered pairs (x, y> represents the
Cartesian plane itself—which accounts for the name). That this collection of
ordered pairs is a set, follows from the Power Set, the Union, and the Separa¬
tion Axioms. Given sets A and B, we form their union A U B, which contains
every x £ A and every y £ B. Next, we form the power set (P(A U B) which
contains every set {x} and every set {x, y) as members—along with other
members. Then, we form that set’s power set (P(P(A U B) which contains every
set {{x}, {x, y}} as a member—along with other members. Now, the Separa¬
tion Axiom tells us that there is a subset that is exactly the set wanted.
30 The method for doing so was first devised by N. Wiener and subsequently improved by
C. Kuratowski. See ‘A Simplification of‘the*Logic of Relations’ by N. Wiener, Proc. of the
Cambridge Philosophical Soc., vol. 17 (1914), pp. 387-390, and, ‘Sur la notion de l’ordre dans la
theorie des ensembles’, C. Kuratowski, Fundamenta MathematicOy vol. 2, (1921), pp. 101-171.
Sec. 7.4] Relations and Functions 187
So for any sets A and B, there is a set that contains all and only those ordered
pairs <x, t/> with x £ A and ?/ £ B. Extensionality insures uniqueness, so
‘A X B can symbolize the set in question. Much the same sort of argument,
but using unions instead of powers, will show that if <x, i/> £ A, then both x
and y belong to U U A. One point should be made clear: The Cartesian product
of A and B does not require A 76 B. The product A X A is a perfectly good
Cartesian product, that of A with itself.
We can now define a (binary) relation quite simply as any set of ordered
pairs. R is a relation if and only if
Obvious examples of relations are 0, and the Cartesian product of any sets. A
and B, and any subset of a Cartesian product.
The domain of a relation R, (dom R), is the set of all first coordinates of R.
The range of a relation R, (ran R), is the set of all second coordinates of R, and
the field of a relation R (fid R) is the set of all coordinates of R. Civen any
relation R, the axioms permit us to specify both its domain and its range. We
first use the Union Axiom twice to produce the set UUR, and then by
Separation
gives us the unique ran R. Examples are plentiful: dom 0 = ran 0=0,
dom (A X B) = A and ran (A X B) = B, and if R is equality of members of E,
then dom R = ran R = E.
Except for the empty relation 0, one must always specify a set within, for,
or over which, the relation is defined. If we tried to define the identity relation
X = Y without limiting its domain (or range, or field) we should get into
trouble—for the aggregate Z of all ordered pairs <x, y) such that x = y would
contain every set z = {{x}, (x, x}}, and the union of its union, U UZ would
be the universal set, which has been proved not to exist. The same problem
would arise in connection with set membership (x £ y) and set inclusion
(x C y).
The notion of a function is a familiar one. The pressure of a fixed volume of
gas is a. function of its temperature, and the cost of sending a letter by mail
from New York to San Francisco is a function of its weight. We define a
function in set theoretic terms as a many-one relation, that is, a set of ordered
pairs no two of which have the same first coordinate. If X and Y are sets, a
188 Set Theory [Ch. 7
function from (or on X) to (or into) Y is a relation, /, such that dom f — X and
such that for each x in X, there is a unique y in Y \yith <x, y> £ /. That / is a
function from X to Y is stated as ‘/:X —> Y\ and.is'defined as
For each x in X, the unique y in Y such that <x, y> £ /is represented by ‘/(x)’;
so /(x) = y. The object y is the uaiue that the function f assumes or takes on
at or for the argument x. Another locution commonly used is that f maps x onto
y. The domain of a function / from X into Y is X, but its range can be a proper
subset of Y. If the range of f is equal to Y, then f is said to map X onto Y.
A function /from X to Y that maps distinct elements onto distinct elements
is called one-to-one. If /: X —> Y is one-to-one and onto Y, then / effects a
pairing of the elements of X with those of Y, and, as such, is called a one-to-one
correspondence between X and Y. If there exists a one-to-one correspondence
between X and Y, these sets are said to be equivalent (or equinumerons); this is
symbolized as ‘X ~ Y\
The familiar 0, 1, 2, 3,... are called natural numbers or counting numbers.
They will be defined in set theoretic terms in the following section. A finite set
is defined as a set onto which a one-to-one function maps the set of all natural
numbers that are less than some natural number n. Such a set is said to contain
n elements. Thus the one-to-one function that maps 0 onto the letter a, 1 onto
b, 2 onto c,..., 24 onto y, and 25 onto z, maps the set of all natural numbers
less than 26 onto the set of all letters of the alphabet. An infinite set is then
defined (negatively) as any set that is not finite.
No finite set is equivalent to any of its proper subsets. But the matter is
otherwise with infinite sets. Galileo remarked long ago that, although not all
numbers are perfect squares, yet ‘there are as many squares as there are
numbers, because they are just as numerous as their roots and all the numbers
are roots/31 Some equivalences are even more surprising. There are as many
whole numbers as there are rational fractions. The one-to-one correspondence
between these two sets is shown by the following enumeration of all fractions:
of two fractions, we enumerate first that fraction, the sum of whose numerator
and denominator is smaller. Of two fractions whose numerator and denomina¬
tor have the same sum, we count first the one with the smaller numerator. (Of
course, we do not repeat equal fractions in our enumeration: If 1/2 has already
been counted, we do not also include 2/4 or 3/6 or • ■ • .) With this ordering of
the fractions, the one-to-one correspondence appears as follows:
1/1 1/2 2/1 1/3 3/1 1/4 2/3 3/2 4/1 1/5 5/1 1/6 ...
0 1 2 3 4 5 6 7 8 9 10 11 ---
31 Galileo Galilei, Dialogues Concerning Two New Sciences, Northwestern University Press,
Evanston and Chicago, 1946, p. 31.
Sec. 7.4] Relations and Functions 189
An enumeration is of course a one-to-one function from the natural numbers
(0, 1, 2, 3 ...) onto the set being enumerated. Any set, all of whose members
can be mapped one-to-one onto the natural numbers, is said to be denumerable
(or countable, or enumerable). (Of course, we have not yet shown that the
collection of all natural numbers is a set. This will be discussed in the next
section.)
If there is a one-to-one correspondence between X and a proper subset of Y
but no one-to-one correspondence between X and Y, then Y is said to be larger
than X, or to have more elements than X, and X is said to be smaller than Y or
to have fewer elements than Y. Cantor proved that there are more real numbers
than there are integers by showing that in any enumeration of real numbers
there must be some real number that gets left out. (Of course, the integers can
be mapped one-to-one onto a subset of the real numbers by the function f
where /(n) = n.) With no loss in generality, we confine our attention to real
numbers in the interval from 0 to 1, and consider any enumeration of real
numbers from that interval:
■ «11 «12 .. Uy
• f,21 «22 a 23 ' ' d ‘2j
• «:n a:v> *
.. a:]j
'• "n
a jo «I3 •
.. a,,
where atj is the ;th digit of the ith decimal fraction enumerated. Given any
such array representing a one-to-one mapping of natural numbers into real
numbers, a ‘missing* real number can be specified by the following diagonal
procedure. One missing real number is the decimal fraction A — 0.dx d2 ...
dk ... whose fcth digit dk is defined to be 7 if akk ^ 5, or 3 if akk > 5. This
‘diagonal’ number is clearly in the interval from 0 to 1, but it cannot be any one
of the numbers in the given array because it differs from each of them in at
least one digit. For any k, the diagonal number differs in its fcth decimal place
from the fcth decimal fraction of the array. Hence no enumeration of real
numbers can contain all of them—which proves that the real numbers are
nondenumerable, that is, that there are more of them than there are integers.
Now, we are in a position to consider another implication of the Power Set
Axiom. The Cantor diagonal proof just described is a special form of the proof
of a more general theorem, which states that for any set A, its power set (PA is
larger than A, itself. There is obviously a one-to-one mapping from A onto the
proper subset of (PA that consists of all singletons in (PA. (For every a in A, its
singleton {a} is in (PA, and s(a) = {a} is obviously one-to-one.) Now consider
any one-to-one function / from A into (PA, and form the subset A' of (PA that
consists of just those members a of A which are not members of/(a), the
element of (PA onto which they are mapped: A — {*: x f(x)}. Now,
190 Set Theory [Ch. 7
consider any a in A. If a £ A', then a $f(a) and so /(a) ^ A', but if a £ A',
then a £ f(a) and so again,/(a) ^ A'. But a is any member of A, so no member
of A is mapped by/onto A', which is, therefore, left out of the mapping/. But
/ was any one-to-one function from A to (PA, so (PA is larger than A.
Because Cantor’s principle of Comprehension was not restricted in the way
that the ZF Axiom of Separation is, Cantor’s set theory contained the set S of
all sets. According to the result derived in the preceding paragraph from the
Power Set Axiom, it follows that the power set (PS of the set S of all sets is
larger than S. But this is impossible, because every subset of S, being a set, is
a member of S. The Cantor paradox, then, is that given any set s there is a still
larger set (Ps, yet there is a largest possible set S. Cantor was aware of this
problem and communicated it to Dedekind in 1899.32 But Cantor did not work
out a satisfactory resolution of this difficulty. In the ZF set theory, the Cantor
paradox does not arise. The problem is solved by the more limited Axiom of
Separation that takes the place of Cantor’s too generous principle of Compre¬
hension. Although larger and ever larger sets can be produced by repeated use
of the Power Set Axiom, no ‘largest’ set is ever reached.
There are other definitions for ‘finite’ and ‘infinite’. One can define an infinite
set as a set that is equivalent to one of its proper subsets. Given this definition of
infinite, a finite set can then be defined as a set that is not an infinite set. This
pair of definitions can be proved to be equivalent to those first mentioned on
page 184, in the presence of the Choice Axiom, which will be introduced in
Section 7.6.
Still another definition of finite set, attributed to Dedekind, is ‘A set S is
called finite if there exists a mapping of S into itself such that no proper subset
of S is mapped into itself.’33
Numbers are useful in many ways. Early in childhood we learn to use the
cardinal numbers ‘one’, ‘two’, ‘three’, ... to count and to answer the question
‘How many?’ And we learn also that the ordinal numbers ‘first’, ‘second’, ‘third’,
... can be used in ordering a collection of objects. Our task is to develop or
define these numbers within ZF set theory. We postpone our discussion of
ordinal numbers until Section 7.7. In Section 7.6 we shall consider cardinal
numbers in general. In this section we are concerned with the ‘natural num¬
bers’: 0, 1, 2, 3, ....
Mathematicians have two ways of characterizing a collection of (mathe¬
matical) objects that they propose to study. One way is to set forth an axiom
system that will provide an ‘implicit definition’ of those objects, from which—it
PI. 0 is a number.36
P 2. If x is a number, then xf is a number.
P 3. If x and y are numbers and x' = if, then x = //.
P4. There is no number x such that x' = 0.
P5. If qr() and for every number a, if qx then qx\ then for every number
x, cpx.
Peano himself acknowledged that not only the natural numbers satisfy these
axioms. For example, the sequence 1, 1/2, 1/4, 1/8, ... also satisfy them, if
‘zero' is taken to denote 1, ‘number’ is taken to denote the set of positive
fractions with numerator 1 and denominator any power of 2, and ‘successor of
M A lucid and detailed account of these constructions is found in Edmund Landau, Foundations
of Analysis, Chelsea Publishing Company, New York, 1951. Or see II. B. Enderton, Elements of
Set Theory, Academic Press, Inc., New York, 1977, or Suppes, op. fit.
35H. Dcdekind, 'Was sind mid was sullen die 7.ahlen?\ Brunswick, 1888, W. W. Beinan, tr., in
Essays on the Theory of Numbers, The Open Court Publishing Company, Chicago and London,
1901. See also his ‘Letter to Kefferstein’ (1890) 11. Wang and S. Bauer-Mengelberg, tr., in van
Heijenoort, op. cit. G. Peano, Arithmetics Principiu, Nova Methodo Exposita, Turin, 1889, tr., in
van Heijenoort, op. cit.
3f,Some people find it objectionable to call zero a number, on the grounds, for example, that if
there are no apples in a box, it is misleading to say that the box contains a number of apples, since
the number is zero. One should less misleadingly say that there are no apples in the box or no
number of apples in the box. Indeed Peano’s own original formulation of his system had the
natural numbers begin with 1 rather than with 0. However, the number 0 is so useful in our
notations and computations that it is accepted in mathematics in spite of possibly misleading
locutions to which it may give rise.
192 Set Theory [Ch. 7
x' is taken to denote x/2, But systems other then the natural numbers that
satisfy the axioms can be shown to be ‘just like the natural numbers, and could
be taken to be just different expressions for them.*
It is clearly desirable to characterize the natural numbers in a more univocal
way than Peano’s. However, Peano’s axioms are very useful: If we manage to
construct the natural numbers, somehow, we can check our constructions by
verifying that they satisfy the Peano axioms, for the latter do yield all of the
strictly formal arithmetic truths that mathematicians prize.
Frege, and later independently Russell/17 proposed a brilliant ‘coastructive’
definition of number, based on the following considerations. If A and B are
equinumerous classes, that is, classes between whose members there is a
one-to-one correspondence, we say that A and B have the same number of
members. Every pair set, for example, has exactly two members. Is ‘two’, then,
the name for some characteristic shared by all pair sets? Does ‘two’ signify the
common characteristic of all pairs? It may be that all pair sets share several
characteristics, so the number ‘two’ cannot be defined in quite this way. Frege
and Russell avoided the ‘positing’ of a unique shared characteristic by defining
‘two’ to be the class (or set) of all pairs—which is unique by Extensionality. The
Frege-Russell definition of ‘zero’ is the class whose only member is the empty
set. The definition of ‘one’ is the class of all singletons and the definition of
‘three’ is the class of all triples, and so on. There is much wisdom here and deep
insight as well as ingenuity. But it cannot be incorporated into ZF set theory
directly, because if there were a set of all singletons (or all doubletons, or etc.),
then its Union would be the universal set, which has been proved not to exist.
Zermelo proposed to identify the natural numbers 0, 1, 2, 3, ... with the sets
0, { 0 }> {{0}}» {{{ 0 }}},•• ■ Later, John von Neumann38 proposed the use
of an alternative and more convenient sequence of sets to define the natural
numbers. By the Axioms of the Empty Set, Pairing, and Union, we know that
each of the following sets exist: 0, {0}, {0,(0}}, {0,{0}, {0,{0}}},
... where each set u is followed by the set u U {u}. It is convenient to define
the successor set u+ of the set u to be the sum of u and {u}. Then, 0 is defined
to be the empty set; 1 is defined to be the successor set 0+ which is 0 U { 0 }
or simply {0 }; 2 is defined to be 1+, which is {0} U {{0}} or simply
{ 0, { 0 } }, and so on. On this approach, instead of trying to define n with
Frege and Russell as the set of all n-membered sets, which we have seen to be
impossible in ZF, we define n to be a particular and special n-membered set, a
representative n-membered set, that the ZF axioms guarantee to exist:
37 G. Frege, Die Grundlagen der Arithmetik, Verlag von Wilhelm Koebner, Breslau, 1884,
J. L. Austin, tT., The Foundations of Arithmetic, Basil Blackwell, Oxford, 1950. B. Russell,
Principles of Mathematics, op. cit. There is some difference between these authors’ formulations.
Frege regarded a number as the number of a concept, whereas Russell regarded a number as the
number of the extension of a concept, that is, of the class of objects satisfying the concept.
38J. von Neumann ‘Zur Einfuhrung der transfiniten Zahlen’, Acta litterarum ac scientiarum
Regiae Universitatis Hungaricae Francisco-]osephinae, Sectio scientiarum mathematicarum 1,
1923, tr., in van Heijenoort, op. cit.
Sac. 7.5]
Natural Numbers and the Axiom ol Infinity 193
Now we have the following:
0=0
1 = {0}
2 = {0,1}
3 = {0,1,2}
n+ = {0,1, 2,..., n }
These infinitely many sets exist, but is there a set containing all of them?
Another axiom is required to insure that there is.
By itself, as here stated, the Axiom of Infinity does not provide us with a unique
set whose only members are 0,1,2, 3.because other objects might belong
to a set, S, guaranteed to exist by the Axiom of Infinity. We shall call any set an
inductive set if it contains 0 and the successor of each of its members. It is clear
that 0,1,2,3,... all belong to every inductive set. The Axiom of Infinity states
that there exists an inductive set, S. Every inductive subset of S contains 0, 1,
2,3,.... so the intersection of all inductive subsets of S, which we shall call to,
exists by the Separation Axiom, is unique by Extensionality, and contains just
the natural numbers.
To verify that <o satisfies the Peano axioms is fairly easy. That 0 £ to follows
from the fact that to is an inductive set, as does also the fact that if « £ to, then
u+ £ to, so PI and P2 hold for to. Axiom P4, which states that no u in to is such
that u+ = 0, follows immediately from the fact that u+ always contain u,
whereas 0 is the empty set 0 and does not contain any u. Because to is the
minimal inductive set, it follows that if any subset S of to is an inductive set,
then S = to. That is, if S C to, if 0 £ S, and if x+ £ S whenever x £ S, then
S = to. This is all that is stated by P5, which is a formulation of the principle of
mathematical induction.
To prove that P3 holds for every n in to, we introduce a new concept and
then prove two lemmas concerning it (a lemma is a subsidiary or preliminary
194 Set Theory [Ch.7
theorem introduced for the purpose of proving .some further theorem). The
new concept is that of a transitive set, defined to be a set that contains every
member of any of its members. Formally,
Proof: (by induction, appealing to P5) Let T be the set of natural numbers
that are transitive sets, that is, T = {x: xfwUxCx}.
First we observe that 0 £ T, since for every x, x $ 0, whence x £ 0 D x £ T.
Next we consider any natural number n and show that n £ 7" D n+ £ T.
Suppose n £ T. By Lemma 1, U(n+) = n, and since n+ = n U (n), n £ n+,
and so U(n+) £ n+. Therefore, n+ £ T. Now, by induction, T = co, that is,
every natural number is a transitive set.
Now we can prove P3, that for any n and in in co, if n4 = ra+, then n = m. If
n+ = m+, then U(n+) = U(m+). Since both n and m are in co, both are
transitive sets by Lemma 2. And by Lemma 1, U(n+) = n and U(m+) = w,
whence n = m.
This finally verifies that the von >feumann sets satisfy all of the Peano axioms
and, therefore, have all the usual arithmetic properties of natural numbers.
They are not the philosophically more plausible Frege-Russell numbers (which
cannot be sets in ZF) but are instances of the Frege-Russell numbers. Each
number n is itself equivalent to every set containing just that number n of
members. Hence they are admirably suited for counting the number of mem¬
bers in any finite set. They have, indeed, been called ‘counter sets'.39 Because
each natural number n is a set containing all of the natural numbers less than n,
the present definition is exactly in accotd with what was stated on page 184 in
defining a finite set. Each natural number, then, can serve the function of a
cardinal number and be used to answer the question ‘How many?' when asked
about a finite set.40 Some of their peculiar set theoretic features are surprising.
For example, each natural number n ‘belongs to' or ‘is a member of’ every
larger number, which may seem repugnant to common sense. But to a greater
or lesser degree, every scientific advance marks some departure from the
common sense that preceded it.41
In Section 7.5 the natural numbers 0, 1, 2, 3,... were given a set theoretic
definition. Each of them is the number of members in some finite set or other
and is a finite set with that number of members. The Axiom of Infinity stated in
that section guarantees the existence of the infinite set co. In Section 7.4, it was
proved that the set co has fewer members than its power set (Pio, and more
generally that any set A has fewer members than its power set (PA. 1 here are,
then, infinitely many sets, each containing a different finite number of mem¬
bers, and also infinitely many more sets, each containing infinitely many
members. Thus there is an unending ascension of sets, each larger than any that
precedes it.
Cantor symbolized the cardinal number of a set A by A , where the two
lines over the symbol for the set signify a double abstraction from the concrete
nature of the set, first, abstracting from the particular nature of its members
and secondly, abstracting from the particular order in which those members
are arranged. It is more usual today to symbolize the cardinal number of a set
A by card A\ The essential nature of a cardinal number is given by
^ «
Be’, The Philosophical Review, vol. 74 (1965), pp.
198 Set Theory [Ch. 7
representative, or cardinal number. This has already been done for the finite
case. It will be convenient to delay this specification for the infinite case until
Section 7.7. In that section it will also be remarked*that the cardinal numbers
are serially (totally, or simply) ordered by the relation < (is less than). That is,
for any cardinal numbers m and n, if m ^ n then either m < n or n < m,
where the relation < is asymmetric, irreflexive, and transitive (these attributes
of relations were defined on pages 134-136). This feature of cardinal numbers
is, of course, necessary if they are to fulfill their intended function, which is to
measure the size of sets.
It was remarked in Section 7.4 that the set of all natural numbers, <o, is
equivalent to various other sets: the set of all perfect squares and the set of all
rational fractions. There is a single cardinal number, card to, that is the number
of members in each of these equivalent sets. We define the symbol for it:
K0 = df card co
this is the Hebrew letter aleph with subscript zero, and is read ‘aleph-nuH\ It
is the smallest infinite cardinal number.
Because the Power Set Axiom produces ever larger sets, there are infinitely
many cardinal numbers. Each is the unique representative of sets equivalent to
each other and to itself. Infinite cardinal numbers have an arithmetic that, in
some respects, is very similar to the familiar arithmetic of finite numbers,
although, in other respects, it is very different. The definitions of the funda¬
mental arithmetic operations on cardinal numbers are the same for both finite
and infinite cardinals.
Intuitively, the sum of two numbers, m and n, is the number p if and only if
there are disjoint sets M and N with m = card M and n = card N and p = card
(M U N). We define m -f n to be card (M U N) where M and N satisfy the
conditions just stated. In other words, the arithmetic sum of two cardinal
numbers x and y is defined to be the cardinal number of the set-theoretic sum
(or union) of any two disjoint sets X and Y that have those cardinal numbers,
respectively. That addition, so defined, is independent of the particular disjoint
sets M and N, can be seen from the fact that for any other disjoint sets M' and
N\ if M~M' and N~N\ then (MU N) ~ (M’ U N’). The familiar com¬
mutation and association laws for + follow from the commutation and
association of U.
The differences between arithmetic addition for finite and infinite cardinals
can be brought out clearly by the following contrasts. For finite numbers m and
n,m + n = m only if n = 0. But for infinite cardinal numbers, that is not true.
The set of natural numbers co can be decomposed into a finite and an infinite
part. For any finite number n, co = {0, 1, 2,..., n — 1} U (n, n + 1,
n -I- 2,...}, where the first summand has cardinality n and the second
summand has cardinality K0. Hence
w + — H0 4" ft — Hq (1)
Sec. 7.6] Cardinal Numbers and the Choice Axiom 197
for any finite number n, even where n ^ 0. Another contrast is the following.
The sum of any two finite numbers different from 0 is larger than either
summand. Because the set of all odd numbers {1, 3, 5, 7,...} and the set of all
even numbers (0, 2,4, 6,...} each has the cardinality K0 and their union is the
set of all natural numbers, we have
+ «o = K0 (2)
Finally, it should be remarked that for any finite cardinal numbers m and n,
where m < n, the result of subtracting m from n, (n — m) is unique. The result
of subtracting m from n, where m and n are infinite cardinals, however, is not
unique. (1) and (2) above show that K0 — K0 can equal any cardinal number
from 0 to H0 itself.
Arithmetic addition is defined also for infinitely many summands. If {mi} is
a set of infinitely many cardinal numbers, then where {Mi} is a set of infinitely
many sets, such that mi = card and i ^ j D Mi fl Mj = 0 for every i and
/, then by definition
y mi = card (U )
i
For example, the denumerable set of all natural numbers can be decomposed
into denumerably many sets, each of which contains dentnnerably many
natural numbers, as in the following (diagonal) array:
1 2 4 7.
3 5 8.
6 9.
10...
Mo + *o + *0 + • • • = H0-
Intuitively, the product m*Ti (or inn) of two numbers tn and ti is the result of
adding m to itself n times. An alternative (equivalent) way to define the
product of m and n is to produce sets M and N where n = card M and
n = card N and then use the Cartesian product to define
= card (M X N).
It is easily seen that this product is unique (i.e., independent of the particular
sets M and N whose Cartesian product is formed), that it is equal to 0 if, and
only if, either M = 0 or N = 0. It is also evident that the product of m and n
198 Set Theory [Gh. 7
obeys the familiar commutation and association laws for multiplication and
that, for the infinite case as well as for the finite one, multiplication is distribu¬
tive with respect to addition. » '
There is, however, a complication in the case of multiplication that involves
infinitely many factors—even where there are ‘only* R0 factors and even where
each factor is finite. Suppose 911 to be a set of infinitely many disjoint nonempty
finite sets M{ where mi = card Mit and we wish to define the product IT
Intuitively, this number is the number of distinct selection sets, ja, where each ju.
contains exactly one element from each of the sets in 911. In the finite case
where911 = {{a, b}, (c, d, e}}, that is, where Mt = (a, b) andM2 = {c, d, e),
the distinct sets ju, are {a, c}, {a, d}, {a, e}f {b, c}, {b, d}, {b, e}. There are six
such sets, which is the expected product since 2 = card M1 and 3 = card M2.
The actual process of forming distinct sets ju, is simple. Select one member from
Mv and select one member from Af2. Form their pair set, which is our first ja.
Then select one member of M, and select a different member of M2. Next, form
their pair set, the second /a, and so on. Now, the problem in the infinite case is
this: an arbitrary choice must be made from among the members of each of the
infinitely many sets A/f of the collection 9fl—unless each set Mi (that is not a
singleton) has its members already arranged alphabetically, numerically, by
size, or in any other order that permits the statement of a rule that dictates
which should be chosen first for inclusion in the first fi, which next for inclusion
in the next ja, and so on. Now for any single finite set or even for an infinite
one, there is no great problem of making an arbitrary selection. But the notion
of making an infinite sequence of arbitrary choices is somewhat fanciful. What
is needed here is another axiom, one that asserts the existence of a selection set,
ja, that contains exactly one member from each of any number of sets,
Zermelo stated this axiom in approximately the following form:
The reason for the difference is this: Among boots we can distinguish
right and left, and therefore we can make a selection of one out of each
pair, namely, we can choose all the right boots or all the left boots; but
with socks no such principle of selection suggests itself, and we cannot be
sure, unless we assume the multiplicative axiom, that there is any class
consisting of one sock out of each pair. Hence the problem.
We may put the matter in another way. To prove that a class has K0
terms, it is necessary and sufficient to find some way of arranging its terms
in a progression. There is no difficulty in doing this with the boots. The
pairs are given as forming an N0, and therefore as the field of a progres¬
sion. Within each pair, take the left boot first and the right second,
keeping the order of the pair unchanged; in this way we obtain a
progression of all the boots. But with the socks we shall have to choose
arbitrarily, with each pair, which to put first; and an infinite number of
arbitrary choices is an impossibility. Unless we can find a rule for
selecting, i.e. a relation which is a selector, we do not know that a
selection is even theoretically possible.43
Clearly, if we have one subset /a, we can get another by taking the relative
complement of each set — p, in turn, and using the Choice Axiom to get
another p, and so on. Hence the new axiom guarantees the existence of all the
selection sets jut in U3H, and legitimizes the definition:
If {m*} is a set of cardinal numbers, and if {Af,} is a corresponding set of sets,
such that card = mi for each i, then
Here, in the case of defining the product of cardinal numbers mit the choice
function /for the set U 9K will select an element xt from each nonempty subset
M4 of U 3R; each determines a singleton (xf}, and the union of all of them will
be a selection set /a.
There are several theorems equivalent to the Choice Axiom, one of which
plays an important role in the next section.44 Much has been written about the
43Bertrand Russell, Introduction to Mathematical Philosophy, C,eorge Allen & Unwin Ltd.,
London, 1919, p% 126. .. .
44cwcnnnM nn no 243-250. A very full discussion of various applications of the Choice
Axiom tadiffwirt^ranches'of mathematics can be found in Fraenkel and Bar-Hillel, op. cit, pp.
59-74.
200 Set Theory [Ch. 7
Choice Axiom. Efforts were made to derive it from the other axioms of set
theory. None succeeded. Some people worried that a contradiction might arise
from adding the Choice Axiom to the other axioms. Those worries proved
groundless. In 1938 Godel proved that if the other axioms of set theory are
consistent, then the result of adding the Choice Axiom to them is also consist¬
ent.45 In 1963 Paul J. Cohen proved that the Choice Axiom cannot be derived
from the other axioms.46
Exponentiation of cardinal numbers follows the same pattern. In the finite
case, just as multiplication involves the addition of equal summands, so
exponentiation involves the multiplication of equal factors. We recall that for
finite nonzero cardinal numbers a, g\ d, the term ba is the result of multiply¬
ing b by itself a times, whence ba+c =ha*bv, (b*d)a = ha • da, (ba)c =
(hc)a = ,
and where h ^ 0, bli = 1 and where a 0, 0" = 0.
It is useful at this point to introduce a new notation for certain sets of
functions:
BA = df {/: f is a function on A to B}
This notation is intended to reveal and exploit the analogy with the familiar
notation for exponentiation. That there is an analogy is shown by the following
examples. Suppose that B = {bv h2, ..., frn},son = card 6; andlet A = {r/1},
so card A = 1. In this case there are exactly n functions in BA: fx(ax) = bv
f2(cil) = b2, ... = bn. In this case, card (BA) = (card B)CArdA. Now for
the same B, let A = {av a2}\ here card A = 2. In this case there are n2
functions in BA\
/ll(«l) = bl
/12(«j) = bx fvtflx) = • = b\
II
CM
II
CM
/2l(fl2) = bl f 23(^2) = b3
a
II
In this case also card (BA) = (card B)2 = (card B)cardA. In general, for any
finite nonempty sets A and B, card (BA) = (card B)K"ArdA. So it is plausible to
extend the definition of exponentiation to cover all cardinal numbers a and b,
45 K. Godel, ‘The Consistency of the Axiom of Choice and of the Generalized Continuum
Hypothesis’, Proceedings of the National Actulemy of Sciences, U.S.A., vol. 24 (1938), pp. 556-557.
See also The Consistency of the Axiom of Choice and of the Generalized Continuum Hypothesis
with the Axioms of Set Theory, Annals of Mathematics Studies, No. 3, Princeton, N.J., 1940.
46 Paul J. Cohen, ‘The Independence of the Continuum Hypothesis’, I and II, Proceedings of the
National Academy of Sciences, V.S.A., vol. .50 (1963), pp. 1143-1158, and vol. 51 (1964), pp.
105-110. See also Paul J. Cohen, Set Theory ami the Continuum Hypothesis, W. A. Benjamin, Inc.,
Menlo Park, Calif., 1966.
Sec. 7.6]
Cardinal Numbers and the Choice Axiom 201
where a and b can be either finite or infinite:
Just as cardinal addition is based on the union of disjoint sets that have the
cardinal numbers being added, and just as cardinal multiplication is based on
the Cartesian product of sets having the cardinal numbers being multiplied, so
cardinal exponentiation is based on mappings of one set onto another, where
the range has the cardinal number that is raised to a power and the domain has
the cardinal number that is the exponent. It is easily shown that for any
nonzero infinite cardinal numbers a, b, c, d the same equalities obtain as for
finite cardinals. So we have = 1, and for any finite cardinal a > 0,
If the set B in BA is equal to 2 ( = {0, 1}), we have the set of all characteristic
functions of subsets of A. The definition is this:
For any subset A' C A the characteristic function
unique subset of co, namely, the subset that contains the natural number n if
and only if dn = 1. Thus the binary fraction 0.111... 1 ... determines the set
co, itself; the binary fraction 0.000 ... 0 ... determiqes the empty subset of co;
and the binary fraction 0.101... 1... determines the subset of co that contains
every natural number except 2. The binary fraction 0.0101 ... 0101 ...
determines the set of all even numbers, where only dn’s with even-numbered
subscripts are l’s, and so on. Conversely, every subset A' of to determines a
binary fraction 0.dxd2d3 ..., in which dn = 1 if, and only if, n £ A'. Thus we
have a one-to-one correspondence between (Pco and (x: x is a real number such
that 0 < x < 1} and hence between (Pco and R, so (Poo ~ R, and card
((Poo) = card R. The usual symbol for card R is K, the cardinal num¬
ber of the continuum. Since 2W ~ (Poo, card (2W) = card ((Poo), and since
card (2W) = (card 2)cardl° which is 2*o, we have established that 2N«> = N.
As for K0, it is also true for N that N + N = N"N=:N. More infinite cardinals
exist by the Power Set Axiom. Some of them will be discussed in the next
section.
1. Let S be the power set (P (a, fo, c) with the ordering relation that of set
inclusion, C.
2. Let P be the set of positive integers and let the ordering relation be
divides without remainder, d.
3. Let co be the set of natural numbers with the ordering relation less than
or equal to, <.
Each of these sets is partially ordered by the relation mentioned for it,
where we define partial order in a set X to be a reflexive, antisymmetric,
and transitive relation in X. These notions were discussed and defined in
Chapter 5 (pages 135-136 and 143). Their definitions in the context of ZF
set theory are
a subset of itself, is that x < f(x), for every x in X. This is proved as follows.
Suppose that there were an x in X, such that f(x) < x. Then since X is well
ordered, there must be a least such x, call it x0. It could not be the least element
in X, because /(x0) £ X and f(x0) < x(). For any y in X, if y < x0 then y </(«/)•
But f(x0) is such a y, so f(x0) < f(f(x0)). But since f(x0) < Xq, by the order
preserving nature of /, f(f(x0)) < /(x0). -Now by transitivity, /(x0) < /(x0),
which contradicts the irreflexiveness of <. So the supposition that there is an
x in X, such that f(x) < x, must have been false.
There is an important consequence of the result just obtained. To state it we
must define a new term. If X is a well ordered set, and a £ X, then the subset
{x: x £ X#x< a) is called the initial segment determined by a, and is usually
denoted by s(a). Now we can prove that no well ordered set can be similar to
one of its initial segments. For if X were a well ordered set containing a as an
element, and/were a similarity function from X onto s(a), then we would have
f(a) £ s(a), whence f(a) < a, which was proved to be impossible.
Many truths about well ordered sets can be proved by what is called the
principle of transfinite induction. It is analogous to the principle of mathe¬
matical induction discussed in Section 8.5, and, indeed, for finite sets the two
are equivalent. In the case of infinite well ordered sets, only the new principle
is applicable. It can be stated as follows:
Its proof in the general case is simple. If X — S were not empty, then it
would contain a least element, say x. That implies that every element of the
initial segment $(x) must belong to S and, by the induction hypothesis, x £ S,
also. But this would be impossible, because x cannot belong both to S and to
X — S. Thus X — S must be empty and since S C X, S = X. This principle, as
applied to natural numbers, is called ‘strong induction,’ and its use will be
illustrated in Chapters 8, 9, and 10.
If two simply ordered sets are similar, they are said to be isomorphic (with
each other). Ordered sets that are isomorphic with each other are said to have
the same order type. The most important order types are those of well ordered
sets. Some writers define ‘ordinal numbers’ as the order types of well ordered
sets.47 An order type of a given nonempty well ordered set, however, cannot be
defined in ZF set theory as the set of all well ordered sets that are isomorphic
with the given well ordered set, because the union of such a set of all similar
well ordered sets would be the universal set—which we know does not exist.
This situation is the same as that which made the Frege-Russell definition of
cardinal number incompatible with ZF set theory. In the case of cardinal
numbers, discussed in Section 7.6, we let an unspecified representative set from
The first and smallest of the infinite ordinal numbers covered by this definition
is co. (Infinite numbers are also called transfinitc numbers.) Using the same
definition of successor set that was given in Section 7.5, lU = u U {#/}, but
now allowing u to range over ordinal numbers, we moved beyond to to to *, then
to (co+) + , and so on. These are easily seen to be ordinal numbers also. For if nr
is an ordinal number, then so is n+, which is nr U {«}. For if £ £ or+, then either
£ £ a or £ £ {n}. In case £ £ nr, then since a is an ordinal number, .v(£) = £; and
in case £ £ {o }, £ = «, in which case s(£) = a, that is, s(<\) = or. So in either
case s(£) = £ for any £ £ n+, and n4 is an ordinal number.
Before stating the next ZF axiom, which will permit the construction of
transfinite ordinal numbers out to any distance that might be desired, it will be
convenient to define the various arithmetical operations that can be performed
with ordinal numbers.
Well ordered sets can be combined by putting one directly after the other.
For this to be done, of course, the sets must be disjoint. But if the set A and B
that we want to combine in this way are not disjoint, w'e can easily obtain other
sets A' and B' that are disjoint, and such that A s A' and B s B'. A' can be the
set of all ordered pairs <a, 0> with a in A and B' can be the set of all ordered
pairs <fo, 1> with b in B. The obvious one-to-one correspondences are
a -» <a, 0) and b <h, 1>, with the order within A' and B' simply borrowed
from A and B. Therefore, given that A and B are disjoint well ordered sets, we
48 See John von Neumann, ‘Uber die Definition durch transfinite Induktiori und verwandte
Fragen der allgemeinen Mengenlehre’, Mathematische Atmaltn. vol. 99 (1928), pp. 373-391. See
also R. M. Robinson, ‘The Theory of Classes. A Modification of von Neumann’s System’, rlhe
Journal of Symbolic ljogic, vol. 2 (1937), pp. 29-36.
206 Set Theory [Ch. 7
CO, CJ -f l, to 4" 2, . . •
The union of this set and co we denote by co2. It is easily seen to be an ordinal
number. After co2 come further ordinal numbers: u:2 + 1. »2 + 2 and so on.
After all of them comes <A whose existence as a set is asserted by another
specification of Replacement. Continuing, we get «3 + 1, «3 + ^ and so on,
and then co4. In the same way, we obtain further ordinal numbers. Thus we get
successively, co, co2, co3.By Replacement, again we obtain an ordinal
number, co2, that is related to these other ordinal numliers in the series in the
way that co is to the natural numbers. It is convenient to introduce new symbols
for some of the transfinite ordinal numbers produced by he Axiom of Re-
placement. For example, the ‘next’ ordinal after the sequence «o-, co« >, co ,
208 Set Theory [Ch. 7
... is usually written as t0. Also, there is an ordinal number which will be
discussed later. . .
It might be helpful at this point to look at some particular transfinite well
ordered sets composed of familiar elements. Three examples follow.
2n — 1
1. The set of all fractions ———, together with 1, ordered by the familiar
in which the initial segment s( 1) is clearly similar to to. The whole well ordered
set must have an ordinal number greater than to. (In fact, its ordinal number is
to + 1.)
2. The set of all odd numbers in order of magnitude, followed by the set of
all even numbers in order of magnitude. This set looks like 1, 3, 5, ...; 2, 4,
6,... and must have an ordinal number twice as large as to. (In fact, its ordinal
number is to + to or to -2.)
3. The set of all positive integer powers of prime numbers pn, ordered
according to the relation < defined as follows: p^ < pjn if either pm < pn or
both pm = pn and i < /. This set looks like
and must have an even larger ordinal number. (In fact, its ordinal number is
to-to or to2.)
Some remarks about the Axioms of Replacement are appropriate at this
point. First, the condition in the hypothesis that <p(x, y) and <p(x, z) implies that
y = z serves to prevent taking as a specification of <p(x, t/), the formula x C y.
If that were allowed, then—since 0 is a subset of every set—taking X as { 0 }
would make S contain all sets, which has been shown to be impossible. Perhaps
it might occur to the reader that the cdXdjtion of being single-valued could
more simply be expressed by specifying a function/such that/(x) = y, rather
than invoking an expression like <jp(x, y). A function is well defined, however,
only if its domain is a set and a set can be specified that contains its range. The
Replacement Axioms are needed precisely to produce a set that is the range of
/. The Replacement Axioms are enormously powerful. Indeed, the Separation
Axioms can be derived from Replacement. Given any set A and any condition
<p(x), if we specify the X in the Replacement Axiom to be A and we specify
<p(x, y) to be (x = y and qp(x)), then the ‘<p(x, y) and <p(x, z) imply y = z part of
the hypothesis of the Replacement Axiom is satisfied. Its conclusion then asserts
(leaving out the trivially true *x = x’)
(3S)(x)(x.e S = X€ A-<px)
Here, we specify the set A in Replacement to be the power set (P(P( 0), which
is {0, {0 }}. If (i and b are the two objects whose pair set is wanted, we
specify q>(x, y) to be (x = 0 -y = a) v (x = { 0 } -i/ = b). Then, for each .r in
(P(P(0), there is exactly one y such that qr>(x, y): for x = 0, y = a, and for
x = {0},y = b. With these specifications, the hypothesis of the Replacement
Axiom is satisfied and its conclusion then states that
(3S)(x)(.r £ S = x = a v x = b)
0, 1, 2, ..., a?, (o + 1, to + 2, ..., to2, co2 -f- 1, co2 + 2, ..., io3, ...,
w4, . . ., w2, w2 4- 1, co2 + 2, . .., to-' + to, to2 4- to + I, to2 + to + 2.
w2 4- w2, to2 4* <o2 4* 1, . . ., to2 4- <o3, . . ., to2 4 <a:4, . . ., to22, . . .,
to23,. . ., to3,. . . , to4,...»to1*',..., to*0, to*** ,..., i 0 4- 1» *'|) 4* 2,...,
Eo + ^ • i P0 + w2, ..., i0 4 to2, . .., f0 4 tou\ .. . ,
f02, . . . , f:0(0, . . . , F0tOw, . . . , f0z.-
Up to any point in this ascension, the set of all ordinal numbers, to that point, is
a well ordered set, W, which has an ordinal number greater than any member
of W. If there were a set of all ordinal numbers, $2, it would have an ordinal
number greater than any ordinal number in ll, that is, an ordinal number
greater than any ordinal. This contradiction is called the Burali-Forti Paradox,
and shows that there can be no set containing all ordinal numbers.49
In Section 7.6 much was said about transfinite cardinal numbers, but their
definition was postponed to the present section. We did say that a cardinal
number was a representative set, serving as the cardinal number of any set
equivalent to it. We want now tcTspecify which sets are to serve as these
representatives. To do so we mak&Wise of an important theorem proved by
Zermelo, using the Axiom of Choice, that every set can be well ordered/*0 We
know that repeated uses of the Power Set Axiom establish the existence of an
endless series of sets with ever larger cardinality. By Zermelos theorem, every
set is equivalent to some ordinal number. This suggests that we look to ordinal
numbers for a specification of the representative sets to serve as cardinal
numbers.
An infinite set, of course, can be equivalent to a great many different ordinal
numbers. In fact, all of the transfinite ordinal numbers indicated on this page
have cardinality of only’ R0- But all of the ordinal numbers equivalent to a
given set S form a set. The argument here is quite simple. The power set of S,
49See C. Burali-Forti, ‘Una Questione Sui Numeri TransfinitiV fiendiconti del Circolo Male-
matico di Palermo, vol. 11 (1897), pp. 154-64, tr. in van Ileijenoort, op. cit., pp. 104-11. See also
I. Copi, ‘The Burali-Forti Paradox’, Philosophy of Science, vol. 25 (1958), pp. 281-86.
50 E. Zermelo, ‘Beweis, das jede Menge wohlgeordnet werden kann\ Mathematische Annalen,
vol. 59 (1904), pp. 514-516. Tr. in van Heijenoort, op. cit., pp. 139-141,
210 Set Theory [Ch. 7
(PS, has higher cardinality than S. Any ordinal number, say a, that is equivalent
to (PS, has cardinality greater than the cardinality, of S, and a is therefore larger
than all of the ordinal numbers equivalent to S. Since every ordinal number fi
less than o is a member of o, a is a set that contains every ordinal number
equivalent to S. Now the Separation Axiom gives us a subset a' of o containing
all and only ordinal numbers equivalent to S. Since o' is an ordered set, it
contains a smallest or least element a, which can then be specified as the
representative set equivalent to S which is selected to serve as the cardinal
number of S. We can define cardinal number now as an ordinal number a such
that if fi is another ordinal number equivalent to «, then a < fi. It is clear that
this definition accords with the essential characteristic of cardinal numbers,
that card A = card B if and only if A ~ B.
It should be realized that powers and products of ordinals must not be
interpreted cardinally. If A and B are well ordered sets, then, in general, card
((ord B)ordA) < (card B)cardA. For example, in the case of 2* we have card
((ord 2)*) = card to = K0, but (card 2)card* = 2*o = «, and K0 < N. And in
the case of tou, we have (card to)cardw = = K, but card ((ord to)(ordto)) = K0
because all positive integers can be arranged into a well ordered set whose
ordinal number is to*. And the set of all positive integers has cardinality K0.51
The nondenumerable set R of all real numbers with cardinal N has an ordinal
number located beyond all of the ordinal numbers indicated on page 209.
There are ordinal numbers with nondenumerably many members. The set of all
ordinal numbers, up to some point beyond the location of the ordinal number
of the well ordered set of real numbers is, itself, a well ordered set. Therefore,
it has a least member—which is called to1, the least ordinal number with
nondenumerable cardinality, and its cardinal number is Kr There is, in fact, a
whole sequence of alephs with increasing cardinality. Their cardinal num¬
bers are
We know that 2K° = N and that K x is the least cardinal number greater than
K0. From this, we know that < K. The much discussed continuum hypoth¬
esis asserts that Kj = K or Kj = 2*<>. The generalized continuum hypothesis
states that for every ordinal number a, Ka+1 = 2K«. Efforts to prove the
continuum hypothesis were made by such distinguished mathematicians as
Cantor and Hilbert. In 1939, Godel proved that if the axioms of set theory are
consistent, the continuum hypothesis cannot be disprovedl, and in 1963 Paul
Cohen proved that the continuum hypothesis cannot be proved from the
axioms of set theory either.52
The nine axioms for ZF set theory stated thus far are believed to be sufficient
for the development of mathematics. It might seem desirable to add a final
51 See A. A. Fraenkel, Abstract Set Theory. 3rd rev. cd., op. cit., p. 205.
52See the references cited in footnotes 45 and 46 on page 200. Gddel’s and Cohen’s results also
apply to the generalized continuum hypothesis.
Sec. 7.7] Ordinal Numbers and the Axioms of Replacement and Regularity 211
restrictive axiom stating that there are no sets other than those whose existence
is required or entailed by the given axioms. A hint as to the direction that such
an axiom might take is given by another paradox, due to Mirimanoff.53 It
involves the concept of a grounded set, defined as any set, .v, for which there
is no sequence of (not necessarily distinct) sets tjv t/2, i/3, ... such that ...
€ !/3 € */2 £ !/i € *■ Let W be the set of all grounded sets. If W is grounded,
then W £ W> whence ... W £ W £ W, so W is not grounded. But if \V is not
grounded, then there is a sequence of sets yv y2, i/a, ..., such that ...
€ */3 € J/2 € !/i € W, whence yx is not grounded, and therefore cannot be a
member of W. In either case, W involves us in a contradiction. The limited
kind of comprehension stated in the Separation Axiom would seem to prevent
the formation of such a set as W. But the very idea of a nonempty set with no
ultimate members, but containing an infinite descending sequence of sets,
seems repugnant. In view of the need to distinguish between a singleton {.v}
and its only member x, the idea of a set’s being a member of itself also seems
suspect. But for the limitation in the Separation Axiom, the very condition
x £ x would lead to the Russell paradox. This paradox can be generalized into
infinitely many others involving what might be called ‘£ -cycles’: x £ yy £ x,
x £ ymy £ zmz £ x, and so on.54 Such ungrounded sets, self-membercd sets, or
£ -cycle sets, all seem both repugnant to our thinking and of no possible utility
in mathematics or logic. They are all ‘ruled out’ by the final ZF axiom:
That this axiom does rule out the undesirable sets in question is easily shown.
1. If A is a set, then A ^ A. If A = 0 , A has no members whatever so A $ A.
If A ^ 0, by the Regularity Axiom, there is an x in A, such that {A } PI x = 0.
Since {A} is a singleton, only A £ {A} so that x = A and {A} HA = 0 . But
since A £ {A}, it follows that A $ A, which was to have been proved
2. There are no sets A and B such that A £ B-B £ A. Suppose that there
were such sets: then both
By the Regularity Axiom there is an x in {A, B}, such that {A, B} flx = 0. But
{A, B} is the doubleton of A and B, so either x = A or x = B. Therefore, either
{A, B} fl A = 0 or (A,B) PI B = 0, which contradicts (1), proving the
supposition false.
3. There is no ungrounded set i/0, such that there is a sequence of sets y v t/2,
t/3,..., such that... € jfo € ifc € Vi € f/0. For if there were such a set y0 and
sequence of sets y,, t/2» 1/3, • - -, we could use the Axiom of Replacement with X
specified to be <0 and y(x,;/) specified to be yx+ £.yx. The condition that for
each x in <p(x, y) and qp(x, z) implies y = z must be satisfied because our
second result shows that there can be no £ -cycles. Therefore, all of the sets yv
y2, 1/3, ... must be distinct. With its hypothesis thus satisfied, the Axiom of
Replacement would guarantee the existence of a set R = (x: x = yn £ w}.
Hence there would be a function / with to = dom f and R = ran /, where
f(n+) £ f(n) for each n £ to. Then, any member r of R would have to equal f(n)
for some n, from which it would follow that f(n+) £ R H r. This implies that
for every r in R, R D r ^ 0. That, however, would contradict the Axiom of
Regularity.
This single chapter was intended to provide a bare introduction to the
fascinating and important branch of mathematics known as set theory. Only a
few of its topics and problems have been mentioned and, of course, not
everything said has been proved. It is hoped that the readers’ interest and
curiosity have been sufficiently aroused to make them want to learn more
about set theory.
Useful books that can be consulted with pleasure and profit are listed here.
Cohen, Paul J., Set Theory and the Continuum Hypothesis, W. A. Benjamin,
Inc., Menlo Park, Calif., 1966.
Enderton, Herbert, Elements of Set Theory, Academic Press, Inc., New York,
1977.
Fraenkel, Abraham, Abstract Set Theory, 3rd rev. ed., North-Holland Publish¬
ing Company, Amsterdam, 1966.
Fraenkel, Abraham, and Yehoshua Bar-Hillel, Foundations of Set Theory,
North-Holland Publishing Company, Amsterdam, 1958.
Halmos, Paul, Naive Set Theory, D. Van Nostrand Company, New York, 1960.
Hausdorff, Felix, Set Theory, 2nd ed., Chelsea Publishing Company, New York,
1957.
Jech, Thomas J., Lectures in Set Theory, Springer-Verlag, Berlin-Heidelberg-
New York, 1971.
Quine, W. V. O., Set Theory and Its Logic, Harvard University Press, Cam¬
bridge, Mass., 1963.
Stoll, Robert R., Set Theory and Logic, W. II. Freeman and Company, Pub¬
lishers, San Francisco and London, 1963.
Suppes, Patrick, Axiomatic Set Theory, D. Van Nostrand Company, New York,
1960.
8
A Propositional Calculus
will be ordinary English, plus elementary arithmetic, with the addition of some
special symbolic devices that will be introduced and explained as they are
needed. It is assumed, of course, that the reader understands the metalanguage,
for the entire discussion occurs within it. Only this one metalanguage will be
used. It will function in some parts of our inquiry as a semantical meta¬
language, in other parts as a Syntax Language.
It should be emphasized that object language’ and ‘metalanguage* are
relative terms. Any language, no matter how simple or how complex, is an
object language when it is being talked about. And any language (which must
be an interpreted or meaningful one, of course) is a metalanguage when it is
used to discuss an object language. Since our object language is uninterpreted,
we cannot use it as a metalanguage. In another context, however, where the
object language is an interpreted or meaningful language, one and the same
language can function as both object language and metalanguage. Thus in our
first chapter the English language was being discussed (and hence was the
object language), and the discussion was carried on in English (which was,
therefore, the metalanguage, also). A sufficiently rich or complex language can
succeed in formulating the whole of its own syntax, and a good deal of its own
semantics. But no consistent language can express the whole of its own
semantics; certainly not the truth conditions for all of its own statements. That
limitation can easily be shown.
That no language can express exhaustively its own semantics is shown by the
following version of the ‘Paradox of the Liar’.1 Consider the following English
sentence:
The sentence printed on page 214, line 25 of this book is not true.
Let us abbreviate the preceding sentence by the letter ‘S’. Now just as it is
obvious that ‘Snow is white’ is true if, and only if, snow is white, so it is equally
obvious that
‘S’ is true if and only if the sentence printed on page 214, line 25 of this
book is not true.
jm
But counting the lines and looking at the page number verifies that ‘S’ is
identical with the sentence printed on page 214, line 25 of this book. Hence
'Attributed to J.Lukasiewicz.
Sec. 8.2] Primitive Symbols and WeH Formed Formulas 215
were not as innocent as they appeared. The source of the trouble is generally
agreed to lie in the attempt to formulate the truth conditions for the sentences
of a language within that language itself. At least, if we distinguish sharply
between object language and metalanguage, and do not try to make an object
language serve as its own semantical metalanguage, the contradiction does not
arise. An alternative method of avoiding such contradictions is discussed in
Appendix C at the end of this book.
We now proceed to construct our logistic system. There are two kinds of
primitive symbols in our propositional calculus: ‘propositional’ symbols and
‘operator’ symbols. We use just four of the latter kind, these being
(,)•
We want infinitely many propositional symbols, for which we use the first four
capital letters of the alphabet (in boldface type), with and without subscripts:
A Ai A., A,
B Bt B2 B3
C C, C2 Cj
D Dj D2 Dj
These are the only symbols that our propositional calculus will contain,2 and in
proving theorems and deriving conclusions from premisses within the system
they are to be regarded as being completely uninterpreted. They may be
thought of, prior to the assignment of meanings, as being repea.able and
recognizable marks rather than ‘symbols’ at all—though it will be convenient
to refer to them as ‘symbols’.
Of course, we are guided in setting up our logistic system by the interpreta¬
tion we intend eventually to give it. This intended interpretation controls our
choice of primitive symbols and also governs our syntactical definition of ‘well
formed formula’. A formula of our system is defined to be any finite sequence of
primitive symbols of our system. The set of all formulas of our system includes
such sequences as the following:
2 We could use just A and' to provide a finite base for the infinite set of propositional symbols
written as
As will be shown in Chapter 9, we could dispense with the symbols '(’ and *)’.
216 A Propositional Calculus [Ch. 8
Bi
(AHA)
~(D)~(~)
~((Aa)-(C 3))
B2B3A7~( )( )•( )
)))((
Only some of these are to count as well formed formulas, however. Our
definition of ‘well formed formula* will be stated, of course, in our Syntax
Language. It is convenient to introduce some special symbols into our Syntax
Language, as an aid in discussing the logistic system clearly and economically.
Although the first four capital letters of the alphabet, with and without
subscripts, printed in boldface type, are symbols in our object language, those
same letters printed in lightface italic type are symbols in our metalanguage.
Their meanings in the metalanguage follow the convention that a lightface
italic letter that is an element of our metalanguage denotes or means that same
letter printed in boldface, which is an element of our object language. In
addition, we introduce the capital letters ‘P*, *Q\ ‘R*, ‘S’,..., with and without
subscripts, and refer to them as ‘propositional variables*. Whereas the propo¬
sitional symbols in our propositional calculus are uninterpreted and have no
meaning, the propositional variables ‘P*, tQ\ ‘R’, ‘S*, ... in our Syntax Lan¬
guage are interpreted and do have meaning. Every propositional variable of our
Syntax Language, until further notice, denotes any formula of our object
language—subject to the following restriction. In any sentence or sequence of
sentences of our Syntax Language, two distinct propositional variables, say ‘P*
and tQ\ may denote either two distinct formulas of our object language, for
example, and ‘'-'((A1)*(C3))\ or one and the same formula of our object
language. Although a propositional variable may denote any formula of the
object language in any one context, it must continue to denote that same
formula wherever it occurs in that context. Thus the propositional variables ‘P*,
‘R*, ‘S*,... of our metalanguage may have substituted for them any name
in the metalanguage of any formula of the object language. At this point, we
introduce the symbols *•*, ‘~~*, and ‘)* into the Syntax Language, and explain
their meanings. Where any propositional variable, say ‘P\ of our Syntax
Language denotes in some context a particular formula of our object language,
say ‘A*, then the symbol ‘~~(P)* of our Syntax Language will, in that context,
denote the formula ‘~(A)* of our object language. Also, where any proposi¬
tional variables, say ‘P* and ‘Q\ denote in some context two formulas of our
object language, say ‘A* and respectively, then the symbol ‘(P) •(())* of
our Syntax Language will, in that context, denote the formula ‘(A) •(~(B2))’ of
our object language.
We cannot simply give a list .of all well formed formulas of our object
language, since there are infinitely .many of them. It is necessary to give an
inductive or recursive definition of ‘well formed formula*, which can be stated
Sec- 8-2l Primitive Symbols and Weil Formed Formulas 217
~((A)-(~(B»)
The question to be decided is whether the foregoing is a wff or not. By part (b)
of the recursive definition, it is a wff, provided that (A)•( — (R)) is a wff. The
latter is a wff by part (c) of the definition, provided that both A and -~(B) are
wffs. The first of these is a wff by part (a) of the definition, and by part (b), the
second is a wff provided that B is a wff, which it is by part (a) of the definition.
Thus the formula in question is a wff. From here on we shall use the proposi¬
tional variables T\ ‘R\ ‘S’, and so on, of our Syntax Language to denote
only well formed formulas of our object language.
EXERCISES
Using the Recursive Definition for Well Formed Formulas, show which of the
following are wffs of our object language:
1. (~(A1))-(A1)
2. MHW-HCY))))
3. HHW’W))
4. M(~(0.)M~(D4)))
*5. (H(~)-(C2)))-M(B3)-(B3)))
6. ~(((A)*(B)W~(£)))
1 Alternatives are available here, as elsewhere, in developing logistic systems. In place of the
recursive definition in the text, we might have used the following:
This alternative definition of wff would make different'formulas wffs, but the resulting logistic-
system could be developed equally well.
218 A Propositional Calculus [Ch.8
7. H(A)-(B)WH(A)-(A))))
8. (H«(A))*(B)*(C))))*((HA))*(B))*HC)))
9. ^((H(A)-H(B))))*(H(H(B)-(C)))-(^(H(C)-(A))))))
MO. -(((-((A)‘(-(B))))‘(A))*H-((A)‘(HB)1
4 The author is indebted to Professor J. Barkley Rosser of Cornell University for permission to
include the following material. The calculi presented here and in Chapter 9 are early versions of
logistic systems that appear in revised form in Professor Rosser’s Logic for Mathematicians,
McGraw-Hill Book Company, New York, 1953.
5Because ‘P v Q’ is an abbreviation of ‘—((^—(^))*(~(^?)))'* it is proper to say that P v Q is, or
is identical with, ^((~-(P))*(MQ)))» but not that P v Q is an abbreviation of the latter, for there
are no abbreviations in the object language itself. (The symbol V is a significant symbol of the
Syntax Language, but does not even occur in the object language, although we could introduce it
there if we wished.) A parallel may help make this clear. Since ‘U.S.S.R.’ is an abbreviation of
‘Union of Soviet Socialist Republics’, the U.S.S.R. is identical with the Union of Soviet Socialist
Republics. The U.S.S.R. is not an abbreviation of anything because it is a nation covering one-sixth
of the globe. Since ‘U.S.S.R.’ denotes a.large nation, the U.S.S.R. is a large nation and has no
meaning in any literal or semantical sense. Similarly, since in any context ‘P v Qf denotes a wff of
R.S., in that context PvQisa wff of R.S. Because R.S. is uninterpreted, its wffs are meaningless
and P v Q, in particular, is meaningless.
Sec. 8.2] Primitive Symbols and Well Formed Formulas 219
((P)' (~~(Q))) • Where two English statements have the symbolic translations
Pand Q> the statement that they are (materially) equivalent, that is, that they
have the same truth values, or that each (materially) implies the other, has the
symbolic translation (P D Q)-(Q D P). This is identically the same wff of R.S.
as (^((P)*(^(Q))))-(^((P)*(~~(P)))). Since material equivalence too is a
frequently used notion, we introduce the symbol<==> into our Syntax Language,
defining ‘P= Q9 as an abbreviation for ‘(P D £))•(£) D P)\
At this point, two notational conventions will be adopted for our meta¬
language. The first is dispensing with the dot (for conjunction), so that (P)((J) is
identically the same formula of the object language as (P) •(£>). The other
notational convention is to replace parentheses by brackets or braces wherever
such replacement is conducive to easier reading.
Even using brackets and braces, too many punctuation marks make reading
difficult, so two further conventions that permit a minimum use of punctuation
will be adopted. The first convention is to assign the following order of
precedence to the symbols
= D v • —
of our Syntax Language. Of these, each has greater precedence or greater scope
than any listed to its right. What is intended here can be explained by the
following examples. The otherwise ambiguous expression ‘^PQP denotes
(—(P)) •(()) rather than •(())), because the connective 4 • ’ (which we have
agreed to represent by juxtaposition) has greater scope than the operator ■*.
The scope of the connective ‘extends over that of the symbol. The
otherwise ambiguous expression *Pv^)R’ denotes Pv(()*R) rather than
(P v ()) • R, because the connective V has, by our convention, precedence over
the connective *•’, and its scope extends over that of the latter. The otherwise
ambiguous expression ‘P D Qv RS' denotes P D [() v (R • S)], because ‘ D* has
precedence over both V and *•’, and V has precedence over And the
otherwise ambiguous expression 4P D Q = D ~~P denotes [P D ()] =
[(—Q) D (—P)], because *=’ has precedence over ‘D’ and and D* has
precedence over Our second convention is that of association to the left,
which means that where the convention of order of precedence does not suffice
to remove the ambiguity of an expression, its parts should be grouped by
parentheses to the left. That is, when an expression contains two (or more)
occurrences of the same connective, and their relative scopes within the
expression are not otherwise indicated, the occurrence to the right shall be
understood to have the wider (or widest) scope. This, too, is best explained by
means of an example or two. Since all of its connectives have equal order of
precedence, the ambiguity of the expression P D Q D P D P ca ~.ot be
resolved by our first convention. According to our second convention, however,
we interpret it as denoting [(P D Q) ^ P] ^ P- Part, but not all of the ambi¬
guity of the expression T = Q = PQ v ~P ^ Q' is resolved by the convention
regarding order of precedence. Once this convention has been appealed to, we
220 A Propositional Calculus [Ch. 8
EXERCISES
The preceding remarks show how to express some truth functions in R.S. But
to prove that R.S. is functionally complete, that is, adequate to express all
possible truth functions of any number of statements, more is required. We
must have in our semantical metalanguage a method of expressing all possible
truth functions. Then we must prove that all of these, or all their substitution
instances, can be expressed in our object language R.S. also, in its normal or
standard interpretation. Such a method of expressing all possible truth func¬
tions is provided by truth tables, which were introduced in Chapter 2. The
method and notation of truth tables is, therefore, imported into our semantical
metalanguage, and will be used freely in discussing the various semantical
properties possessed by R.S. on its standard interpretation.
Truth functions may have one, two, or any number of arguments (in the
mathematical sense in which an ‘argument’ is an independent variable). Thus
/(P) is a truth function of P if and only if its truth or falsehood is completely
determined by the truth or falsehood of P. Similarly, f(P, Q) is a truth function
of P and Q if and only if its truth value is determined solely by the truth values
of P and Q.
There are exactly four different truth functions of a single argument, and
these may be expressed by the following truth tables:
6 Here we are using ‘truth function’ in the strict and proper sense of a nonlinguistic correlation
between or among tmth values.
Sec. 8.2] Primitive Symbols end Well Formed Formulas 221
the intended or normal interpretations of ~~P and PQ are given by the truth
tables
l_and f P Kt
T F T T T
FT T F F
F T F
F F F
That R.S. is adequate to express ft(P\ f2(P), /3(f>), and f4(P) is proved by
actually formulating them in R.S. The function f2(P) is true when P is true and
false when P is false, and is, therefore, expressible in R.S. as P itself. The
function ft(P) is false when P is true and true when P is false, and is, therefore,
expressible in R.S. as — P. The function f3(P) is false no matter which truth
value Passumes, and is, therefore, expressible in R.S. as ~~PP. The function f4(P)
is true in every case and can therefore be expressed in R.S. as the negation of
/3(P), that is, as —PP). We have thereby shown that all singulary truth
functions are expressible in R.S.
There are, of course, more truth functions of two arguments than of one
argument. These are defined by the following truth tables:
p Q MQ) P Q m q) P MQ) P V m v)
T T F T~ T T T T T T T T
T F T T F F T F T T F T
F T T F T T F T F F T T
F F T F F T F F T F F F
P Q MQ) P Q m Q) P V JjMQ)
P Q m q)
T T F T T F T T F T T T
T F F T F T T F T T F F
F T T F T F F T T F T F
F F T F F T F F F F F T
These are all the truth functions of two arguments, and are called ‘binary* or
‘dyadic* functions. That they are all expressible? in R.S. is easily shown by
actually expressing them by means of the — and * symbols: for example,
/14(P, Q) is expressible as PQ while /^P, Q) is expressible as ~~(PQ)-
EXERCISES
1. Express each of the binary truth functions fx(P, Q), *f2(P, Q), ■ • ■ > */s(P> Q)> • • ■ >
7n(P, <?)>••■> /16(^ Q) ^ wffs of R.S.
2. There are 256 ternary (or triadic) truth functions: ft{P, Qf R), f2(P.Q, fl),...,
/256(P, R), each of which is completely determined (or defined) by a different
eight-row truth table. Take any ten of them and express each as a wff of R.S.
••• *>,-1 Pn
Row 1: T T T T
Row 2: T T T F
The truth function/(P,, P2,..., .,, Pn) must have either an F in just one row
of its truth table, or F’s is more than one row of its truth table, or F’s in no rows
of its truth table. In any case the truth function can be represented by a wff
of R.S.
Sr -(PrP2-...-P^-PJ
S2: ~(P1-P2' - 'P„-I—P„)
S2»_x: . .-~Pn_i'Pn)
S2»: ~(~PX-~P2—Pn)
Sec. 8.2] Primitive Symbols and Well Formed Formulas 223
case 2. /(Fj, P2,..., Pn_v Pn) has F’s in more than one row of its truth
table. If it has F s in the k( 1 < k < 2n) rows il# i2, • • •, ik, then it is represented
in R.S. by the wff . .-S, .
case 3. /(F1# P2» • • •»Pn-v Pn) has F’s in no row of its truth table.
Such a tautologous function is represented in R.S. bv the wff
-'Pn-i-Pn).
The preceding cases exhaust the possibilities, so we have shown how to
express any truth function of any number of arguments in R.S. This is not a
theorem of or in R.S., but a theorem about it, established in our semantical
metalanguage. We may, therefore, list it as
/(l)
for any arbitrary m, if f(m) then f(m + 1)
there, f(m) for every m
1 + 3 + 5 + • • • + (2m -- 1) = m2
which shows that for an arbitrary m, if the sum of the first m odd integers is
m2, then the sum of the first (m + 1) odd integers is (m + l)2. We thus have
established the fi-ca.se; from it and the a-case, by weak induction, we draw the
desired conclusion that for every m, the sum of the first m odd integers is equal
to m2. Weak induction may be thought of as summarizing an unending
sequence of arguments of the form Modus Ponens:
/(1) m /(m)
if /(l) then /(2); if /(2) then /(3); ...; if f(m) then f(m
•‘•/(3) .\/(m -h 1)
a-CASE. Any truth function f(P) of just one argument can be expressed in
R.S., as was shown on pages 220-221.
T F F T V2k_,
T F F F V2fc
F T T T V2fc+1
F T T F V2*+2
F F FT V2fc+i_l
F F F F V2Jfc+i
e. n-fi j i
T T T T V.
T T T F V2
F F F T ^2*-l
F F F F v2*
and by the /?-case assumption can be expressed in R.S. by the m/f S,. A truth
>
function f2(P2, P.„ • ■ ■ ,
Pfc+ of the arguments P.,, P„..., P, . P, , .■
can be
defined by the last 2 rows of the first truth table if we delete its first column:
13 /. .now.)
T T T T V +1
T T T F
F F FT V2fc+i
F F F F V2fc+i
and by the /i-case assumption can be expressed in R.S. by the wff S2.
Now the truth table for the wff l\mSA has the same entries as the original
truth table in its first 2k rows and F’s in its last 2k rows. The truth table tor the
wff has all F’s its first 2k rows and the same entries as the original truth
table in its last 2k rows. From this, it is clear that the truth table for the wff
(1\'S 1)v(^F1*S2) has exactly the same entries as those in the original truth
table, whence the arbitrary truth function #(/*,,P2,. .., Pk, l\ f,) of k -f 1
arguments can be expressed in R.S. by the wff (1\ • S v (—1\ *S2) or
H_(Pi.Si)._(_JVS2)). This establishes the (i-case.
Metatheorem I now follows by weak induction from <v and /f.
The term ‘strong induction’ will be used to reier to the somewhat less
frequently encountered type of mathematical induction whose schema is
m
for any arbitrary m, ii f(k) for every k < m then f(m)
therefore, f(m) for every iu
77m
/(1) and /(2) and ... and f(m - 1)
if/(l) and /(2) and ... and/(in - 1) then /(w);...
_____ . -
226 A Propositional Calculus [Ch.8
rt-cASE: If g(P, p, R,...) contains just one symbol, to be well formed it must
be either P alone, or p alone, or R alone, or .... Where the arguments P, p,
R, ... all have the value false, g(P, p, A,...) will also have the value false,
since it is one of them. Hence any well formed formula of the present system
that contains exactly one symbol cannot have the value true when its argu¬
ments all have the value false.
/?-case: Here we assume that any well formed formula g(P, p, R,...)
containing fewer than m symbols cannot have the value true when all of its
arguments have the value fake. We shall prove, under this assumption, that
any well formed formula containing exactly m symbols cannot have the value
true when all its arguments have the value fake. Consider any formula
g(P, p, R,...) that contains exactly m symbols (where m > 1). To be well
formed, g(P, p, R,...) must be either
where gj(P, p, R,...) and g2(P, p, R,...) are well formed formulas containing
fewer than m symbols. By the /3-case assumption, when P, p, R,... are aMfake,
both gy(P, p, R,...) and g2(P, p, R,. . .) will have the value fake also. Now,
given any two propositions that are fake, both their disjunction and their
conjunction are fake. Hence in this case g(P, p, R,.. .) has the value fake, also.
This establishes the /3-case. *
Having established both the a and /3 cases, by strong induction we infer that
no well formed formula of the system based on •, v, P, p, R, ... can express a
truth function that has the value true when all of its arguments have the value
fake. Hence the system is not functionally complete.
EXERCISES
5. —and +, where 4- is the symbol for exclusive disjunction, defined by the tmth
table for f7 on page 221.
*6. 3 and 4- 8. 4* and == 10. = and
7. v and 4- 9. 4- and • 11. EE and D
12. 13 and where is the symbol for (material) noniinplication.8 defined by the
truth table for /13 on page 221.
13. D and , where (Z! is the symbol for converse nonimplication, defined by the
truth table for f12 on page 221.
14. and </L
15. Which, if any, of the sixteen binary truth functional operators fv f>,. . ., /l6
defined on page 221, can be added to the propositional symbols P, (). ft, S,. . . and
parentheses, to give a functionally complete logistic system with a single operator
symbol? 4
The rules for our system and the proofs of the theorems within it will he
much simplified if we assume infinitely many well formed formulas as axioms or
postulates. Of course, we cannot actually write out an infinite list of axioms
within our object language, but we can use our Syntax Language to specify
exactly which wffs of R.S. are axioms and which are not. Ihere can be no
objection to having an infinite number of axioms if there is an effective process
for determining whether or not any given tuff is an axiom. The infinite list of
axioms of R.S. may be written as
Axiom 1. P 3 (P"P)
Axiom 2. (P-Q) 3 P
Axiom 3. (P =3 Q) 3 IMP**) :j M«**)l
M(AM~((A)*(A))))
M(»)*M(®W®>)))
M(C)-M(C)-(C))))
M(Ai)'(M(A ,)•(.*,))))
~((~(A)HM(~tA))'(~(A)))))
^((^(B))-(M(MB))*(^(*)))))
M((A) -(D))- (—(((A) * (D)) • ((A) • (D)))))
~(((A3)-(B7))-M((As) ■•(*?))'*((Aa)'"(*7)))))
and infinitely many more wffs of R.S. It designates, in fact, every wff of R.S.
that is of the indicated pattern. It is effectively decidable whether any given
finite sequence of symbols of R.S. is of this pattern or not. Three patterns are
set forth, and every wff of R.S. that exemplifies any one of these patterns is
assumed as an axiom.
That these axioms are reasonable assumptions is evidenced by the fact that
on the normal interpretation of the symbols — and •, they are all tautologies.
Our logistic system is intended, upon interpretation, to be adequate for the
formulation of arguments. Arguments, as we know, consist of premisses and
conclusions, all of which are expressed in statements. Corresponding to such
arguments, we have in our logistic system sequences of well formed formulas of
which the last formula is the 'conclusion’. We shall set up a criterion that will
enable us to distinguish formally between two kinds of sequences of wffs within
R.S.: those which become valid arguments when interpreted normally and
those which do not.
In Chapter 3 a valid argument was characterized as one for which a formal
proof or demonstration could be given. A formal proof of an argument’s
validity was defined as a sequence of statements, each of which was either a
premiss or which followed from preceding statements by an elementary valid
argument, and whose last statement was the conclusion of the argument that
was being proved valid. Thus the question of the validity of any argument was
‘reduced’ to the question of the validity of certain elementary argument forms
or rules of inference. Something like that will be proposed for R.S., but it is not
desirable to make such a wholesale set of assumptions about valid elementary
inferences as was made in Chapter 3, where nineteen rules of inference were
postulated, plus rules of Conditional Proof and Indirect Proof. However, some
rule of inference must be assumed for our logistic system or there will be no
inference legitimized within it at all. One rule of inference will suffice for the
validation of all arguments and (hence) for the proof of all theorems within R.S.
We assume Modus Ponens, and state it (in our Syntax Language) as
~((A)-(~B)))
A and (AjWAg)
B ~(C)
^ r*.P„ b V
assorts that there is a demonstration of the validity of the argument that has l\,
P2,..., P„ as premisses and as conclusion. For example, all of the infinitely
many arguments denoted by
P D()
-m)
~~(RP)
S,: (P 3 Q) 3 l3 '-(HP)}
• S2:PD()
S3: -(QR) 3 -(HP)
S4: ~(OT
S5: '-(HP)
Proof: The formula P* ~~Pis a wffof R.S. but is not a tautology; therefore, by
Metatheorem II, it is not a theorem of R.S. Hence R.S. contains a formula that
is not provable as a theorem, so R.S. is consistent by the Post criterion.
_Q_ P-Q
0 2 0 0 IT
110 1 1
2 0 0 2 2
1 0 1
1 1 2
1 2 2
2 0 2
2 1 2
2 2 2
PDQ = df ~(P-~p)
P P=>P
0 0 0
0 1 1
0 2 2
1 0 0
1 1 0
1 2 . 1
2 0 0
2 1 • 0
2 2 ' 0
Sec. 8.4] Independence ol the Axioms 233
The characteristic of wffs that we utilize here is that of having the value 0,
regardless of the values assigned to its component propositional symbols. This
characteristic is easily seen to belong to Ax. 2 and Ax. 3 of R.S. by the following
tables (analogous to the table explained on page 30).
^_V) d r
0 0 0 0 0
0 1 1 0 0
0 2 2 0 o
1 1 0 0 1
1 2 1 0 l
1 2 2 0 1
2 2 0 0 2
2 2 1 0 2
2 2 2 0 2
3 - («_• _P)J
(p Q) 2 _Ll (V_• JO
0 0 0 0 2 0 0 0 0 2 0 0 0
0 0 0 0 1 0 1 1 0 1 110
ooooo 0 2 2 0 0 2 2 0
1 1 0 1 2 0 0 0
0 110 1
0 110 0 J 2 1 11110
0 110 0 1 2 2 0 0 2 2 0
0 2 2 0 0 2 2 0 2 2 0 0 0
0 2 2 0 0 2 2 1 11110
0 2 2 0 0 2 2 2 0 0 2 2 0
1 0 0 0 2 0 0 0 0 10 11
1 0 0 0 1 0 1 1 0 0 12 1
1 0 0 0 0 0 2 2 0 0 2 2 1
1 1 0 0 10 11
10 10 1
1 2 1 0 0 12 1
10 10 0
1 2 2 0 0 2 2 1
10 10 0
2 2 0 110 11
112 0 0
2 2 1 0 0 12 1
112 0 0
2 2 2 0 0 2 2 1
112 0 0
ooo 0 0 0 2 2
2 0 0 0 2
0 0 12 2
2 0 0 0 1 0 1 1
0 2 2 0 0 2 2 2
2 0 0 0 0
1 1 0
0 0 0 2 2
2 0 10 1
1 2 1 0 0 12 2
2 0 10 0
12 2 0 0 2 2 2
2 0 10 0
2 2 0 0 0 0 2 2
2 0 2 0 0
2 2 1 0 0 12 2
2 0 2 0 0
2 2 2 0 0 2 2 2
2 0 2 0 0
[Ch.8
p Q np
2 0 2 0
2 12 0
2 2 2 0
The characteristic of wffs that we utilize here is (again) that of having the
value 0, regardless of the values assigned to its component propositional
symbols. This characteristic is easily seen to belong to Ax. 1 and Ax. 3 of R.S. by
the following tables.
P D (P • P)
0 0 0 0~ 0
10 10 1
2 0 2 2 2
(P D Q) D (g • R) 3 - (« • P)]
0000 20000 2 0 0 0
0000 20010 2 10 0
000000220 0 2 2 0
02 1 0 2 1000 2 0 0 0
0 2 1 0 2 1 0 1 0 2 10 0
021001220 0 2 2 0
022002202 2 0 0 0
022002212 2 10 0
02 2 0 0 2220 0 2 2 0
10 0 0 2 0000 2 0 0 1
1000 2 0010 2 10 1
100000220 0 2 2 1
12 1 0 2 1000 2 0 0 1
12 102 101° 2 10 1
12100 1220 0 2 2 1
122002202 2 0 0 1
122002212 2 10 1
1220 0 2220 0 2 2 1
20 0 0 2 0000 0 0 2 2
2000 2 0010 0 12 2
2000 0 0220 0 2 2 2
20 10 2 1000 0 0 2 2
2 0 1 0 2 1 0 1 0 0 12 2
2 0 1 0 0 122 0 0 2 2 2
2020 0 2200 0 0 2 2
2020 022 1 0 0 12 2
202002220 0 2 2 2
236 A Propositional Calculus [Ch.8
P Q P-Q PDQ
0 0 0 0~
011 1
0 2 2 2
10 2 0
112 0
12 2 0
2 0 2 0
2 12 0
2 2 2 0
The characteristic of wffs that we utilize here is (yet again) that of having
the value 0, regardless of the values assigned to its component propositional
symbols. This characteristic is easily seen to belong to Ax. 1 and Ax. 2 of R.S. by
the following tables.
p D (P • p)
0 0 0 0 0
1 0 1 2 1
2 0 2 2 2
[p • Q) D p
0 ~ <r~ 0 0 0
0 i 1 0 0
0 2 2 0 0
1 2 0 0 1
1 2 1 0 1
1 2 2 0 1
2 2 . 0 0 2
2 2 . i 0 2
2 2 2 0 2
Sec. 8.5]
Development of the Calculus 237
The characteristic is easily seen to be hereditary with respect to R 1 of R.S.
by consulting the table for ‘ D'.In the only row in which P and P D Q have the
value 0, Q also has the value 0. Hence if the characteristic belongs to one or
more wffs, it also belongs to every wff deduced from them by R 1.
Finally, it is readily seen that the characteristic in question does not belong
to Ax. 3. When P and Q are both assigned the value 1 and R is assigned the
value 0, (P D Q) D [~~(QaR) D ~~(ft*P)] has the value 1 rather than 0 for
[—(1*®) ^ ^(0-1)] reduces successively to 0 D [—2 D ^1], to
0 D [0 D 1], to 0 D 1, and finally to 1. Hence Ax. 3 is independent.
EXERCISES
For each of the following sets of axioms for propositional calculi, prove the
independence of each axiom:
Axiom l.(PvP) DP
Axiom 2. P D (P v Q)
Axiom 3. [Pv(pvR)] D ft) v(PvR)]
Axiom 4. (QDR)D [(P\Q) D (Pvfl)]
2. The Gotlind-Rasiowa system P(J has the same primitive operators, definition of D,
and Rule as PN above. Its three axioms are
Axiom 1. (P v P) D P
Axiom 2. P D (PvQ)
Axiom 3. (<? D R) 3 [(Pv Q) D (R v P)\
Axiom 1. P D (Q D P)
Axiom 2. [P D (Q D R)] D [(P D Q) D (P DR) J
Axiom 3. (P D p) D D —P)
Axiom 4. — —P D P
Axiom 5. P D ~~~P
4. Lukasiewicz’s system L.S. has the same primitive operators, definition of v, and
Rule as F.S. above. Its three axioms are
Axiom 1. P D (Q 3 P)
Axiom 2. [F D (Q 3 H)] -* [(^ D Q) ^ (P ^ H)1
Axiom 3. (~P 3 ~Q) D (Q ^ P)
are motivated by the desire for rigor in its development. When interpreted
normally, the formulas ~~RP and P~~R are logically equivalent. But they
cannot be so regarded as wffs of R.S., in its development, and the wff
—RP = P~~R cannot be accepted as a theorem until it has been formally
derived from the axioms of the system.
It is convenient to begin with the demonstration of a derived rule of
inference for R.S., which will validate infinitely many arguments in it. We state
it as
DR 1. P D Q, <? D R h ~~(~RP)
Its demonstration requires a sequence of just five well formed formulas, the
third of which we shall write twice, once in unabbreviated and once in
abbreviated form:
Sr (P 3 Q) 3 [~[Q~R) 3 ^RP)]
■V p => Q
S3; D
S3: (Q 3 R) 3 ~(~RP)
S4: Q D R
S5: ~(~RP)
That this sequence of S’s is a deinonstraction is easily verified. The first line, S1?
is Axiom 3 of R.S. It is true that there is an apparent difference between and
our syntactical formulation of Axiom 3:
because Sx contains ~~R wherever Axiom 3 contains R. The point is that we are
here talking about well formed formulas of R.S. Both Sj and Axiom 3 denote
infinitely many wffs of R.S., and every wff of R.S. which is denoted by S1 is also
denoted by Axiom 3 (though not conversely). We can put the matter another
way. Our first derived rule (DR 1) validates infinitely many arguments formu-
lable within R.S., for example:
as well as
validity is being demonstrated. The first line in the demonstration in R.S. for
the first of die four examples given is the wffoi R.S. denoted by Sx when T\
Q , and R are taken to denote ‘A\ ‘B’, and ‘C\ respectively. But this is
identically the same wff of R.S. denoted by our syntactical formulation of
Axiom 3 when P , and ‘R’ in it are taken to denote ‘A\ 4B\ and ‘~(C)\
respectively. Hence the first wff of the demonstration sequence is one of the
infinitely many axioms of R.S. supplied by Axiom 3. The situation is the same
with respect to the demonstration of the validity of every other argument in
R.S. that is validated by DR 1.
It is readily seen that the other lines of the sequence conform to the
requirements laid down in our definition of demonstration. The two premisses
of the argument occur as S2 and S4, while S3 follows from S, and S2 by R 1; and
S5 follows from S3(S3) and S4 by R 1. It is helpful to write in the ‘justification'
for each line in a demonstration: ‘Ax. 3* to the right of S,, ‘premiss' to the right
of S2, and so on. These labels are not part of the demonstration, but are helpful
to both the writer and the reader.
At this point the question naturally arises as to whether derived rules of
inference can be appealed to in deriving theorems from the axioms of the
system. It is most conveniently discussed in connection with an actual example.
Let us take as our first theorem of R.S. the formula
Th. 1. h~~(~~PP)
This formula follows directly from Axioms 1 and 2 by means of our first derived
rule, DR 1. A sequence of formulas which has already been demonstrated to be
a valid argument (by our demonstration oi DR 1) is
SA: P D PP Ax. 1
S2: PP 13 P Ax. 2
S3: DR 1
That S2 is Ax. 2 should be clear from the previous discussion. Every wff of R.S.
denoted by our syntactical formulation of Ax. 2, P(J D P, is an axiom, and
every wff of R.S. denoted by the syntactical expression ‘PP D P' is (also)
denoted by 'PQ 3 P\ so S2 denotes infinitely many of the axioms of R.S. that
are denoted by our syntactical formulation of Ax. 2. And that S3 really follows
by DR 1 from Sx and S2 is seen by observing that DR 1 validates any argument
of the form
PDQ
QD R
no matter what wffs of R.S. the formulas P, Qf and R may be. Thus DR 1
includes the case in which T and ‘R’ denote identically the same formulas,
while 'Q9 and ‘PP* also denote the same wffs.
[Ch. 8
Sx: P D PP Ax. 1
S2: PP DP Ax. 2
S3: (P D PP) D [~~(PP~P) D —(—^FP)] Ax. 3
S4: —(PP~-P) D ~~(~~PP) R 1
SJ: (PP DP)D ~~(~~PP)
S5: -(-PP) R 1
Th. 2. h-PDP
that follows directly from Theorem 1 by definition. Its proof may be written as:
|~ ~~(-~PP)
which denotes every wff that is of this form, no matter which wff Hie syntacti¬
cal variable ‘P’ denotes. Every wffdenoted by *— (- P~~P)’ is of that form,
and is therefore included among the infinitely many provable formulas of R.S.
SBC. 8.5] Development of the Calculus 241
Compared with demonstrations, proofs are shorter and are therefore easier
to write out. Since any proof can be made into a demonstration by replacing
any line that is a previously established theorem bv the demonstration of that
theorem, and by replacing any line that results from the use of a derived rule
by the demonstration of that rule, proofs can he regarded as shorthand
notations for demonstrations. However, proofs are different, and should not be
confused with demonstrations.
Before proceeding with the development of additional theorems and derived
rules for R.S., it should be remarked that Th. 2 can be equally well expressed as
b P v P, which is a version of the principle of the Excluded Middle. For, by
our definition of the symbol V, Pv P* is an abbreviation of
which has (Th. 2) —P D P' as an alternative abbreviation. In the latter form
it constitutes part of the principle of Double Negation.
Some additional theorems of R.S., together with their proofs (not demon¬
strations) are these:
H. 3. h ~~(QR) D (R D
Proof: b D Q Th. 2
b (—() D <;)) D [~(W 3 ~(R~~C>] Ax. 3
R 1
b ~(C«) D
1 —(QR) 3 (R D -{)) df.
ii. 4. b R D-R
H. 5. b (p d P) d (~p =) ~V)
DR 2. —P D b() D P
-PD-pbpDP and b (P D ~~ Q) D (Q D P)
The first of these asserts that there is a sequence of wffs, each of which is either
~~P D or an axiom or follows from two preceding wffs by R 1 and whose
last wffis Q D P. The second, however, asserts that there is a sequence of wffs,
each of which is an axiom or follows from two preceding wffs by R 1 and whose
last wffis (~~P D ~~Q) D (Q D P). (The second has not yet been established.)
Of course there is some connection between them as there is between any two
statements such as P b Q and b P D Q. Given the latter, the former is easily
established. To the sequence of wffs S^, S2,..., Sk (where Sk is P D Q), which
constitutes a demonstration for b P D Q, we need only add P as S^+1 and derive
Q as Sfc+2* f°r if follows from Sk and Sk+1 by R 1. But although b P D Q follows
from P b Q, the proof that is does is less simple. It will be established as
Development of the Calculus 243
Metatheorem III (the ‘Deduction Theorem ); but until it has been proved, it
cannot be assumed to hold for R.S.
Some additional derived rules, together with their proofs, are these:
DR 3. P D pbflP D QR
DR 4. P D Q, R D S h ~[~(^S)(7^)]
DR 5. P D Q, Q D R, R D S h P D S
*T h . 6. h(R-~P) D (PR)
Th. 7. h PDF
Th. 8. h RPDPR
*DR 6. PDQ,QDR\-PDR
Some additional theorems and derived rules are useful in proving the next
Metatheorem.
DR 7. P D Q, R D S\-PR D QS
DR 7, Cor. 1. P D Q h PR D QR
DR 7, Coh. 2. R D S h PR D PS
DR 8. P D Q, P D R h P D QR
DR 9. P D R, Q D S\-(PvQlD (RvS)
*DR 10. P D R, Q D R\-(PvQ) D R
These last two theorems are the two halves of the principle of Exportation, but
before the principle itself can be derived from them, the principle of Con¬
junction must be established (as DR 14).
DR 11. P ID Q, P D (Q D R) b P D R
TH . 15. I-PD(()D PQ)
Having established these derived rules and theorems, we are now able to
prove the Deduction Theorem for R.S. as
Proof: We assume that Pv P2, ..., ,, Pn b {), that is, that there is a
demonstration or sequence of toffs Sl5 s2,... , SN, such that each S- is either an
axiom, or a P}(i = 1, 2,.. ., n), or follows from two previous S\s by H 1, and SH
is Q. Now consider the sequence of ioffs I’n => S„ Pn DS2.Pn D S„. If we
can ‘fill in’ wffs before each Pn D S4 in such a way that the resulting total
sequence is a demonstration from pvp2,...,pn v so each line of the resulting
total sequence is either an axiom or a Pt (i = 1, 2, ..., n — 1), or follows
from two previous lines by R 1, then since Pn D Ss is Pn D {)% we shall have a
demonstration that Pv P2, .. ., Pn { h P n D {l That we can ‘fill in’ to get the
desired demonstration, is proved by weak induction on the number of formulas
Pn D S( involved.
pn-i
[Ch.8
(/?) Now suppose that we have properly filled in all the lines up to and
including Pn D Sk_v so we have a sequence of wffs> .which is a demonstration
that Pv P2, • • • > Pn~ 1 ^ S/c-1- Under this assumptioh we show how to fill in
to include Pn D Sk in the sequence, which will then be a demonstration that Pv
P2, ...» Pn_j I- Pn D Sk. By assumption, Sk is either an axiom, or a
Pi (i = 1, 2,..., n), or resulted in the original demonstration from the appli¬
cation of R 1 to two previous S’s, say and (i, \ < k).
PDQiPD(QDR)\-PDR
we have
PwDS,.,PnD(S;.DSJhPnDSk
Insert the demonstration of this derived rule, whose last line is Pn D Sk, and the
entire sequence will then be a demonstration that Pv P2, ..., Pn_1 b PM D Sk.
Now by weak induction, we conclude that we can fill in for any number of
lines Pn D in such a way that the resulting sequence will be a demonstration
that Pv P2,..., Pn_j b Pn DSit We can, therefore, do it for the demonstration
of Pv P2,..., Pn_v Pn b Q no matter how many lines Sv S2,..., Ss it contains.
And since Ss is (), we can construct a demonstration that Pv P2, ...,
Pn_x b Pn D Q. This concludes our proof of the Deduction Theorem.
An immediate consequence is
PDQ premiss
P premiss
Q R 1
QDR premiss
R R 1
Then we can prove DR 6 by simply applying the D.T. once, the proof reading-
P D Q, Q D R, Pi- R DR 6'
P D Q, Q D R I- P D R D.T.
DR 15. P=gb-P=-t>
DR 16. P=Q, R = S\-PR = QS
DR 16, Cor. P= Q9 R = S b Pv R = Q v S
case 3. S is a Pt. In this case S* also is the same P{. Since bPi = P{ by
Th. 7 and DR 14, b S = S* wherice Q = R b S = S*.
(fi) Here the Metatheorem is assumed to be true for any S containing < n
symbols. Consider any S containing exactly n symbols (n > 1). It is clear that S
must be either or S1,S2.
The various equivalences that made up the last ten of the nineteen Rules of
Inference can easily be established. The principle of Double Negation has
already been proved as Theorem 19. The various principles of Commutation
and Association are obtained by simply applying DR 14 to already established
theorems and corollaries.
The proofs of the final group of theorems of the R.S. will be left as exercises for
the reader.
With the establishment of this final group of theorems, R.S. has been shown
to contain all of the logical principles appealed to in validating extended
arguments in Chapter 3. Containing also, as it does, the Deduction Theorem
and the principle of Double Negation, it is adequate also for the methods of
Conditional Proof and Indirect Proof that were discussed in Chapter 3. It still
remains to be proved that the system is deductively complete, which will be
established in the next section.
In Theorems 22, 23, and 30 we already have the Association of V and *•’,
and the Distribution of *■’ with respect to V. But as stated, these properties
have been established only for cases involving exactly three wffs. In proving
the deductive completeness of R.S., it is convenient to make use of more
general Association and Distribution principles. These will be established as
our next three Meta theorems. The-first of them will establish the general
Association and Commutation of the conjunction symbol, stating that no
Sec. 8.6] 251
matter in what order or grouping any number of wffs are conjoined, the
resulting wff will be equivalent to the result of conjoining them in any other
order or grouping. We may state this formally as follows:
(/?) Here we assume the Metatheorem to be true for every k < n factors 1\,
P2,. •., Pfc. Now consider and R, each constructed out of n > 1 factors 1\, P2,
..., Pn. Q is S-T and R is X• Y.
Each of the wffs S and T contains at least one of the factors Pv P2, . . ., Pn.
We can assume that Px is a factor of S, because if not we can apply Th. 21 and
relabel to obtain h Q = S • T where S now does contain l\ as a factor.
Because T contains at least one of P2, ..., Pn as a factor, S contains < n
of the factors Pv P2, ..., Pn. Hence either S is I\ and b = I\ • 1\ or by the
j8-case assumption b S == where S' is a wff that contains all the factors of
S except Pv In the latter case, by MT IV, Cor. we have
b Q = (PrS')-T
b Q=I\-(S'-T)
b Q = i\-r
By the same reasoning, we can show that there is a wff, call it Y\ such that
b R = Px-Yf
Each of the wffs T and T contains the n - 1 factors P2, I\,..., Pn, so by the
/?-case assumption
bT'= Y'
b Pl-Tf = P1-Y'
\-Q = R
The next principle concerns the general Association and Commutation of the
disjunction symbol V*. No matter in what order or grouping any tvffs are
connected by V’, the resulting disjunction or ‘logical sum* will be equivalent to
the result of connecting them by V in any other order or grouping. We may
state this formally as
Meta theorem VI. Let Pv P2, ... 9 Pn be any tvffs and let Q and R be
any two wffs constructed out of them by means of V. If each P- (1 < i < n)
occurs exactly once in each of the wffs Q and R, then b Q = R.
(fi) Assume the Meta theorem true for k summands Pv p»... , Pk. Now let Q
be the logical sum or disjunction of Pv P2y..., Pk> Pk+1, and let S be the logical
sum of P^R, P2R9 .... PkR, Pk+iR- Now we argue that
b (Pn v P2 v ... v Pk)R = P.R v P2R v ... v PkR by the j3-case assumption
b Pk+1R = Pk+1R Th. 7 and DR 14
b (Pj v P2 v ... v Pk)R v Pk+1R
(PjR v P2R v ... v P^R) v Pk+1R DR 16, Cor.
Sec. S.6] Deductive Completeness 253
h[(P1vP2v...vPJvPt+1]R =
(PlVP2v...vPfc)flvPk+1R Th. 30, Cor.
h[(P1vP2v...vPk)vPm]R =
(Pjfi v P2R v ... v PkR) v Pk+lR MT IV, Cor.
By our convention of association to the left, the preceding line can also be
written as
Any such truth table has 2n rows, each of which represents a different assign¬
ment of T’s and F’s to the Pt*s, one row for every possible assignment. That
only T’s appear in the column under S indicates that every possible assignment
of T’s and F’s to the P/s must assign a T to S.
To show that b S for every such S, we establish the following:
1. Each row of its truth table, that is, each assignment of truth values to the
P.’s, can be represented by a wff of R.S., the first row by Qv the second
by Q2, ...» and the last or 2nth by Q2n‘>
2. If Qv Q2i..., Q2n are the 2n wffs representing all possible assignments
of T « and F’s to the P/s, then b (Q1 v Q2 v ... v ()r); and
3. If a particular assignment of T’s and F’s to the P/s assigns a T to S, then
where Q represents that particular assignment, we have b D S.
254 A ProposNIonal Calculus [Ch.8
That these will suffice to prove that b S is easily seen. Where the truth table
for S has all T’s in the column under S, then .1- PjDS, b Q2 3 S, ...,
b Q2n D S. From these, by 2n — 1 uses of DR 10> we have b v Q2 v ...
v Q2n) D S. And once we have established that b (Qj v()2v...v @2")> we
obtain b S by R 1.
Now we proceed to attack the problem in detail. First, we must show that
each possible assignment of T’s and F\s to the P^s of a set (Px, P2,..., Pn) can be
represented by a wff. It is convenient to lay down the following:
Where T’s are assigned to every P{, this assignment is represented by the
conjunction Px *P2 • ... *Pn, which we denote by *QX\ Where T’s are assigned to
every Pi except Pn, which is assigned an F, the assignment is represented by the
conjunction PxmP2m • • • * Pn-1' ^Pn, which we denote by ‘Q2 , Similarly, we use
this procedure for every other possible assignment corresponding to every
row of the truth table, ending with the conjunction —Pt • ~~P2 • ... -^Pn,
which we denote by *Q2n’. In this way any row of any truth table can be
represented by a wff of R.S., which establishes the first result mentioned in the
preceding paragraph.
Next, we turn our attention to the second result, expressed as
(/?) Assume that the Metatheorem is true for Px, P2, ..., Pk. Now consider
the set PXi ^2» * * * Jk’^k + V Where ^)1, Q2,..., Q# represent all possible distinct
assignments of truth values to P\> P%> • • • > we have b (Qx vp2v...v Q2k) by
the /?-case assumption.
Then we continue the argument as follows:
case 1. Qi assigns a T to S, that is, to 1\. Hence Pi rather than ^l] must be
a factor of Qjm By MT V, b Qj = P, where K is a conjunction of all the factors
of Qj except Pt. Now we argue
b P{R D Pi Ax. 2
b Qj D ^ MT IV, Cor.
which is b Qj D S.
which is b Qj D —S.
(ft) Assume that the Metatheorem is true for any S containing any number
k < n symbols. Now consider any S containing n ( >1) symbols. The wff S is
either S1*S2 or —S3.
case 1. S is Sj-Sjj.
Subcase A: Qj assigns a T to S. Here Qj must assign a T to St and a T to
S2. Since Sv S2 contain fewer than n symbols each, we argue as follows:
which is b Qi D S.
Subcase B: Qf assigns an F to S. Here Qj must assign an F to S1 or an F to
S2. If to Sx, then b Qj D -Sx by the ft-case assumption, and hence by DR 12
b Qj D —(Si -S2), which is b D — S. If to S2 then b Qi D —S2 by the /1-case
assumption, and hence by DR 13 b Q} D —-(S1 *S2), which is b Qj D —S.
case 2. S is — S-j.
Subcase A: Qj assigns a T to S. Here Qj must assign an F to S3. Hence by
the /1-case assumption, b D ~~S3 which is b Qj D S.
Subcase B: Qi assigns an F to S. Here Qj must assign a T to S3. Hence by
the /1-case assumption, b Qj D S3. But b S3 D-S3 by Th. 4, so by DR 6 we
have b ^ D --S3, which is b Qj D —S.
The deductive completeness of the system now follows easily, and may be
proved as
b JQX ? S
b Q-2 D S
b Q2n DS
Sec- 8a6l Deductive Completeness 257
where Q2,..., Q2„ represent all possible assignments of truth values to Px%
P2, . .., Pn. Now by 2n - 1 uses of DR 10, we have
h v <?2 v .. v C>2») D S
and by MT VIII,
h (Qi v v • • • v y2n)
The decision problem for any deductive system is the problem of stating an
effective criterion for deciding whether or not any statement or well formed
formula is a theorem of the system. In view of the analvticity and deductive
completeness of R.S. (Metatheorems II and X), the method of truth tables
constitutes a solution to the decision problem. Truth tables enable us to decide
effectively whether or not any wff is a tautology. By MT II, only tautologies
are theorems, and by MT X, all tautologies are theorems. Hence truth tables
enable us to decide effectively whether or not any wff is a theorem. Moreover,
the proofs up to and including that for MT X do not merely assure us that for
any tautologous wff there exists a demonstration—they prescribe effectively a
method of actually constructing its demonstration. The demonstration con¬
structed by following the directions contained in the proof of deductive
completeness will usually be longer than one discovered through the exercise of
ingenuity and inventiveness. That is to be admitted. But it is significant and
important that through the use of the directions contained in the proofs up to
and including that of MT X, a demonstration within the logistic system can be
written out for any tautology—without any need for ingenuity or inventive¬
ness. Our effective solution of the decision problem for the system guarantees
that this can be done.
It is clear from the foregoing that any argument whose validity can be
established by the use of truth tables can be proved valid in R.S. In Chapter 3,
the claim was made that any such argument could be proved valid by using the
list of nineteen Rules of Inference augmented by the principles of Conditional
Proof and Indirect Proof. We are now in a position to substantiate that claim,
which is equivalent to the assertion that the method of deduction set forth in
Chapter 3 is deductively complete. We can do so by showing that every
argument which can be proved valid in R.S. can also be proved valid by the
methods of Chapter 3.
Consider any argument Pv ..., Pn Q that can be proved valid in R.S. To
say that it can be proved valid in R.S. is to say that there is a demonstration in
R.S. for the derived rule of inference Pv .. ., Pn b Q. By n uses of the Deduction
Theorem, Exportation, and the Rule of Replacement, we have h P D Q where
P is a conjunction of Pv ..., Pn. By the analyticity of R.S., P D Q is a truth
table tautology, so P- is a contradiction. Now there is a formal proof of
[Ch. 8
validity for the argument
using the methods of Chapter 3 (by iterated uses of the principle of Conjunc¬
tion). Hence there is a formal proof using the methods of Chapter 3 for the
argument.
.\N (2)
qv[(p-~~p)-r] q
the formal proof of validity for (2) can be extended to a formal proof of validity
for
There are three different senses in which the phrase ‘alternative systems of
logic* can he understood. These parallel the three senses of the phrase ‘alter¬
native systems of geometry*, and can most easily be explained by analogy with
the latter. We may speak of Euclidean plane geometry and Euclidean solid
geometry as ‘alternative systems* in the sense that the first can be studied
independently of the second, and they are certainly different in that the second
is more inclusive than the first. Analogously, we may speak of a Propositional
Calculus and a Function Calculus as ‘alternative systems of logic* in that the
first can be studied independently of the second, and that the second is more
inclusive than the first—where a Function Calculus contains all the tautologies
and rules of the Propositional Calculus plus Quantification Axioms, Rules, and
Theorems. It is not this sense of an alternative system with which we shall be
concerned in the present chapter.
A second sense is that in which we can speak of Euclidean geometry and
Riemannian (or Lobachevskian) geometry as alternative systems. These are
alternative in the sense that, although they may possess some theorems in
common, each contains some theorems not included in the other. Parallel to
this situation in geometry, there are alternative systems of logic exhibiting the
same sort of differences. An ordinary ‘two-valued* system of logic, whose
formulas—on interpretation—-are either true or false, can be contrasted with
‘three-valued* or ‘many-valued* systems of logic whose formulas are supposed
to take—on interpretation—either three or n > 3 different ‘truth values’.
Alternative systems of logic in this sense have been extensively developed, first
by Jan Lukasiewicz in Poland and independently by E. L. Post in this country,
more recently by J. B. Rosser and A. R. Turquette.1 A study of these is beyond
iSee ]. Lukasiewicz, O logice trojwartosciowej/, Ruch Filnzoficzny (Lwow), vol. 5 (1920), pp.
169-171.'
E. U-Post, ‘Introduction to a General Theory of Elementary Propositions’, American Journal of
Mathematics, vol. 43 (1921), pp. 163-185.
J. B. Rosser, ‘On the Many-Valued Logics’, American Journal of Physics, vol. 9 (1941), pp.
207-212.
]. B. Rosser and A. R. Turquette, ‘Axiom Schemes for M-Valued Propositional Calculi', Journal
259
260 Alternative Systems and Notations [Ch. 9
the scope of this book. It is not this sense of alternative systems with which we
shall be concerned in the present chapter.
The third sense in which one can speak of alternative systems of geometry is
that in which different axiomatic bases are assumed, but identically the same
theorems are derivable from each of them. Thus many different axiom sets for
Euclidean geometry have been devised, all of which yield the same theorems.
In alternative systems of this sort, different terms may be taken as primitive or
undefined, and different formulas are assumed as axioms or postulates. What
may be a primitive term in one system may be defined by means of other
primitives in the other, and what is assumed as an axiom in one may be derived
as a theorem from the axioms of the other—where those axioms correspond to
theorems of the first. It is this sense of alternative systems that will be discussed
in the present chapter.
The logical truths whose systematization is under consideration are truth
functional tautologies. Any system wholly adequate to their expression and
development must be functionally complete, analytic, and deductively com¬
plete, in the senses in which R.S. was proved to possess these properties in
Metatheorems I, II, and X of the preceding chapter. Any such system will be
called a Model System of Logic, and any other axiom system for logic will be a
genuine or acceptable alternative to R.S. if and only if it is a Model System.
There are many different Model Systems, either different in that they start with
different primitive or undefined terms, or different in that they assume different
formulas as axioms. They are all equivalent, however, first in being able to
express^—on their normal interpretations—all truth functions, second in in¬
cluding all tautologies as theorems, and third in having only tautologies as
theorems. One such alternative system will be proved to be a Model System in
the following section.
A At Ao Aq ...
B Bx B2 B3 ...
C Cx C2 C2 ...
D Dx D2 D3 ...
of Symbolic Logic, vol. 10 (1945), pp. 61-82, and Many-Valued Logics, North-Holland Publishing
Company, Amsterdam, 1952.
See also: Alan Ross Anderson et al., ‘Proceedings of a Colloquium on Modal and Many-Valued
Logics’, Acta Philosophica Fennica, Fasc. 16 (1963); and A. A. Zinov’ev, Philosophical Problems of
Many-Valued Logic, G. Kung and D. D. Corney, eds. and trs., D. Reidel Publishing Company,
Dordrecht-Holland, 1963.
Sec. 9.2] The Hilbert-Ackennenii System 261
P D () = df Vp
P-p = df H^Pv
Pp = dfP-p
P=P = df(PDp)(pDP)
We shall continue to use the same conventions regarding parentheses that were
adopted in the preceding chapter.
Four (patterns of) axioms or postulates are assumed in H.A.
P 1. (PvP) D P
V 2 .PD (PvQ)
V 3. (PvQ) D (QvP)
p 4. (PDQ) 3 [(flvP) D(RvQ))
~PvQ
Q
whereas R 1 legitimizes arguments within R.S. of the form
P
~(P~Q)
Q
and these are clearly different. We may make the contrast more vivid by
writing them as
By MTIV, Cor., and Th. 29 of R.S., any wffin R.S. that follows by R 1 from two
other wffs must also follow by R' 1 and conversely. But this statement cannot
be assumed true of H.A. until it is proved.
A ‘demonstration in H.A.’ of the validity of an argument having premisses
Pv P2.Pn and conclusion () is defined to be a sequence of wffs Sv S2, ...,
St (of H.A.), each of which is either a postulate P 1, P 2, P 3, or P 4 or a
Pi (1 < i < n) or follows from two preceding S’s by R' 1, and such that St is
Q. That there is such a demonstration in H.A. is written
pvp2,...,pn ke
|fr:P
which asserts that there is a sequence of wffs Sv S2,..., St (of H.A.) each of
which is either a postulate P 1, P 2, P 3, or P 4 or follows from two preceding
S’s by R' 1, and such that St is P.
The functional completeness of H.A. is easily established. (It was Exercise 1
on page 226 in the preceding chapter.) A proof of the analyticity of H.A. is
easily given by using truth tables to show that any postulate P 1, P 2, P 3, P 4
is a tautology, and then proving that any wff which follows from tautologies by
repeated applications of R' 1 must be tautologous also.
The independence of the H.A. postulates is established by the following
models.
To prove that Postulate 1 is independent, we use the three-element model
Sec. 9.2] The Hilbert-Ackermann System 263
{0,1,2}, of which 0 is the designated element. Wffs are assigned values in
accordance with the tables:
2 0 0 0 0
0
1 0 0 1 0 2
2 1 0 2 0 2
1 0 0 0
1 1 1 0
1 2 0 0
2 0 0 0
2 1 2 1
2 2 2 0
264 Alternative Systems and Notations [Ch. 9
P ~~P P Q PvQ P D Q
o r~ 0 0 0 0~
1 0 0 10 1
2 3 0 2 0 2
3 0 0 3 0 3
10 0 0
111 0
12 2 0
13 3 0
2 0 0 0
2 12 3
2 2 2 0
2 3 0 3
3 0 0 0
3 13 0
3 2 0 0
3 3 3 0
P-Q
P_ <? Pvp ~P ~Pv ~(~P v ~~Q)
V 0 0 ~5~ 5 5 0
0 1 0 5 5 5 0
0 2 3 5 4 5 0
0 3 3 5 1 0 5
0 4 0 5 0 0 5
0 5 0 5 0 0 5
1 0 0 5 5 5 0
1 1 0 5 5 5 0
1 2 3 5 4 5 0
1 3 3 5 1 0 5
1 4 0 5 0 0 5
1 5 0 5 0 0 5
2 0 3 4 5 5 0
2 1 3 4 5 5 0
2 2 3 4 4 5 0
2 3 3 4 1 0 5
2 4 3 4 0 0 5
2 5 3 4 0 0 5
3 0 3 1 5 0 5
3 1 3 1 5 0 5
3 2 3 1 4 0 5
3 3 3. 1 1 0 5
3 4 3 1 0 0 5
3 5 3 1 0 0 5
4 0 0 0 5 0 5
4 1 0 0 5 0 5
4 2 3 0 4 0 5
4 3 3 0 1 0 5
4 4 5 0 0 0 5
4 5 5 0 0 0 5
5 0 0 0 5 0 5
5 1 0 0 5 0 5
5 2 3 0 4 0 5
5 3 3 0 1 0 5
5 4 5 0 0 0 5
5 5 5 0 0 0 5
~~(Pm ~~Q) to infer Q: and the three H.A. formulations of the R.S. axioms take
only designated values. But for the value 2 for. P, we have P v ~~P =
2 v ^2 = 2 v 4 = 3, which is not a designated value.2'
To prove the deductive completeness of H.A., we first establish some
theorems, derived rules, and metatheorems for it.
DR 1. PDQ,QDr\skPDR
Proof: 1 ,PD(P\Q) P 2
2. (PvQ) D (QvP) P 3
3. PD (Qv P) DR 1
DR 2. Q D r\ha(PvQ) D (PvR)
Theorem 3. |nx:P D P
Proof: l.PD(PvP) P 2
2. (PvP) D P P 1
3. PDP DR 1
DR 3. PvQ IhxQvP
2See Henry Hiz, ‘A Warning About Translating Axioms’, American Mathematical Monthly, vol.
65 (1958), pp. 613 ff.; Thomas W. Scharle, 'Are Definitions Eliminable in Formal Systems?*
(Abstract), The Journal of Symbolic Logic, vol. 35 (1970), pp. 182 ff.; and Alonzo Church,
Introduction to Mathematical Logic, Princetop University Press, Princeton, N.J., 1956, pp.
125-128. On this matter I have benefited from correspondence with Professor ]ean Porte and from
discussion with Professor Anjan Shukla.
Sec. 9.2] The Hllbert-Ackermenn System 267
Theorem 4. IhaPv^P
Proof: 1. P D P Th. 3
2. ~PvP df.
3. Pv~P DR 3
Theorem 5. [haP D —P
DR 4. p, g liiAPg
Proof: l. p d [g d (pg)] Th. 10
2. P premiss
3. g d (pg) R' 1
4. g premiss
5. pg R' 1
DR 5. PDQ |HA~g D ~P
Proof: 1. P D Q premiss
2. Q D Th. 5
3 .PD ~~P DR 1
4. ~Pv -—~Q df.
5. -Q v ~P DR 3
6. —Q D ~~P df.
Sec. 9.2] The Hilbert-Ackermann System 269
Theorem 14. (ha(PQ) D P
Proof: 1. —P D (~Pv ~~Q) P2
2. ~(~Pv 3 ~_^p DR 5
3.-P D P Th. 6
4. ~(~Pv ~<p) D P DR 1
5. (PQ) D P df.
The next two derived rules follow from the two preceding theorems by R' 1.
DR 6. PQ |haP
DR 7. Pq\haQ
DR 8. P=Q biA ~P= ~<p
Proof: 1. PD(PvP) P 2
2. (PvP) D P p 1
3. [P D (Pv P)][(Pv P) DP] DR 4
4. P=(PvP) df.
case 2. S is Q and S* is also. Here [ha () ==£ {) (Th. 13), that is,
[ha S = S*, whence Q = R IiIa S = S*.
case 3. S is a Pr Here S* is Pr also. Here [ha l\ === Pi (Th. 13), that is,
[ha S = S*, whence () = K [ha S = S*.
(/?) Here the Metatheorem is assumed to be true for any S containing fewer
than n symbols. Now consider any S containing exactly n ( >1) symbols. S is
either —Sx or S1 v S2.
A few more theorems and derived rules will move us closer to a proof of
deductive completeness for H.A.
272 Alternative Systems and Notations [Ch.9
Proof: 1. p) D () Th. 14
2. [Pv(pR)] D (Pv()) DR 2
3. (^R) D fl Th. 15
4. [Pv(QR)] D (PvR) DR 2
5. [Pv«?R)J D[(Pv<?)(PvR)] DR 13
DR 15. P D (Q D R) (ha(PQ) DR
Demonstration: 1. P D (Pv Q) P 2
2. P premiss
3. Pvg R' 1
\iiZQ = [(PivS')vT]
k? = |PlV(S'vr)]
By the same reasoning we can show that there is a wff\ call it Y', such that
lifAK = (P1vy')
Each of the wffs T' and Y' contains the n — 1 disjuncts P2, ^3> • • •»Pn , so by the
j8-case assumption
only to propositional symbols; and (3) no disjunct is a conjunction, that is, the
symbol v is nowhere adjacent to a conjunction. In symbols, the ioff S is in
C.N.F. if and only if S is
false. This is contrary to the assumption that S is a tautology and therefore true
for any assignment of truth values to its component propositional symbols.
With these remarks in mind, we can state and prove
Proof: Let SCNF be a wff in C.N.F., which is a tautology. Then SCNF must be
a conjunction Sj S2... Ss each of whose conjunets Si (1 < i < s) is a disjunction
of wffs Tv T2, ..., Tn. each of which is either a propositional symbol or the
negation of a propositional symbol. Since SCNF is a tautology, each of its
conjunets Sj is tautologous, because a conjunction is true if and only if all of its
conjunets are true. We have already remarked that a disjunction of proposi¬
tional symbols and negations of propositional symbols can be a tautology only
if there is a propositional symbol P such that both P and —P are disjuncts of Si.
But (ns: P v ~~P by Theorem 4, from which we derive |fia (P v — P) v () by DR
16, where in this case () is a disjunction of all of the disjuncts of S,j other than P
and — P. But by MT II, fnA Si = \(P v —P) v (_)], whence [iTA Si by MT I, Cor.
Hence every conjunct Si of SCNF is a theorem, that is: [ha S\, |itaS2, ...,
[iTA S8. Now by s — 1 uses of DR 4 we obtain [ha Sl S2 . .. Ss which is [tta Scnf.
This completes our proof of Metatheorem IV.
EXERCISES
Prove that each of the propositional calculi described in the exercises on page 237 is
a model system of logic.
It has been remarked that the language of symlxilic logic requires punctua¬
tion if ambiguity is to be avoided. It shares this characteristic with natural
languages as well as with other artificial languages like (ordinary) algebra. We
have been using three kinds of punctuation marks in our logical language:
parentheses, brackets, and braces. In the following discussion, for convenience,
we shall use the word ‘brackets’ Jto refer to all of these.
Even moderately complicated formulas require many pairs of brackets,
which make them difficult to read. Any use of paired punctuation marks
Sec. 9.3] The Use of Dots as Brackets 277
involves redundancy. In the formula
(PDQ)v(PD
PDQ)v(PD ^Q
P D Q. v .P D
There is no danger of confusing the punctuation dot with the conjunction dot
because the punctuation dot can occur only adjacently to a connective symbol
such as V’, ‘D', or *=', whereas the conjunction dot can never do so.
Dots are symmetrical in shape, in contrast to the asymmetry of brackets.
Thus ‘(’, ‘[’, ‘{' are all concave on the right, which indicates that they group or
‘operate toward' the right, while ‘)\ ‘]\ ‘}’ are concave on and ‘operate toward*
the left. The symmetry of the dot notation is compensated for by introducing
the convention that punctuation dots always operate away from the connec¬
tive symbol to which they are adjacent.
There is another analogy between bracketing logical formulas and punctu¬
ating sentences of natural languages. The latter have punctuation marks of
different degrees of strength, the stronger of which ‘take precedence over* or
‘extend over* the weaker. For example, a period is stronger than a semicolon,
and a semicolon is stronger than a comma. Of the three kinds of brackets used
in logical formulas, the unstated convention has been to regard braces as
stronger than brackets, and brackets as stronger than parentheses. We have
used parentheses to group symbols within brackets, but not conversely, and we
have used brackets to group symbols within braces, but not conversely.
When this convention is adhered to, it permits additional redundant brackets
to be dropped. Thus the formula
(P D Q) D [(R v P) D {R v Q)]
PDQ)D[RvP)D (R v Q
P D Q. D :RvP. D .RvQ
278 Alternative Systems and Notations ‘ [Ch. 9
where the convention is that two dots bind more strongly or have greater scope
than one dot. Using three dots as a punctuation mark, that has greater scope
than either one or two dots, the formula
can be rewritten as
Pv ..Qv R : D Pv : Qv . Pv R
can be written as
For the sake of symmetry and greater ease of reading, one frequently adds dots
which are not strictly necessary for the avoidance of ambiguity. Thus the
formula
P D .QDP
is frequently written
P.D.QDP
The preceding lengthy formula may be more easily read when rewritten
according to this convention as
Now what should be done in case two bracketed expressions are connected
by the conjunction symbol, as in the formula
(P D QHQ 3 P)
Sec. 9.4] A Parenthesis-Free Notation 27S
Applying the technique already described, we should emerge with the awk¬
ward expression
PDQ.-.QDP
P D p.p D P
Here the single dot is to be thought of as operating in both directions, left and
right. This convention is satisfactory, as can be seen by observing that the
different formulas
1. PD[(Q.Q)DP]
2. [P D (p.p)] D P
3. [(PDQ).(J]DP
4. PD[p.(pDF)]
The use of dots as brackets has the advantage of economy, and the further
advantage of providing an infinite number of different punctuation marks of
different degrees of strength, where the strength or scope of each may be
determined by the simple expedient of counting its constituent dots.3
A logical notation that dispenses with brackets entirely has been devised by
the Polish logician, J. Lukasiewicz, and has been extensively used by logicians
of the Polish School. Corresponding to the four most commonly used operator
symbols
~ D • v
3 A more extended discussion of this topic will be found in (,. I. Lewis and (,. II. Langford,
Symbolic Logic The Century Co., Appendix 1, New York, 1932, pp. 486-489. For more technical
discussions' the reader is referred to H. B. Curry, ‘On the Use of Dots as Brackets in Logical
Expressions’, The Journal of Symbolic lagic, vol. 2 (1937), pp. 26-28 and A. M. Turing, ‘The Use
of Dots as Brackets in Church’s System’, ibid., vol. 7 (1942), pp. J46-156.
280 Alternative Systems and Notations [Ch.9
NC KA
In writing their formulas, they use lower-case letters ‘p\ *q\ V, V,... instead
of the capital letters ‘JP\ *Q\ TV, ‘S’, ... Instead of the connective symbols *C\
*K\ ‘A’ being written between the formulas they connect, they are placed
directly to the left of the two formulas to be joined. Thus
—P is written Np
P D Q is written Cpq
P'Q is written Kpq
Pv Q is written Apq
P D (Q D R) is written CpCqr
(P D Q) D fl is written CCpqr
P D (Q'R) is written CpKqr
(P D @)*R is written KCpqr
(Pv Q) D (RmS) is written CApqKrs
Ax. 1. CpKpp
Ax. 2. CKpqp
Ax. 3. CCpqCNKqrNKrp
And the three postulates of Lukasiewicz’s own system are written in his
notation as
P 1. CpCqp
P 2. CCpCqrCCpqCpr
P 3. CCNqNpCpq
The Polish notation has the obvious advantage of dispensing with all special
punctuation marks, for the order in which its symbols are written suffices to
make any formula unambiguous.
EXERCISES
_P\V
T t F
TFT
F T T
F F T
The other operators, ~, •, v, and D can all be defined in terms of the stroke
function. That the following definitions preserve the standard interpretations of
the symbols being defined is easily verified by the method of truth tables:
-P = df P|P
P.Q = df j
P Q. | .P | Q
PvQ= df P|P.|.Q|Q
P D Q = df P.|.Q|Q
The other operator that suffices for a functionally complete logic is that of
‘joint denial’, symbolized by a dagger with its point down. Written ‘PJQ’, its
standard interpretation is to deny that either of the formulas P or Q is true,
which is the same as affirming that they are both false. It is defined by the truth
table
p_q_nv
T T F
T F F
F T F
F F T
tr
4 See W. V. O. Quine, Mathematical Logic, Harvard University Press, Cambridge, Mass., 1947,
pp. 48-49. The stroke function is frequently referred to as the ‘Shcffer stroke function’ after
Professor H. M. Shefler, although it was first used by C. S. Peirce.
282 Alternative Systems and Notations [Ch. 9
The other operators can all be defined in terms of, the dagger function alone.
That the following definitions preserve the standard interpretations of the
symbols that are being defined is easily verified by the. method of truth tables:
= df PIP
PQ = df PlP.l.QlQ
PvQ = di PlQ.l.PlQ
PDQ = di PlP.l.Q.l-.PlP.l.Q
PIQ = df P\P.\.Q\Q-.\iP\P.\.Q\Q
P K? = df PiP^Qm-nP'l QlQ
EXERCISES
Thus far in the text and exercises, a number of alternative Model Systems of
Logic have been set forth. Each of them is based on two primitive operator
symbols, and the number of their axioms or postulates ranges from three (for
R.S.) to five (for F.S.). A system that is more economical both in primitive
operators and postulates is attributable to J. G. P. Nicod.5 The Nicod System
can be set forth as follows.
It should be noted that even the recursive rule for well formed formulas is
simpler in the Nicod System, since it needs only two rather than three clauses.
We have here the first fruit of using only one operator symbol.
The single axiom required (or rather, the single pattern for infinitely many
axioms of the object logic) is stated in our metalanguage, in which we use dots
for brackets, as
Ax. P.\.Q\R:\::T.\/r\T:.\:.S\Q:\:P\S.\.P\S
W. V. Quine, ‘A Note on Nicod’s Postulate , Mind, n.s. vol. 41 (1932), pp. 345-350.
B. A. Bernstein, ‘Remark on Nicod’s Reduction of Principia Mathematical, The Journal of
Symbolic Logic, vol. 2 (1937), pp. 165-166.
Thomas W. Scharle, ‘Axiomatization of Propositional Calculus with Sheffer Functors’, Notre
Dame Journal of Formal Logic, vol. 6 (1965), pp. 209-217.
Several alternative, one-axiom systems are listed in A. N. Prior, Formal Logic, Oxford University
Press, England, 1962, p. 306. Two in particular are noteworthy because they use only four
propositional variables, in contrast to Nicod’s five. They are also different from Nicod’s axiom in
that they contain no proper part that is itself a theorem. These systems are attributable to
Wajsberg (1931) and to Lukasiewicz (1931). I am indebted to Professor P. T. Geach of the
University of Leeds for calling them to my attention.
284 Alternative Systems and Notations [Ch. 9
instrument for deduction. Modus Ponens, which may be stated as
is simply a special case of the Nicod Rule, in which ‘R’ and ‘Q* denote
identically the same well formed formula of the object language. Nicod’s Rule
as well as his axiom is more complicated than those required in less economical
systems of logic.
That the Nicod System is functionally complete has already been indicated
in our discussion of the stroke function in Section 9.5. The analyticity of N is
simply enough shown by establishing via truth tables that the Axiom is
tautologous, and that the Rule can lead only to tautologies from tautologies.
The deductive completeness of N remains to be proved to complete the proof
that the Nicod System is a Model Logic. When this is attempted, we find that
for all the economy of its single axiom and the greater deductive strength of
its Rule, it is very difficult indeed to derive theorems in the Nicod System.
Nevertheless, it is interesting from the point of view of seeing how far one can
go in the direction of reducing the number of postulates and still have a
deductively complete system of logic.
We shall develop seventeen theorems of the Nicod System, and one derived
rule. The last four theorems will be the four axioms of the Ililbert-Ackermann
System, and the derived rule will be the Ililbert-Ackermann System’s rule R' 1.
Of course, these must be stated in their unabbreviated form, and then expressed
in the Nicod System’s notation. The Hilbert-Ackermann axioms, expressed in
terms of that system’s primitive symbols — and v are
1. ^(PvP)vP
2. -Pv(Pv())
3. -(Pv())v(()vP)
4. ~~(^PvQ) v [-~(R v P) v (R v ())]
P:|:0|£.|.P|P.
4. P\P.\.P\P:\:Q\Q:.\:.P\r.\.P\P:\:Q\Q-.:\::P\P.\.r\P:\:Q\Q:.\:.P\
•PI P I :-R \R.\.Q\Q:\:R\ R-[ | ::.R | R. | .P| P:| :R| R.| .P| P:. |
:.R|R.|.P|P:|:fl|R.|.P|P::|::R|fl.|.gig:|:R|R.|.p|^.
R' 1. From P and P | P. | .P | P: | :Q \ Q to infer Q.
Sec. 9.8]
The Nlcod System 285
Our development of the Nicod System will be entirely in terms of the stroke
function. Since the formulas of the proofs are of almost intolerable length, they
will be described rather than written out. Our descriptions, however, will be
sufficiently complete to permit the reader to write them out for himself.
Theorem 1. k p:. |:.T.\.T\T: |:S::. |::.S: |:T. I .T12’:. |:.0;; |;:S: |:T. I .TI
T:.\:.Q.
Proof: Line 1 is the Nicod Axiom with T in place of P, of (), and of R. Line
2 is the Nicod Axiom with T.\.T\T in place of P and of Q, and with
S | T: | :T | S. | .T | S in place of R. Line 3 is the result of applying the Nicod Rule
to lines 1 and 2. Line 4 is the Nicod Axiom with S: | :T. | .T | T in place of P, with
T. | .T | T: | :S in place of £) and of R, and with Q in place of S. Line 5 is the
result of applying the Nicod Rule to lines 3 and 4.
Theorem 2. (nT.|.T|T
Proof: Line 1 is the Nicod Axiom with S|£): | :P |S. | .P |S:. |:.S |(): |:
P |S. | .P |S in place of P, with T. | .T |T:. |:.S \Q: |:P|S. | .P |S in place of Q and
of R, and with P. |.Q|R in place of S. Line 2 is Theorem 5 with S|£):|:
P | S. | .P | S in place of P, and with T. | .T | T in place of S. Line 3 is the result of
applying the Nicod Rule to lines 1 and 2. Line 4 is the Nicod Axiom. Line
5 is the result of applying the Nicod Rule to lines 3 and 4. Line 6 is Theorem 3
withS1^: |:P|S.|.P|Ss.|:.S|p:|:P|S.|.P|SinplaceofS,andwithP. \.Q\Rin
place of P. Line 7 is the result of applying the Nicod Rule to lines 5 and 6.
Theorem 8. |NP.|.^|fi::|::9|S:|:P|S.|.P|S:.|:.g|S:|:f>|S.\.P\S
in place of R. Line 5 is the result of applying the Nicod Rule to lines 3 and 4.
Line 6 is Theorem 2 with S in place of T. Line 7 is the result of applying the
Nicod Rule to lines 5 and 6. Line 8 is Theorem 3 with P. | .S | S: | :P. | .S | S in
place of S, and with S in place of P. Line 9 is the result of applying the Nicod
Rule to lines 7 and 8.
Theorem 12. |n P: | :Q| R. | .0| R:: | ::Q: \ :P\R. \ .P\R:. \ :.Q: \ :P\R. \
■P\R
Deriving all tautologies from a single axiom (axiom form) by a single rule in
terms of a single operator is thus seen to be possible. But it is a tedious business.
As Dr. Samuel Johnson is said to have remarked in quite another connection, it
is ., like a dog’s walking on his hind legs. It is not done well; but you are
surprised to find it done at all.’
io
A First-Order Function
Calculus
10.1 The New Logistic System RS1
290
Sac. 10.1J The New Logistic System RS, SSI
A1. B\ C\ AJ, B\, CJ, A\, B\, C\, ...
A2, Bl Cl Af, Bf, Cl Al Bl Cl-..
Al Bl Cl A\, Bl Cf, Af, Bl C\, ...
These are predicate constants, and on the system’s intended interpretation each
of them will designate a particular attribute or binary relation or ternary
relation... or n-ary relation according as its right superscript is ‘1’ or ‘2’ or ’3’
or ... or Vi\
4. Infinitely many capital letters from the middle part of the alphabet, with
and without subscripts, having right-hand superscripts T, ‘2^, ‘3\ ...
P\ Q\ R\ Pi Ql R\, Pi Ql Rl...
P2, Q\ R\ Pi Ql Rl Pi Ql Rl...
P\ Q*> Rl Pi Ql Rl Pi Ql Rl ...
These are predicate variables, and on the system’s intended interpretation will
be symbols for which names of particular attributes, binary relations, and so
forth, can be substituted. Symbols of the third and fourth categories are
predicate symbols.
5. Infinitely many lower-case letters from the first part of the alphabet, with
and without subscripts:
These are individual variables, and on the system’s intended interpretation will
be individual variables of the kind discussed in Chapter 4. Symbols of the fifth
and sixth categories are individual symbols.
7. Just four additional symbols complete the list of primitive symbols of RS1;
these are the tilde, the dot, and left- and right-hand parentheses:
■, (. )
the object language, and we shall also use brackets and braces to denote the
object language’s parentheses when that is conducive to easier reading. Capital
letters, with and without subscripts, will be used in* the metalanguage as
syntactical variables, that is, as symbols for which designations of any symbols
or sequences of symbols of the object language can be substituted. Lower case
letters, with and without subscripts, will be used as syntactical individual
variables, that is, symbols for which designations of individual symbols of the
object language can be substituted. Finally, we adopt the convention that the
juxtaposition of two symbols of the object language will be denoted in the
metalanguage by the juxtaposition of their names. Thus in any context where
4F’ denotes ‘A1’ and V denotes V, 4F(x)’ will denote *A1(jc)’. We shall also find
it convenient to insert commas into any sequence of symbols of the meta¬
language that designate individual symbols of the object language. Thus where
*F' denotes 4B3’ and4x1’,4x2\ and x3’ denote a2\ and 4a3\ respectively, we
shall use *F(xv x2,x3)’ to denote 'B*(axa2az)\
We introduce the symbols V, ‘D\ * = ’, and 43’ into the object language by
definition, and denote them in the metalanguage by the symbols V,4 D\ 4=\
and 43\ The new symbols of the object language are introduced as definitional
abbreviations:
We shall feel free to drop parentheses (in the metalanguage), retaining only
those required to avoid ambiguity, or to make for greater ease of interpreta¬
tion. We shall also, on occasion, write as *PQ\ Although we shall not
always take advantage of the following convention by dropping unnecessary
parentheses, we set up the following order of precedence among the symbols of
our metalanguage, each symbol having precedence over any that lies in a
column to its right:
= D v • —
(3x)
P=(x)Qv~~RS D (3x)T'U
is understood to denote the same formulas of the object language that are
denoted by ’.
Sec. 10.1] The New Logistic System US, 203
ip)= {{[(*K>M(~fl)-(S)]} D {[(3jt)T]•[[/]}}
Our final convention is that of association to the left, which means that where
the convention of order of precedence does not suffice to remove the ambiguity
of an expression, its parts should be grouped by parentheses to the left. That is,
when an expression contains two (or more) occurrences of the same connective,
and their relative scopes within the expression are not otherwise indicated, the
occurrence to the right shall be understood to have the wider (or widest) scope.
We define formula of RSt as any finite sequence of symbols of RSP Among
these are included such sequences as
)~)(
(x)(Al(x))
(&y)(B2(xy)))D(Cl(a))
(3 v ~ =
Q3(ab)
of which we shall want lo include only the second and third as well formedf,
that is, as meaningful on the system’s intended or normal interpretation.
Now we define well formed formula of RS { by the following recursive rules:
Now that we have an effective criterion for wff\ we shall restrict our
discussion in the remainder of this chapter to wffs, which alone interest us.
It is convenient at this point to introduce and define some additional special
terms. (There is no loss of generality in phrasing our definitioas in terms of
undefined symbols only, for defined symbols are always eliminable.)
We assume infinitely many postulates for our object logic. Every wff of any
of the five following patterns is a postulate:
PI .PD (P-P)
P 2. (P-Q) D P
P3. (PDQ)D[-~(Q-R)D~(R-P)]
P 4. (x)(P D Q) D [P D (*)£>], where x is any individual variable, P is any
wff containing no free occurrences of x, and Q is any wff.
P 5. (x)P D Q, where x is any individual variable, y is any individual
variable or constant, P is any wff, Q is the result of replacing each free
occurrence of x in P by y, and if y is a variable, then it must occur free in Q
at all places where x occurs free in P (that is, no bound occurrence of y in Q
is the result of replacing a free occurrence of x in P by y).
this statement has a true antecedent and a false consequent, but it is not an
instance of P 4, since x occurs free in (x = 1) Also prevented is such a patent
falsehood as
^i> P2> • ■ •» Pn 1“ Q
asserts that there is a finite sequence of wffs Sv S2,..., St such that for every
S,- (1 < / < t) either:
Df. Where P is any wff of RS,, its associated propositional formula (abbre¬
viated to a.p.f. and symbolized F°) is the formula that results from F by first
deleting all occurrences of quantifiers in F, together with all parentheses that
were required by those quantifiers, and then replacing every well formed part
of F of the form Pn(xv x2,..., xw) and every propositional symbol in F by the
propositional constant A.
As an example, where F is
((*1)((^2)(A2(^1r2)))) D (((xl)(B1(xl)))9((x2)(Bl(x2))))
it’s a.p.f. F° is
(A) D ((A)-(A))
In developing the present system, we shall borrow freely from the results
obtained in Chapter 8, in which it was established that every tautology can be
demonstrated as a theorem in the Propositional Calculus based on P 1, P 2, P 3,
and R 1. This is noted in the following theorem and derived rule.
In any proof in RSls any step that is justified by Th. O or DR O will be noted
simply by ® (for ‘Propositional Calculus’).
At this point we proceed to establish the first few derived rules and theorems
of RSX.
DR 3. (*)(P-p) [-(*)/’•(*)?
DR 4. ix)iP=Q)\-ix)P = i.x)Q
i- (*xp-0 =>(*)<?
whence I- (x)(P’Q) D (x)P’(x)Q ®
h (*)P DP P '5
I- (x)Q DQ P 5
I- {x)P-(x)Q D P-Q ®
\-(x)P-(x)Q D (x)(P-Q) DR 1
h[(x)(P.g) D(x)P-(x)Q]-
[(*)?•(*# D (x)(P-Q)] ®
h(*xp-p) = (*)p-(*)g df.
T h . 2. h(x)(PD(?)D[(x)PD(x)g]
Next we state and prove the Deduction Theorem for RSt, which corresponds
to the strengthened method of Conditional Proof used in Section 3.8 and
Chapters 4 and 5.
case 3. Sfc results from applying R 1 to two previous S's, say S4 and Sj where
S. is St D Sk. By the /8-case assumption, since i<kt j <k we already have
Pn D Si and Pn D Sjt which is Pn D (Sf D Sk). Here we insert the demonstra-
tion of’[PB D (St D SJ] D [(Pn 3 S() D (Pn 3 SJ] (a tautology), and PB 3 Sk
follows by two applications of R 1.
300 A First-Order Function Calculus [Ch. 10
From a, [i by weak induction we can thus fill in for any number of steps S;. of
the original demonstration. Moreover, no variable is quantified in the ‘filled in’
sequence which was not quantified in the original sequence. This proves
Metatheorem II (D.T.).
MT II. Corollary: The D.T. as stated above holds for any system of
logic which has only rules R 1 and R 2 and which contains demonstrations for
Proof: Obvious.
Some additional theorems which can be established quite easily using the
Deduction Theorem are
Th. 3. h (x)(P=Q)D[(x)P=(x)Q]
#T h . 4. h (x)(P D Q) D [(3x)P D (3x)Q]
Their proofs will be left as exercises for the reader. We now proceed to some
theorems that state equivalences. Since (3x) was introduced as an abbreviation
for ~~(x)~~, the next theorem,
T ii . 5. h(3x)P= ~~(x)~~P
Th. 7. h ~-(x)P=(3x)~~P
Th. 8. h ~~(lx)P=(x)~~P
Sec. 10.2] v««wv|viiieiii vi nvj wv
T H • 9- 1-(x)(y)P=(y)(x)P
Proof: b (y)P D P P 5
\~(x)[y)PDP] R 2
h (x)(y)P D (:x)P DR 5
b (x)(y)P D (y)(x)P DR 1
b (*)(«f)P=(y)(x)P by®
3Tlie last restriction does not limit the generality of the Metatheorem, since all occurrences of
the defined symbols v, D, =, and 3 can be replaced bv undefined symbols.
302 A First-Order Function Calculus [Ch. 10
(/8) Now suppose the Metatheorem true for any wff W that contains k or
fewer symbols, and consider a tvff W that contains k + 1 symbols. W must be
^L, (x)L, or MmN.
10.3 Duality
1. W: P-Q
WA: •—P v —f)
2. W:(x)(PvQ)
WA: (3x)(~P*~p)
3. W: (y)(3z)[Pv(~£>vR-S)]
WA: (3t/)(z)[~P-(>(~R v ~S)]
1. W = WAA
1. W is ~R.
case
Subcase A: R contains more than one symbol. Then (~R)A is ~(RA). By
the /8-case assumption, h ~~R = RA, hence by ® I- ~~R = ~(RA), or
|-R = (~R)A which is h ~W = WA.
Subcase B: R contains just one symbol, that is, R is a P{. Then W is ~P,
and WA is Pv By ® b ~~P( = Pt, whence h ~W = WA.
The familiar theorems of De Morgan are also special cases of the duality
theorem. And the general duality theorem permits us to formulate negations
with ease. For example, the proposition
may be symbolized as
(lx){C\x)-(y)[S'(y) D T*(y,x)])
%i)]}
and is the symbolic translation of the English sentence
For every course there is some student who does not take it.
Having established the Rule of Replacement and the Duality Theorem, some
additional theorems are very easily proved.
The proofs of the next two theorems are equally simple, and may be left as
exercises.
Proof: h (x)Q D Q P 5
\-QDQ ©
H Q => MV DR 1
©
i-(*)e=p
Th. 15.
Th. 16. h (x)(P-Q) = (x)P-Q
* D R 6. P D Q h (3x)P D <?
For the next two theorems we make the general assumptions that x and y are
any two individual variables, and F(x) and F(y) are exactly alike except that x
is free in F(x) at all and only those places that y is free in F(y):
Again in the presence of R 1, EG follows from P 5 via Th. 27. The restric¬
tions on EG are exactly the same as those on P 5. We may establish EG as a
derived rule of RS,:
Proof: Q premiss
Q D (3x)P Th. 27
(3 x)P R 1
DR 9 . If F(x) and F(y) are exactly alike except that x is free in F(x) at all
and only those places that y is free in F(t/), then F(x) h (y)F(y).
We define S' to be (r-) ... (.r. )(*vi ) • • • )S.. , which is (x-. ) .. . (X: )
,) • • • (xj')(XjJS^. Since S£ b by / — 1 uses of P 5 and R 1, it is obvious
from the existence of the original quasi-demonstration that there exists a
genuine demonstration that
(1) p„P2,...,Pn,S',Si,/-,(«/,),/•;(!/,),...,/-;„(!/,„)I-(,)
O') .
si,, /',(</,). /',< </,).!/,„ ,) i /■;„(*/„,) => y
Now by successive uses of R 2, Theorems 21 and 20, and the Rule of Replace¬
ment, we obtain from those same premisses first
then
I- (3?3 y
and finally
and finally by R 1
h Q
310 A First-Order Function Calculus [Ch. 10
in which R 2 is not used to quantify any variable free in any of the premisses
Sfc2> • • •, S^, F2(y2),..., The foregoing argument can
be used m times to obtain a demonstration that
(m) F1,P2,...,Pn,S'i,S'2,...,S;<hp
in which R 2 is not used to quantify any variable free in any of the premisses
S'ki, Sko, ..., Skt. Hence we can use the Deduction Theorem again to obtain
r-r>..s;,.,Ks;,Dt>
Since preceded in the original quasi-demonstration, we obtain also from
those same premisses
and finally by R 1
in which R 2 is not used to quantify any variable free in any of the premisses
Ski, S;2,..,, S;( t. The foregoing af^ument can be used t times to obtain a
genuine demonstration that
(m+t) PvP29...,Pn\-Q
(x) (A'(x))
(3 x)((B'(x))D(C*(xy)))
(y) ((3 z)(~(A3(xyz))))
We now define the term ‘scope’ as follows: If ((i)x)B is a well formed part (i.e.,
a wff that is a part) of a wff A, then the scope of that particular occurrence of
(Qx) in A is the particular occurrence of B which immediately follows ({)*).'*
And we define the phrase ‘initially placed’ in this way: A quantifier is initially
placed in a wff F if it*either stands at the beginning of For is preceded (except
for parentheses) only by other quantifiers, and its scope extends to the end of
F. It should be clear that an equivalent definition of ‘prenex normal form’ can
be stated as: A ivff is in prenex normal form if and only if all its quantifiers are
initially placed, no two quantifiers are on the same variable, and every variable
occurring in a quantifier occurs also at least once within the scof)e of that
quantifier.
We can now state and prove our next Metatheorem.
6This definition provides a more precise formulation of the notion of scope that was discassed
informally in Section 4.4. See pp. 86-87.
312 A First-Order Function Calculus [Ch. 10
case 1. All quantifiers in F are initially placed.
Subcase Li. At least two quantifiers are on tbe.same variable. Here F is
of the form
(QxJ(Qx2)...(Qxi)...(Qxi)...(Qxn)G
If we denote all of F that immediately follows the first (QxJ by A, then xt does
not occur free in A, and by Th. 14 or Th. 15, h (QxJA = A. Hence by the Rule
of Replacement the first of any initially placed quantifiers on the same variable
can be simply dropped out to yield an equivalent formula. So if all quantifiers
of F are initially placed, then there is an equivalent formula all of whose
quantifiers are initially placed and which has at most one quantifier on any
variable.
Subcase l.ii. There is a variable in F that occurs only in a quantifier.
Here we have F of the form
(Qxl)...(Qxi)...(Qxn)G
We define two new terms at this point: In any wff in prenex normal form
the group of quantifiers (Qxx)(Qx2)... ({)xn) is the prefix and the quantifier-free
formula G is the matrix.
EXERCISES
A wff in prenex normal form may contain free variables. We wish to be able,
without any loss in generality, to confine our attention to wffs containing no
314 A First-Order Function Calculus [Ch. 10
case 2. If P contains n free variables xv x2, ..., xni then G is the cwff
(Xi)(x2) ... (xn)P. For if b F, then b P\ and by R 2, b (xn)P,\ and by n uses
of R 2, we have b G. And if b G, that is, b (xl)(x2) ... (xn)P, then since
b (xjixz) ... (xn)P D (x2) ... (xJP by P 5, R 1 gives b (x2) ... (xJP, and by n
uses of P 5 and R 1, we have b P, and hence b F.
We prove now a further result that gives for any wff a. still more specialized
normal form wff that is a theorem if and only if the original wff is a theorem.
b G D {G-[D{t) D D(t)]} by ®
b(3t){C D {G-[D(t) D D(t)]}) by DR 8, (EG)
hGD(3t){G‘[D(()DD(t)]} by Th. 19 and R.R.
We also have
l-{G*[D(0 DD(()]) DG P 2
Kt){{G-[D(l)DD(f)]}DG) R 2
b(3t){G‘[D(t) DD(()]) DG Th. 21 and R.R.
Hence
h(3t){G-[D(i)DD(f)]} = G by ®
See. 10.S]
Normal Forms SIS
Now the prenex normal form of (3f) {G • [ D(t) D D(t)]} is the formula oftypeR
that was desired. It is dosed, it is in prenex normal form, it begins with an
existential quantifier, and 1- F if and only if I- R. Where G is (p*j) (Ox,) ...
(Qxn)G', then
I-(3t){G*[f)(f) D D(t)]) or
h (3tMQ*i)(Qx2) • • • (<?xn)G']-~[D(t)-~D(f)]}
(3t)(Q*d(Qx.i)... «,\){G'-~~[D(t)-~D(t)].}
By definition, any cwff in prenex normal form that begins with an existential
quantifier will be said to be of type R.
EXERCISES
For each of the following wffs, construct a formula of type R that is a theorem if
and only if the given wff is a theorem:
*1. (x)(y)(z){\H(x) v H(y)\ v II(z))
2. F(xvx»...,xn) D G(z)
*3. (3x)G(xyy)vF(z)
4. (3z)F(z,z) D (x)G(x1 y)
5. (x)F(x, y) D G(z)
Next, we present three new definitions, of which the third defines the
‘Skolem normal form’, which is a still more specialized type of well formed
formula, about which we shall establish a further Metatheorem.
Proof; For any wff F we can construct a formula of type R, call it Fv such
that b F if and only if b Fv (If F is already of type R, then F = Fv) If l<\ is of
rank 0, then it is in the desired Skolem normal form. If Fx is of rank k > 0, then
316 A First-Order Function Calculus [Ch. 10
we show how to construct a formula of type R of rank less than fc, call it F2.
such that h Fl if and only if h F2. We thus embark on a process that will yield
formulas F3, F4, f5, .. . all of type R and of decreasing rank. Ultimately, then,
we arrive at a formula F0 of type R and of rank 0 such that h F if and only
ifhF(,
If F\ is not in Skolem normal form, then its rank is greater than 0, that is, Fx
has the structure
First, we want to show that b F, if and only if b C. The two implications are
Hence, if b Fv then b C.
Now we substitute B(x.... x', y) for the predicate variable II{x\.x', r/)7
7The rule governing such substitution (rule of substitution for functional variables) is difficult
and complicated to express. Since this is our only use of it, we shall not state it here. A discussion
of alternative formulations can be found in A. Church, Introduction to Mathematical Logic /,
Princeton University Press, Princeton, N.J., 195b, pp. 193 ff., 289, and 297.
318 A First-Order Function Calculus [Clt. 10
The right-hand formula is the formula F2 of type R, such that h F if and only
if h F2. Now the rank of F2 is lower than the rank of Fv for their prefixes are the
same except that F2 has an additional universal quantifier in the final position.
This does not affect the rank. Another exception is that one of the universal
quantifiers ((y)) which preceded an existential quantifier has been replaced by
an existential quantifier, which lowers the rank by one. If F2 has rank 0 then it
is the Skolem normal form of F. If it is not, then the argument can be repeated
to obtain a formula F3 of type R of still lower rank. Hence we can ultimately
arrive at a formula of type R of rank 0, which is the desired Skolem normal
form.
EXERCISES
10.6 Completeness of
Nr (3x)F(x) D (x)F(x)
N2: (3x)(3y){F(x)-F(y)-[G(x) = ~~G(y)]} D (x)F(x)
N3: (3x)(3y)(lz){F(x)-F(y)-F(z)-[G(xj= ~G(i/)]-
[H(y) = *H(z)]-[l(x) = —/(«)]} ^ (x)F(x) .
asserts (respectively) that there is at most one individual, that there are at most
two individuals, that there are at most three individuals,... But if we want our
logical system to be applicable to any possible nonempty universe, regardless
of the exact number of individuals it contains, there is no Ni such that we want
either Nj or ^'Ni to be provable as theorems. A different kind of completeness
was proved for the propositional calculus presented in Chapter 8. That logistic
system was proved to be complete in the sense that every wff that (on its
normal interpretation) is a truth table tautology is provable as a theorem in the
system. That kind of completeness is admirably suited for a propositional
calculus, but it will not do for a functional calculus. A first-order function
calculus that did not have the wff
(3x)(y)F{x9 y) D (y)(3x)F(x, y)
8 The following discussion and proof of completeness is an adaptation to RSj of the completeness
proof for an alternative first-order function calculus given by Leon Henkin in his article The
Completeness of the First-Order Functional Calculus’, Vie Journal of Symbolic Logic, vol. 14
(1949), pp. 159-166. Professor Henkin simplified the published proof and he communicated it to
this writer in the spring of 1951. It is reproduced here with his kind permission.
The first proof of completeness for a first-order function calculus was published by Kurt Godel,
‘Die Vollstandigkeit der Axiomc der logischen Funktionenkalkuls, Monalshefte fiir Mathematik
und Physik, vol. 37 (1930), pp. 349-360.
320 A First-Order Function Calculus [Ch. 10
(a) Any well formed part — W is assigned the truth value T or F according
as the well formed part W is assigned the truth value F or T.
(b) Any well formed part of the form X* Y is assigned the truth value T if
and only if both X and Y are assigned the truth value T.
(e) Any well formed part (x)R is assigned the truth value T if and only if
R is assigned the value T, regardless of which element of the model is
assigned to all free occurrences of x in R.
(d) Any well formed part x2,. .., xn) is assigned the truth value T if
and only if the elements of the model assigned to xv x2,..., xn, in that
order, stand to each other in the n-ary relation assigned to Pn.
this it follows that (xn )S is valid, and (xn_ )S is valid,..., and finally that Sc
is valid. On the other hand, if Sc is valid, (x2)(x3)... (xH )S is assigned the value T,
regardless of which element of the model x1 is taken to denote, from which it
follows that (x3)... (xn )S is assigned the value T regardless of which elements xx
and x2 are taken to denote, from which it follows ... finally that S is assigned
the value T regardless of which elements of the model the free individual
variables xv x2,... ,xn are taken to denote, which means that S is valid. Hence
with no real loss in generality we can confine our attention to closed tvffs (tuffs
containing no free variables), which we shall refer to as ctvffs.
To prove our system complete, it is useful to introduce the notion of a set of
cwffs. We shall use the capital Greek letters Gamma and Lambda, with and
without subscripts, to denote sets of cwffs of RSt. We shall also make use of
braces to represent sets of cwffs: for example, the set whose only member is the
cwffS will be written {S}, the set whose only two members are the cwffs Sx
and S2, will be written {SA, S2), and so on. To assert that there is a demonstra¬
tion of the wff(? from the closed well formed premisses P,, P2,..., Pn, we may
write either
Pv P2.P„ h Q
or
{PvP2.?„}!-<?
If F is a set of cwffs and S is a cwff not belonging to F, then the new set
containing all members of V and S will be written as F U {S}. To assert that
there is a demonstration of Q from premisses belonging to V U {S} we may
write either
F U {S}\-Q or r, S\~Q
If there is a demonstration of the wff Q from the premisses Pv P2,..., Pn, then
there is, of course, a demonstration of Q from any set of cwffs F which contains
{Pv P2,..., Pn } as a subset (which will be written {Pv P2,..., Pn } C T), no
matter what other cwffs F may contain. The set F may even be infinite: There
is nothing in our proof of the Deduction Theorem that requires any revision to
accommodate this possibility. Of course, in any demonstration of Q from an
infinite set of premisses only a finite number of them will actually be used, for
a demonstration is always a finite sequence of wffs.
We defined a first-order function calculus to be complete if and only if every
Sec. 10.6] Completeness of RSt 329
valid wff is provable in it as a theorem, and then showed that the first-order
function calculus RSj is complete if and only if every valid cwff is provable in
it as a theorem. To say that every valid cwff is provable in RSt as a theorem is,
by transposition, to say that for any cwff S, if S is not a theorem, then S is not
valid.
In defining the terms ‘valid' and ‘satisfiable’, we remarked that for any wffS,
either S is valid or is satisfiable. Hence to say that S is not valid is to say
that —S is satisfiable. And so we can say that RSt is complete if and only if, for
any cwff S, if S is not a theorem, then — S is satisfiable. We can establish this
result by introducing a characteristic (p such that both
for from (1) and (2) the completeness of RSj will follow bv a Hypothetical
Syllogism.
We first define two attributes of sets of cwffs. A set A of cuffs is inconsistent
if A I—-PmP, otherwise A is consistent. It is obvious that any wff can be
inferred from an inconsistent set, for where S is any wff* b (~~P'P) v S by
®, which is logically equivalent to — PmP\- ~~(~~P*P) D S. Since b — (^PmP)
by ®, we have by R i that — P-P b S, whence A b S where A is any inconsist¬
ent set of cwffs and S is any wff whatever. Now the characteristic (j> referred to
above can be defined as follows: A cwff' S has characteristic q> if and only if
{—S} is a consistent set.
And we demonstrate (1) above as the following:
~Sh -~P-P
h D ^P-P
from which
b ~~(*P-P) 3-S
which by ® gives
bS
we prove a somewhat more general result, for the statement of which the term
‘simultaneously satisfiable' must be defined. The definition is this: A set A of
cwffs of a formal system is simultaneously satisfiable if and only if there is a
model and a normal interpretation of all formulas of A with respect to that
model which satisfies all the wffs of A. Now we state and prove
A b (*){[(3*)P,. 3 Pi] 3 by R 2
A b (3*)[(3*)p, 3 ^,] D ~~P'P by Th. 21 and R.R.
A h [(3*)p, 3 (3x)pt] D ~P*P by Th. 19 and R.R.
A h [(3*)p, D (3x)Pj] by ®
and finally
A h -f’-P
(a) Any cwff is assigned the truth value T or F according as the cwff S
is assigned the truth value F or T.
(b) Any cwffX • Y is assigned the truth value T if both X and Y are assigned
the truth value T; otherwise it is' assigned F.
(c) Any cwff (x)R is assigned the truth value T if the truth value T is
assigned to every result of replacing all free occurrences of x in R by an
individual constant of RS*; otherwise it is assigned F.
Sec. 10.6] Completeness of RS1 327
(d) Every cwff Pn(av a2,..., an) is assigned the truth value T if the
elements of the model assigned to av «2,..., an, in that order, stand to
each other in the n-ary relation assigned to PM, otherwise it is as¬
signed F.
We now can prove that all wffs of I' are satisfied on this interpretation. What
we shall actually prove is the stronger result that every cwff S of RSt* is
assigned the truth value T or F according as S belongs or does not belong to F.
Our proof is by strong induction on the number of symbols in S, counting each
propositional symbol, each cwff of the form Pn(av a2,..., an), each occurrence
of a universal quantifier (x), and each occurrence of • and — as a single symbol.
(a) In case n = 1, S is either a propositional symbol or a cwff of the form
Pn(ai, a2,..., an). If S is a propositional symbol, by paragraph 1 of the original
assignment of meanings, S is assigned the truth value T or F according as it is
contained or not contained in I". If S is the cwff Pn(al,a2.an) then by
paragraph 3 and condition (d), S is assigned the truth value T or F according as
it is contained or not contained in V.
(/3) Here we assume that every cwff of RSj* which contains fewer than k
symbols is assigned the truth value T or F according as it is contained or not
contained in T. Now we consider any cwff S of RS/ that contains exactly
k > 1 symbols. Here S must be either — W or X• V or (x)R.
case 1. S is —W.
If — W is in T, then since V is a maximal consistent set, W is not in F. Here,
by the /3-case assumption, since W contains fewer than k symbols, W is assigned
the truth value F. By condition (a), — W must be assigned T. Hence if S is in
r, it is assigned the truth value T.
If — W is not in T, then W is in F. Here W is assigned T, so by condition (a),
—W must be assigned F. Hence if S is not in l\ it is assigned the truth value F.
case 2. S is X-Y.
If X • y is in F, then by ® T I- X and V b y, so X and y are both in F. Since
each contains fewer than k symbols, by the /3-case assumption they are both
assigned the truth value T, and by condition (b), S is also.
If X* y is not in F, then not both X and y can be in V (for if they were, then
by ® T h X • y and X • Y would have to be contained in F), so not both X and y
are assigned the truth value T, and by condition (b), S is assigned the truth
value F.
case 3. S is (x)R.
If (x)R is in F then T h (x)R, and where Q is any result of replacing all free
occurrences of x in R by an individual constant, h \ x)R 3 ^ by P 5, so by R 1,
F h Q, Since F is a maximal consistent set, every such Q is contained in T, and
since each of them contains fewer than k symbols, they are all assigned the
328 A First-Order Function Calculus [Ch. 10
truth value T by the /3-case assumption. Now by condition (c), (x)R is assigned
T also. Hence if (x)R is in F, then (x)R is assigned.the truth value T.
If (x)R is not in F, then ~~(x)R is in F, whence F b ^(x)R, and by Th. 7 and
the Rule of Replacement, F h (3x)-^R. Since F is a maximal consistent set, it
must contain (3x)~~R. But (3x)-— R is a cwff of RS,* which begins with an
existential quantifier, and so the formula (3 x)~~R D ~~R* must have been
added to some to form A;-—where ~~R* is the result of replacing all free
occurrences of x in ^R by u{ . Since the cwff (3x)~~R D —R* belongs to
some such A;., it must belong to F. Hence, by R 1, F b — R*, so ^R* belongs to
F also. Since F is consistent, R* does not belong to T, and since it contains
fewer than k symbols, by the /i-case assumption, it is assigned the truth value F.
Hence by condition (c), (x)R is assigned the truth value F.
This completes the induction, and proves that all formulas of F, and hence all
those of A, are simultaneously satisfiable. Metatheorem X, together with the
discussion which preceded it, completes our proof of
From this new postulate, in the presence of the other postulates, rules, theo¬
rems, derived rules, and metatheorems of RSX already at hand, we can derive
the following results concerning identity.
9Thc idea of using this one axiom schema is credited to Hao Wang by W. V. O. Quine in Set
Theory and Its Logic, Harvard University Press, Cambridge, Mass., 1963, p. 13.
Sec. 10.7] RSj with Identity 329
Theorem I — 1: h [(* = y)*Fx] D Fy
Theorem 1-2: \- y = y *
R 2
II
II
Theorems 1-1 and 1-2 are the usual axioms for identity.
Proof: We begin with an instance of Th. 1-1 in which ‘Ft/ is *^(u = *)’
n
7T
Th. 1-1
II
I
X
II
H
I
II
H
II
II
The four ‘Principles of Identity’ stated in Section 5.4 can be established here as
derived rules of inference.
DR I - 1. x = y\- y = x
Proof: x = y premiss
y =* Th. 1-5 & R.R.
DR I - 2. Fx, y = x\- Fy
Proof: y =x premiss
x = y DR l-l
Fx premiss
(x = y)‘Fx ®
[(* = y)’Fx1 Fy Th. 1-1
Fy R 1
called extended function calculi, and if they are consistent, they are demon¬
strably incomplete.10 But these more advanced parts of symbolic logic lie
beyond the scope of the present book.
Of course, we shall need the four logical symbols —% *, (, and ). Those are all
the primitive or undefined terms we need.
We can define the additional logical symbols v, D, =, and 3, for our object
logic, as in Section 10.1. Then we can define wff for our system recursively as
follows:
We assume the familiar five postulates of RSt together with the axiom
schema for identity, P 6 of Section 10.7. We assume the two familiar rules of
inference R 1 and R 2 of Section 10.1, and the definition of demonstration. It is
easy enough to define = in terms of £ as in ZF-1, and then to define * and*
as obvious abbreviations. All of the ZF axioms can be symbolized m this
system. Then, we can either add the ZF axioms as additional axioms or else
take them all together as premisses to every argument whose conclusion is
specifically set theoretic or mathematical.
So far so good.
There is a new paradox lurking, however. We have 6een, in our completeness
proof for RSj, how that system is satisfied by a denumerable model. On the
assumption that the ZF Set Theory is consistent, it, too, has a denumerable
model. This was proved by Thoralf Skolem. But the ZF system contains both an
axiom of infinity and a power set axiom, which permits proof that there are
nondenumerable sets. Here is the Skolem Paradox: A system which has a
denumerable model contains a theorem that asserts the existence of nonde¬
numerable sets.
How shall we understand this situation? It cannot be said that there is
general agreement about what has gone wrong. Here, for what they may be
worth, are two of the proposed resolutions.
One thing that can be said is that the set membership relation simply has not
been (cannot be?) completely described or defined formally. So the binary re¬
lation in our model, which serves as interpretation for £, is not set member¬
ship but something only formally parallel to it. In short, this proposed resolu¬
tion says that the possibility of a denumerable model for a nondenumerable
theory has shown only that our model is ‘nonstandard’, and is not what we were
trying to talk about or specify.
Another way out of the Skolem Paradox is based on the realization that
equinumerousness or equivalence of sets is established only by actually pro¬
ducing a function that is or effects a one-to-one correspondence between them.
It may be that the function needed to map a set onto its power set simply
cannot be defined in the system. If the set is denumerable, then its power set
can be proved nondenumerable in the system only because the function that
would enumerate its members does not exist in the system. It is as though the
power set, seen from outside the system, can be proved equinumerqus with the
denumerable set, but to specify the one-to-one correspondence requires a
different, more powerful system than ZF. This proposed resolution has more
adherents than one might have thought. Intuitively, sets are the size they are,
quite independently of what functions are available to map one onto another.
This view seems to me to be an extreme kind of operationalism—which
appears to suggest that if you canno^t measure something, then it does not exist!
It is heartening to be able to end a textbook that has no prerequisites with a
problem that has not yet been solved.
APPENDIX
Incompleteness of the
Nineteen Rules
The nineteen Rules of Inference presented in Sections 3.1 and 3.2 of Chapter
3 are incompletewhich is to say that there are truth functionally valid
arguments whose validity cannot he proved using only those nineteen Rules. To
discuss and establish this incompleteness, it is useful to introduce the notion of
a characteristic that is ‘hereditary with respect to a set of rules of inference’.
We offer this definition: A characteristic {\> is hereditary with respect to a set of
rules of inference if and only if whenever <1> belongs to one or more statements,
it also belongs to every statement deduced from them by means of those Rules
of Inference. For example, truth is a characteristic that is hereditary with
respect to the nineteen Rules of Inference presented in the first two sections of
Chapter 3. As we have already said, any conclusion must be true if it can be
deduced from true premisses bv means of our nineteen Rules of Inference.
Indeed, we should not want to use any rules of inference with respect to which
truth was not hereditary.
Nowt, to prove that a set of rules of inference is incomplete, we must find a
characteristic d> and a valid argument a such that
(1) <l> is hereditary with respect to the set of rules of inference; and
(2) <I> belongs to the premisses of (x but not to the conclusion of (x.
iThe following proof of incompleteness was devised by my friend Professor Leo Simons of the
University of Cincinnati. See Leo Simons, ‘Logic Without Tautologies’, Notre Dame Journal of
Formal Logic, vol. 15 (1974), pp. 41J-431, and More Logics Without Tautologies’, ibid., vol. 19
(1978), pp. 543-557. For some discussion of the matter, see Hubert L. Armstrong, ‘A Question
about Completeness’, ibid., vol. 17 (1976), pp. 295-296, and Paul J. Campbell, ‘An Answer to
Armstrong’s Question about Incompleteness in Copi\ ibid., vol. 18 (1977), pp. 262-264.
333
To obtain such a characteristic, we introduce a three-element model in terms
of which the symbols in our nineteen Rules can be, interpreted. The three
elements are the numbers 0,1, and 2, which play roles analogous to those of the
truth values true (T) and false (F) introduced in Chapter 2. Every statement
will have one of the three elements of the model assigned to it, and it will be
said to assume, take on, or have one of the three values 0, 1, or 2. Just as in
Chapter 2 the statement variables p, </, r,..., were allowed to range over the
two truth values T and F, so here we allow the statement variables p, q, r,... to
range over the three values 0, 1, and 2.
The five symbols *•’, V’, ‘ D’, and that occur in our nineteen Rules
can be redefined for (or in terms of) our three-element model by the following
three-valued tables:
—p =2 — p
p-q = max(p, q)
pvq = min(p, q)
p D q = min(2 — p, q)
p=q = max(min(2 — p, q), min(2 — q, p))
anf Wi? S!*0W that havin8 the value 0 is hereditary with respect to
0 us o ns an disjunctive Syllogism also. That it is hereditary with respect
to Hypothetical Syllogism can be shown by the following table:
P 9 r p Da t/Dr n ~
0 0 0 0 0 0
0 0 1 0 1 1
0 0 2 0 2 2
0 1 0 1 0 0
0 1 1 1 1 1
0 1 2 1 1 ' 2
0 2 0 2 0 0
0 2 1 2 0 1
0 2 2 2 0 2
1 0 0 0 0 0
1 0 1 0 ] 1
1 0 2 0 2 1
1 1 0 I 0 0
1 1 1 1 1 1
1 1 2 1 1 1
1 2 0 1 0 0
1 2 1 1 0 1
1 2 2 1 0 1
2 0 0 0 0 0
2 0 1 0 1 0
2 0 2 0 2 0
2 1 0 0 0 0
2 1 1 0 1 0
2 1 2 0 1 0
2 2 0 0 0 0
2 2 1 0 0 0
2 2 2 0 0 0
value 0, the expressions flanking the equivalence sign necessarily have the same
value. For example, in the table appropriate to. the first of De Morgan’s
Theorems, * '
the equivalent expressions (p*r/)* and p v ~~ q' have the same value in
every row even though the statement of their equivalence fails to possess the
value 0 in rows two, four, and five. It should be obvious, however, that having
the value 0 is hereditary with respect to the replacement of all or part of any
statement by any other statement that is equivalent to the part replaced.
Alternative proofs that having the value 0 is hereditary with respect to the
nineteen Rules make use of our analytical definitions of the logical symbols. For
example, that having the value 0 is hereditary with respect to Constructive
Dilemma can be shown by the following argument. By assumption,
‘(p D q)'(r D s)’ and ‘p v r both have the value 0. Hence both p D q and
V D s’ have the value 0, so either p = 2 or q = 0 and either r = 2 or $ = 0.
Since p v r has the value 0, either p = 0 or r = 0. If p = 0, then p ^ 2,
whence q = 0, and if r = 0, then r ^ 2 whence s = 0. Therefore, either q = 0
or s = 0 whence ‘q v s’ has the value 0, which was to be shown.
Once it has been established that the characteristic of having the value 0 is
hereditary with respect to the nineteen Rules, to prove the incompleteness of
those Rules, one need only exhibit a valid argument whse premisses have the
value 0, but whose conclusion does not have the value 0. Such an argument is
ADB
.‘.A D (A*B)
whose validity is easily established by a truth table. Where ‘A’ has the value 1
and ‘B’ has the value 0, the premiss ‘A D B’ has the value 1 D 0 = 0 but the
conclusion ‘A D (A#B)’ has the value 1 D (1*0) = 1 D 1 = 1. Therefore, the
nineteen Rules of Inference are incomplete.
APPENDIX
The capital Greek letters pi and sigma are often used to express generalized
logical products (conjunctions) and generalized logical sums (disjunctions). We
define
n n n n
p‘S^=and pvii^=n^v^)
i=l i= l i=l i=l
p=fp-{q\q)]
p=[pv(q-q)]
P'[(q v r)*(s*t)]
pv[(qvr)-(s-t)]
P v [(q v r) v (s • £)]
p v [(q v r) v (sv t)]
pvqvrvsvt
In this expression, the negation symbol is applied only to the single variables q
and s.
The principle of Distribution permits us to change any given formula into an
equivalent one in which conjunction symbols occur—if at all—only between
single variables or their negations, or else into a different equivalent form in
which disjunction symbols occur—if at all—only between single variables or
their negations. Thus the expression
(1) (P v t/)*(rvs)
in which conjunction symbols connect single variables only. And the expression
(3) (p-q)v(r-s)
(p v 7) ^ (p = <7)
and then to
(pvq)v(p = q)
(pv<7)v[(p.<j)v(p^)]
(p-q)v[(p-q)v(p-q)]
Now, by the principle that the conjunction of any statement p with a tautology
of the form qvq is logically equivalent to the statement itself, we obtain first
pv[(pvp)-q\
and finally
pvq
which is logically equivalent to, but much simpler than the form with which
we began.1
The term ‘normal form’ is sometimes reserved for more specific types of
expressions. These more specific types are also called ‘Boolean Expansions’ or
‘Boolean normal forms’, after the British logician George Boole (1815-1864). A
formula containing the variables p, </, r,... is said to be in disjunctive Boolean
normal form, or a disjunctive Boolean Expansion, when it is in disjunctive
normal form, when every disjunct contains exactly one occurrence of every
variable (either the variable or its negation), when the variables occur in
alphabetical order in each disjunct, and when no two disjuncts are the same.
The disjunctive Boolean Expansion of pvq is formed by first replacing p by
p*(qv q) and q by (p v p)*q to obtain the equivalent expression p• (q v q) v
(p v p)-c/, then using the Disjunction and Association rules to obtain (pmq) v
(p9q) v (puq) v (pmq), and finally cancelling out any repetitions of the same
disjunct by the rules of Commutation and Tautology, which results in
(p*q)v(puq)v(p9q). The disjunctive Boolean Expansion of any noncontra¬
dictory formula can be obtained by the same general method.
A formula containing the variables p, q, r, ... is said to be in conjunctive
Boolean Expansion asserts only that at least one of its disjuncts is true, any
disjunctive Boolean Expansion containing n variables and 2n disjuncts is
tautologous. This point is made in somewhat different terms in Chapter 8,
Section 8.6, and again in Chapter 9, Section 9.2.
It was pointed out in Chapter 2 that a truth-functional argument is valid if
and only if its corresponding conditional statement (whose antecedent is the
conjunction of the argument’s premisses and whose consequent is the argu¬
ment s conclusion) is a tautology. Since counting the number of disjuncts of its
disjunctive Boolean Expansion permits us to decide whether or not a given
form is a tautology, this provides us with an alternative method of deciding the
validity of arguments. Thus the argument form p v qt ~**p . \ ^is proved valid
by constructing the disjunctive Boolean Expansion of its corresponding condi¬
tional [(p vq)-p] D q, and observing that the number of its disjuncts is 22.
Since the negation of a tautology is a contradiction, an argument is valid if
and only if the negation of its corresponding conditional is a contradiction.
Hence another method of deciding the validity of an argument is to form the
conjunctive Boolean Expansion of the negation of its corresponding conditional
and count the number of its conjuncts. If it contains n distinct variables and has
2n conjuncts, then the argument is valid; otherwise it is invalid.
EXERCISES
*1. p = p 9. pv(q-r-q)
2. pv(q-p) *10. p D [(p D q) D </l
3. pv(p-q) 11. (pvq)-(qvr)-(pvr)-r
4. q -(p v q) 12. (p-q) v (q-r)v (p-r) v r
*5. p = (q-p) 13. p • {pD [</•(</ => r)\)
6. p-(p v q) 14. p v {p D [q v (q D r)l>_=
7. p-q-p *15. {pvq) 3 [Ip 3 q) D (</ v*/)l
8. (pv q)*q 16. (p-q) D [(p ^ q) ^ (p*q) 1
17. [p'[(q-r) vp]}v {(p-f/)-[r D (p-F)|) v (p-[p D (p-q^F)]}
18. (p-q) v (p-q-r) v (p-q) v (p-q-r) v (p-q)v (p-q-r) v (p-q-r)
APPENDIX
c
The Ramified Theory
of Types1
The simple theory of types, which was expounded briefly on pages 153-154,
suffices to eliminate such logical paradoxes as that of the alleged attribute
impredicable. But there is another kind of paradox or contradiction that is not
prevented by the simple theory of types. One example of this other kind is the
paradox of the liar, which was discussed on page 214. These two kinds of
paradoxes were first explicitly distinguished by F. P. Ramsey in 1926.2 Since
then those of the first kind have been known as ‘logical paradoxes’, the second
kind either as ‘epistemological paradoxes’ or, more usually, as ‘semantical
paradoxes’.
A singularly clear example of a semantical paradox is due to Kurt Grelling.3
The Grelling paradox can be stated informally as follows. Some words desig¬
nate attributes that are exemplified by the words themselves, for example,
‘English’ is an English word, and ‘short’ is a short word. Other words designate
attributes not exemplified by the words themselves, for example, ‘French’ is not
a French word, and ‘long’ is not a long word. We shall use the word ‘hetero-
logical’ to designate the attribute a word has just in case it designates an
attribute not exemplified by the word itself. Thus the words ‘French’ and ‘long’
are both heterological. The contradiction now arises when we ask if the word
‘heterological’ is heterological. If it is heterological, then it designates an
attribute that it does not exemplify, and since it designates heterological, it is
not, itself, heterological. But if it is not heterological, then the attribute that it
designates is exemplified by itself, so that it is heterological. Here the contra¬
diction is made explicit: if it is, then it isn’t, and if it isn’t, then it is.
A somewhat more formal derivation of the Grelling paradox, patterned after
'The reader who desires a more comprehensive discussion of this topic should consult Chapter
3 of I. M. Copi, The Theory of Logical Types, Routledge & Kegan Paul Ltd., London, 1971,
pp. 76-114.
2F. P. Ramsey, ‘The Foundations of Mathematics’, Proceedings of the London Mathematical
Society, 2nd series, vol. 25 (1926), pp. 338-384. Reprinted in The Foundations of Mathematics,
Harcourt Brace Jovanovich, New York, and Kegan Paul, Trench, Trubner & Co. Ltd., London,
1931, pp. 1-61.
3K. Grelling and L. Nelson, ‘Bemerkursgen jpu den Paradoxien von Russell und Burali-Forti’,
Abhandlung der Fries*schen Schulet n.s. vol. 2 (1907-1908), pp. 300-324.
344
The Rammed Theory of Types 34s
Ramsey s version of it,* is the following. Where Des’ designates the name
relation, so that s designates <p’ is symbolized as sDes</>’, we begin with the
definition
CASE 1
1. Het(‘Het’) D KSs^^Het’Desc/) • Tlet’Desi// = ^ $ = c/> • ~~<>(‘Het’)
2. D :*Het'Des^> • Tlet’Desi^ == ^ = </>• —</,(‘Het’)
3.
D :%flet’Des$• ‘Ilet’DesIIcl El Ilet = </>• -~<;>(‘Ilet’)
4. 3 :‘Het’DesHet EE Ilet = <>-^<;>(4 5Het’) «
5. D :Het = (/>• ^(‘Hct’)
6. 3 :-Het(‘IIef)
CASE 2
1. —Het(‘ilet’ D :(</>) -1‘Ilet’ 1 )es</>• ‘Het’Desi// 7= ^4, = —^(‘1 let')]
2. 3 :‘IIet’DesHet D ~[‘Het’Dea£ ~ v,
4' = 1 let • ~1 Iet(‘I let’)]
3. 3 :‘IIet’Des^ = ^ 4, = Ilet- D •Ilet('Het’)
4. D :‘I Iet’Desy = ^ y = Met (assuming ‘Ilet’ univocal)
5. D :Het(‘llet’)
Hence
llet(TLet’) ~ -Het(‘HeO
which is a contradiction.
The derivation of fhe preceding contradic tion did not violate any restrictions
imposed by the simple theory of logical types, so that theory does not prevent
such contradictions. Contradictions of this second kind can be eliminated,
however, by the adoption of a more complicated version of the theory of
logical types, known as the ‘ramified' (or ‘branching') theory of logical types.
Now, according to the simple theory of logical types, all entities are divided
into different logical types, of which the lowest contains all individuals, the
next lowest consists of all attributes of individuals (designated by functions of
individuals), the next, of all attributes of attributes of individuals (designated by
functions of functions of individuals), and so on. There are also hierarchies of
relations among individuals and attributes, and of attributes of relations, but
they need not be considered here. Where we use lower-case letters to denote
individuals and capital letters to designate attributes, then we can represent
4Op cit., p. 358 and pp. 369-372; reprint p. 27 and pp. 42-46.
5Where writing the psi as a subscript to the equivalence symbol abbreviates the universal
quantification of the equivalence with respect to psi, so that ‘sDesi! ~ ^ ^ = </>’ is an abbreviation
for 4(t//)[sDcs^ = $ = 4>]\
this hierarchy as follows, where the subscript attached to a function indicates
its proper level in the hierarchy: . .
Just as the simple hierarchy of types prevents us from speaking about all
functions or attributes and permits lis to speak only about all functions of
individuals or all functions of functions of individuals, and so forth, so the
The RamHM Theory of Typos 347
miCUFJ-FM D FJjb)}
Instead, we can say that Bob has all of Al’s good first-order qualities, sym¬
bolized as
(%){[xG2(%y%(a)] D %(b)}
D%m
or that Bob has all of Al's good nth order qualities, for some specified n. It
should be noticed that the attribute of having all of Al\s first-order attributes,
symbolized
=> %(*)}
is a second-order attribute.
The preceding formulation of the ramified theory of logical types is crude,
but it suffices to eliminate the Grelling paradox. In Case 1, the step from 2 to
3 is rejected on the grounds that the function ‘Met’ is of higher order than the
function variable V/A and may not be instantiated in its place. In Case 2, the
step from 1 to 2 is likewise rejected on the grounds that the function ‘Het’ is
of higher order than the function variable ‘</>* and may not be instantiated in its
place.
Another feature of the ramified theory of logical types is that it divides
propositions into a hierarchy of propositions of different orders. Just as any
function (of any type) is a first-order function if it makes no reference to any
totality of functions of that type (i.e., if it contains no quantifier on any function
variable of that type), so any proposition is said to be a first-order proposition if
it makes no reference to any totality of propositions (i.e., if it contains no
quantifier on any propositional variable). In general, a proposition is of order
n -f 1 if it contains a quantifier on a propositional variable of order n but
contains no quantifier on any propositional variable of order m where m > n.®
The restriction here is that we can never refer to all propositions, but only to
all propositions of this or that specified order. Thus we cannot say that ‘None
6^ principle Mathematics propositions are also divided into different orders on the basis of
differences in the orders of the functions they contain. We shall ignore that subtlety in the present
exposition, however.
of the propositions uttered by Smith tends to incriminate him’, which we might
partially symbolize as
but can say instead either that ‘None of the first-order propositions uttered by
Smith tends to incriminate him’, or that ‘None of the second-order propositions
uttered by Smith tends to incriminate him’, and so forth. We would partially
symbolize the second of these alternative propositions as
of logical types, at least of its ramified version, would seem to be its prevention
of the paradoxes.
There are difficulties as well as advantages with the ramified theory of
logical types. One of these concerns the notion of identity. The usual definition
of identity (of individuals) is
(x = y) =df(F1)[F1(x) = Fl(y)]
from which definition all of the usual attributes of the identity relation can be
deduced. But that definition violates the ramified theory of logical types, since
reference is made in it to all functions of type 1. Were we to replace it by the
definition
(x = y) = di('F1)['I<\(x)='Fl(,j)]
which provides that x and y are identical if they have all of their first-order
attributes in common, the possibility arises that x and y might be identical (in
this sense) and still have different second-order attributes. For any n, defining
identity as the sharing of all attributes of order n would permit “identical”
individuals to differ with respect to attributes of order m where m > n. If we
accept the restrictions of the ramified theory of logical types, then we cannot
define the identity relation, and even if we accept the relation of identity as
primitive or undefined, we could not state all the rules for its usage.
Other disadvantages of the ramified theory of logical types are more tech¬
nical, and will merely be mentioned. Mathematicians wish to establish their
theorems for all functions (of numbers), but they cannot do so if they are bound
by the hierarchy of orders. Moreover, certain existence theorems in analysis,
such as that of the Least Upper Bound, cannot be proved within the restrictions
of the ramified theory of types. The Cantorean theory of the infinite, which is
basic to nearly all of modern mathematics, cannot be established within the
rigid framework of the ramified type theory. And even the principle of
mathematical induction must be abandoned in its full generality, for its
complete statement is prevented by the ramified theory of logical types.
To relax the excessive restrictions of the hierarchy of orders, the ‘Axiom of
Reducibility’ was introduced. That axiom asserts that to any function of any
order and any type there corresponds a formally equivalent first-order function
of the same type (two functions are formally equivalent when, for any admis¬
sible argument, they are either both true or both false). In the presence of this
axiom, the identity relation can be satisfactorily defined in terms of first-order
functions and all of the disadvantages mentioned in the preceding paragraph
disappear. The question naturally arises: Does the Axiom of Reducibility relax
the restrictions of the ramified theory of logical types sufficiently to permit the
reintroduction of the paradoxes? If all of the semantical terms such as ‘true*,
‘false’, and ‘designates’ are added to a logical system like that of Principia
Mathematica, which contains both the ramified type theory and the Axiom of
Reducibility, and names for all functions are added also, then at least some of
the paradoxes reappear.9 On the other hand, if the names of some functions are
left out, then even in the presence of the semantical terms, the paradoxes do
not seem to be derivable even with the aid of the Axiom of Reducibility.10
It may not be out of place here to indicate briefly how the ‘levels of
language* method of avoiding the semantical paradoxes11 is remarkably similar
to the ramified type theory’s hierarchy of orders.12 Confining our remarks to
the Grelling paradox, we note that this paradox does not arise in an object
language (like the extended function calculus, for example) when we assume
that there are no symbols in it that designate symbols. Now does it arise in the
metalanguage of that object language. Since the metalanguage contains syno¬
nyms for all symbols of the object language and names for all symbols of the
object language as well as its own variables and the name relation (which we
write as ‘Des’), the symbol ‘Het’ can be defined in it. By definition:
The way in which the safeguards of the levels of language theory serve to
prevent this contradiction makes it very similar to the theory of orders. The
definition of Het in the meta-metalanguage that was written above requires the
addition of subscripts to resolve its ambiguity. Once these ambiguities are
pointed out and resolved, the contradiction vanishes.
In the first place, the meta-metalanguage contains two symbols for the name
relation, DeSj and Des2 .13 The first of these is the meta-metalanguage’s
synonym for the name relation in the metalanguage. The full sentence
vDes^
4
fDes2c/>
Het/f)
Het2(u)
Het/c) = df (3<t>):vDesrf>-vDesx\p = ^
and
It is clear that we cannot define He^ in terms of Des2 because the values of the
arguments of the two functions are terms of different languages, of the object
13As was suggested by Ramsey in a somewhat different context, op. cit., p. 370, in reprint,
p. 43.
language for He^ and of the metalanguage for Dcs2. For the same reason, Het2
cannot be defined in terms of Desr
No version of the Grelling can be derived fron? the definition of Hetls
because the only values of its arguments are terms of the object language, and
there is no term of the object language analogous to either ‘Het^ or ‘Het2\ The
only possibility lies in the direction of deriving a contradiction from the
definition of Het2, and this is thwarted by something remarkably like Prin-
cipias theory of orders.
In the definition of Het2 we cannot substitute for V the name of the symbol
for that function, because although the function symbol 4Het2’ occurs in the
meta-metalanguage, no name of that function symbol occurs there. The best
we can do is to substitute the meta-metalanguage’s name of the function
symbol of the metalanguage that is synonymous with ‘Het1’, for which we do
have a name in the meta-metalanguage (call it “Het”). Making the substitution,
we have
Since “Het” is the meta-metalanguage’s name for the function symbol of the
metalanguage that is synonymous with *Het1’, we have
and consequently
Het2(‘Het’fD -Het^Het )
pp. 52-84. See especially p. *50 in vol. 4 and pp. 00-61 m reprmt.
"*
Solutions to
II. I. 1. A D B
2. C D D
3. Hv-D)*(-Av-B) /.*. —A v —C
4. (A D B)-(C D D) 1, 2, Conj
5. —B v 3, Simp.
6. ~A v —C 4, 5, D.D.
5. 1. (R D ~S)-(T d ~r/)
2. (V D ~W)-(X D ~~Y)
3. (T D W)-(t/ =3 S)
4. Vvfl /:,. —Tv — 17
5. V D ~W 2, Simp.
6. R D ~S 1, Simp.
7. (V D <—W)*(R D ~S) 5, 6, Conj.
8. ^Wv —S 7, 5, C.D.
9. ~~Tv 3, 8, D.D.
IV. 1. 1. (A vG)DS
2. A-T /.\s
3. A 2, Simp.
4. A v G 3, Add.
5. S 1, 4, M.P.
1. (—K mP) D (B v R)
. 2. —K D (B D D)
3. K v(R D J£)
4. — K-P /.*. DvE
5. —K 4, Simp.
6. B D D 2, 5, M.P.
7. R D E 3, 5, D.S.
8. (B D D)-(fl D E) 6, 7, Conj.
9. BvB 1, 4, M.P.
10. DvE 8, 9, C.D.
10. 1. Tv(EDD)
2. T D C
3. (E D G) D (D D /)
4. (r-T ~C) D (D D C)
20. 1. OD(FDQ)
2. P D (Q D R) /:. OD(PD R)
3. (OP) 3 Q 1. Exp.
4. (P-Q) DR 2, Exp.
5. (Q-P) DR 4, Com.
6. QD (PDR) 5, Exp.
7. (O-P) D(PDR) 3, 6, Fl.S.
8. OD[PD(PD R)] 7, Exp.
9.0: [(P-P) D R] 8, Exp.
10. O D (P D R) 9, Taut.
25. 1. J-(K-L)
2. (K-J) D [Mv(NvO)]
3. ~M-(~P—0)/.\N
4. (J-K)-L 1, ASSOC.
5. J’K 4, Simp.
6. K-J 5. Com.
7. Mv(NvO) 2, 6, M.P.
8. ~M 3, Simp.
9. NvO 7, 8, D.S.
10. 0\N 9, Com.
11. (~M-~P)—O 3. Assoc.
Ml SotuHons to Saiectad Exercises on Page* 45-92
IV. 1. 1. S D G 5. 1. (G-L)v(W-T)
2. ~S D E /.'.GvE 2. —G /.\T
3. ~G D ~S 1, Trans. 3. v ~L 2, Add.
4. ~G D E 3, 2, H.S. 4. ~(G-L) 3, De M.
5. --G v E 4, Impl. 5. W’T 1, 4, D.S.
6. GvE 5, D.N. 6. T>W 5, Com.
7. T 6, Simp.
10. 1. BvT 15. 1. (WDM)*(/DE)
2. (BvC)D (L-M) 2. Wv/
3.-l /.*.r 3. (W D -£)•(/ D ~~Af) /.’. E =
4. -Lv-Af 3, Add. 4. MvE 1, 2, C.D.
5. 4, De M. 5. —E v 3, 2, C.D.
6. -(BvC) 2, 5, M.T. 6. £ D -Af 5, Impl.
7. -B--C 6, De M. 7.--MvE 4, D.N.
8. ~~B 7, Simp. 8. ~~Af D E 7, Impl.
9. T 1, 8, D.S. 9. (E D D E) 6, 8, Conj
10. E=~~M 9, Equiv.
21. 1 .(TDE)-(ADL) /.'. (TvA) D (EvL)
2. TDE 1, Simp.
3. -TvE 2, Impl.
. 4. Kv£)vL 3, Add.
5. -Tv(EvL) 4, Assoc.
6. (EvL)v-T 5, Com.
7. (AD L)-(T D E) 1, Com.
8.ADL 7, Simp.
9. -AvL 8, Impl.
10. (-AvL)vE 9, Add.
11. —A v (L v E) 10, Assoc.
12. -Av(EvL) 11, Com.
13. (EvL)v-A 12, Com.
14. [(E vL)v-!]•[(£ vL)v —A] 6, 13, Conj.
15. (EvL)v(~~T-~A) 14, Dist.
16. (~~T-~A)v(EvL) 15, Com.
17. ~(TvA) v(£vL) 16, De M.
18. (Tv A) D (EvL) 17, Impl.
5. 1. (V3 ~W)-(X 3 Y)
2. (~W 3 Z)-(Y 3 ~A)
3. (Z 3 ~B)-(—A 3 C)
4. V-X /. ’. —8-C
5. V D — W 1, Simp. 5'. -(-fl-C) I.p.
6. V 4, Simp. 6'.-B v —C 5', De M.
7. —W 5, 6, M.P. 7'. -Z v-A 3, 6', D.D.
8. —W D Z 2, Simp. 8'. ——vv v — y 2, 7', D.D.
9. Z 8, 7, M.P. 9'. — V v —X 1, 8', D.D.
10. Z D —8 3, Simp. 10'. -(V-X) 9', De M.
11. —8 10, 9, M.P. 11'. (V-X)--(V-X) 4, 10', Conj
12. (X D Y)-( V D W) 1, Com.
13. XDY 12, Simp.
14. X-V 4, Com.
15. X 14, Simp.
16. Y 13, 15, M.P.
17. (Y D -A)*(-W D Z) 2, Com.
18. Y D -A 17, Simp.
19. —A 18, 16, M.P.
20. (—A D C)-(Z D ~B) 3, Com. -
21. -ADC 20, Simp.
22. C 21, 19, M.P.
23. — B-C 1J, 22, Conj.
7.
r
6. C
8. C 3
6, D.N.
6-7, C.P.
9. D 5, 8, M.P.
10.ADD 3-9, C.P.
p> 2. By
3. (By v Vy) 3 (OyDy) U UI
2, Add.
4. By v Vy
3, 4, M.P.
5. OyDy
5, Com.
6. DyOy
0, Simp.
7. Dy
2-7, C.P.
8. By 3 Dy
8, UG
9. (x)(Bx 3 Dx)
382 Sohittoiis to Selected Exercises on Poges 77-83
20. 1. (x){(Bx v Wx) 3 [(Ax v Fx) 3 Sx]} (x)[Bx 3 (Ax 3 Sx)]
-*2. By
—> 3. Ay
4. (ByvWy) 3 [(Ay v Fy) 3 Sy] 1, UI
5. ByvWy 2, Add.
6. (Ay v Fy) 3 Sy 4, 5, M.P.
7. AyvFy 3, Add.
| 8- Sy 6, 7, M.P.
9. Ay 3 Sy_ 3-8, C.P.
10. By 3 (Ay 3 Sy) 2-9, C.P.
11. (x)[Bx 3 (Ax 3 Sx)] 10, UG
Aa Ac Ba Be
"T T T F~
Ka Kb La Lb
T F T F
or F T F T
Ka Kb La Lb Ma Mb
T F F T T T
or F T T F T T
5. ~~Ca-Ra 3, Com.
6. ~~Ca 5, Simp.
7. —Va 4, 6, M.T.
8. Ra 3, Simp.
9. Ra'-^Va 8, 7, Conj.
10. (3x)(Rx-~Vx) 9, EG
10. (*)[T* 3 (Ox v Ex)] B Ta 3 (iOa v Ea) Ta Oa Ea
(x)[(Ox-Tx) 3 ~£x] (Ofl'Ta) D — Ea "X F T"
(3x)(7x-£'x) Ta-Ea
.'. (3x)(71c*Ox) Ta-Oa
Exercises on page 88:
I. (3x)Dx 3 (3x)(Py-By)
5. (*)[Bx 3 (Yx 3 Rx)]
10. (*){(Ox-Px) 3 [(y)(Mj/ 3 —Py) v Afx]}
15. (*){[Sx-(t,)(Sy 3 Wy)] 3 Wx}
Exercises on page 92:
I. 1. d and /
5. none
II. 1. a, b, and c
2- g
Exercises on pages 100-101:
3. The use of UG at line 4 is wrong, because v (V) occurs free in the assumption
'Fx = Gy’ within whose scope the premiss Ok (‘Fx = Gy) lies.
6. There are three mistakes here:
a. The use of UG at line 5 is wrong, because v (V) does not occur free in
(‘(3x)[(Fx*Gz) D Hy]') at all places that ju (*y) occurs free in O/a
C(3x)[(Fx-Gy) D Hy]').
b. The use of UG at line 5 is wrong, because the variable v (%z) occurs free in line
2 within whose scope (‘(3x)[(Fx*Gz) D Hy]') lies.
c. The use of UI at line 7 is wrong, because v (V) does not occur free in Qv
(‘(3x)[(Fx-Gx) D Hx]’) at all places that /a (Y) occurs tree in 4>/a
(\3x)[(Fx-Gy) D Hy]').
9. There are three (or four) mistakes here:
Line 3, ‘Fx'Gy’, is claimed at line 5 to be a corresponding to the (‘Fx-Gx) in
line 1. This claim is false regardless of whether we take v to be Y or to be *y . So
in either case we note that
a. The use of El at line 5 is wrong, because v (x* or 'y') does not occur free in <S>v
(‘Fx'Gy’) at all places that /a (Y) occurs free in <!>/a (‘Fx-Gx’).
If we take v in $v ({Fx'Gy ) to be x , then we note that
b. The use of El at line 5 is wrong, because v (Y) occurs free in p (*Fx’)t the last
line lying within the scope of the assumption (‘Fx-Gy').
In any case, there are two additional mistakes present:
c. The use of El at line 8 is wrong, because v (Y) occurs free in line 5 which
precedes (4~Fx*Gx').
d. The use of El at line 8 is wrong, because v (Y) occurs free in p ( —Fx ), the last
line lying within the scope of the assumption <t>v (‘—Fx-Gx*).
364 Solutions to Selected Exercises on 103-104
8. 1. (x)[Rx D (SxvMx)]
2. (x)[(Ux-Rx) D -Sx] /:.(x)(Ux 3 Rx) 3 (y)(Uy 3 My)
-> 3. (x)(Ux D Rx)
r-> 4. Uz
5. Uz D Rz 3, UI
| 6. Rz 5, 4, M.P.
7. Uz-Rz 4, 6, Conj.
8. (Uz*Rz) D —Sz 2, UI
9. 8, 7, M.P.
10. Rz D (Sz v Mz) I, UI
11. SzvMz 10, 6, M.P.
12. Mz II, 9, D.S.
13. Uz D Mz 4-12, C.P.
14. (yXUy D My) 13, UG
15. (x)(Ux D Rx) D (y)(Uy D My) 3-14, C.P.
Aa Ab Ba Bb Ca Cb Da Dh
~~T T T F F T F F"
orTTFTTFFF
5. (*)(3y)(Fx=Gy) _
,’.(3y)(x)(Fx=Gy) [aj>\
[(Fa = Ga) v (Fa = Gb)] • [(Fb == Ga) y (Fb = Gb)]
.'. [(Fa = Ga)’(Fb = Ga)] v [(Fa = Gb) • (Fb = Gb)]
Solutions to Mooted Exercises on Pages IOS-115 367
Fa Fb Ca Gb
T F T F
or T F F T
or F T T F
or F T F T
5. ->1. (3*)(Fx-Gx)
p* 2. FyGy
3. Fy 2, Simp.
4. (3x)Fx 3, EG
5. Gy 2, Simp.
6. (3x)Gx 5, EG
7. (3x)Fx*(lx)Gx 4, 6, Conj.
8. (lx)Fx-(3x)Gx 1, 2-7, El
9. (3x)(Fx-Gx) D [(3x)Fx-(3x)Gx] 1-8, C.P.
15 . -* 1. (3x)Fx 3 (y)Cy
2. F*
3. (3x)Fx 2, EG
1, 4, M.P.
4. (y)Gy
4, UI
5. Gy
2-5, C.P.
6. Fx 3 Gy
7. ij/)(F* 3 Gy) 6, UG
8. (x)(y)(Fx 3 Gy)__ 7, UG
17. CZ 3, 5- 16, El
4-17, C.P.
18. 4 3 17
3-18, C.P.
19. 3 3 [18]
370 Solution* to Selected Exercises on Pages 134-139
9. 1. {x)[{Px‘~~Rxx) 3 (y)(Py 3
2. (y){Py D (x)[(Px- ^Ryx) D *Hyx]) /. .[x){[Px-(z)(Pz 3
. • D (y)(Py D
-> 3. Px-(z)(Pz D ~~Rxz)
r-» 4. Py
5. (z)(Pz D ~~Rxz) 3, Simp.
6. Px D ~~Rxx 5, UI
7. Px 3, Simp.
8. ~~Rxx 6, 7, M.P.
9. Px-~~Rxx 7, 8, Conj.
10. (Px—Rxx) D (y)(Py D ~~Ryx) 1, UI
11. (y)(Py => ~Ryx) 10, 9, M.P.
12. Py D — Ryx 11, UI
13. ~~Ryx 12, 4, M.P.
14. Px*~~Ryx 7, 13, Conj.
15. Py D (x)[(Px-~~Ryx) D ~Hyx\ 2, UI
16. (x)[(Px-~~Ryx) D Hyx\ 15, 4, M.P.
17. (Px'~~Ryx) D — Hyx 16, UI
18. ~Hyx 17, 14, M.P
19. 4 D 18 4-18, C.P.
20. (y)(19) 19, UG
21. 3 3 20 3-20, C.P.
22. (jc){21} 21, UG
L (x)(y)(z)l(RxyRyz) 3 Rxz]-(x)—Rxx
2. (x)(y)(z)[(RxyRyz) 3 Rxz\ 1, Simp.
3. (y)(z)[(Rxy-Ryz) 3 Rxz] 2, UI
4. (z)[(RxyRyz) 3 Rxz] 3, UI
5. (RxyRtjx) 3 Rxx 4, UI
6. (x)—Rxx 1, Simp.
7. —Rxx 6, UI
8. -(Rxy-Ryx) 5, 7, M.T.
9. ~~Rxy v — Ryx 8, Dc M.
10. Rxy D ~~Ryx 9, Impl.
11. (y)(Rxy 3 -Ryx) 10, UC
12. (x)(y)(Rxy 3 —Ryx) 11, UG
13. {1} 3 12 1-12, C.P.
14. (fl){13) 13, UG
1.
r>2. Fx
3. ^~~~Fx 2, D.N.
4. Fx 3, D.N.
5. Fx 3 Ft 2-4, C.P.
6. (Fx 3 Ft)-(Ft 3 Ft) 5, Taut.
7. Fx = Ft 6, Equiv.
8. (F)(Ft = Ft) 7, UG
9. ((/)[(* = y) = (F}(Fx=Fy)i 1, UI
10. (t = t) = (F)(Ft = Ft) 9, UI
11. [(t =t) 3 (F)(Ft = Ft)l-
[(F)(Ft = Ft) 3 (t = *)] 10, Equiv.
11, Simp.
12. (F)(Ft= Ft) 3(t = t)
12, 8, M.P.
13. X = X
13, UG
14. (t)(t = t)
Exercises on pages 175-176:
T h . 2. There is at most one entity 1 in C such that a H 1 = a
Proof: Suppose there are entities 1, and 12 such that « n 1, = a, a D 12 a.
Then 1, n 12 = 1, and 12 0 1, = 12. 1, H 12 = 1, n 1, by Ax « 6 .
Hence lt = 12 by two substitutions of equals for equals.
374 Solutions to Selected Exorcises on Pages 175-176
Th. 7. 0^1
Lemma 2. «n(aUj8) = a
L. 2
s
?-
II
Solutions to Selected Exercises on Pages 176-226 S7S
F. o D [(a U ft) U y] = a L. 1
P n [(« u ft) u y] = [ft n (« u ft)] u (p n y) Ax. 8
= [ft n (ft U a)] u (ft n y) Ax. 5
= pu(ftny) L. 2
G. ft n [(a U ft) U y] = ft L. 1
y ft [(a U /}) U y] = y n [y U (a U ft)] Ax. 5
II. y n [(a U ft) U yj = y L. 2
I. (ft U y) n [(« Uft)Uy\=ftL)y by L. 3 from G, H.
J. [a U (ft U y)] fl [(« U ft) U y] = n U (ft U y) by L. 3 from F, /.
« U (ft U y) = [a U (ft U y)] D [(« U ft) U yj J
= [(«Uj8)Uy]n[« U (ft U y)j Ax. 6
a U (ft U y) = (a U ft) U y E
2. S(D,«) is functionally incomplete, for it contains no wff that can express a truth
function whose value is false when all its arguments have the value true. Proof by
strong induction on the number of symbols in the wff g(P, Q> R,...) of S( D,*).
a) If g(P, p, R,.. ) contains just one symbol, it is either P or Q or R or ...: If these
are all true, then g(P, p, R,...) cannot have the value false.
P) Assume that any wff containing <m symbols cannot be false when all its
376 Solutions to Selected Exercises on Pages 226-244
arguments are true. Now consider any wff g(P, Q, fl>...) containing exactly m
symbols (m > 1). It must be either gj(P, Q, R,.%..) D g2(Pf Q,R,...) or
g i{P, Q, K,.. .)-g2(P, QRt.. .).Butg x(Pt Qy R,.. .)andg2(P, Qy R,...) each contains
<m symbols, hence they must have the value true if all their arguments are true.
And because T D T and T*T are both T, g(P, Q,Ry...) cannot be false when
all of its arguments have the value true. Hence no wff of S( D, •) can be false when
its arguments are all truet so S(D,*) is functionally incomplete.
6. S(D, + ) is functionally complete, for the of R.S. is definable in S(D,+) as
~~P = df P -|- (P D P), and the *•* of R.S. is then definable in S(D, + ) as
P*Q = df ~~[P D (P 4- £)]. Hence any truth function expressible in R.S. is ex¬
pressible in S(D, + ), whose functional completeness now follows from that of R.S.
DR 6. PDQ,QDR\-PD R
Proof PDP Th. 7
PDQ pr.
Q D R pr.
PDR DR 5
DR 10. PDRyQDRh(PvQ)DR
Proof: PDR Pr-
QDR pr.
(PvQ) D (Rv R) DR 9
~~fl D '*-'R~*'R Ax. 1
[~~R D ~~R^R] D [~~(-^R~~R) D ~R] Th. 5
-(~~fl-R) D-~fl R 1
(fl v R) D-~R df.
-~fl D R Th. 2
(PvQ) DR DR 5
Solutions to Selected Exercises on Pages 244*290 377
Th. 14. !-[Pp 3 R] 3 [P 3 (() 3 R)]
1)R 14. P, () h PQ
Q Pi¬
ll 1
‘V
Th. .3 0. P(Qv R) ~ PQ v PR
Proof: h PQ 3 P Ax. 2
I- PR 3 P Ax. 2
|-(P<;)vPR) D P DR 10
h PP 3 ()P Th. 8
h PP 3 £> Ax. 2
1- P(1 3 0 DR 6
h PR 3 HP Th. 8
h RP 3 R Ax. 2
1- PR 3 R DR 6
f-(PpvPR) 3 (()vR) DR 9
h(PpvPH) 3 P(QvR) DR 8
l-P«DvR) 3 (^)v R)P Th. 8
378 Solutions to Selected Exercises on Pages 250-282
h (QvR)PDQPvRP Th. 18
hP(QvR) D QPyRP DR 6
\-QP=PQ Th. 21
hP(QvR) D PQvRP MT IV, Cor.
h RP=PR Th. 21
hP{QvR) D PQvPR MT IV, Cor.
hP(QvR) = PQvPR DR 14, df.
DR 1. PDQ, Q D R \prP D R
rH. 1. |jr(Pv<?)D(evP)
Proof: 1. [Q D (pvP)] D {(PvQ) D [Pv(QvP)]) Ax . 4
2. Q D (QvP) Ax. 2
3. (PvQ)D [Pv(pvP)] Rule
4. [Py(QvP)] D [Qv(PvP)] Ax. 3
5. (PvQ) D [Qv(PvP)] DR 1
6. [(PvP) DP] D {[pv(PvP)] D {QvP)} Ax. 4
7. (PvP) DP Ax. 1
8. [Qv(PvP)] D (QyP) Rule
9. (PyQ) D(QyP) DR 1
DR 6. P 3 £> h (3x)P 3 Q
Proof: P D Q premiss
(x)(P 3 (?) ' R 2
(3x)P 3 Q Th. 21 and R.R.
5. p = (q*p)_ _
(P*9’P) vp-(q*p)=
(P'(I‘P)v [p*(q v P)]_
(P’9'P)v (P'9) v (p'p)
(p-q-p) v (p-q)
(r/'p-p) v(p-q)
(q-p-p^v (q-q)vjp-q)
q-[(p’p) vq]v(p-q)
q-[qv (p^pj\v (p-q)
(<T9)v(p-q)
(p-q)v(q-q)
p-q
10. p 3 Up 3 9) 3 9]
p v (p vq) v q
pv (p * q) v q
p_v(p*q)vq
[p*(q vq)] v(p-q) v [q-(p vp)J
(p-q) v (p-q) v (p-q) v (q -p) v (q-p)
(p-q) v (p-q) v (p-q) v (p-q) v (p-q)
(p-q) v (p-q) v (p-q) v (p-q) v (p-q)
(p-q)v(p-q)v(p-q)v(p-q)
[p-(q vq)] v [p-(q vq)]
pvp
PAGE PAGE
>
0, A, 0 172, 192
58, 97
1, v 172
(*) 65
c 175
(3*) 65
e 177
66
t 177
* 68
c 178
V 71, 91
{ } 178
f*
91
{*: l'x) 179
—-
141, 171, 328
<P 179
* 143, 173
MM
382
PAGE
PAGE
XsY, X ~ Y 203
ha, n<j4, n{,o£ A},
n{x: * € A}, 183
s{a) 204
184 207
A - B <p(x, y)
185 q 209
A\B
210
<*>y> 185 «„ K
261, 292
X~ Y 188, 215, 261, 291 V
hiA 262
u+ 192, 207
281
193 1
281
m <n 196 1
196 hi 283
Hn
D 292
m + n 196, 206, 227
198 = 292
311
199 F° 295
X Mj
296
199 ®
i
200 W* 301
B4
w4 302
R 201
311
« 202 (Q*i)
PAGE PAGE
sc 321 345
<t>
345
K 325 t\ \
337 345
V =*
f[Pi
337
i= 1 !P 348
338
i=l
INDEX
385
386 Index
Association (cont.) B
principles of, 39, 102, 171, 244, 249, 250-
252, 273, 337 Balboa, 27 * *
Assumption Baldwin, James Mgrk, In.
discharged, 58, 60 Ball, W. W. Rouse, 162n.
hidden, 167 Bar-Hillel, Yehoshua, 180n., 199n., 212
of limited scope, 57-59, 75, 90, 98 Bar symbol, 171, 337
Asymmetrical relations, 134-135 Base of a system, 166, 264
Atkin, Eugene, ix Bass, Walter A., ix
Attribute variables, 150 Bauer-Mengelberg, Stefan, 179n., 191n.
Attributes, 63, 320, 345 Beard, Robert W., ix
of attributes, 150-155, 345 Beaulieu, Marc, ix
of different orders, 346-347 Beaulieu, Richard, ix
of formal deductive systems, 164-166 Belfiglio, Anthony, ix
predicable and impredicable, 152-154 Beman, W. W., 191n.
of relations, 134-136, 143 Benacerraf, Paul, viiin., 195n.
and sets, 177-178 Bernays, Paul, 81n.
Aussonderung Axiom, 179-180 Bernstein, B. A., 283n.
Austin, J. L., 192n. Beta case, 223
Auswahl, Axiom der, see Choice axiom Biconditional, 29, 335-336
Aut, 11 Binary
Axiom, see Chapter 6 fractions, 201-202
of F.S., 237 functions, 222
of H.A., 261 relations, 117, 134
of L.S., 237 Binding of free variables, 90, 111; see also
of N, 283 EG, UG
of PQ, 237 Blackwood, R. T., ix
of Pn, 237 Blum, Alex, ix, 8n.
of R.S., 227 Bohnert, Herbert G., ix
of RSlf 294 Boole, George, 7, 341
of ZF, 178, 180, 181, 182, 184, 185, 193, Boolean
198, 199, 207, 211 algebra, 175-176
Abstraction, 180 expansion, 172n., 341-343
of Boolean algebra, 175 Booth, John Wilkes, 141, 146
Choice, 198, 199 Both, 13
Class algebra, 175 Bound
Comprehension, 180 least upper, 349
Empty Set, 181 occurrence of a variable, 84, 293
Extensionality 178 Brackets, 276-279; see also Parentheses
of Identity, 328 Bradley, M. C., 258n.
Infinite list, 227 * Branching theory of types, 345
of Infinity, 193 Browning, Lorin, ix
Multiplicative, 198 Brutus, 140
Pairing, 182 Burali-Forti, C., 209n.
Peano’s, 191 Burali-Forti Paradox, 209, 21 In.
Power Set, 185 Bybee, Michael, x
Reducibility, 349
Regularity, 211 c
Replacement, 207
schema, 180 C D., 34
Separation, 180 C.'N.F., 274
Specification, 180 C.P., 51, 57-58
of Subsets, 180 .cwff, 314, 322
Union, 184 Caesar, 140
Calculus of the method of deduction, 54, 257-258
extended function, 331, 350 of the natural deduction’ apparatus, 328
first-order function, see Chapter 10 Compound statement, 8-9, 63, 218
function, 259 Comprehension, 180, 190
predicate, 290n. Commutation, principles of, 39, 42, 102,
propositional, 164, 169, 176, 259; see also 171, 249, 250-252, 273, 338
Chapters 8 and 9 Conclusion, 3-5, 55, 56
sentential, 169 Condition
Campbell, Paul J., 333n. necessary, 18
Cantor, Georg, viii, 176, 177, 179, 189, 190, sufficient, 18
195, 201, 210, 349 Conditional
paradox, 190 corresponding, 29-30, 50, 54-55, 343
Canty, John Thomas, 258n. proof, 49-52, 54-55, 56-60, 250, 298, 307
Card, 195 statement, 16-18, 29-80, 49-51, 54-55,
Cardinal number, 190, 195-202, 210 56-60, 67, 218, 343
Carnap, Rudolf, 153n. Conj., 34
Carroll, Lewis, 141 Conjunct, 8
Cartesian Product, 186, 197 Conjunction, 8-9, 69, 116, 176
Castor and Pollux, 8 principle of, 34, 102, 242, 245, 247
Categorical Conjunctive normal form, 274-276, 340
proposition, 170, 173 Boolean, 341-343
syllogism, 174 Connective, 9, 13
Characteristic function, 201 Consequent, 16
Choice Axiom, 198, 199-200 Consistency, 61-62, 79n., 164-165, 231,
Church, Alonzo, 227n., 237, 266n., 279n., 295-296, 323-328
317n., 350n. Constant
Circle, vicious, 158, 160 individual, 63, 64, 85, 191, 320, 324, 326
Class, see Chapter 7 predicate, 291
addition, 175 propositional, 290
Algebra, 163, 170-176 Constructive dilemma, 34, 249, 335
complement, 171 Contingent, 28, 61-62
empty, 172, 175 Continuum hypothesis, 210
expression, 172-173 Contradiction, 28, 53, 55, 59-60, 61-62, 153,
inclusion, 170, 174 164, 176, 179-180, 214-215, 342; see
intersection, 170 also Appendix C
multiplication, 175 Contradictory, 10, 28, 61-62, 66, 68, 173
product, 170 Contraries, 66, 68
simple-term, 172 Convention
union, 170 of association to the left, 50, 219-220, 293
universal, 172 governing 12, 219, 292
Clemens, Samuel, 141 governing ‘ 219, 292
Closed, 314, 322 governing and ‘^p’, 91-92
Closure, 321 governing juxtaposition, 292
Cohen, Paul J., 180n., 200, 210, 212 governing order of precedence, 219-220,
Com., 39 292-293
Comey, D. D.t 260n. governing order of statement variables, 20,
Complement, 171, 184-185 27
absolute, 182 governing punctuation, 277
relative, 184-185 governing scopes of quantifiers, 67n.
Completeness Converse nonimplication, 227
deductive, 165-166, 231, 250, 253, 257- Conversion, 173-174
258, 260, 264, 274-276, 318-328 Coordinates, 186
expressive, 164, 218 Copi, I. M., 60n., 146n., 153n., 209n., 258n.,
functional, 218. 220-225, 281-282 331n., 333n., 344n., 350n.
388 Index
M N, 282-289
Nagel, Ernest, 331n.
M.P., 33, 34, 228 Napoleon, 155
M.T., 33, 34, 249 Natural deduction, 71n., 114, 307-311, 328
Many-one, 187 Natural numbers, 188, 190-195
Many-valued logics, 259 Negation, 10, 13, 65, 176, 337
Mapping, 190 double, 29, 40, 102, 171, 241-242, 247,
Mars, 9 249, 250, 337
Marsh, Robert Charles, 153n. quantifier, 109-110
Martin, James A., 8n. Negative proposition
Material particular, 66
equivalence, 28, 40, 176, 219, 250 singular, 64
implication, 17, 21-24, 28, 40, 102, 250 universal, 66
nonimplication, 227 nEgO, 66
Mathematical induction, 185, 204, 223-226, Neither, 13
349 Nelson, L., 344n.
Matrix, 313 Newman, James R., 33 In.
Matthew, 130 Newton, Isaac, 157
Max, 334 Nicod, J. G. P., 282-289
Maximal consistent set, 325 Nondenumerable, 189, 210, 332
Mediate inference, 174 Non-Euclidean geometry, 161-162, 166
Membership, 177 Nonimplication
Metalanguage, 213-215, 350-353 converse, 227
semantical, 213-215 material, 227
syntactical, 213-214 Nonreflexive relations, 136
Metatheorem, 223 Nonsymmetrical relations, 135
Meyers, C. Mason, 8n. Nontransitive relations, 135
Micah, 71 Normal form, 311-318, see also Appendix B
Middle, excluded, 241 Boolean, 341-343
Min, 334 conjunctive, 274-276, 340
Minogue, Brendan P., ix disjunctive, 258, 340
Mirimanoff, D., 211 prenex, 311
Model, 78-81, 117, 105-107, 121-122, 136- Skolem, 315
137, 232-236, 320-322, 326 of type R, 315
three-element, 232-237, 262-263, 334 ^ Normal interpretation, 218, 220, 221, 228,
four-element, 234, 264 230, 238, 254, 261, 320, 321, 326
six-element, 265 Nu, 71, 91, 141n.
system of logic, 260, 276, 284 Null-class, 181
Modus Number, 190, 191, 192
Ponens, 22-24, 33-34, 40-41, 89, 224, 225, cardinal, 190, 195-202, 210
228, 249, 283-284 finite, 78, 136-137, 184, 185, 188, 190,
Tollens, 22-24, 32-34, 40-41, 249 195, 196-197, 205, 322
Monadic function, 220 infinite, 184, 188, 190, 196-197, 205, 227,
Montague, R., 21 In. 322
Mu, 91, 130, 141n. , natural, 189, 190-195
Multiplication ordinal, 190, 202-210
cardinal, 197-199 . transfinite, 205, 208, 209
ordinal, 206 • Numerals, Arabic and Roman, 6-7
Multiplicative Axiom, see Choice Axiom Numerical propositions, 145
Index 393
0 Phi, 65, 68, 90-92
Pi, 337-3.38
O proposition, 66-69, 170, 173-174, 304 Place marker, 64
Object, 177 Plato, 116, 117, 118, 151
language, 213-215, 227, 350-353 Plus symbols, 192, 227
Obverse, 173 Poincar£, Henri, 157
One-to-one, 188 Polish notation, 279-280
Operationalism, 332 Pollux, 8
Operators, 215, 223, 279-280 Ponewiy see Modus Ponens
stroke and dagger, 281-282 Porte, Jean, 266n.
Opposition, Square of, 66, 68, 304 Porter, William Sidney, 141
Or, 10-14, 16 Possible nonempty universe, 78-81, 319-320,
Order, 185-186, 190 321
of functions, 346 Post, E. L., 161, 259
partial, 202-203 criterion for consistency, 164, 231, 296
of precedence, 219, 277, 292-293 Postulate, sec Axiom
of propositions, 347-348 Power set, 185, 189-190
of quantifiers, 121 Axiom, 185
serial, 196 Precedence, order of, 219, 277, 292-293 ■
simple, 196, 203, 201 Predicable, 152
total, 196, 203 Predicate
type, 204 calculus, 290m.
of words, 13 constant, 150, 291
Ordinal number, 190, 202-210 symbol, 291, 320, 326
term, 63, 170
P variable, 1.50, 291
Prefix, 313
P0, 237 Premiss, 3
PN, 237 suppressed, 137
Pair set, 178, 192 understood, 229
ordered, 185 Prenex normal form, 311
unordered, 182 Primary occurrence, 149
Paradox Primitive
Cantor, 190 operators, 223, 281
epistemological, 344 symbols, 215, 260, 290
Crclliug, 344, 347, 3.50, 352 terms, 166, 260
heterological, 344 Principal Matheuuitica, 7, J53».. 260, 331 n.,
liar, 214. 348 347n., 348ri., 349-350, 352
logical, 344 Prior, A. N., 283n.
of material implication, 28 Proclus, 161
Mirimanoff, 211 Product of classes, 170
Russell, 152-153, 179, 180 Proof
semantical, 344 of analyticity, 230-231, 284, 296, 321
Skolem, 332 circular, 158
Parallel postulate, 160-161, 166 conditional, 49-52, 54-55, 56-60, 250, 298,
Parentheses, 12, 64n., 67, 219, 276-279, 307
292-293 of consistency, 164-165, 231, 295-296
Parenthesis-free notation, 279-280 of deductive completeness, 253-257, 264,
Part, well formed, 293, 311 274-276, 318-328
Particular propositions, 66-68 of deductive incompleteness, 264-265
Peano, Guiseppe, 7, 191, 192, 193, 194 formal 32-34, 39-42, 50, 59, 89-90, 228
Peirce, Charles, 1, 7, 281 n. of functional completeness, 220-225, 281-
law, 62 282
Pfeiffer, John E., 341 n. of functional incompleteness, 226
394 Index
Proof (cont.) universal, 65, 85-86
of incompleteness of rules, 333-336 words, 126
of independence of axioms, 165, 231-237, Quantifier
262-266 existential, 65
indirect, 52-54, 55, 59-60, 61-62, 250 initially placed, 311
of invalidity, 20, 24-25, 48-49, 78-81, negation, 109-110
105-107 order of, 121
in R.S., 240-241 rank, 315
reductio ad absurdum, 52-54, 55, 60n., scope, 67, 84, 86-87, 111, 311
61-62, 161 universal, 65
shorter, 52, 61-62, 102 Quaternary relations, 117
of tautology, 54-55, 61-62 Quine, W. V. O., 194n., 21 In., 212, 281n.,
of validity, 32-34, 39-42, 61-62 283n., 328n.
versus demonstration, 240-241
Proposition, 2, 5; see also Statement
categorical, 170, 173-174 R
general, 64-69, 78
multiply general, 83-88 R.R., 301
negative, 64, 66 R.S., 218, 264
numerical, 145 RSlf 290
orders of, 347-348 RS/, 324
particular, 66-69 Ramified theory of types, see Appendix C
relational, 116-127 Ramsey, F. P., 344, 345, 348n., 351n.
singly general, 83 Ran, 187
singular, 63-64, 84 Range, 187
subject-predicate; 66-69, 170 Rank, 315
universal, 66 Rankin, Terry L., ix
Propositional Rasiowa, 237
calculus, 164, 169, 176, 259; see also Real variable, 84n.
Chapters 8, 9 Reasoning, 1-2
constant, 290 Recursive
function, 64, 67, 84-55, 89-92, 118 definition, 216-217, 261, 283, 293, 331
symbol, 215, 290, 320, 326 rule, 261, 283, 293
variable, 216, 290 Reductibility axiom, 349-350, 353
Protasis, 16 Reductio ad absurdum, 52-54, 55, 60n., 61-
Proverbs, 15, 71, 130 62, 161
Psalmsy 15, 71, 130 Redundance, 40-41, 165
Psiy 68, 90-92 in punctuation, 277
Psychology, 1 Referring, 145, 146, 148
Punctuation, 12-13, 67, 219, 276-279, 292- Reflexive relations, 135-136, 202
293 Refutation by logical analogy, 19-21
Putnam, Hilary, viii, 146n. Regression, 158, 160
Pythagoras, 158-159 Regularity Axiom, 211
Relations, see Chapter 5
many-one, 187
Q in set theory, 185-187
Relative complement, 184-185
QN, 109-110 Replacement
Quantification, 64-69, 85-86, 90, 150-155, Axiom Schema of, 207
259; see also Chapter 10 mle of, 39-40, 248, 271, 301
existential, 65, 85-86 Representation, 253-254
order of, 121 Representative, 192, 195-196, 209-210
rules, 71-76, 89-100, 113-114; see also Restricted function calculus, 290n.
EG, EX, UG, UI Riemann, 161, 259
Index 395
Rigor, 162-163, 166-168 function, 346-347
Ring, Elizabeth Muir, ix proposition, 347-348
Robinson, R. M., 205n. Self-evidence, 160-161
Rockefeller, 5, 20 Sellars, W., 353n.
Romans, 130 Semantical, 168-169, 213, 218n., 220
Rosser, J. Barkley, 218, 259 paradoxes, 214-215; see also Appendix C
Rule terms, 349
of conditional proof, 49-52, 54-55, 56-60, Sentence, 2, 5
250, 298, 307 Sentential calculus, see Calculus,
derived, 229, 238-239 propositional
of indirect proof, 52-54, 55, 59-60, 61-62, Separation, 180, 190
250 Serial order, 196
of inference, 33-34, 39-42, 72-75, 89-100, Set, 176-212
102, 113-114, 228, 261-262, 283, 294, empty, 180-182
333-336 grounded, 211
of logic, 167 inductive, 193
quantification, 71-76, 89-100, 113-114; see infinite, 209
also EG, El, UG, UI selection, 198
recursive, 216, 283, 293 successor, 191, 192, 205
of replacement, 39-40, 248, 271, 301 transitive, 194
of thumb, 42, 159 well ordered, 203
Russell, Bertrand, 78, 146, 147, 149, 153, Set Theory, see Chapter 7
179, 181, 182, 192, 194, 195, 198, 199n., as a first-order theory, 331-332
204, 211, 260, 348, 350n., 353 Shannon, Claude E., 341n.
Russell paradox, 152-153, 179, 180 Sheffer, II. M., 281n., 282n.
Shoes and socks illustration, 198-199
s Shotwell, Jennifer M., ix
Shukla, Anjan, 266n.
Saccheri, Gerolami, 161 Sigma, 337-338
Samuel, 15 Similar, 203
Sanford, David H., 9n. Simons, Leo, 333n.
Satisfiable, 321 Simp., 34
simultaneously, 324 ' Simple
Satisfy, 179-180, 321, 332 class terms, 172
Saul, 15 order, 196, 203
Scharle, Thomas W., 266n., 283n. propositional functions, 79n.
Schema, 180, 207, 223, 225 statement, 8-9, 78
Scherer, Donald, 60n. theory of types, 153-154; see also
Schdnfinkel, Moses, 81n. Appendix C
Science Simplification, principle of, 34, 41, 42, 90,
of geometry, 158-164, 166, 259-260 102, 249
as organized knowledge, 157 Simultaneously satisfiable, 324
of physics, 157, 163 Singleton, 182
of reasoning, 1 Singly general proposition, 83
Scope Singulary proposition, 63-64, 84, 112
of an assumption, 57-59, 73, 75, 96, 99- Singular function, 220
100 Size, 195, 196
indicator, 149 Skolem, Thoralf A., 179n., 332
of operator symbols, 219, 292 normal form, 315
of a quantifier, 67, 84, 86-87, 111, 311 paradox, 332
Scott, Kenneth J., x Smaller than, 189
Scott, Sir Walter, 140, 146 Smith, Ronald C., ix
Secondary occurrence, 149 Socrates, 3, 63, 64, 71, 75, 95, 116, 117, 118,
Second-order 151
396 Index
Solomon, Song of, 15 199n„ 212
Sound, 5 Syllogism * •
Specific form categorical, 174
of an argument, 21, 25, 34, 48 disjunctive, 11-12, 16, 20-21, 33, 34, 102,
of a statement, 27 249
Spinoza, 159 hypothetical, 22-24, 33, 34, 243-244, 249,
Square of opposition, 66, 68, 304 335
Statement, 2, 5; see also Proposition Symbols, 6
arithmetical, 145 advantages of, 6-7
component, 8 operator, 215
conditional, 16-J8, 29-30 primitive, 215
contingent, 28, 62 propositional, 215
contradictory, 28, 62 quantifier, see Chapter 4
exceptive, 143 relational, see Chapter 5
form, 27-30 truth-functional, see Chapter 2
hypothetical, 16 uninterpreted, 163, 215
implicative, 16 Symmetrical relations, 134
simple and compound, 8-9, 78, 218 Syntactical, 168-169, 213
specific form of a, 27 formula, 227
tautologous, 27, 62 variable, 244
truth-functional component, 9 Syntax language, 213-214, 227
truth-fiinctionally compound, 9 System
variable, 20 deductive, see Chapter 6
Stoll, Robert R., ix, 176n., 182, 195n., 212 logistic, 166-169, 223; see also Chapters 8,
Strengthened rule of conditional proof, 9, 10, passim
56-60
Stroke function, 281-282
Strong T
disjunction, 11, 218
induction, 225-226 Table
Subclass, 173 three-valued, 232-236, 263, 334
Subcontraries, 66, 68 four-valued, 264
Subject term, 63, 84, 170 six-valued, 265
Subject-predicate propositions, 22, 66-69, truth, 10, 17, 19-24, 27-30, 32, 41, 48-49,
170 61-62, 108, 220-225, 253-254, 257
Subset, 178, 180, 185, 322, 325 Tapscott, Bangs 1.., ix
Axiom, 180 Tarski, A., 353n.
proper, 178 Taut., 40
Substitution, 42, 85, 176 Tautologous, 27-28, 55, 61-62
different from replacement, 42 Tautology, 27-28, 55, 232, 275-276, 295,
instance, 20, 27, 33-34, 04, 67, 71, 84-86, 296, 319, 342
90 principles of, 40, 171, 250, 338
rule for functional variables, 317n. proof of, 54-55, 61-62
Successor, 191, 192, 205 and validity, 29-30, 49-50, 54-55, 343
such that, 179 Temporal words, 126
Sum Terms, 63-64, 157-158, 166, 170, 172
cardinal, 196 defined, 154-155
logical, 170, 252 logical, 167
ordinal, 205-206 predicate, 63, 170
set-theoretic, 170, 183, 192, 196 primitive, 166
Summand, 252, 255 relative, 214
Superlative, 148 semantical, 349-350
Suppes, Patrick, 179n., 180n., 182n., 191n., simple class, 172
Index 397
subject, 63, 170 u
undefined, 159-160, 164, 166, 293
Ternary UG, 72-73, 75, 90, 99-100, 114, 130-131,
functions, 222 307-308
relations, 117 UI, 71, 75, 90, 93-94, 113, 130-131, 307
Tetradic relations, 117 Unary function, 220
The, 132, 146-149 Undefined terms, 159-160, 164, 166, 293
Theorem, 229, see also Chapter 6 Union, 170, 183
Thing, 63 Axiom, 184
Thomas, Norman L., ix Unit set, 182
Three-valued Universal
logic, 259 affirmative, 66-68
tables, 232-236, 263, 334 class, 172, 175
Tilde, 10 generalization, 72-73, 75, 90, 99-100; see
Timothy, 71 also UG
Todhunter, 159n. instantiation, 71, 75, 90, 93-94, 146, 148;
Tollens, see Modus Tollens see also UI
Total order, 196, 203 negative, 66-68
Totally reflexive relation, 135 proposition, 66-67
Townsend, Burke, ix quantification, 65, 85-86, 345n.
Trans., 40 quantifier, 65, 293
Transfinite set, 180
induction, 204 Universe
numbers, 205, 208, 209 possible nonempty, 78-81, 319-321
Transitive proof of its nonexistence, 180
relations, 135, 202 Unless, 13
set, 194 Unordered, 182
Translation, 69, 124-127
Transposition, principle of, 40, 174, 241,
242, 250 V
Triadic, see Ternary
Truth, 4-5, 320, 326-327 Valid, 3, 4n., 20, 21, 24-25, .33, 79, 80-81,
extrasystematic, 162 89-90, 169, 174, 321
formal, 27 categorical syllogisms, 321
function, 220-223 and tautologous, 29-30, 49-50, 55, 343
-functional component, 9 Validity, 45, 63, 321
-functional connective, 9, 13 conditional proof of, 49-52, 54-55, 56-60,
functionally compound statement, 9, 14, 250, 298, 307
39 formal nature of, 20
hereditary, 231-232, 333 formal proof of, 32-34, 39-42, 50, 59, 89-
logical, 79n., 108-115, 120, 319, 321 90, 228
necessary, 27 indirect proof of, 52-54, 55, 59-60, 61-62,
self-evident, 160-161 250
table, 10, 17, 19-24, 27-30, 32, 41, 48-49, semantical and syntactical, 169, 213
61-62, 108, 220-225, 253-254, 257 strengthened conditional proof of, 56-60
unconditional, 55 and tautology, 55, 343
value, 9, 48-49, 61-62, 320, 326-327 Value, 188, £34
Turing, A. M., 279n. Van Heijenoort, Jean, 179n., 190n., 191 n.,
Turquette, A. R., 259 192n„ 198n., 209n., 331n.
Twain, Mark, 141 Variable
Two-valued logic, 259 apparent, S4n.
Type R, 315 attribute, 150
Types, logical, 153-154; see also Appendix C binding, 90, 111; see also EG, UG
Variable (cont.) Weak
bound occurrence, 84, 293 disjunction, 11, 176, 218
dangling, 87 induction, 223-^25'
free occurrence, 84, 85, 111-112, 180, 294, Wedge symbol, 11-12, 218, 219, 261, 292
320 Well formed
freeing, 90; see also El, U1 formula, 167-169, 215-226, 261, 283, 293,
functional, 317n. 331
independent, 220 part, 293, 311
individual, 64, 84-85, 291 Wetterau, Elaine, x
interchange of bound, 307 Whitehead, Alfred North, 7, 153n., 182n.,
predicate, 150, 291 260, 348
propositional, 216, 290 Wiener, N., 186n.
real, 84n. Wilder, Raymond L., 190n., 204n.
relation, 150 Williams, Frank, ix
statement, 20, 290 Wittgenstein, L., 350n.
syntactical, 292
Vee symbol, 11-12, 218, 219, 261, 292
Vel, 11
Vicious circle, 158, 160
Y
principle, 348
Yields symbol, 229, 242, 262, 283, 294-295
Von Neumann, John, 192, 194, 205n.
Yuting, S., 21 In.
z
w
ZF, 176-212
wff, 168-169, 215-226, 234, 261, 283, 293, Zermelo, Ernst, 153n., 179, 180, 181, 192,
331 198, 209
closed, 314, 322 Zerinelo-Fraenkel Set Theory, vii, 176-212
Wajsberg, 283n. Zero, 191, 192, 193
Wang, Hao, 191n., 328n. Zinov’ev, A. A., 260n.