Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
At this point, we’ve talked a lot about mathematical logic, arguments, and some common terminology. In this section, we introduce a method of providing proof for propositions. This will be the most straight-forward technique we have at our disposal.
Consider a statement such as
p → q.
How would we be able to show that implication to always be true? In other words, how do we show that
p ⟹ q?
Suppose that we (somehow) knew that
p ⟹ r
r ⟹ q.
This basically reduces down to the Law of the Syllogism. Since we have p ⟹ r, we know that the proposition p → r is always true. Similarly, we know that the proposition r → q is always true. Hence:
p | |
p → r | |
r → q | |
∴ | q |
The argument
p | |
p → r | |
r → q | |
∴ | p → q |
is the basis for any direct proof.
Figure 2.9.1: Any proof based on this style of argument is called a direct proof.
This proof strategy is called a direct proof because the proposition we’re trying to show has p as the hypothesis and q as the conclusion. The first premise has p as the hypothesis, and the last premise has q as the conclusion. We’re essentially chaining a bunch of implications together to get from p to q.
Let’s break down a simple example of a proof that can be supplied for a mathematical proposition. Since we’re providing a proof for this mathematical proposition, we can call the proposition a theorem.
Consider the following statement:
If n is an even integer, then n + 1 is an odd integer.
First, let’s make clear what our universe is. Because we have an implication where the hypothesis and conclusion are talking about integers, our universe of discourse will be the collection of all integers, which is commonly denoted by ℤ (we are not using the general symbol 𝒰 for this universe since the collection of integers has a pretty well-established symbol at this point in time).
Implicit as usual is the universal quantifier in the implication, so the proposition
If n is an even integer, then n + 1 is an odd integer
can be rewritten as
∀n [e(n) → o(n + 1)]
Also worth noting is that, as usual, we didn’t write ∀n ∈ ℤ [e(n) → o(n + 1)] because it should be clear from the problem’s context that we are only considering integers, and so when we choose a value for n, we will choose integer numbers.
So, how do we show that ∀n [e(n) → o(n + 1)] is always true? In other words, how do we show that
e(n) ⟹ o(n + 1)?
Something we can try is to appeal to the definition of even number and odd number. Since we are asserting that n is an even integer (if n were not an even integer, then the above implication is trivially true,) we know that there exists some other integer k such that
n = 2k.
But wait, if n is equal to 2k, then n + 1 would have to be equal to 2k + 1. This is always true because we can substitute 2k for n whenever we see the letter n in an equation.
Now notice that we have 2k + 1, which satisfies the definition of an odd integer. Thus, since n + 1 is equal to 2k + 1, and 2k + 1 is an odd integer, then n + 1 is an odd integer. We don’t necessarily know what particular integer k is, but we also don’t know what particular integer n is either, except that n must be even. Thus, everything we did for n was applicable to all even integers, meaning anytime we add one to an even integer, we get an odd integer. This completes our proof, and so we have a theorem. Whenever n is an even integer, n + 1 must be an odd integer.
So, how would we write this argument out in mathematical logic notation? First, let’s define some open statements. We can pick out four open statements in particular:
e(n): | n is even |
o(n): | n is odd |
p(n): | ∃k [n = 2k] |
q(n): | ∃k [n = 2k + 1] |
Note that the n used in defining the open statements is not referring to the n used throughout this proof. The letter n is being used in the open statements to show what happens when a number is substituted in for n. It is a common occurrence in mathematics to reuse certain letters for different purposes. Usually, we strive to pick different letters, but that can also be a hassle. It should be clear from context which variables are being used, but it is something to get used to. For example, we have that
e(x): | x is even |
o(α): | α is odd |
p(n0): | ∃k [n0 = 2k] |
q(m + 1): | ∃k [m + 1 = 2k + 1] |
The statement we’re trying to prove is
∀n [e(n) → o(n + 1)].
When trying to write out our argument in tabular form, we need some premises. One premise we can assume is e(n0) where n0 is a specific integer that is arbitrarily chosen. This is because as discussed in the section on Universal Generalization, there’s no point in working with ∀n [e(n) → o(n + 1)] if we’re working with some integer n0 where e(n0) = 0 because then the implication e(n0) → o(n0 + 1) is trivially true. We want to make sure that when e(n0) is true, then so is o(n0 + 1), and so any situation where e(n0) is false is pointless. Hence, we have the argument
∀n [e(n)] | |
∴ | ∀n [o(n + 1)] |
Now, we’ll validate the above argument using the rules of inference, along with the definitions of even integer and odd integer:
Step | Proposition | Reason |
---|---|---|
(1) | ∀n [e(n) ↔ p(n)] | Definition of Even Integer |
(2) | e(n0) ↔ p(n0) | Universal Specification on (1) |
(3) | ((e(n0) → p(n0)) ∧ (p(n0) → e(n0)) | (p ↔ q) ⟺ (p → q ∧ q → p) |
(4) | e(n0) → p(n0) | Conjunctive Simplification on (3) |
(5) | e(n0) | Assumed Premise |
(6) | p(n0) | Modus Ponens on (4) and (5) |
(7) | ∀n [p(n) → q(n + 1)] | (n = 2k) ⟺ (n + 1 = 2k + 1) for all integers n |
(8) | p(n0) → q(n0 + 1) | Universal Specification on (7) |
(9) | q(n0 + 1) | Modus Ponens on (6) and (8) |
(10) | ∀n [o(n) ↔ q(n)] | Definition of Odd Integer |
(11) | o(n0 + 1) ↔ q(n0 + 1) | Universal Specification on (10) |
(12) | (o(n0 + 1) → q(n0 + 1)) ∧ (q(n0 + 1) → o(n0 + 1)) | (p ↔ q) ⟺ (p → q ∧ q → p) |
(13) | q(n0 + 1) → o(n0 + 1) | Conjunctive Simplification on (12) |
(14) | o(n0 + 1) | Modus Ponens on (9) and (13) |
(15) | ∴ ∀n [e(n) → o(n + 1)] | Universal Generalization on (5) and (14) |
There are a couple of things to observe about this argument. The reasons listed for steps (3), (7), and (12) use the letters p, q, n, and k. Note that in the given reasons, the letters do not necessarily refer to the exact same variables being used in the propositions. The letters in the reasons are just general placeholders. We could have used other letters such as w, x, y, and z. However, letters n and k are very commonly used to represent integers, and letters e and o seem convenient to denote even and odd integers respectively.
In steps (3) and (12), we made use of the logical equivalence between a biconditional and the conjunction of two implications:
x ↔ y ⟺ (x → y) ∧ (y → x)
Remember, logical equivalencies can be used as reasons in arguments. This is because in any logical expression, two propositions that are logically equivalent can always be substituted for one another.
The reason given in step (7) is a result of basic algebra. Remember if any two numbers are equal to each other, then we can add one to both numbers, and those two new numbers will also be equal to each other. The truth of this result is near-axiomatic. For example, we obviously have that
5 = 5
We can’t just add 1 to only a single side of the equation:
5 + 1 ≠ 5 (6 ≠ 5)
We must add 1 to both sides in order for the equation to be true:
5 + 1 = 5 + 1 (6 = 6)
We’ll re-write this proof in a more conventional style later in this section where we won’t explicitly point out this fact. We only listed this reason for the sake of total completeness in the tabular argument provided above. Usually in a conventionally written proof, these kinds of obvious facts are left out of the detail because most readers will just assume this knowledge to be true (in further, more deep and nuanced study of mathematical logic, these facts are not assumed, but must also be proven, though that is well beyond the scope of this book). Usually, basic arithmetic and algebra doesn’t need explicit mention, though trickier and more careful arithmetic and algebra should be fully worked out.
One last thing to note is that when we invoked the rule of Universal Specification in steps (2), (8), and (11), along with the assumed premise in step (5), we used the same placeholder n0. In this validation, we are using the exact same arbitrarily chosen integer in all steps because we want to show that when an integer is even, the integer that is one more is an odd integer. Hence, we must do all of our manipulations on the exact same number. If we used different numbers throughout the argument, then we wouldn’t know that the result holds.
And there we have it, a fully worked out proof for a simple result. Whenever we want to provide a proof for a theorem, we can always carefully write out several propositions and layout a formal argument as we did above. Most of the time, proof-writing will use a mixture of English language, arithmetic, and algebraic manipulation written out in a paragraph format. We showed how to write out a proof using formal logic because if we’re ever unsure of the correctness of a given proof, we can use the tools of mathematical logic for analysis.
From here on out, fully written out tabular arguments will be rarely presented, if at all. Instead, we will write out proofs for theorems using a more conventional paragraph style.
Let’s rewrite the above, very lengthy proof in a more conventional style.
If n is even, then n + 1 is odd.
Because n is even, we know that there exists some integer, which we’ll call k, such that
n = 2k
Adding 1 to both sides yields the equation
n + 1 = 2k + 1.
But 2k + 1 is an odd integer by definition, and since n + 1 = 2k + 1, we must have that n + 1 is odd as desired.
The proof given is much more compact and much easier to follow. Again, if we’re ever unsure if a given proof is correct, we can write out the open statements, refer to the rules of inference, and write out an argument in tabular form.
As noted in this style guide, we will present theorems and their proofs in specially marked magenta boxes. If you wish to try and provide a proof before seeing one (always an excellent learning exercise,) you can leave the proof hidden until you are ready to see one. This is the preferred style for these books. Other books and authors have their own styles for presenting proofs in books and papers.
Our next few theorems expand on the idea of even and odd numbers.
If n is odd, then n + 1 is even.
Because n is odd, there exists some integer k such that
n = 2k + 1.
Adding 1 to both sides yields the equation
n + 1 = 2k + 1 + 1 = 2k + 2.
Notice that a factor of 2 can be brought out on the right-hand side of the equation:
n + 1 = 2(k + 1).
But k + 1 is an integer, which we can refer to as ℓ. Thus, there exists some integer ℓ such that
n + 1 = 2ℓ.
This means that n + 1 is even as desired.
If n is even, then n + 2 is even.
By Theorem 2.9.1, we know that since n is even, n + 1 is odd. Then by Theorem 2.9.2, (n + 1) + 1 must be even.
Notice that (n + 1) + 1 = n + 2, meaning n + 2 is even as desired.
If n is odd, then n + 2 is odd.
This result is proved using nearly identical logic as that of Theorem 2.9.3.
Let’s quickly discuss Theorem 2.9.1 and Theorem 2.9.2. These results seem obvious to anyone with a high school education, but they give us a chance to practice writing proofs using easy results. This means we can easily check that the logic used in the proofs is valid. The main tool we used in the proof were the definitions of even integer and odd integer. Of course, there was a little bit of algebra involved too, but not much. We just had to make sure that the relevant equations remained balanced after adding 1 to n.
Theorem 2.9.3 had an interesting proof. There, we used two previous theorems. This is completely valid, since those are theorems (meaning they are already proven to be true). Hence, since they are true, we can use those as facts in our proof.
Finally, the proof given for Theorem 2.9.4 is perhaps the simplest proof given so far. Of course, we could just invoke Theorem 2.9.4 first on n, and then invoke Theorem 2.9.3 on n + 1 to get the desired result, but that seems like a minor enough detail that we can almost simply copy and paste the proof from Theorem 2.9.3. Sometimes this is warranted, other times there may be enough differences to necessitate writing out an original proof.
Let’s expand our understanding of even and odd numbers even further!
If m and n are both even, then m + n is even.
Since m is even, we know there exists some integer a such that
m = 2a.
Likewise, there is some integer b such that
n = 2b.
Thus, for m + n, we have that
m + n = 2a + 2b = 2(a + b).
But since a and b are integers, the number a + b must also be an integer, which we’ll refer to as c, hence we have that
m + n = 2c,
which satisfies the definition of even.
Thus, m + n is even as desired.
If m is even, and n is odd, then m + n is odd.
We know that m is even and n is odd, therefore, there exists integers a and b such that
m = 2a
n = 2b + 1.
Adding m and n together yields
m + n = 2a + 2b + 1 = 2(a + b) + 1.
Notice that since a and b are integers, the number a + b is also an integer, which we’ll call c. Thus, we have that
m + n = 2c + 1,
meaning m + n satisfies the definition of an odd integer. Hence, m + n is odd as desired.
If m and n are both odd, then m + n is even.
Since m and n are both odd, we know there exists some integers a and b such that
m = 2a + 1
n = 2b + 1.
Again, adding m and n together yields the equation
m + n | = | 2a + 1 + 2b + 1 |
= | 2a + 2b + 2 | |
= | 2(a + b + 1) |
In the last step, we simply factored out a 2 from all three terms in the sum.
Notice that since a, b, and 1 are all integers, we have that a + b + 1 is an integer, which we can simply refer to as c, but notice that this means
m + n = 2c
which means m + n is even as desired.
It’s usually a good idea to play around with a theorem to see how it can work in practice, so let’s go over a quick example.
Both 6 and 18 are even (6 = 2*3 and 18 = 2*9), and adding 6 and 18 together yields
6 + 18 | = | 24 |
= | 2*12 |
So, the sum of 6 and 18, two even integers, is also even.
We have that -3 is odd and 98 is even, so according to Theorem 2.9.6, the sum should be odd:
-3 + 98 | = | 95 |
= | 94 + 1 | |
= | 2*47 + 1 |
So, the sum of -3 and 98 is odd as predicted by Theorem 2.9.6.
Both -17 and 1983 are odd, so by Theorem 2.9.7, the sum of these two numbers should be even:
-17 + 1983 | = | 1966 |
= | 2*983 |
So, Theorem 2.9.7 comes through for us as expected.
The next proof involves a product instead of a sum.
If m and n are both odd, then mn is odd.
Since m and n are both odd, we know there exists integers a and b such that
m = 2a + 1
n = 2b + 1.
As such, when we multiply m and n together, we get the following:
mn | = | (2a + 1)(2b + 1) |
= | 2a*2b + 2z*1 + 1*2b + 1*1 | |
= | 4ab + 2a + 2b + 1 | |
= | 2(2ab + a + b) + 1 | |
= | 2c + 1 |
Because there exists an integer c such that mn = 2c + 1, mn is odd by definition as desired.
Notice that in the last step in the chain of equations in the proof for Theorem 2.9.8, we simply replaced the quantity (2ab + a + b) with the single letter c. Since 2, a, and b are integers, we also know that the quantity (2ab + a + b) must also be an integer. We can just use the letter c to refer to that integer, which makes the subsequent steps a bit easier.
Just like we can always make substitutions in algebra, we can also make substitutions for algebraic and arithmetic expressions in proofs. We should just make clear what substitution is being used.
Let’s explore some more theorems and proofs that involve square numbers along with even and odd numbers.
If n is even, then n2 is even.
Because n is even, there’s some integer k such that
n = 2k.
Thus, squaring n yields the following equation:
n2 | = | (2k)2 |
= | 22k2 | |
= | 4k2 | |
= | 2*2k2 |
Since k and 2 are both integers, we know that 2k2 is also an integer, which we can just refer to as k0. This means that we have
n2 = 2k0
meaning n2 is an even integer as desired.
The algebra in the above proof can be handled however we feel comfortable. Simple steps can be omitted or combined as desired, but again, anything tricky should be clearly laid out.
If n is odd, then n2 is odd.
Because n is odd, we have that
n2 = n*n,
which is a product of two odd integers.
Theorem 2.9.8 thus guarantees that n*n is odd, and so n2 is odd as desired.
Theorem 2.9.10 provides another demonstration of how previous theorems can be used in the proofs of other theorems.
The next theorem’s proof has some tricky algebra, and as such, the algebra and arithmetic will be more clearly laid out. The next theorem is actually just the converse of Theorem 2.9.10. Remember, just because a theorem is true, does not mean its converse is automatically true. A proposition is not necessarily logically equivalent to its converse.
If n2 is odd, then n is odd.
Since n2 is odd, we know there exists some integer k such that
n2 = 2k + 1.
We can move the 1 to the left-hand side of the equation:
n2 – 1 = 2k.
We have a difference of squares (since 1 = 12) meaning we can factor the left-hand side of the equation as follows:
(n – 1)(n+1) = 2k.
Notice that (n – 1) + 2 = (n + 1) meaning (n – 1) and (n + 1) are either both even or are both odd.
However, since (n – 1)(n + 1) = 2k, we have that whatever integer (n – 1)(n + 1) equals, it must be even, meaning at least one of (n – 1) and (n + 1) are even, but since both must be even or both must be odd, we have that both (n – 1) and (n + 1) are even.
Since (n – 1) is even, and (n – 1) + 1 = n, Theorem 2.9.1 guarantees that n must be odd.
Hence, n is odd as desired.
The proof for Theorem 2.9.11 required a common algebraic trick (factoring a difference of squares), but just because it is common, does not mean it’s an obvious step for the proof. It also required knowing some more about even and odd integers than what we showed here, though those results are not too hard to prove. Furthermore, why that algebraic trick? Why not one of the other myriad of tricks that are available to us?
If we’re lucky or clever enough to squint at the problem just right, we may have a flash of insight on how to proceed as we did. But something else we could have tried doing is taking the square root of both sides of the first equation in the proof:
n = √(2k + 1)
What exactly do we do with that? Nothing seems like a good next step, so we’re stuck here. Yeah, we have n on the left-hand side, which is good, because we’re trying to show that n satisfies some property. In general, it’s actually a good strategy in proofs to try and express the quantity of interest in an equation where it is isolated on one side of the equation. However, in this particular problem, it leaves us stuck with no way to proceed. Perhaps if we’re really good at handling roots in equations, then we could proceed, but nothing comes to mind immediately.
Sometimes, we’ll have no choice but to try and deal with these sticky algebraic expressions, but other times, we can sidestep a lot of algebra. As a matter of fact, the use of clever logic frees us from having to rely on clever algebra, which may be too clever for us to grasp in a timely manner. One such logical strategy is the subject of the next section.