An-Introductory-Course-in-Elementary-Number-Theory (Raji)
An-Introductory-Course-in-Elementary-Number-Theory (Raji)
Number Theory
Wissam Raji
2
Preface
These notes serve as course notes for an undergraduate course in number the-
ory. Most if not all universities worldwide offer introductory courses in number
theory for math majors and in many cases as an elective course.
The notes contain a useful introduction to important topics that need to be ad-
dressed in a course in number theory. Proofs of basic theorems are presented in
an interesting and comprehensive way that can be read and understood even by
non-majors with the exception in the last three chapters where a background in
analysis, measure theory and abstract algebra is required. The exercises are care-
fully chosen to broaden the understanding of the concepts. Moreover, these notes
shed light on analytic number theory, a subject that is rarely seen or approached
by undergraduate students. One of the unique characteristics of these notes is the
careful choice of topics and its importance in the theory of numbers. The freedom
is given in the last two chapters because of the advanced nature of the topics that
are presented.
Thanks to professor Pavel Guerzhoy from University of Hawaii for his contri-
bution in chapter 6 on continued fraction and to Professor Ramez Maalouf from
Notre Dame University, Lebanon for his contribution to chapter 8.
Contents
1 Introduction 7
1.1 Algebraic Operations With Integers . . . . . . . . . . . . . . . . 8
1.2 The Well Ordering Principle and Mathematical Induction . . . . . 11
1.2.1 The Well Ordering Principle . . . . . . . . . . . . . . . 11
1.2.2 The Pigeonhole Principle . . . . . . . . . . . . . . . . . 11
1.2.3 The Principle of Mathematical Induction . . . . . . . . 12
1.3 Divisibility and the Division Algorithm . . . . . . . . . . . . . . 14
1.3.1 Integer Divisibility . . . . . . . . . . . . . . . . . . . . . 15
1.3.2 The Division Algorithm . . . . . . . . . . . . . . . . . . 16
1.4 Representations of Integers in Different Bases . . . . . . . . . . . 18
1.5 The Greatest Common Divisor . . . . . . . . . . . . . . . . . . . 21
1.6 The Euclidean Algorithm . . . . . . . . . . . . . . . . . . . . . . 25
1.7 Lame’s Theorem and Binet’s Formula . . . . . . . . . . . . . . . 29
1.7.1 Lame’s Theorem . . . . . . . . . . . . . . . . . . . . . . 29
1.7.2 Binet’s Formula . . . . . . . . . . . . . . . . . . . . . . . 31
2 Prime Numbers 35
2.1 The Sieve of Eratosthenes . . . . . . . . . . . . . . . . . . . . . . 35
2.2 Alternate definition of prime number . . . . . . . . . . . . . . . . 38
2.3 The infinitude of Primes . . . . . . . . . . . . . . . . . . . . . . 39
2.4 The Fundamental Theorem of Arithmetic . . . . . . . . . . . . . 41
3
4 CONTENTS
3 Classical questions 57
3.1 Geometric numbers . . . . . . . . . . . . . . . . . . . . . . . . . 57
3.2 Irrational numbers . . . . . . . . . . . . . . . . . . . . . . . . . . 59
3.3 Gaussian integers . . . . . . . . . . . . . . . . . . . . . . . . . . 61
3.3.1 Ring properties of Z[i] . . . . . . . . . . . . . . . . . . . 61
3.3.2 Division . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
3.3.3 Primality . . . . . . . . . . . . . . . . . . . . . . . . . . 68
3.4 Algebraic and transcendental numbers . . . . . . . . . . . . . . . 70
3.4.1 The algebraic numbers form a ring . . . . . . . . . . . . . 70
3.4.2 Liouville’s number . . . . . . . . . . . . . . . . . . . . . 72
4 Congruences 77
4.1 Introduction to congruences . . . . . . . . . . . . . . . . . . . . 77
4.2 Residue Systems and Euler’s φ-Function . . . . . . . . . . . . . . 84
4.2.1 Residue Systems . . . . . . . . . . . . . . . . . . . . . . 84
4.2.2 Euler’s φ-Function . . . . . . . . . . . . . . . . . . . . . 86
4.3 Linear Congruences . . . . . . . . . . . . . . . . . . . . . . . . . 87
4.4 The Chinese Remainder Theorem . . . . . . . . . . . . . . . . . 89
4.4.1 Direct solution . . . . . . . . . . . . . . . . . . . . . . . 89
4.4.2 Incremental solution . . . . . . . . . . . . . . . . . . . . 91
4.5 Field properties of residues, primality . . . . . . . . . . . . . . . 93
CONTENTS 5
Introduction
Integers are the building blocks of the theory of numbers. This chapter contains
somewhat very simple and obvious observations starting with properties of inte-
gers and yet the proofs behind those observations are not as simple. In this chapter
we introduce basic operations on integers and some algebraic definitions that will
be necessary to understand basic concepts in this book. We then introduce the
Well ordering principle which states basically that every set of positive integers
has a smallest element. Proof by induction is also presented as an efficient method
for proving several theorems throughout the book. We proceed to define the con-
cept of divisibility and the division algorithm. We then introduce the elementary
but fundamental concept of a greatest common divisor (gcd) of two integers, and
the Euclidean algorithm for finding the gcd of two integers. We end this chap-
ter with Lame’s Lemma on an estimate of the number of steps in the Euclidean
algorithm needed to find the gcd of two integers.
7
8 CHAPTER 1. INTRODUCTION
a+b=b+a
a·b=b·a
(a + b) + c = a + (b + c)
(a · b) · c = a · (b · c)
a · (b + c) = a · b + a · c.
1.1. ALGEBRAIC OPERATIONS WITH INTEGERS 9
In the set Z there are ”identity elements” for the two operations + and ·, and these
are the elements 0 and 1 respectively, that satisfy the basic properties
a+0=0+a=a
a·1=1·a=a
for every a ∈ Z.
The set Z allows additive inverses for its elements, in the sense that for every
a ∈ Z there exists another integer in Z, denoted by −a, such that
a + (−a) = 0. (1.3)
While for multiplication, only the integer 1 has a multiplicative inverse in the
sense that 1 is the only integer a such that there exists another integer, denoted by
a−1 or by 1/a, (namely 1 itself in this case) such that
a · a−1 = 1. (1.4)
From the operations of addition and multiplication one can define two other
operations on Z, namely subtraction (denoted by −) and division (denoted by
/). Subtraction is a binary operation on Z, i.e. defined for any two integers in Z,
while division is not a binary operation and thus is defined only for some specific
couple of integers in Z. Subtraction and division are defined as follows:
Some of the sets we study in Number Theory contain all these properties. Let
R be a set, with two operations called addition and multiplication. We say that R
is a ring if
• addition satisfies
10 CHAPTER 1. INTRODUCTION
while
• multiplication satisfies
The Well Ordering Principle: A least element exist in any non empty set of pos-
itive integers.
This principle can be taken as an axiom on integers and it will be the key to
proving many theorems. As a result, we see that any set of positive integers is
well ordered while the set of all integers is not well ordered.
The Pigeonhole Principle: If s objects are placed in k boxes for s > k, then at
least one box contains more than one object.
Proof. Suppose that none of the boxes contains more than one object. Then there
are at most k objects. This leads to a contradiction with the fact that there are s
objects for s > k.
12 CHAPTER 1. INTRODUCTION
We use the well ordering principle to prove the first principle of mathematical
induction
Proof. Let S be the set of positive integers containing the integer 1, and the integer
k + 1 whenever it contains k. Assume also that S is not the set of all positive
integers. As a result, there are some integers that are not contained in S and thus
those integers must have a least element α by the well ordering principle. Notice
that α 6= 1 since 1 ∈ S. But α − 1 ∈ S and thus using the property of S, α ∈ S.
Thus S must contain all positive integers.
and thus the the statement is true for n = 1. For the remaining inductive step,
suppose that the formula holds for n, that is nj=1 j = n(n+1)
P
2
. We show that
n+1
X (n + 1)(n + 2)
j= .
j=1
2
n! ≤ nn
Proof. Let T be a set of integers containing 1 and such that for every positive
integer k, if it contains 1, 2, ..., k, then it contains k + 1. Let S be the set of all
positive integers k such that all the positive integers less than or equal to k are in
T . Then 1 is in S, and we also see that k + 1 is in S. Thus S must be the set of
all positive integers. Thus T must be the set of all positive integers since S is a
subset of T .
Exercises
1. Prove using mathematical induction that n < 3n for all positive integers n.
Pn n(n+1)(2n+1)
2. Show that j=1 j2 = 6
.
Pn j−1 2
3. Use mathematical induction to prove that j=1 (−1) j = (−1)n−1 n(n+
1)/2.
Pn
4. Use mathematical induction to prove that j=1 j 3 = [n(n+1)/2]2 for every
positive integer n.
Pn
5. Use mathematical induction to prove that j=1 (2j − 1) = n2
Example 3. a) Note that any even integer has the form 2k for some integer k,
while any odd integer has the form 2k + 1 for some integer k. Thus 2|n if n is
even, while 2 - n if n is odd.
b) ∀a ∈ Z one has that a | 0.
c) If b ∈ Z is such that |b| < a, and b 6= 0, then a - b.
Proof. Since a | b and b | c, then there exist integers k1 and k2 such that b = k1 a
and c = k2 b. As a result, we have c = k1 k2 a and hence a | c.
The following theorem states that if an integer divides two other integers then
it divides any linear combination of these integers.
Proof. Since c | a and c | b, then by definition there exists k1 and k2 such that
a = k1 c and b = k2 c. Thus
a | b1 , a | b2 , ..., a | bn
then n
X
a| kj bj (1.6)
j=1
Theorem 5. The Division Algorithm If a and b are integers such that b > 0, then
there exist unique integers q and r such that a = bq + r where 0 ≤ r < b.
r > r − b = a − bq − b = a − b(q + 1) =≥ 0.
b(q1 − q2 ) + (r1 − r2 ) = 0.
As a result we have
b(q1 − q2 ) = r2 − r1 .
1.3. DIVISIBILITY AND THE DIVISION ALGORITHM 17
Exercises
2. Use the division algorithm to find the quotient and the remainder when 76
is divided by 13.
3. Use the division algorithm to find the quotient and the remainder when -100
is divided by 13.
4. Show that if a, b, c and d are integers with a and c nonzero, such that a | b
and c | d, then ac | bd.
6. Prove that the sum of two even integers is even, the sum of two odd integers
is even and the sum of an even integer and an odd integer is odd.
7. Show that the product of two even integers is even, the product of two odd
integers is odd and the product of an even integer and an odd integer is even.
10. Show that the square of any integer is of the form 3m or 3m + 1 but not of
the form 3m + 2.
18 CHAPTER 1. INTRODUCTION
In this section, we show how any positive integer can be written in terms of any
positive base integer expansion in a unique way. Normally we use decimal nota-
tion to represent integers, we will show how to convert an integer from decimal
notation into any other positive base integer notation and vise versa. Using the
decimal notation in daily life is simply better because we have ten fingers which
facilitates all the mathematical operations.
Notation An integer a written in base b expansion is denoted by (a)b .
Theorem 6. Let b be a positive integer with b > 1. Then any positive integer m
can be written uniquely as
m = bq0 + a0 , 0 ≤ a0 < b.
q0 = bq1 + a1 , 0 ≤ a1 < b.
1.4. REPRESENTATIONS OF INTEGERS IN DIFFERENT BASES 19
q1 = bq2 + a2 , 0 ≤ a2 < b,
.
.
.
ql−2 = bql−1 + al−1 , 0 ≤ al−1 < b,
ql−1 = b · 0 + al , 0 ≤ al < b.
Note that the sequence q0 , q1 , ... is a decreasing sequence of positive integers with
a last term ql that must be 0.
Now substituting the equation q0 = bq1 + a1 in m = bq0 + a0 , we get
m = b(bq1 + a1 ) + a0 = b2 q1 + a1 b + a0 ,
m = b 3 q 2 + a2 b 2 + a1 b + a0 ,
.
.
.
= bl ql−1 + al−1 bl−1 + ... + a1 b + a0 ,
= al bl + al−1 bl−1 + ... + a1 b + a0 .
What remains to prove is that the representation is unique. Suppose now that
where if the number of terms is different in one expansion, we add zero coeffi-
cients to make the number of terms agree. Subtracting the two expansions, we
get
If the two expansions are different, then there exists 0 ≤ j ≤ l such that cj 6= aj .
As a result, we get
We now get
aj − cj = (al − cl )bl−j + ... + (aj+1 − cj+1 )b,
we do the following
214 = 3 · 71 + 1
71 = 3 · 23 + 2
23 = 3 · 7 + 2
7 = 3·2+1
2 = 3·0+2
Example 7. To find the base 10 expansion, i.e. the decimal expansion, of (364)7 :
Exercises
Definition 2. The greatest common divisor of two integers a and b is the greatest
integer that divides both a and b.
We denote the greatest common divisor of two integers a and b by (a, b). We
also define (0, 0) = 0.
There are couples of integers (e.g. 3 and 4, etc...) whose greatest common
divisor is 1 so we call such integers relatively prime integers.
Example 9. The greatest common divisor of 9 and 16 is 1, thus they are relatively
prime.
Note that every integer has positive and negative divisors. If a is a positive
divisor of m, then −a is also a divisor of m. Therefore by our definition of the
greatest common divisor, we can see that (a, b) = (| a |, | b |).
We now present a theorem about the greatest common divisor of two integers.
The theorem states that if we divide two integers by their greatest common divisor,
then the outcome is a couple of integers that are relatively prime.
Proof. We will show that a/d and b/d have no common positive divisors other
than 1. Assume that k is a positive common divisor such that k | a/d and k | b/d.
As a result, there are two positive integers m and n such that
The next theorem shows that the greatest common divisor of two integers does
not change when we add a multiple of one of the two integers to the other.
Proof. We will show that every divisor of a and b is also a divisor of a + cb and
b and vise versa. Hence they have exactly the same divisors. So we get that the
greatest common divisor of a and b will also be the greatest common divisor of
a + cb and b. Let k be a common divisor of a and b. By Theorem 4, k | (a + cb)
and hence k is a divisor of a+cb. Now assume that l is a common divisor of a+cb
and b. Also by Theorem 4 we have ,
We now present a theorem which proves that the greatest common divisor of
two integers can be written as a linear combination of the two integers.
Theorem 9. The greatest common divisor of two integers a and b, not both 0 is
the least positive integer such that ma + nb = d for some integers m and n.
Proof. Assume without loss of generality that a and b are positive integers. Con-
sider the set of all positive integer linear combinations of a and b. This set is non
empty since a = 1 · a + 0 · b and b = 0 · a + 1 · b are both in this set. Thus this set
has a least element d by the well-ordering principle. Thus d = ma + nb for some
integers m and n. We have to prove that d divides both a and b and that it is the
greatest divisor of a and b.
By the division algorithm, we have
a = dq + r, 0 ≤ r < d.
24 CHAPTER 1. INTRODUCTION
Thus we have
Definition 4. Let a1 , a2 , ..., an be integers, not all 0. The greatest common divisor
of these integers is the largest integer that divides all of the integers in the set. The
greatest common divisor of a1 , a2 , ..., an is denoted by (a1 , a2 , ..., an ).
Example 11. The integers 3, 6, 7 are mutually relatively prime since (3, 6, 7) = 1
although (3, 6) = 3.
Definition 6. The integers a1 , a2 , ..., an are called pairwise prime if for each i 6= j,
we have (ai , aj ) = 1.
Example 12. The integers 3, 14, 25 are pairwise relatively prime. Notice also that
these integers are mutually relatively prime.
Notice that if a1 , a2 , ..., an are pairwise relatively prime then they are mutually
relatively prime.
1.6. THE EUCLIDEAN ALGORITHM 25
Exercises
6. Show that if m and n are integers such that (m, n) = 1, then (m+n,m-n)=1
or 2.
8. Show that if a and b are relatively prime integers, then (a+2b, 2a+b) = 1or
3.
9. Show that if a1 , a2 , ..., an are integers that are not all 0 and c is a positive
integer, then (ca1 , ca2 , ..., can ) = c(a1 , a2 , ...an ).
Lemma 1. If a and b are two integers and a = bq + r where also q and r are
integers, then (a, b) = (r, b).
The above lemma will lead to a more general version of it. We now present the
Euclidean algorithm in its general form. It states that the greatest common divisor
of two integers is the last non zero remainder of the successive division.
r0 = r1 q1 + r2 0 ≤ r2 < r1 ,
r1 = r2 q2 + r3 0 ≤ r3 < r2 ,
.
.
.
rn−2 = rn−1 qn−1 + rn 0 ≤ rn < rn−1 ,
rn−1 = rn qn .
Notice that, we will have a remainder of 0 eventually since all the remainders
are integers and every remainder in the next step is less than the remainder in the
previous one. By Lemma 1, we see that
Example 13. We will find the greatest common divisor of 4147 and 10672. Note
that
We now use the steps in the Euclidean algorithm to write the greatest common
divisor of two integers as a linear combination of the two integers. The following
example will actually determine the variables m and n described in Theorem 9.
The following algorithm can be described by a general form but for the sake of
simplicity of expressions we will present an example that shows the steps for
obtaining the greatest common divisor of two integers as a linear combination of
the two integers.
28 CHAPTER 1. INTRODUCTION
29 = 551 − 9 · 58,
= 551 − 9(609 − 551 · 1),
= 10.551 − 9.609,
= 10 · (1769 − 609 · 2) − 9 · 609,
= 10 · 1769 − 29 · 609,
= 10 · 1769 − 29(2378 − 1769 · 1),
= 39 · 1769 − 29 · 2378,
= 39(4147 − 2378 · 1) − 29 · 2378,
= 39 · 4147 − 68 · 2378,
= 39 · 4147 − 68(10672 − 4147 · 2),
= 175 · 4147 − 68 · 10672,
For any two integers a and b, we can always find integers m and n
such that gcd(a, b) = am + bn.
We can always find this expression by reversing the results of the Euclidean algo-
rithm.
Exercises
1. Use the Euclidean algorithm to find the greatest common divisor of 412 and
32 and express it in terms of the two integers.
2. Use the Euclidean algorithm to find the greatest common divisor of 780 and
150 and express it in terms of the two integers.
4. Let a and b be two positive even integers. Prove that (a, b) = 2(a/2, b/2).
5. Show that if a and b are positive integers where a is even and b is odd, then
(a, b) = (a/2, b).
Proof. We use the second principle of mathematical induction to prove our result.
It is easy to see that this is true for n = 3 and n = 4. Assume that αk−2 < fk
30 CHAPTER 1. INTRODUCTION
Theorem 11. using the Euclidean algorithm to find the greatest common divisor
of two positive integers has number of divisions less than or equal five times the
number of decimal digits in the minimum of the two integers.
Proof. Let a and b be two positive integers where a > b. Applying the Euclidean
algorithm to find the greatest common divisor of two integers with a = r0 and
b = r1 , we get
r0 = r1 q1 + r2 0 ≤ r2 < r1 ,
r1 = r2 q2 + r3 0 ≤ r3 < r2 ,
.
.
.
rn−2 = rn−1 qn−1 + rn 0 ≤ rn < rn−1 ,
rn−1 = rn qn .
1.7. LAME’S THEOREM AND BINET’S FORMULA 31
Notice that each of the quotients q1 , q2 , ..., qn−1 are all greater than 1 and qn ≥ 2
and this is because rn < rn−1 . Thus we have
rn ≥ 1 = f2 ,
rn−1 ≥ 2rn ≥ 2f2 = f3 ,
rn−2 ≥ rn−1 + rn ≥ f3 + f2 = f4 ,
rn−3 ≥ rn−2 + rn−1 ≥ f4 + f3 = f5 ,
.
.
.
r2 ≥ r3 + r4 ≥ fn−1 + fn−2 = fn ,
b = r1 ≥ r2 + r3 ≥ fn + fn−1 = fn+1 .
Thus notice that b ≥ fn+1 . By Lemma 2, we have fn+1 > αn−1 for n > 2. As a
result, we have b > αn−1 . Now notice since
1
log10 α > ,
5
we see that
log10 b > (n − 1)/5.
Thus we have
n − 1 < 5 log10 b.
Now let b has k decimal digits. As a result, we have b < 10k and thus log10 b < k.
Hence we conclude that n − 1 < 5k. Since k is an integer, we conclude that
n ≤ 5k.
the sequence. An elegant technique gives us a concise formula for such relations,
and we illusrate this using the Fibonacci sequence.
In general, fn = fn−1 + fn−2 , and of course fn−1 = fn−1 , giving us the matrix
equation ! ! !
fn 1 1 fn−1
= .
fn−1 1 0 fn−2
Let’s give this 2 × 2 matrix a special name,
!
1 1
F = .
1 0
As usual, matrices carry more information than you might expect at first glance,
and the characteristic polynomial of this one has very interesting roots:
!
1−λ 1
0 = det
1 −λ
= −λ(1 − λ) − 1
= λ2 − λ − 1
⇓
p √
−(−1) ± (−1)2 − 4 · 1 · (−1) 1± 5
λ= = .
2·1 2
√
These results have several wonderful aspects. For instance, (1 + 5)/2
There is more! The whole point of any eigenvalue λ of F is that, for any eigen-
vector e,
F e = λe =⇒ F (e2 e1 )T = (λe2 λe1 )T .
The eigenvectors are a basis for the eigenspace, so any solution to F x = λx has
the form x = c1 e1 + c1 e2 , where c1 and c2 are arbitrary constants, while e1 and e2
√ √
are the eigenvectors corresponding to (1 + 5)/2 and (1 − 5)/2, respectively.
1.7. LAME’S THEOREM AND BINET’S FORMULA 33
(Notice that the columns of Q are the eigenvectors that correspond to the diagonal
elements of Λ, the eigenvalues.) By substitution,
Combine this with the relationship in (1.7), and we have the relationship
! √ !n−2 √ !
1+ 5 1+√ 5 √1
1 1 2
0 2 √5 5
fn = √
1− 5
√
1+ 5
√
1− 5 1−√ 5 1
f2
− 2 − 2 0 2
− 2 5
− √
5
where fn = (fn fn−1 )T and f2 = (f2 f1 )T = (1 1)T . The first row of the simplified
product yields a “closed” form relationship between fn , f1 , and f2 ,
√ !n−1 √ !n−1
1 1+ 5 1 1− 5
fn = √ −√
5 2 5 2
√ n−2
! √ !n−2
1 1+ 5 1 1− 5
+√ −√ .
5 2 5 2
Theorem 12 (Binet’s Formula). The nth Fibonacci number has the form
" √ !n √ !n #
1 1+ 5 1− 5
fn = √ − .
5 2 2
2. What happens when you add the squares of consecutive Fibonacci numbers?
That is, find a pattern to the sequence 12 + 12 , 12 + 22 , 22 + 32 , 32 + 52 , . . . .
Prove the property you find by induction. (One way to solve this requires
proving two claims simultaneously, by induction.)
3. Find an upper bound for the number of steps in the Euclidean algorithm that
is used to find the greatest common divisor of 38472 and 957748838.
4. Find an upper bound for the number of steps in the Euclidean algorithm that
is used to find the greatest common divisor of 15 and 75. Verify your result
by using the Euclidean algorithm to find the greatest common divisor of the
two integers.
5. Using a computational aid, test Binet’s formula for some large values of n.
Prime Numbers
Prime numbers, the building blocks of integers, have been studied extensively
over the centuries. Being able to present an integer uniquely as product of primes
is the main reason behind the whole theory of numbers and behind the interesting
results in this theory. Many interesting theorems, applications and conjectures
have been formulated based on the properties of prime numbers.
In this chapter, we present methods to determine whether a number is prime
or composite using an ancient Greek method invented by Eratosthenes. We also
show that there are infinitely many prime numbers. We then proceed to show that
every integer can be written uniquely as a product of primes.
We introduce as well the concept of diophantine equations where integer so-
lutions from given equations are determined using the greatest common divisor.
We then mention the Prime Number theorem without giving a proof of course in
addition to other conjectures and major results related to prime numbers.
35
36 CHAPTER 2. PRIME NUMBERS
Note that any integer greater than 1 that is not prime is said to be a composite
number.
Proof. We present the proof of this Lemma by contradiction. Suppose that there
is an integer greater than one that has no prime divisors. Since the set of integers
with elements greater than one with no prime divisors is nonempty, then by the
well ordering principle there is a least positive integer n greater than one that has
no prime divisors. Thus n is composite since n divides n. Hence
Notice that a < n and as a result since n is minimal, a must have a prime divisor
which will also be a divisor of n.
Theorem 13. If n is a composite integer, then n has a prime factor not exceeding
√
n.
Proof. Since n is composite, then n = ab, where a and b are integers with 1 <
√
a ≤ b < n. Suppose now that a > n, then
√
n<a≤b
and as a result
√ √
ab > n n = n.
2.1. THE SIEVE OF ERATOSTHENES 37
√
Therefore a ≤ n. Also, by Lemma 3, a must have a prime divisor a1 which is
√
also a prime divisor of n and thus this divisor is less than a1 ≤ a ≤ n.
We now present the algorithm of the Sieve of Eratosthenes that is used to de-
termine prime numbers up to a given integer.
1. Write a list of numbers from 2 to the largest number n you want to test.
Note that every composite integer less than n must have a prime factor less
√
than n. Hence you need to strike off the multiples of the primes that are
√
less than n
2. Strike off all multiples of 2 greater than 2 from the list . The first remaining
number in the list is a prime number.
4. Repeat the above steps until no more multiples are found of the prime inte-
√
gers that are less than n
Exercises
1. Use the Sieve of Eratosthenes to find all primes less than 100.
2. Use the Sieve of Eratosthenes to find all primes less than 200.
Definition 9. Let p be a positive integer, greater than 1. We say that p is prime if,
whenever p divides the product of two integers a and b, it also divides at least one
of a or b.
Definition 9 might not appeal to you: why would someone want to define
primality this way? To see why this definition is useful, consider the following
examples.
Example 16. For instance, 6 divides the product 2 · 3, but 6 divides neither 2 nor
3. Hence, 6 is not prime.
That example might not inspire you so much, so try this one on for size. We
know that 5 is prime. Suppose 5 divides the product of 2 and an integer m; since
5 is prime and it does not divide 2, it must divide m.
Definition 9 also has advantages when we apply the notion of a prime number
to other sets; we will look at that later. For now, though, we have to ask ourselves:
are Definitions 8 and 9 equivalent? After all, they say different things, so there
is a possibility that they classify different numbers as prime. That would cause
problems!
In fact, the two definitions are equivalent. To see this, let p, a, and b be positive
integers.
Assume first that p is irreducible; that is, whenever it factors as ab, either p = a
or p = b. We need to show that this implies Euclid’s criterion. By way of con-
tradiction, suppose there exist integers a and b such that p divides ab, but divides
neither a nor b. Choose positive a and b such that this product is minimized. By
Exercise 5, p ≤ ab. We consider two cases.
2.3. THE INFINITUDE OF PRIMES 39
Elements.
Proof. We present the proof by contradiction. Suppose there are finitely many
primes p1 , p2 , ..., pn , where n is a positive integer. Consider the integer Q such
that
Q = p1 p2 ...pn + 1.
By Lemma 3, Q has at least a prime divisor, say q. If we prove that q is not one
of the primes listed then we obtain a contradiction. Suppose now that q = pi for
1 ≤ i ≤ n. Thus q divides p1 p2 ...pn and as a result q divides Q − p1 p2 ...pn .
Therefore q divides 1. But this is impossible since there is no prime that divides 1
and as a result q is not one of the primes listed.
The following theorem discusses the large gaps between primes. It simply
states that there are arbitrary large gaps in the series of primes and that the primes
are spaced irregularly.
Theorem 15. Given any positive integer n, there exists n consecutive composite
integers.
Notice that every integer in the above sequence is composite because k divides
(n + 1)! + k if 2 ≤ k ≤ n + 1 by 4.
Exercises
Lemma 4. If a,b,c are positive integers such that (a, b) = 1 and a | bc, then a | c.
42 CHAPTER 2. PRIME NUMBERS
Proof. We present the proof of this result by induction. For k = 1, the result
is trivial. Assume now that the result is true for k. Consider n1 n2 ...nk+1 that is
divisible by p. Notice that either
We now state the fundamental theorem of arithmetic and present the proof
using Lemma 5.
n = p1 p2 ...ps = q1 q2 ...qr
Cancel out all common primes from the factorizations above to get
Thus all the primes on the left side are different from the primes on the right side.
Since any pjl (l = 1, · · · , n) divides pj1 pj2 ...pju , then pjl must divide qi1 qi2 ...qiv ,
and hence by Lemma 5, pj1 must divide qjk for some 1 ≤ k ≤ v which is impos-
sible. Hence the representation is unique.
a
n = pa11 pa22 pa33 ...pj j (2.1)
where all the pi are distinct for 1 ≤ i ≤ j. One can also write a formal product
Y
n= pαi i , (2.2)
all primes pi
Example 18. The prime factorization of 120 is given by 120 = 2·2·2·3·5 = 23 ·3·5.
Notice that 120 is written in the two ways described in 2.
44 CHAPTER 2. PRIME NUMBERS
where we exclude in these expansions any prime p with power 0 in both a and b
(and thus some of the powers above may be 0 in one expansion but not the other).
Of course, if one prime pi appears in a but not in b, then ai 6= 0 while bi = 0, and
vise versa. Then the greatest common divisor is given by
Lemma 6. Let a and b be relatively prime positive integers. Then if d divides ab,
there exists d1 and d2 such that d = d1 d2 where d1 is a divisor of a and d2 is a
divisor of b. Conversely, if d1 and d2 are positive divisors of a and b, respectively,
then d = d1 d2 is a positive divisor of ab.
Proof. Let d1 = (a, d) and d2 = (b, d). Since (a, b) = 1 and writing a and b in
terms of their prime decomposition, it is clear that d = d1 d2 and (d1 , d2 ) = 1.
Note that every prime power in the factorization of d must appear in either d1 or
d2 . Also the prime powers in the factorization of d that are prime powers dividing
a must appear in d1 and that prime powers in the factorization of d that are prime
powers dividing b must appear in d2 .
Now conversely, let d1 and d2 be positive divisors of a and b, respectively.
Then
d = d1 d2
is a divisor of ab.
2.4. THE FUNDAMENTAL THEOREM OF ARITHMETIC 45
This result had been conjectured by Gauss but was first proved by Dirichlet.
Dirichlet proved this theorem using complex analysis, but the proof is so chal-
lenging. As a result, we will present a special case of this theorem and prove that
there are infinitely many primes in a given arithmetic progression. Before stating
the theorem about the special case of Dirichlet’s theorem, we prove a lemma that
will be used in the proof of the mentioned theorem.
Lemma 7. If a and b are integers both of the form 4n + 1, then their product ab
is of the form 4n + 1
where n3 = 4n1 n2 + n1 + n2 .
Theorem 18. There are infinitely many primes of the form 4n + 3, where n is a
positive integer.
Proof. Suppose that there are finitely many primes of the form 4n + 3, say p0 =
3, p1 , p2 , ..., pn . Let
N = 4p1 p2 ...pn + 3.
Notice that any odd prime is of the form 4n + 1 or 4n + 3. Then there is at least
one prime in the prime factorization of N of the form 4n + 3, as otherwise, by
46 CHAPTER 2. PRIME NUMBERS
Lemma 7, N will be in the form 4n + 1. We wish to prove that this prime in the
factorization of N is none of p0 = 3, p1 , p2 , ..., pn . Notice that if
3 | N,
which is impossible since pi 6= 3 for every i. Hence 3 doesn’t divide N . Also, the
other primes p1 , p2 , ..., pn don’t divide N because if pi | N , then
pi | (N − 4p1 p2 ...pn ) = 3.
Hence none of the primes p0 , p1 , p2 , ..., pn divides N. Thus there are infinitely
many primes of the form 4n + 3.
Exercises
3. Show that all the powers of in the prime factorization of an integer a are
even if and only if a is a perfect square.
Definition 10. The least common multiple (l.c.m.) of two positive integers is the
smallest positive integer that is a multiple of both.
2.5. LEAST COMMON MULTIPLE 47
We denote the least common multiple of two positive integers a an b by ha, bi.
We can figure out ha, bi once we have the prime factorization of a and b. To
do that, let
a = pa11 pa22 ...pamn and b = pb11 pb22 ...pbmn ,
where (as above) we exclude any prime with 0 power in both a and b. Then
max(a1 ,b1 ) max(a2 ,b2 ) max(a ,b )
ha, bi = p1 p2 ...pm n n , where max(a, b) is the maximum of
the two integers a and b. We now prove a theorem that relates the least common
multiple of two positive integers to their greatest common divisor. In some books,
this theorem is adopted as the definition of the least common multiple. To prove
the theorem we present a lemma
min(a, b) + max(a, b) = a + b
1. ha, bi ≥ 0;
and
max(a1 ,b1 ) max(a2 ,b2 ) max(an ,bn )
ha, bi = p1 p2 ...pm ,
then
max(a1 ,b1 ) max(a2 ,b2 ) max(an ,bn ) min(a1 ,b2 ) min(a2 ,b2 )
ha, bi(a, b) = p1 p2 ...pm p1 p2 ...pmin(a
n
n ,bn )
max(a1 ,b1 )+min(a1 ,b1 ) max(a2 ,b2 )+min(a2 ,b2 ) max(an ,bn )+min(an ,bn )
= p1 p2 ...pm
= pa11 +b1 pa22 +b2 ...p(a
n
n +bn )
Note also that we used Lemma 8 in the above equations. For part 3, it would be a
nice exercise to show that ab/(a, b) | m (Exercise 6). Thus ha, bi | m.
Exercises
3. Find the least common multiple and the greatest common divisor of 25 56 72 11
and 23 58 72 13.
4. Show that every common multiple of two positive integers a and b is divis-
ible by the least common multiple of a and b.
5. Show that if a and b are positive integers then the greatest common divisor
of a and b divides their least common multiple. When are the least common
multiple and the greatest common divisor equal to each other.
x = x0 + (b/d)t y = y0 − (a/d)t
Proof. Suppose that the equation ax + by = c has integer solution x and y. Thus
since d | a and d | b, then
d | (ax + by) = c.
Now we have to prove that if d | c, then the equation has integral solution. Assume
that d | c. By theorem 9, there exist integers m and n such that
d = am + bn.
c = dk
50 CHAPTER 2. PRIME NUMBERS
x0 = km and y0 = kn.
We have to prove now that x and y are solutions for all integers t. Notice that
We now show that every solution for the equation ax + by = c is of the form
x = x0 + (b/d)tand y = y0 − (a/d)t.
a(x − x0 ) + b(y − y0 ) = 0.
Hence
a(x − x0 ) = b(y − y0 ).
Dividing both sides by d, we get
a/d(x − x0 ) = b/d(y − y0 ).
Notice that (a/d, b/d) = 1 and thus we get by Lemma 4 that a/d | y − y0 . As a
result, there exists an integer t such that y = y0 − (a/d)t. Now substituting y − y0
in the equation
a(x − x0 ) = b(y − y0 ).
We get
x = x0 + (b/d)t.
2.7. THE FUNCTION [X] , THE SYMBOLS ”O”, ”O” AND ”∼” 51
Example 20. The equation 3x+6y = 7 has no integer solution because (3, 6) = 3
does not divide 7.
Example 21. There are infinitely many integer solutions for the equation 4x +
6y = 8 because (4, 6) = 2 | 8. We use the Euclidean algorithm to determine m
and n where 4m + 6n = 2. It turns out that 4(−1) + 6(1) = 2. And also 8 = 2.4.
Thus x0 = 4.(−1) = −4 and y0 = 4.1 = 4 is a particular solution. The solutions
are given by
x = −4 + 3t y = 4 − 2t
Exercises
1. Either find all solutions or prove that there are no solutions for the diophan-
tine equation 21x + 7y = 147.
2. Either find all solutions or prove that there are no solutions for the diophan-
tine equation 2x + 13y = 31.
3. Either find all solutions or prove that there are no solutions for the diophan-
tine equation 2x + 14y = 17.
4. A grocer orders apples and bananas at a total cost of $8.4. If the apples cost
25 cents each and the bananas 5 cents each, how many of each type of fruit
did he order.
2.7 The function [x] , the symbols ”O”, ”o” and ”∼”
We start this section by introducing an important number theoretic function. We
proceed in defining some convenient symbols that will be used in connection with
the growth and behavior of some functions that will be defined in later chapters.
52 CHAPTER 2. PRIME NUMBERS
1. [x + n] = [x] + n, if n is an integer.
Using the definition of [x], it will be easy to see that the above properties are
direct consequences of the definition.
We now define some symbols that will be used to estimate the growth of number
theoretic functions. These symbols will be not be really appreciated in the context
of this book but these are often used in many analytic proofs.
Now, the relation g(x) = o(f (x)), pronounced ”small-oh” of f (x), is used to
indicate that f (x) grows much faster than g(x). It formally says that
g(x)
lim = 0. (2.4)
x→∞ f (x)
Example 23. sin(x) = o(x) at ∞, and xk = o(ex ) also at ∞ for every constant
k.
f (x)
lim = 1. (2.6)
x→∞ g(x)
Example 24. [x] ∼ x.
There are some other properties that we did not mention here, properties that are
rarely used in number theoretic proofs.
Exercises
Lemma 9. Let p be a prime and let m ∈ Z+ . Then the highest power of p dividing
m! is
∞
X m
i=1
pi
h i
Proof. Among all the integers from 1 till m, there are exactly mp integers that
h i h i
are divisible by p. These are p, 2p, ..., p p. Similarly we see that there are m
m
pi
integers that are divisible by pi . As a result, the highest power of p dividing m! is
X m m X m
i − i+1 =
i≥1
pi p i≥1
pi
π(x) ∼ x/logx
So this theorem says that you do not need to find all the primes less than x to
find out their number, it will be enough to evaluate x/logx for large x to find an
estimate for the number of primes. Notice that I mentioned that x has to be large
enough to be able to use this estimate.
Several other theorems were proved concerning prime numbers. many great
mathematicians approached problems that are related to primes. There are still
many open problems of which we will mention some.
Conjecture 1. Twin Prime Conjecture There are infinitely many pairs primes p
and p + 2.
Conjecture 4. Polignac Conjecture For every even number 2n are there infinitely
many pairs of consecutive primes which differ by 2n.
Classical questions
This chapter visits some of the classical questions of number theory, which are a
vital part of mathematical culture.
Did you ever pause to count how many pebbles were in each pile?
1, 3, 6, 10, 15, . . .
57
58 CHAPTER 3. CLASSICAL QUESTIONS
These numbers are called triangular, for a reason you’ll probably never guess.1
As you can see, the ith triangular number is built from the one before it in a reli-
able pattern: t1 = 1, and if we know ti , then the (i + 1)th number is ti + (i + 1).
This is another example of a recursive sequence. Sure, you saw them earlier
with the Fibonacci numbers, but this one’s a little easier to deal with: the recursion
only requires knowledge of one previous number. Still, it would be nice to com-
pute the ith triangular number without having to know the one before it, which
would require us to determine the one before it, and so forth and so on, until we
finally descended back down to t1 . Doing that all the time is boring. Wouldn’t life
be nicer if we had a concise little formula for it?
Indeed, it would! Let’s try to find one. One way to look at this is by redrawing
the picture. After all, a triangle is usually half a square:
. . . well, maybe not quite half a square. Our triangle seems to cover the entire
diagonal. Well, a triangle is also half a rectangle. . .
That works out very nicely! The nth rectangle has area n(n + 1), so it makes
sense that the ith triangle has area n(n+1)/2. This is a perfectly reasonable explana-
tion, but if you prefer, we can resort to induction: It is clear that t1 = 1. Assume
that tn = n(n+1)/2; we obtain tn+1 by adding n + 1 to tn . Simplifying the sum, we
1
Hope that gave you a chuckle.
3.2. IRRATIONAL NUMBERS 59
see that
Exercises
1. The nth pentagonal number is the number of pebbles you get when ar-
ranged in a pentagon with n pebbles on a side; the first few are 1, 6, 16, 31,
. . . . (See the diagrams below.) Conjecture a concise formula, and prove that
it is correct.
Hint: To find a conjecture for the formula, look for triangular numbers.
2. The nth hexagonal number is the number of pebbles you get when arranged
in a hexagon with n pebbles on a side. Find the first few hexagonal numbers,
conjecture a concise formula, and prove that it is correct.
Remark 3. Remember how we said the Pythagoreans believed that every mea-
surement could be represented as the ratio of two integers. According to lore, it
3.3. GAUSSIAN INTEGERS 61
was a Pythagorean who discovered this fact. Once he told his companions, they
sent him on a one-way cruise to the bottom of the Mediterranean ocean. The
Pythagoreans have a well-deserved reputation for mathematical excellence, but
even they were only human.
Exercises
√
1. Show that if p is prime, then we cannot write p as the ratio of two integers.
2. Show that if n = pm, where p is prime and does not divide m, then we
√
cannot write n as the ratio of two integers.
• closure, because
and closure of integer addition implies that this number is a Gaussian inte-
ger;
62 CHAPTER 3. CLASSICAL QUESTIONS
• associativity, because
• identity, as 0 + 0i satisfies
and
• associativity, because
Exercises
1. Complete the explanation that the Gaussian integers satisfy the properties
of a ring.
3.3.2 Division
We can think of Gaussian integers as vectors on the plane: a + bi corresponds to
the vector (a, b).
64 CHAPTER 3. CLASSICAL QUESTIONS
15
10
5 10
Call the square of the Euclidean length of this vector the norm of the Gaussian
integer, written as, N (a + bi) = a2 + b2 . Typically, one does not multiply two
vectors to each other, but here it makes sense to multiply them in a way that
imitates the product of the corresponding Gaussian integers:
15
10
5 10 15
If we can multiply Gaussian integers, then there’s a good bet that we can divide
them, too — but how should we go about doing it? In particular, we’d like to do
so in a way that gives us the smallest remainder possible — where, by “smallest”
remainder, we refer of course to a Gaussian integer’s norm.
This approach implies that we can divide two Gaussian integers: let a + bi, c +
di ∈ Z[i], and put
Let’s start with two Gaussian integers that lie on the same line, but aren’t
multiples of each other: 4 + 2i and 10 + 5i. It should be pretty clear that we can
obtain the smallest possible remainder using either 2 or 3, since
10
3(4 +2i)
5
10 +5i
2(4 +2i)
5 10 15
You should notice right away that there can be more than one remainder: in this
example, we have ±(2+i). So that’s one difference from ordinary integer division.
On the other hand, the norm seems to be unique, at 5.
So what if the two integers aren’t on the same line? Inasmuch as products of
Gaussian integers consist of adding one scaling to the rotation of another scaling,
it seems best to adopt the following approach for two Gaussian integers α and β:
• Find the integer c such that the distance between cα and β is minimal.
• Find the integer d such that the distance between diα and β is minimal.
10
10 +8i
6(2 + i)
i(2 + i)
5 10 15
If you look carefully, you will see that there is more than one way to obtain a
remainder that like within the circle: for instance, both γ = 5 + i and γ = 6 + 2i
serve this purpose. However, they do not minimize N ((10 + 8i) − γ(2 + i)), and
we will use this fact to prove the general case.
Sketch of proof. Our proof relies highly on geometry, so it may help to draw some
pictures while reading this. In particular, we frequently use the norm of a Gaussian
integer as the radius of a circle around another one.
Suppose, to the contrary, that it is not; then the endpoint of γα lies on or
outside a circle of radius N (α) with the endpoint of β at the center. For simplicity,
we assume that α has positive real and imaginary parts; we can modify the easily
otherwise, as indicated below. Without loss of generality, suppose γα is closer to
the origin than β. Extend γ by adding to it one of γ̌ = γ + α or γ̂ = γ + iα. If
the distance to β remains unchanged in either direction, then these three points lie
on a circle with β at the center. By hypothesis, the circle has radius greater than
N (α), so that γ + (1 + i)α lies within it, contradicting the choice of both c and
d. Otherwise, supposed that both γ̌ and γ̂ lie further from β than γ; in this case, β
must lie within a circle of radius N (α) from γ, contradicting the hypothesis that
it does not. Thus, either γ̌ or γ̂ lies closer to β than γ, which again contradicts the
68 CHAPTER 3. CLASSICAL QUESTIONS
Exercises
1. Divide 30 + 23i by
(a) 4,
(b) 2i, and
(c) 4 + 2i.
Use both the geometric approach, and the method of simplifying a complex
fraction, then rounding. Notice that you don’t always get the same answer.
3.3.3 Primality
The only integers which have integral multiplicative inverses are ±1. Well, which
Gaussian integers have multiplicative inverses? Suppose a + bi has an inverse
c + di. You will show in the exercises that, in this case, b = −d/c2+d2. Because c
and d are integers, the sum of their squares must be 1 (it is the only way we can
get c2 + d2 to divide d) so c = ±1 and d = 0 or c = 0 and d = ±1. In short, the
only Gaussian integers with multiplicative inverses are ±1 and ±i.
Another question to ask is, what makes a Gaussian integer “prime”? Accord-
ing to the irreducibility criterion, an integer p is prime if p is divisible only by 1
3.3. GAUSSIAN INTEGERS 69
and itself. As an integer, 2 is prime because the only numbers that divide it are 1
and itself.
What about the Gaussian integers? Rather surprisingly, many prime integers
are not prime Gaussian integers! For example, 5 = (1 + 2i)(1 − 2i), 13 =
(2 + 3i)(2 − 3i), 17 = (1 + 4i)(1 − 4i), and so forth.
Does this happen to all prime integers? We pass over 2 for the moment, but
suppose there exist integers a and b such that 7 = (a + bi)(a − bi). That would
mean 7 = a2 + b2 . The fact that a and b are integers means that their squares
have to be positive integers, which means that they have to be smaller than 7.
That limits our options, and it’s easy to verify that no integer squares add up to 7:
1 + 1 = 2, 1 + 4 = 5, 4 + 4 = 8. So, 7 remains prime even as a Gaussian integer.
We see that the question of whether a number is prime depends very much on
the ring!
Exercises
2. Show that if an integer p factors as (a + bi)(c + di), then the factors are
conjugate.
a0 α0 + a1 α1 + . . . + an αn = 0.
3.4. ALGEBRAIC AND TRANSCENDENTAL NUMBERS 71
α a1 + a2 α1 + . . . + an αn−1 = −a0 .
So, we have found a multiplicative inverse, after all — but is it in E? The quotients
−ai/a0 are all rational numbers, and E is the smallest ring that contains both Q and
α, implying that, indeed,
a1 a2 1 an n−1
− − α − ... − α ∈ E.
a0 a0 a0
and
d0 + d1 (αβ)1 + ... + dn (αβ)n = 0.
These equations remain true even if we multiply both sides by the greatest com-
mon denominators of the ci and the di , so we may assume that these coefficients
are actually integers! (If the first choice was wrong, just reassign them to the
coefficients obtained by clearing the denominators from the equations above.)
72 CHAPTER 3. CLASSICAL QUESTIONS
So, let
fˆ = c0 + c1 x + · · · + cn xn , and
ĝ = d0 + d1 x + · · · + dn xn .
Per the discussion above, fˆ and ĝ are polynomials with integer coefficients. Since
α + β is a root of fˆ and αβ is a root of ĝ, we see that α + β and αβ are algebraic.
Our choice of α and β was arbitrary in A, so the algebraic numbers are closed
under addition and multiplication.
We have shown that A is closed under addition and multiplication. The re-
maining properties of a ring are immediate, as A is a subset of the set C of complex
numbers, which is itself a ring.
0.110001000000000000000001 . . . .
As the digits proceed on to the right, the number of 0’s between two 1’s grows
huge, thanks to the factorial. Since the decimal expansion neither terminates nor
repeats, λ must be irrational. What’s more, this particular pattern of non-repetition
is critical to transcendence. The require two steps, neither of which is obvious.
3.4. ALGEBRAIC AND TRANSCENDENTAL NUMBERS 73
The first step is to show Liouville’s inequality, which states that an irrational
algebraic number is not “especially close” to any rational number. What does that
mean? Suppose that α is algebraic and irrational, while a/b is rational. Choose a
minimal polynomial f of degree n that has α as a root; since f is minimal, it does
not factor. We will show that only finitely many rational numbers a/b are closer to
α than 1/bn+1 .
The second step is to show that λ does not satisfy Liouville’s inequality. Were
λ algebraic, only finitely many rational numbers a/b would be closer to α than
1/bn+1 , regardless of the choice of a/b. Remember that λ has all those increasing
lengths of 0’s: that will give us an infinite sequence of rational numbers that are
closer to λ than 1/10n+1 , regardless of the choice of n.
Multiply both sides of the opposite ends of the inequality to find that
b ≤ c.
Since b is positive, there are only finitely many b that we can choose to be smaller
than c. For each of these, only finitely many a satisfy 1/bn+1 ≤ |a/b − α|. Hence,
there are only finitely many rational numbers a/b closer to α than 1/bn+1 .
Exercises
√
1. Find a polynomial f whose roots include 2. Try to give f as low a degree
as possible.
√
2. (a) Let E be the smallest ring that contains both Q and 2. What is the
dimension of E as a vector space over Q?
√
4
(b) Let F be the smallest ring that contains both E and 3. What is the
dimension of F as a vector space over E, and as a vector space over
Q?
Congruences
77
78 CHAPTER 4. CONGRUENCES
There are many common properties between equations and congruences. Some
properties are listed in the following theorem.
Theorem 24. Let a, b, c and d denote integers. Let m be a positive integers. Then:
m | ((a + c) − (b + c))
and as a result
a + c ≡ b + c(mod m).
and as a result
a − c ≡ b − c(mod m).
5. If a ≡ b(mod m), then m | (a − b). Thus there exists integer k such that
a − b = mk and as a result ac − bc = m(kc). Thus
m | (ac − bc)
and hence
ac ≡ bc(mod m).
6. If a ≡ b(mod m), then m | (a − b). Thus there exists integer k such that
a − b = mk and as a result
ac − bc = mc(k).
Thus
mc | (ac − bc)
and hence
ac ≡ bc(mod mc).
80 CHAPTER 4. CONGRUENCES
(a − b) + (c − d) = (a + c) − (b + d) = m(k + l).
As a result,
m | ((a + c) − (b + d)),
hence
a + c ≡ b + d(mod m).
(a − b) − (c − d) = (a − c) − (b − d) = m(k − l).
As a result,
m | ((a − c) − (b − d)),
hence
a − c ≡ b − d(mod m).
As a result,
m | (ac − bd),
hence
ac ≡ bd(mod m).
4.1. INTRODUCTION TO CONGRUENCES 81
Example 29.
We now present a theorem that will show one difference between equations
and congruences: we cannot cancel across congruence in all cases. For instance,
8 × 6 ≡ 8(mod 20) and 16 × 8 ≡ 8(mod 20), so the transitive property implies
that 2 × 3 ≡ 4 × 3(mod 6). However, it is obviously a bad idea to cancel 8 from
both sides of this congruence, as 6 is not congruent to 16 modulo 10.
In general, this means that we cannot solve congruences in quite the same
way as we solve equations: the congruence 2x ≡ 0(mod m) does not force x ≡
0(mod m)! Nevertheless, there are some cases where this is possible, and we can
also find some similar properties that do hold. In other words, dividing both sides
of the congruence by the same integer doesn’t preserve the congruence.
82 CHAPTER 4. CONGRUENCES
Theorem 25.
Proof. Part 2 follows immediately from Part 1. For Part 1, if ac ≡ bc(mod m),
then
m | (ac − bc) = c(a − b).
Hence there exists k such that c(a − b) = mk. Dividing both sides by d, we get
(c/d)(a − b) = k(m/d). Since (m/d, c/d) = 1, it follows that m/d | (a − b).
Hence a ≡ b(mod m/d).
Theorem 26. If
hm1 , m2 , ..., mt i | (a − b)
Exercises
3. Show that if a, b, m and n are integers such that m and n are positive, n | m
and a ≡ b(mod m), then a ≡ b(mod n).
6. Show that a number is divisible by three if and only if the sum of its digits
is divisible by 3.
Hint: Write 3 base 10, and use Theorem 24.
7. Show that a number is divisible by nine if and only if the sum of its digits is
divisible by 9.
8. Show that a number is divisible by four if and only if its last two digits (tens
and ones place) make a number that is divisible by four.
9. Show that a number is divisible by eight if and only if its last three digits
make a number that is divisible by eight.
10. Show that a number is divisible by eleven if and only if the alternating sum
of its digits is divisible by 11. For instance, the alternating sum of 112 = 121
is 1 − 2 + 1 = 0, and the alternating sum of 46 × 11 = 506 is 5 − 0 + 6 = 11.
11. Using techniques similar to those of the previous exercises, formulate and
prove rules of divisibility for 6 and 7.
84 CHAPTER 4. CONGRUENCES
Definition 14. A complete residue system modulo m is a set of integers such that
every integer is congruent modulo m to exactly one integer of the set.
The easiest complete residue system modulo m is the set of integers 0, 1, 2, ..., m−
1. Every integer is congruent to one of these integers modulo m. This is important
enough that mathematicians call it the set of canonical residues modulo m.
Example 31. The set of integers {0, 1, 2, 3, 4} form a complete residue system
modulo 5. Another complete residue system modulo 5 could be 6, 7, 8, 9, 10.
Definition 15. A reduced residue system modulo m is a set of integers ri such that
(ri , m) = 1 for all i and ri 6= rj (mod m) if i 6= j.
Example 32. The set of integers {1, 5} is a reduced residue system modulo 6.
The following lemma will help determine a complete residue system modulo
any positive integer m.
Proof. We will prove this lemma by contradiction. Suppose that the set of m
integers does not form a complete residue system modulo m. Then we can find at
least one integer a that is not congruent to any element in this set. Hence non of
the elements of this set is actually congruent to the remainder when a is divided
by m. Thus dividing by m yields to at most m − 1 remainders. Therefore by the
pigeonhole principle, at least two integers in the set that have the same remainder
modulo m. This is a contradiction since the set of integers is formed of m integers
that are incongruent modulo m.
Proof. Let us prove first that no two elements of the set {ka1 +b, ka2 +b, ..., kam +
b} are congruent modulo m. Suppose there exists i and j such that
ai ≡ aj (mod m)
Example 33. Since 1 and 3 are the only two integers that are relatively prime to
4 and less than 4, then φ(4) = 2. Also, 1,2,...,6 are the integers that are relatively
prime to 7 that are less than 7, thus φ(7) = 6.
Now we can say that the number of elements in a reduced residue system
modulo n is φ(n).
Theorem 28. If a1 , a2 , ..., aφ(n) is a reduced residue system modulo n and (k, n) =
1, then ka1 , ka2 , ..., kaφ(n) is a reduced residue system modulo n.
Proof. The proof proceeds exactly in the same way as that of Theorem 24.
Exercises
4. Show that any reduced residue system satisfies the properties of a ring.
4.3. LINEAR CONGRUENCES 87
Theorem 29. Let a, b and m be integers such that m > 0 and let c = (a, m). If c
does not divide b, then the congruence ax ≡ b(mod m) has no solutions. If c | b,
then
ax ≡ b(mod m)
x = x0 + (m/c)t
Thus the above values of x are solutions of the congruence ax ≡ b(mod m). Now
we have to determine the number of incongruent solutions that we have. Suppose
that two solutions are congruent, i.e.
Thus we get
(m/c)t1 ≡ (m/c)t2 (mod m).
t1 ≡ t2 (mod c).
Example 34. Let us find all the solutions of the congruence 3x ≡ 12(mod 6).
Notice that (3, 6) = 3 and 3 | 12. Thus there are three incongruent solutions
modulo 6. We use the Euclidean algorithm to find the solution of the equation
3x − 6y = 12 as described in chapter 2. As a result, we get x0 = 6. Thus the
three incongruent solutions are given by x1 = 6(mod 6), x1 = 6 + 2 = 2(mod 6)
and x2 = 6 + 4 = 4(mod 6).
Example 35. The modular inverse of 7 modulo 48 is 7. Notice that a solution for
7x ≡ 1(mod 48) is x ≡ 7(mod 48).
Exercises
x ≡ b1 (mod n1 ),
x ≡ b2 (mod n2 ),
..
.
x ≡ bt (mod nt ),
Since
Nj ≡ 0(mod nk ) for all j 6= k,
x0 ≡ x1 (mod N ).
We now present an example that will show how the Chinese remainder theo-
rem is used to determine the solution of a given system of congruences.
x ≡ 1(mod 2)
x ≡ 2(mod 3)
x ≡ 3(mod 5).
y1 ≡ 1(mod 2).
4.4. THE CHINESE REMAINDER THEOREM 91
As a result, we get
Exercises
2. Find all integers that leave a remainder of 4 when divided by 11 and leaves
a remainder of 3 when divided by 17.
Solving this in the usual fashion for linear Diophantine equations, we find that all
solutions of the equation have the form q = −2 + 5a and r = −1 + 3a, where a
is an integer. By substitution, x = 6(−2 + 5a) + 4 = 30a − 8.
Substitute this into the third congruence, and we have 30a − 8 ≡ 15s + 7.
Solving this again in the usual fashion for linear Diophantine equations, we find
that all solutions of the equation have the form a = 1 + b, s = 1 + 2b, where b is
an integer. By substitution, x = 30(1 + b) − 8 = 30b − 8. We can now verify that
These are not especially useful for testing whether an integer is prime. To start
with, Euclid’s criterion isn’t even finite, as we’d have to test every product of
integers. At least the irreducibility criterion requires us to check only finitely
√
many factors (2, 3, 4, . . . , p) but even this gets tedious and wasteful as the
numbers grow beyond toy size. This section gives a third criterion for primality,
also finite, that takes a different approach.
Guess what? If p is prime, then the set Fp = {0, 1, ..., p − 1} is a field, where
addition and multiplication are performed modulo p. How so?
All the properties of a field are fairly clear, except multiplicative inverses. We turn
our attention to that now.
• 2 · 25 ≡ 2 · 32 ≡ 2 · 4 ≡ 8 ≡ 1 (mod 7);
• ...
Alas, we cannot test it for all possible prime numbers, because (as we saw
earlier) there are infinitely many primes. To show that this is true for all primes,
we adopt a different approach.
On the other hand, Theorem 27 tells us that {x, 2x, . . . , (p − 1)x} is a complete
system of residues, so its elements are congruent to {1, 2, . . . , p − 1}. By substi-
tution, then,
Combining equations (4.1) and (4.2) via the transitive property, we see that
The nonzero elements of Fp are all relatively prime to p. Thus, their product is
also relatively prime to p. By Theorem 25, we can cancel (p − 1)! from both sides
of this last equation to obtain the desired result:
1. Use Fermat’s Little Theorem to show that 12 and 1001 are not prime. As a
hint, when computing ap−1 , try to group products so that you minimize the
number of multiplications necessary.
4.6. THEOREMS OF FERMAT, EULER, AND WILSON 97
Theorem 32. Let p be a prime. A positive integer m is its own inverse modulo p
if and only if p divides m + 1 or p divides m − 1.
Hence p | m2 − 1. As a result,
p | (m − 1)or p | (m + 1).
Thus
m2 ≡ 1(mod p).
Proof. When p = 2, the congruence holds. Now let p > 2. Using Theorem
26, we see that for each 1 ≤ m ≤ p, there is an inverse 1 ≤ m̄ ≤ p such that
mm̄ ≡ 1(mod p). Thus by Theorem 28, we see that the only two integers that
have their own inverses are 1 and p − 1. Hence after coupling the integers from 2
to p − 2 each with its inverse, we get
Thus we get
1.2.3.....(p − 2)(p − 1) ≡ (p − 1)(mod p)
As a result, we have (p − 1)! ≡ −1(mod p).
Note also that the converse of Wilson’s theorem also holds. The converse tells
us whether an integer is prime or not.
(m − 1)! + 1 ≡ 0 (mod m)
then m is prime.
Example 39. Note that 34 = 81 ≡ 1(mod 5). Also, 2φ(9) = 26 = 64 ≡ 1(mod 9).
Thus, we get
ap ≡ a(mod p).
Now if p | a, we have
ap ≡ a ≡ 0(mod p).
Theorem 37. If p is a prime number and a is an integer such that p - a, then ap−2
is the inverse of a modulo p.
Hence
ap−2 a ≡ 1(mod p).
Exercises
In this chapter, we study functions, called multiplicative functions, that are defined
on integers. These functions have the property that their value at the product of
two relatively prime integers is equal to the product of the value of the functions at
these integers. We start by proving several theorems about multiplicative functions
that we will use later. We then study special functions and prove that the Euler
φ-function that was seen before is actually multiplicative. We also define the sum
of divisors and the number of divisors functions.
Later define the Mobius function which investigate integers in terms of their
prime decomposition. The summatory function of a given function takes the sum
of the values of f at the divisors of a given integer n. We then determine the
Mobius inversion of this function which writes the values of f in terms of the
values of its summatory function. We end this chapter by presenting integers with
interesting properties and prove some of their properties.
103
104 CHAPTER 5. MULTIPLICATIVE NUMBER THEORETIC FUNCTIONS
Proof. We prove this theorem by induction on the number of primes in the factor-
ization of n. Suppose that n = pa11 . Thus the result follow easily. Suppose now
that for s
Y
n= pakk ,
k=1
5.1. DEFINITIONS AND PROPERTIES 105
we have s
Y
f (n) = f (pakk ).
k=1
So we have to prove that if
s+1
Y
n= pakk ,
k=1
then
s+1
Y
f (n) = f (pakk ).
k=1
Notice that for
s+1
Y
n= pakk ,
k=1
Qs ak as+1
we have ( k=1 pk , ps+1 )= 1. Thus we have
s+1
! s
!
Y Y as+1
f (n) = f pakk = f pakk f ps+1
k=1 k=1
From the above theorem, we can see that to evaluate a multiplicative function
at an integer, it will be enough to know the value of the function at the primes that
are in the prime factorization of the number.
We now define summatory functions which represents the sum of the values
of a given function at the divisors of a given number.
This function determines the sum of the values of the arithmetic function at
the divisors of a given integer.
Exercises
1. Determine whether the arithmetic functions f (n) = n! and g(n) = n/2 are
completely multiplicative or not.
5.2. MULTIPLICATIVE NUMBER THEORETIC FUNCTIONS 107
Proof. The first part is obvious since every positive integer less than p is relatively
prime to p. Conversely, suppose that p is not prime. Then p = 1 or p is a composite
number. If p = 1, then φ(p) 6= p − 1. Now if p is composite, then p has a positive
divisor. Thus φ(p) 6= p − 1. We have a contradiction and thus p is prime.
Theorem 41. Let p be a prime and m a positive integer, then φ(pm ) = pm −pm−1 .
Proof. Note that all integers that are relatively prime to pm and that are less than
pm are those that are not multiple of p. Those integers are p, 2p, 3p, ..., pm−1 p.
There are pm−1 of those integers that are not relatively prime to pm and that are
less than pm . Thus
φ(pm ) = pm − pm−1 .
108 CHAPTER 5. MULTIPLICATIVE NUMBER THEORETIC FUNCTIONS
Theorem 42. Let m and n be two relatively prime positive integers. Then φ(mn) =
φ(m)φ(n).
Proof. Denote φ(m) by s and let k1 , k2 , ..., ks be a reduced residue system modulo
m. Similarly, denote φ(n) by t and let k10 , k20 , ..., kt0 be a reduced residue system
modulo n. Notice that if x belongs to a reduced residue system modulo mn, then
(x, m) = (x, n) = 1.
Thus
x ≡ ki (mod m)and x ≡ kj0 (mod n)
for some i, j. Conversely, if
some i, j then (x, mn) = 1 and thus x belongs to a reduced residue system modulo
mn. Thus a reduced residue system modulo mn can be obtained by by determin-
ing all x that are congruent to ki and kj0 modulo m and n respectively. By the
Chinese remainder theorem, the system of equations
has a unique solution. Thus different i and j will yield different answers. Thus
φ(mn) = st.
Theorem 43. Let n = pa11 pa22 ...pas s be the prime factorization of n. Then
1 1 1
φ(n) = n 1 − 1− ... 1 − .
p1 p2 ps
5.2. MULTIPLICATIVE NUMBER THEORETIC FUNCTIONS 109
Theorem 44. Let n be a positive integer greater than 2. Then φ(n) is even.
a a −1−1
φ(pj j ) = pj j (pj − 1).
a
We see then φ(pj j )is even if pj is an odd prime. Notice also that if pj = 2, then it
a
follows that φ(pj j ) is even. Hence φ(n) is even.
110 CHAPTER 5. MULTIPLICATIVE NUMBER THEORETIC FUNCTIONS
Proof. Split the integers from 1 to n into classes. Put an integer m in the class Cd
if the greatest common divisor of m and n is d. Thus the number of integers in the
Cd class is the number of positive integers not exceeding n/d that are relatively
prime to n/d. Thus we have φ(n/d) integers in Cd . Thus we see that
X
n= φ(n/d).
d|n
Theorem 47. Let p be a prime and let n = pa11 pa22 ...pat t be a positive integer. Then
pa+1 − 1
σ(pa ) = ,
p−1
and as a result,
t a +1
Y pj j − 1
σ(n) =
j=1
pj − 1
24 −1 53 −1
Example 45. σ(200) = σ(23 52 ) = 2−1 5−1
= 15.31 = 465.
Theorem 49. Let p be a prime and let n = pa11 pa22 ...pat t be a positive integer. Then
τ (pa ) = a + 1,
and as a result,
t
Y
τ (n) = (aj + 1).
j=1
τ (pa ) = a + 1
Exercises
7. Find the sum of positive integer divisors and the number of positive integer
divisors of 35
8. Find the sum of positive integer divisors and the number of positive integer
divisors of 25 34 53 73 13.
Note that if n is divisible by a power of a prime higher than one then µ(n) = 0.
In connection with the above definition, we have the following
114 CHAPTER 5. MULTIPLICATIVE NUMBER THEORETIC FUNCTIONS
Example 48. Notice that µ(1) = 1, µ(2) = −1, µ(3) = −1 and µ(4) = 0.
Proof. Let m and n be two relatively prime integers. We have to prove that
µ(mn) = µ(m)µ(n).
µ(mn) = 0 = µ(m)µ(n).
What remains to prove that if m and n are square-free integers say m = p1 p2 ...ps
where p1 , p2 , ..., ps are distinct primes and n = q1 q2 ...qt where q1 , q2 , ..., qt . Since
(m, n) = 1, then there are no common primes in the prime decomposition be-
tween m and n. Thus
In the following theorem, we prove that the summatory function of the Mobius
function takes only the values 0 or 1.
5.3. THE MOBIUS FUNCTION AND THE MOBIUS INVERSION FORMULA115
P
Theorem 51. Let F (n) = d|n µ(d), then F (n) satisfies
(
1 if n = 1;
F (n) =
0 if n > 1.
Proof. For n = 1, we have F (1) = µ(1) = 1. Let us now find µ(pk ) for any
integer k > 0. Notice that
Thus by Theorem 36, for any integer n = pa11 pa22 ...pat t > 1 we have,
We now define the Mobius inversion formula. The Mobius inversion formula
expresses the values of f in terms of its summatory function of f .
Theorem 52. Suppose that f is an arithmetic function and suppose that F is its
summatory function, then for all positive integers n we have
X
f (n) = µ(d)F (n/d).
d|n
Proof. We have
X X X
µ(d)F (n/d) = µ(d) f (e)
d|n d|n e|(n/d)
X X
= µ(d)f (e)
d|n e|(n/d)
X X
= µ(d)f (e)
e|n d|(n/e)
X X
= f (e) µ(d)
e|n d|(n/d)
116 CHAPTER 5. MULTIPLICATIVE NUMBER THEORETIC FUNCTIONS
P
Notice that d|(n/e) µ(d) = 0 unless n/e = 1 and thus e = n. Consequently we
get
X X
f (e) µ(d) = f (n).1 = f (n).
e|n d|(n/d)
Example 49. A good example of a Mobius inversion formula would be the in-
version of σ(n) and τ (n). These two functions are the summatory functions of
f (n) = n and f (n) = 1 respectively. Thus we get
X
n= µ(n/d)σ(d)
d|n
and
X
1= µ(n/d)τ (d).
d|n
Exercises
2. Find the value of µ(n) for each integer n with 100 ≤ n ≤ 110.
P
3. Use the Mobius inversion formula and the identity n = d|n φ(n/d) to
show that φ(pt ) = pt − pt−1 where p is a prime and t is a positive integer.
In other words, a perfect number is a positive integer which is the sum of its
proper divisors.
Example 50. The first perfect number is 6, since σ(6) = 12. You can also view
this as 6 = 1 + 2 + 3. The second perfect number is 28, since σ(28) = 56 or
28 = 1 + 2 + 4 + 7 + 14.
The following theorem tells us which even positive integers are perfect.
Theorem 53. The positive integer n is an even perfect number if and only if
Proof. We show first that if n = 2l−1 (2l − 1) where l is an integer such that
l ≥ 2 and 2l − 1 is prime then n is perfect. Notice that 2l − 1 is odd and thus
(2l−1 , 2l − 1) = 1. Also, notice that σ is a multiplicative function and thus
σ(n) = 2n.
Notice now that (2r+1 − 1, 2r+1 ) = 1 and thus 2r+1 | σ(s). Therefore there exists
an integer q such that σ(s) = 2r+1 q. As a result, we have
So we get that q | s. We add q to both sides of the above equation and we get
We have to show now that q = 1. Notice that if q 6= 1, then s will have three
divisors and thus σ(s) ≥ 1 + s + q. Hence q = 1 and as a result s = 2r+1 − 1.
Also notice that σ(s) = s + 1. This shows that s is prime since the only divisors
of s are 1 and s. As a result,
n = 2r (2r+1 − 1),
Proof. Suppose that ` is composite, that is ` = rs where 1 < r < ` and 1 < s < `.
Thus after factoring, we get that
Notice that the two factors above are both greater than 1. Thus 2` − 1 is not prime.
This is a contradiction.
The above theorem motivates the definition of interesting numbers called Mersenne
numbers.
We prove a theorem that help decide whether Mersenne numbers are prime.
Since p1 is a common divisor of 2p − 1 and 2p1 −1 − 1 and thus not relatively prime.
Hence (p, p1 − 1) = p. Hence p | (p1 − 1) and thus there exists a positive integer
k such that p1 − 1 = kp. Since p1 is odd, then k is even and thus k = 2m. Hence
p1 = kp + 1 = 2mp + 1.
We now define Fermat numbers and prove some theorems about the properties
of these numbers.
n
Definition 27. Integers of the form Fn = 22 + 1 are called Fermat numbers.
Fermat conjectured that these integers are primes but it turned out that this is
not true. Notice that F0 = 3, F1 = 5, F2 = 17, F3 = 257 and F4 = 65, 537 while
F5 is composite. It turned out the F5 is divisible by 641. We now present a couple
of theorems about the properties of these numbers.
120 CHAPTER 5. MULTIPLICATIVE NUMBER THEORETIC FUNCTIONS
F0 F1 F2 ...Fn−1 = Fn − 2
Proof. We will prove this theorem by induction. For n = 1, the above identity is
true. Suppose now that
F0 F1 F2 ...Fn−1 = Fn − 2
Notice that
n n n+1
F0 F1 F2 ...Fn = (Fn − 2)Fn = (22 − 1)(22 + 1) = 22 − 1 = Fn+1 − 2.
Using Theorem 53, we prove that Fermat numbers are relatively prime.
Proof. Assume without loss of generality that s < t. Thus by Theorem 52, we
have
F0 F1 F2 ...Fs ...Ft−1 = Ft − 2
Assume now that there is a common divisor d of Fs and Ft . thus we see that d
divides
Ft − F0 F1 F2 ...Fs ...Ft−1 = 2.
Exercises
4. We say n is abundant if σ(n) > 2n. Prove that if n = 2m−1 (2m − 1) where
m is a positive integer such that 2m − 1 is composite, then n is abundant.
123
124 CHAPTER 6. PRIMITIVE ROOTS AND QUADRATIC RESIDUES
by the well ordering principle, there is a least positive integer x that satisfies this
congruence ax ≡ 1(mod n).
To find all integers x such that ax ≡ 1(mod b), we need the following theorem.
Theorem 58. If (a, b) = 1 with b > 0, then the positive integer x is a solution of
the congruence ax ≡ 1(mod b) if and only if ordb a | x.
Proof. Having ordb a | x, then we have that x = k.ordb a for some positive integer
k. Thus
ax = akordb a = (aordb a )k ≡ 1(mod b).
Now since ax ≡ 1(mod b),we have ar ≡ 1(mod b). Since ordb a, we get r = 0.
Thus x = q.ordb a and hence ordb a | x.
Example 54. Since ord7 2 = 3, then 215 ≡ 1(mod 7) while 10 is not a solution
for 2x ≡ 1(mod 7).
ai ≡ aj (mod b)
6.1. THE ORDER OF INTEGERS AND PRIMITIVE ROOTS 125
i ≡ j(mod ordb a)
ai ≡ aj ai−j ≡ aj (mod b)
We introduce now primitive roots and discuss their properties. We are inter-
ested in integers whose order modulo another integer is φ(b). In one of the exer-
cises, one is asked to prove that if aand b are relatively prime then ordb a | φ(b).
Example 55. Notice that φ(7) = 6 hence 2 is not a primitive root modulo 7. While
ord7 3 = 6 and thus 3 is a primitive root modulo 7.
Theorem 60. If (r, m) = 1 with m > 0 and if r is a primitive root modulo n, then
the integers {r1 , r2 , ...rφ(m) } form a reduced residue set modulo m.
126 CHAPTER 6. PRIMITIVE ROOTS AND QUADRATIC RESIDUES
Proof. To prove that the set {r1 , r2 , ...rφ(m) } form a reduced residue set modulo
m we need to show that every two of them are relatively prime and that no two
of them are congruent modulo m. Since (r, m) = 1, it follows that (rn , m) = 1
for all positive integers n. Hence all the powers of r are relatively prime to m. To
show that no two powers in the above set are equivalent modulo m, assume that
ri ≡ rj (mod m).
Proof. Let
Thus ordm ru = φ(m) and ru is a primitive root if and only if (u, φ(m)) = 1.
Exercises
1. Determine ord13 10.
2. Determine ord11 3.
5. Show that if n is a positive integer, and a and b are integers relatively prime
to n such that (ordn a, ordn b) = 1, then ordn (ab) = ordn a.ordn b.
7. Show that if a and n are relatively prime with n > 0, then ordn a | φ(n).
128 CHAPTER 6. PRIMITIVE ROOTS AND QUADRATIC RESIDUES
Example 57. Notice that x ≡ 3(mod 11) is a root for f (x) = 2x2 + x + 1 since
f (3) = 22 ≡ 0(mod 11).
We now introduce Lagrange’s theorem for primes. This is modulo p, the fun-
damental theorem of algebra. This theorem will be an important tool to prove that
every prime has a primitive root.
m(x) = b1 x + b0 and p - b1 .
A root of m(x) is a solution for b1 x+b0 (mod p). Since p - b1 , then this congruence
has exactly one solution by Theorem 26.
Suppose that the theorem is true for polynomials of degree n − 1, and let
m(x) be a polynomial of degree n with integer coefficients and where the leading
coefficient is not divisible by p. Assume now that m(x) has n + 1 incongruent
roots modulo p, say x0 , x1 , ..., xn . Thus
m(xk ) ≡ 0(mod p)
6.2. PRIMITIVE ROOTS FOR PRIMES 129
Thus f (xk ) ≡ 0(mod p) for all 1 ≤ k ≤ n and thus x1 , x2 , ..., xn are roots of
f (x). This is a contradiction since we a have a polynomial of degree n − 1 that
has n distinct roots.
Theorem 64. Consider the prime p and let p − 1 = kn for some integer k. Then
xn − 1 has exactly n incongruent roots modulo p.
We now prove a lemma that gives us how many incongruent integers can have
a given order modulo p.
Lemma 12. Let p be a prime and let m be a positive integer such that p − 1 = mk
for some integer k. Then
for all positive integers k. By Theorem 60, we know that xm − 1 has exactly m
incongruent roots modulo p, so that every root is congruent to one of these powers
of a. We also know by Theorem 57 that the powers of ak with (k, m) = 1 have
order m. There are exactly φ(m) such integers with 1 ≤ k ≤ m and thus if there
is one element of order m modulo p, there must be exactly φ(m) such positive
integers less than p. Hence S(m) ≤ φ(m).
Proof. Let p be a prime and let m be a positive integer such that p − 1 = mk for
some integer k. Let F (m) be the number of positive integers of order m modulo
p that are less than p. The order modulo p of an integer not divisible by p divides
p − 1, it follows that
X
p−1= F (m).
m|p−1
6.2. PRIMITIVE ROOTS FOR PRIMES 131
we see that F (m) = φ(m) for each positive divisor m of p − 1. Thus we conclude
that F (m) = φ(m). As a result, we see that there are p − 1 incongruent integers
of order p − 1 modulo p. Thus p has φ(p − 1) primitive roots.
Exercises
9. Show that if p is a prime and p ≡ 1(mod 4), then there is an integer x such
that x2 ≡ −1(mod p).
132 CHAPTER 6. PRIMITIVE ROOTS AND QUADRATIC RESIDUES
Theorem 66. If p is an odd prime with primitive root r, then one can have either
r or r + p as a primitive root modulo p2 .
ordp r = φ(p) = p − 1.
Thus
rm ≡ 1(mod p).
m | φ(p2 ).
rp−1 ≡ 1(mod p2 ).
6.3. THE EXISTENCE OF PRIMITIVE ROOTS 133
Hence
p2 | sp−1 − (1 − prp−2 .
then
p2 | prp−2 .
Thus we have
p | rp−2
We now show that any power of an odd prime has a primitive root.
Theorem 67. Let p be an odd prime. Then any power of p is a primitive root.
Moreover, if r is a primitive root modulo p2 , then r is a primitive root modulo pm
for all positive integers m.
134 CHAPTER 6. PRIMITIVE ROOTS AND QUADRATIC RESIDUES
Proof. By Theorem 62, we know that any prime p has a primitive root r which is
also a primitive root modulo p2 , thus
for all integers m ≥ 2. Once we prove the above congruence, we show that r is
also a primitive root modulo pm . Let n = ordpm r. By Theorem 54, we know that
n | φ(pm ). Also, we know that φ(pm ) = pm (p − 1). Hence n | pm (p − 1). On the
other hand, because
pm | (rn − 1),
Since φ(p) = p − 1, we see that by Theorem 54, we have n = l(p − 1). also
n | pm−1 (p − 1), we have that n = ps (p − 1), where 0 ≤ s ≤ m − 1. If
n = ps (p − 1) with s ≤ m − 2, then
m−2 (p−1)
pk | r p − 1,
ordpm r = φ(pm ).
We prove now (8.5) by induction. Assume that our assertion is true for all
m ≥ 2. Then
m−2 (p−1)
pm - (rp − 1).
Because (r, p) = 1, we see that (r, pm−1 ) = 1. We also know from Euler’s
theorem that
m−2 (p−1)
pm−1 | (rp − 1).
6.3. THE EXISTENCE OF PRIMITIVE ROOTS 135
Because p - k, we have
m−1 (p−1)
pm+1 - (rp − 1).
Example 59. Since 3 is a primitive root of 7, then 3 is a primitive root for 7k for
all positive integers k.
m2 = 4n2 + 4n + 1 = 4n(n + 1) + 1.
Thus
k−1
2k+1 | (m2 − 1).
Theorem 69. If m is not pa or 2pa , then m does not have a primitive root.
Proof. Let m = ps11 ps22 ...psi i . If m has a primitive root r then r and m are relatively
prime and ordm r = φ(m). We also have, we have (r, ps ) = 1 where ps is of the
primes in the factorization of m. By Euler’s theorem, we have
s
ps | (rφ(p ) − 1).
Now let
L = [φ(ps11 ), φ(ps22 ), ..., φ(psi i )].
We know that
rL ≡ 1(mod pskk )
m | (rL − 1),
are not relatively prime unless m = ps or m = 2ps where p is an odd prime and t
is any positive integer.
We now show that all integers of the form m = 2ps have primitive roots.
Theorem 70. Consider a prime p 6= 2 and let s is a positive integer, then 2ps has
a primitive root. In fact, if r is an odd primitive root modulo ps , then it is also a
primitive root modulo 2ps but if r is even, r + ps is a primitive root modulo 2ps .
and no positive exponent smaller than φ(ps ) has this property. Note also that
φ(2ps ) = φ(ps ),
so that
s
ps | (rφ(2p ) − 1).
If r is odd, then
s
2 | (rφ(2p ) − 1).
s
2 | ((r + ps )φ(2p ) − 1).
s
ps | ((r + ps )φ(2p ) − 1).
s
As a result, we see that 2ps | ((r + ps )φ(2p ) − 1) and since for no smaller power of
r + ps is congruent to 1 modulo 2ps , we see that r + ps is a primitive root modulo
2ps .
Theorem 71. The positive integer m has a primitive root if and only if n = 2, 4, ps
or 2ps
Exercises
1. Which of the following integers 4, 12, 28, 36, 125 have a primitive root.
4. Show that there are the same number of primitive roots modulo 2ps as there
are modulo ps , where p is an odd prime and s is a positive integer.
6. Show that the integer n has a primitive root if and only if the only solutions
of the congruence x2 ≡ 1(modn) are x ≡ ±1(mod n).
6.4. INTRODUCTION TO QUADRATIC RESIDUES AND NONRESIDUES139
x2 ≡ a(mod p)
Hence
x0 ≡ x00 (mod p) or x0 ≡ −x00 (mod p).
140 CHAPTER 6. PRIMITIVE ROOTS AND QUADRATIC RESIDUES
The following theorem determines the number of integers that are quadratic
residues modulo an odd prime.
Proof. To find all the quadratic residues of p among all the integers 1, 2, ..., p − 1,
we determine the least positive residue modulo p of 12 , 22 , ..., (p − 1)2 . Consider-
ing the p − 1 congruences and because each congruence has either no solution or
two incongruent solutions, there must be exactly (p − 1)/2 quadratic residues of
p among 1, 2, ..., p − 1. Thus the remaining are (p − 1)/2 quadratic nonresidues
of p.
Exercises
4. Show that if p is prime and p ≥ 7, then there are always two consecutive
quadratic residues of p. Hint: Show that at least one of 2, 5 or 10 is a
quadratic residue of p.
5. Show that if p is prime and p ≥ 7, then there are always two quadratic
residues of p that differ by 3.
(
a 1 if a is a quadratic residue of p
=
p −1 if a is a quadratic nonresidue of p.
Example 61. Notice that using the previous example, we see that
1 2 4
= = =1
7 7 7
3 5 6
= = = −1
7 7 7
3
Example 62. Let p = 13 and a = 3. Then 13
= −1 ≡ 36 (mod 13).
Corollary 3. If p 6= 2 is a, then
(
−1 1 if p ≡ 1(mod 4)
=
p −1 if p ≡ −1(mod 4).
Proof. By Euler’s criterion, we know that
a
= (−1)φ(p)/2 (mod p)
p
If 4 | (p − 1), then p = 4m + 1 for some integer m and thus we get
(−1)φ(p)/2 = (−1)2m = 1.
p − 1 ≡ 1(−1)1 (mod p)
2 ≡ 2(−1)2 (mod p)
p − 3 ≡ 3(−1)3 (mod p)
4 ≡ 4(−1)4 (mod p)
.
.
.
p−1
r ≡ (−1)(p−1)/2 (mod p),
2
144 CHAPTER 6. PRIMITIVE ROOTS AND QUADRATIC RESIDUES
p − m1 , p − m2 , ..., p − mk , p − n1 , p − n2 , ..., p − nt
which implies
s p−1
(−1) m1 m2 ...(p − mk )n1 n2 ...nt ≡ !(mod p),
2
Simplifying, we get
Thus we get
a(p−1)/2 ≡ (−1)k (mod p).
5
Example 63. To find 13
using Gauss’s lemma, we calculate
6
X
[5i/13] = [5/13] + [10/13] + [15/13] + [20/13] + [25/13] + [30/13] = 5
i=1
5
Thus we get 13
= (−1)5 = −1.
Exercises
5. Let a and b be integers not divisible by p. Show that either one or all three
of the integers a, b and ab are quadratic residues of p.
7.
Show
2
that if p is an odd prime and a is an integer not divisible by p then
a
p
= 1.
Proof. Consider the least positive residues of the integers a, 2a, ..., ((p − 1)/2)a;
let m1 , m2 , ..., ms be integers of this set such that mi > p/2 for all i and let
n1 , n2 , ..., nt be those integers where ni < p/2. Using the division algorithm, we
see that
ia = p[ia/p] + r
(p−1)/2 (p−1)/2 s t
X X X X
ia = p[ia/p] + mi + ni . (6.3)
i=1 i=1 i=1 i=1
p − m1 , p − m2 , ..., p − ms , p − n1 , p − n2 , ..., p − nt
are precisely the integers 1, 2, ..., (p − 1)/2, in the same order. Now we obtain
(p−1)/2 s t s t
X X X X X
i= (p − mi ) + ni = ps − mi + ni . (6.4)
i=1 i=1 i=1 i=1 i=1
Now since we are taking the following as exponents for −1, it suffice to look at
them modulo 2. Thus
(p−1)/2
X
0≡ [ia/p] − s(mod 2).
i=1
148 CHAPTER 6. PRIMITIVE ROOTS AND QUADRATIC RESIDUES
(p−1)/2
X
[ia/p] ≡ s(mod 2)
i=1
Using Gauss’s lemma, we get
a P(p−1)/2
= (−1)s = (−1) i=1 [ia/p] .
p
Theorem 77. The Law of Quadratic Reciprocity Let p and q be distinct odd
primes. Then
p q p−1 q−1
= (−1) 2 . 2
q p
Proof. We consider now the pairs of integers also known as lattice points (x, y)
with
1 ≤ x ≤ (p − 1)/2and 1 ≤ y ≤ (q − 1)/2.
p−1 q−1
The number of such pairs is 2
. 2 . We divide these pairs into two groups de-
pending on the sizes of qx and py. Note that qx 6= py for all pairs because p and
q are distinct primes.
We now count the pairs of integers (x, y) with
1 ≤ x ≤ (p − 1)/2and 1 ≤ y ≤ qx/p.
For each fixed value of x with 1 ≤ x ≤ (p − 1)/2, there are [qx/p] integers
satisfying 1 ≤ y ≤ qx/p. Consequently, the total number of pairs with are
is
(p−1)/2
X
[qi/p].
i=1
6.6. THE LAW OF QUADRATIC RECIPROCITY 149
Exercises
3
1. Evaluate 53
.
31
2. Evaluate 641
.
3. Using the law of quadratic reciprocity, show that if p is an odd prime, then
(
3 1 if p ≡ ±1(mod 12)
=
p −1 if p ≡ ±5(mod 12).
Example 64. Notice that from the prime factorization of 45, we get that
2 2 2
= = (−1)(−1) = 1
55 5 11
We now prove some properties for Jacobi symbol that are similar to the prop-
erties of Legendre symbol.
Theorem 78. Let n be an odd positive integer and let a and b be integers such
that(a, n) = 1 and (b, n) = 1. Then
1. if n | (a − b), then
a b
= .
n n
2. a b
ab
= .
n n n
As a result, we have
a m ci m ci
Y a Y b
= =
n i=1
pi i=1
pi
ab a b
Proof of 2: Note that by Theorem 71, we have p
= p p
for any prime p
appearing in the prime factorization of n. As a result, we have
m c
ab Y ab i
=
n i=1
pi
c m c
m
a iY b i
Y
=
i=1
pi i=1
pi
a b
= .
n n
−1 2
In the following theorem, we determine n
and n
.
1.
−1
= (−1)(n−1)/2 .
n
2.
2 2
= (−1)(n −1)/8 .
n
Pm
= (−1) i=1 ci (pi −1)/2 .
152 CHAPTER 6. PRIMITIVE ROOTS AND QUADRATIC RESIDUES
As a result, we have
m
X
(n − 1)/2 ≡ ci (pi − 1)/2 (mod 2).
i=1
Hence
2 Pm 2
= (−1) i=1 ci (pi −1)/8 .
n
Because 8 | p2i − 1, we see similarly that
and thus
m
X
2
n ≡1+ ci (p2i − 1)(mod 64),
i=1
We now show that the reciprocity law holds for Jacobi symbol.
6.7. JACOBI SYMBOL 153
Y n Ym cj di
b a pj qi
=
a b i=1 j=1
qi pj
and n
X qi − 1 b−1
di ≡ (mod 2).
i=1
2 2
Thus we conclude that
m n
X pj − 1 X qi − 1 a−1 b−1
cj di ≡ . (mod 2).
j=1
2 i=1
2 2 2
Exercises
258
1. Evaluate 4520
.
1008
2. Evaluate 2307
.
3. For which positive integers n that are relatively prime to 15 does the Jacobi
symbol 15
n
equal 1?
4. Let n be an odd square free positive integer. Show that there is an integer a
such that (a, n) = 1 and na = −1.
154 CHAPTER 6. PRIMITIVE ROOTS AND QUADRATIC RESIDUES
Chapter 7
In this chapter, we introduce continued fractions, prove their basic properties and
apply these properties to solve some problems. Being a very natural object, con-
tinued fractions appear in many areas of Mathematics, sometimes in an unex-
pected way. The Dutch mathematician and astronomer, Christian Huygens (1629-
1695), made the first practical application of the theory of ”anthyphaeiretic ratios”
(the old name of continued fractions) in 1687. He wrote a paper explaining how
to use convergents to find the best rational approximations for gear ratios. These
approximations enabled him to pick the gears with the best numbers of teeth. His
work was motivated by his desire to build a mechanical planetarium. Further
continued fractions attracted attention of most prominent mathematicians. Euler,
Jacobi, Cauchy, Gauss and many others worked with the subject. Continued frac-
tions find their applications in some areas of contemporary Mathematics. There
are mathematicians who continue to develop the theory of continued fractions
nowadays, The Australian mathematician A.J. van der Poorten is, probably, the
most prominent among them.
155
156 CHAPTER 7. INTRODUCTION TO CONTINUED FRACTIONS
Notation 1. We write
1
[a0 ; a1 , a2 , . . . , an ] = a0 +
a1 + a + . 1. .
+ a1n
2
Still, in the case of infinite number of terms a certain amount of work must be
carried out in order to make the above formula meaningful. At the same time, for
the finite number of terms the formula makes sense.
Example 65.
1 1 1 16 26
[−2; 1, 3, 5] = −2 + 1 = −2 + 5 = −2 + 21 = −2 + =− .
1 + 3+ 1 1 + 16 16
21 21
5
7.1. BASIC NOTATIONS 157
we have
Example 66. Consider the continued fraction expansion for 42/31. We obtain
a0 = [42/31] = 1, δ = 42/31 − 1 = 11/31. Now r1 = 1/δ = 31/11 and
a1 = [α1 ] = [31/11] = 2. The new δ = 31/11 − 2 = 9/11. Now r2 = 1/δ = 11/9
and a2 = [α2 ] = [11/9] = 1. It follows that δ = 11/9 − 1 = 2/9. Now
r3 = 1/δ = 9/2 and a3 = [α3 ] = [9/2] = 4. It follows that δ = 9/2 − 4 = 1/2.
Now r4 = 1/δ = 2 and a4 = [α4 ] = [2] = 2. It follows that δ = 2 − 2 = 0 and we
are done.
158 CHAPTER 7. INTRODUCTION TO CONTINUED FRACTIONS
The above example shows that the algorithm stops after finitely many steps.
This is in fact quite a general phenomenon. In order to practice with the introduced
notations let us prove a simple but important proposition.
However, we require that an > 1, where an is the last element of a finite continued
fraction. Then the answer is ”yes”.
Hint. Make use of the formulas (7.5) below.
From now on we assume that an > 1.
Another natural question is about infinite continued fractions and (as one can
easily guess) real numbers. The proof of the corresponding result is slightly more
involved, and we do not give it here. In this brief introduction we just formulate
the result and refer to the literature ([12, Theorem 14]) for a complete proof. We,
however, provide some remarks concerning this result below. In particular, we
will explain at some point, what the convergence means.
Theorem 81. An infinite continued fraction converges and defines a real number.
There is a one-to-one correspondence between
• all (finite and infinite) continued fractions [a0 ; a1 , a2 , . . .] with an integer a0
and positive integers ak for k > 0 (and the last term an > 1 in the case of finite
continued fractions)
and
• real numbers.
Note that the algorithm we developed above can be applied to any real number
and provides the corresponding continued fraction.
Theorem 81 has certain theoretical significance. L.Kronecker (1823-1891)
said, ”God created the integers; the rest is work of man”. Several ways to represent
real numbers out of integers are well-known. Theorem 81 provides yet another
way to fulfill this task. This way is constructive and at the same time is not tied to
any particular base (say to decimal or binary decomposition).
We will discuss some examples later.
Exercises
(a) 2/3
(b) 2/51
(c) 2/101
(d) 3/7
(e) 7/3
2. Find a pattern for the continued fraction expansion of every rational number
of the form 1/a.
3. Suppose gcd(2, a) = 1. Show that 2/a has the continued fraction expansion
[0; b, a] where b = [a/2].
6. Prove that under the assumption an > 1 the continued fraction representa-
tion given in Proposition 1 is unique. In other words, the correspondence
between
• finite continued fractions [a0 ; a1 , a2 , . . . an ] with an integer a0 , positive
integers ak for k > 0 and an > 1
and
• rational numbers
is one-to-one.
pk = ak pk−1 + pk−2
(7.5)
qk = ak qk−1 + qk−2 .
Remark. It does not matter here whether we deal with finite or infinite con-
tinued fractions: the convergents are finite anyway. Proof. We use the induction
argument on k. For k = 2 the statement is true.
Now, assume (7.5) for 2 ≤ k < l. Let
pl
α = [a0 ; a1 , a2 , . . . al ] =
ql
be an arbitrary continued fraction of length l + 1. We denote by pr /qr the r-th
convergent α. Consider also the continued fraction
β = [a1 ; a2 , . . . , al ]
and denote by p0r /qr0 its r-th convergent. We have α = a0 + 1/β which translates
as
pl = a0 p0l−1 + ql−1
0
(7.6)
ql = p0l−1 .
Also, by the induction assumption,
and
ql = al p0l−2 + p0l−3 = al ql−1 + ql−2 ,
pk−1 pk (−1)k
− = (7.9)
qk−1 qk qk qk−1
and
pk−2 pk (−1)k ak
− = .
qk−2 qk qk qk−2
Since all the numbers qk and ak are positive, the above formulas imply the follow-
ing.
making sense out of an infinite continued fraction: this should be common limit
of these two subsequences. It is somehow more technically involved (although
still fairly elementary!) to prove that these two limits coincide.
1 pk 1
≤ α− ≤
qk (qk+1 + qk ) qk qk qk+1
Proof.
Another inequality, which provides the lower bound for the distance between
the number α and k-th convergent is slightly more involved. To prove it we first
consider the following way to add fractions which students sometimes prefer.
Lemma 16. If
a c
≤
b d
then
a a+c c
≤ ≤ .
b b+d d
Consider now the sequence of fractions
pk pk + pk+1 pk + 2pk+1 pk + ak pk+1 pk+2
, , ,..., = , (7.10)
qk qk + qk+1 qk + 2qk+1 qk + ak qk+1 qk+2
where the last equality follows from (7.5).
It follows that the sequence (7.10) is increasing if k is even and is decreasing
if k is odd. Thus, in particular, the fraction
pk + pk+1
(7.11)
qk + qk+1
164 CHAPTER 7. INTRODUCTION TO CONTINUED FRACTIONS
is between the quantities pk /qk and α. Therefore the distance between pk /qk and
the fraction (7.11) is smaller than the distance between pk /qk and α:
pk pk + pk+1 1
α− ≥ = .
qk qk + qk+1 qk (qk + qk+1 )
The second (right) inequality in Theorem 83 is now proved. This finishes the
proof of Theorem 83.
Exercises
Hint. Introduce formally p−1 = 1 and q−1 = 0, check that then formulas
7.5 are true also for k = 1.
4. Prove Proposition 2
pk 1
α− ≤ .
qk qk qk+1
6. Prove Lemma 16
7. Use (7.5) to show that the sign of the difference between two consecutive
fractions in (7.10) depends only on the parity of k.
7.3. VERY GOOD APPROXIMATION 165
Thus we have a/b > p1 /q1 or a/b lies between two consecutive convergents
pk−1 /qk−1 and pk+1 /qk+1 for some k. Assume the latter. Then
a pk−1 1
− ≥
b qk−1 bqk−1
and
a pk−1 pk pk−1 1
− < − = .
b qk−1 qk qk−1 qk qk−1
It follows that
b > qk . (7.12)
Also
a pk+1 a 1
α− ≥ − ≥ ,
b qk+1 b bqk+1
which implies
1
|bα − a| ≥ .
qk+1
At the same time Theorem 83 (it right inequality multiplied by qk ) reads
1
|qk α − pk | ≤ .
qk+1
It follows that
|qk α − pk | ≤ |bα − a| ,
and the latter inequality together with (7.12) show that a/b is not a ”good” ap-
proximation of α in this case.
This finishes the proof of Theorem 84.
Exercises
2. Show that if a/b > p1 /q1 then a/b is not a ”good” approximation to α.
7.4. AN APPLICATION 167
7.4 An Application
Consider the following problem which may be of certain practical interest. As-
sume that we calculate certain quantity using a computer. Also assume that we
know in advance that the quantity in question is a rational number. The com-
puter returns a decimal which has high accuracy and is pretty close to our desired
answer. How to guess the exact answer?
To be more specific consider an example.
with some two hundred digits of accuracy which, of course come short to help in
guessing the period and the exact denominator of 121169.
We are not going to check all convergents, because we notice the irregularity:
one element, 68110 is far more than the others. In order to explain this we use the
168 CHAPTER 7. INTRODUCTION TO CONTINUED FRACTIONS
left inequality from Theorem 83 together with the formula (7.5). Indeed, we have
an approximation of α which is unexpectedly good: |α − pk /qk | is very small (it
is around 10−15 ) and with a modest qk too. We have
and
pk 1
α− ≥ 2 .
qk qk (ak+1 + qk−1 /qk )
It follows that 1/qk2 (ak+1 + qk−1 /qk ) is small (smaller than 10−15 ) and therefore,
ak+1 should be big. This is exactly what we see. Of course, our guess is correct:
123456
= [1, 52, 1, 53, 2, 4, 1, 2, 1].
121169
Theorem 85. For almost every real α the probability for a number k to appear as
an element in the continued fraction expansion of α is
1 1
ck = ln 1 + . (7.13)
ln 2 k(k + 2)
Remarks. 1. The words ”for almost every α” mean that the measure of the set
of exceptions is zero.
2. Even the existence of pk (defined as a limit) is highly non-trivial.
Theorem 85 may (and probably should) be considered as a result from ergodic
theory rather than number theory. This constructs a bridge between these two ar-
eas of Mathematics and explains the recent attention to continued fractions of the
mathematicians who study dynamical systems. In particular, V.I.Arnold formu-
lated the following open problem. Consider the set of pairs of integers (a, b) such
that the corresponding points on the plane are contained in a quarter of a circle of
radii N :
a2 + b 2 ≤ N 2 .
Expand the numbers p/q into continued fractions and compute the frequencies
sk for the appearance of k in these fractions. Do these frequencies have limits
as N → ∞? If so, do these limits have anything to do with the probabilities,
given by (7.13)? These questions demand nothing but experimental computer
investigation, and such an experiment may be undertaken by a student. Of course,
it would be extremely challenging to find a phenomena experimentally in this way
and to prove it after that theoretically.
Of course, one can consider more general kinds of continued fractions. In
particular, one may ease the assumption that the elements are positive integers
and consider, allowing arbitrary reals as the elements (the question of conver-
gence may usually be solved). The following identities were discovered inde-
pendently by three prominent mathematicians. The English mathematician R.J.
Rogers found and proved these identities in 1894, Ramanujan found the iden-
tities (without proof) and formulated them in his letter to Hardy from India in
170 CHAPTER 7. INTRODUCTION TO CONTINUED FRACTIONS
1913. Independently, being separated from England by the war, I. J. Schur found
the identities and published two different proofs in 1917. We refer an interested
reader to [2, 1] for a detailed discussion and just state the amazing identities here.
√ √
s
5+ 5 5 + 1 2π/5
[0; e−2π , e−4π , e−6π , e−8π , . . .] = − e
2 2
√ √
s
5− 5 5 − 1 π/5
[1; e−π , e−2π , e−3π , e−4π , . . .] = − e
2 2
Exercises
The distribution of prime numbers has been the object of intense study by many
modern mathematicians. Gauss and Legendre conjectured the prime number the-
orem which states that the number of primes less than a positive number x is
asymptotic to x/logx as x approaches infinity. This conjecture was later proved
by Hadamard and Poisson. Their proof and many other proofs lead to the what is
known as Analytic Number theory.
In this chapter we demonstrate elementary theorems on primes and prove el-
ementary properties and results that will lead to the proof of the prime number
theorem.
8.1 Introduction
P∞ 1
It is well known that the harmonic series diverges. We therefore deter-
n=1 n
mine some asymptotic formulas that determines the growth of the n≤x n1 . We
P
start by introducing Euler’s summation formula that will help us determine the
asymptotic formula.
171
172 CHAPTER 8. INTRODUCTION TO ANALYTIC NUMBER THEORY
We might ask the following question. What if the sum is taken over all the
primes. In this section, we show that the sum over the primes diverges as well.
We also show that an interesting product will also diverge. From the following
theorem, we can actually deduce that there are infinitely many primes.
For the proof of Euler’s summation formula see [3, Chapter 3].
Proof. We use Euler’s summation formula by taking f (t) = 1/t. We then get
Z x Z x
X1 1 {t} 1
= dt − 2
dt + 1 + O
n≤x
n 1 t 1 t x
Z ∞ Z ∞
{t} {t} 1
= log x + 1 − 2
dt + 2
dt + O
1 t x t x
Notice now that {t} ≤ t and hence the two improper integrals exist since they are
dominated by integrals that converge. We therefore have
Z ∞
{t} 1
0≤ 2
dt ≤ ,
x t x
we also let ∞
{t}
Z
γ =1− dt
1 t2
8.1. INTRODUCTION 173
and we get the asymptotic formula. Notice that γ is called Euler’s constant. Notice
also that similar steps can be followed to find an asymptotic formulas for other
sums involving powers of n.
We now proceed to show that if we sum over the primes instead, we still get a
divergent series.
1
− p1 ) diverge.
P Q
Theorem 86. Both p p and p (1
Thus we have
∞
1X 1 1X 1
log P (x) − S(x) < <
2 p≤x p(p − 1) 2 n=1 n(n − 1)
where A(x) = 0 for x < 1. Assume also that g has a continuous derivative on the
interval [y, x], where 0 < y < x. Then we have
X Z x
f (n)g(n) = A(x)g(x) − A(y)g(y) − A(t)g 0 (t)dt.
y<n≤x y
8.2. CHEBYSHEV’S FUNCTIONS 175
1. Show that one gets every n1 , n ∈ Z+ where each prime factor of n is less
than or equal to x in the proof of Theorem 1.
We define also the following functions, the last two functions are called Cheby-
shev’s functions.
P
1. π(x) = p≤x 1.
P
2. θ(x) = p≤x logp
P
3. ψ(x) = n≤x Ω(n)
Notice that
X ∞
X X ∞
X X
m
ψ(x) = Ω(n) = Ω(p ) = logp.
n≤x m=1, pm ≤x p m=1 p≤x1/m
Notice that the above sum will be a finite sum since for some m, we have that
x1/m < 2 and thus θ(x1/m ) = 0.
We use Abel’s summation formula now to express the two functions π(x) and
θ(x) in terms of integrals.
Now let g(x) = log x in Theorem 84 with y = 1 and we get the desired result for
the integral representation of θ(x). Similarly we let g(x) = 1/ log x with y = 3/2
and we obtain the desired result for π(x) since θ(t) = 0 for t < 2.
We now prove a theorem that relates the two Chebyshev’s functions θ(x) and
ψ(x). The following theorem states that if the limit of one of the two functions
θ(x)/x or ψ(x)/x exists then the limit of the other exists as well and the two limits
are equal.
8.3. GETTING CLOSER TO THE PROOF OF THE PRIME NUMBER THEOREM177
where m ≤ log2 x. Moreover, we have that θ(x) ≤ x log x. The result will follow
after proving the inequality in Exercise 2.
Exercises
1. Show that
where m ≤ log2 x.
√ √
2. Show that 0 ≤ ψ(x) − θ(x) ≤ (log2 (x)) x log x and thus the result of
Theorem 86 follows.
ψ(x)
lim = 1. (8.3)
x→∞ x
Proof. We have proved in Theorem 86 that (8.2) and (8.3) are equivalent, so if we
show that (8.1) and (8.2) are equivalent, the proof will follow. Notice that using
the integral representations of the functions in Theorem 85, we obtain
π(x) log x 1 x π(t)
Z
θ(x)
= − dt
x x x 2 t
and Z x
π(x) log x θ(x) log x θ(t)
= + dt.
x x x 2 t log2 t
Now to prove that (8.1) implies (8.2), we need to prove that
1 x π(t)
Z
lim dt = 0.
x→∞ x 2 t
Notice also that (8.1) implies that π(t)
t
= O 1
log t
for t ≥ 2 and thus we have
Z x Z x
1 π(t) 1 dt
dt = O
x 2 t x 2 log t
Now once you show that (Exercise 1)
Z x √ √
dt x x− x
≤ + √ ,
2 log t log 2 log x
then (8.1) implies (8.2) will follow. We still need to show that (8.2) implies (8.1)
and thus we have to show that
Z x
log x θ(t)dt
lim = 0.
x→∞ x 2 t log2 t
8.3. GETTING CLOSER TO THE PROOF OF THE PRIME NUMBER THEOREM179
π(x) π(x)
l1 = lim inf , L1 = lim sup ,
x→∞ x/logx x→∞ x/logx
θ(x) θ(x)
l2 = lim inf , L2 = lim sup ,
x→∞ x x→∞ x
and
ψ(x) ψ(x)
l3 = lim inf , L3 = lim sup ,
x→∞ x x→∞ x
then l1 = l2 = l3 and L1 = L2 = L3 .
where m ≤ log2 x
Also,
X log x X log x
ψ(x) = log p ≤ log p = log xπ(x).
p≤x
log p p≤x
log p
Thus we have
θ(x) ≤ ψ(x) ≤ π(x) log x
180 CHAPTER 8. INTRODUCTION TO ANALYTIC NUMBER THEORY
As a result, we have
θ(x) ψ(x) π(x)
≤ ≤
x x x/ log x
and we get that L2 ≤ L3 ≤ L1 . We still need to prove that L1 ≤ L2 .
Let α be a real number where 0 < α < 1, we have
X X
θ(x) = log p ≥ log p
p≤x xα ≤p≤x
X
> α log x (log p > α log x)
xα ≤p≤x
= αlogx{π(x) − π(xα )}
As a result,
θ(x) απ(x)
> − αxα−1 log x
x x/ log x
Since limx→∞ α log x/x1−α = 0, then
π(x)
L2 ≥ α lim sup
x→∞ x/ log x
As a result, we get that
L2 ≥ αL1
As α → 1, we get L2 ≥ L1 .
Proving that l1 = l2 = l3 is left as an exercise.
We now present an inequality due to Chebyshev about π(x).
Proof. Put
π(x) π(x)
l = lim inf , L = lim sup ,
x→∞ x/ log x x→∞ x/ log x
It will be sufficient to prove that L ≤ 4 log 2 and l ≥ log 2. Thus by Theorem 2,
we have to prove that
θ(x)
lim sup ≤ 4 log 2 (8.4)
x→∞ x
and
ψ(x)
lim inf ≥ log 2 (8.5)
x→∞ x
To prove (8.4), notice that
(n + 1)(n + 2)...(n + n)
N = C(2n, n) = < 22n < (2n + 1)N
n!
Suppose now that p is a prime such that n < p < 2n and hence p | N . As a result,
Q
we have N ≥ n<p<2n p. We get
N ≥ θ(2n) − θ(n).
Since N < 22n , we get that θ(2n) − θ(n) < 2n log 2. Put n = 1, 2, 22 , ..., 2m−1
where m is a positive integer. We get that
µp
X 2n n
sp = −2 i .
i=11
pi p
182 CHAPTER 8. INTRODUCTION TO ANALYTIC NUMBER THEORY
h i
log 2n
. Thus we have N = p≤2n psp . If x is a positive integer then
Q
where µp = log p
Hence we get
log N ≤ ψ(2n).
Using the fact that 22n < (2n + 1)N , we can see that
As a result, we get
ψ(x)
lim inf ≥ log 2.
x→∞ x
Exercises
2. Show that x √ √
x− x
Z
dt x
≤ + √ ,
2 log t log 2 log x
8.3. GETTING CLOSER TO THE PROOF OF THE PRIME NUMBER THEOREM183
3. Show that x √ √
x− x
Z
dt x
≤ + √
2 log2 t log2 2 log2 x
4. Show that
(n + 1)(n + 2)...(n + n)
N = C(2n, n) = < 22n < (2n + 1)N
n!
2n
2√ 22n
5. Show that 2 n
< N = C(2n, n) < √
2n
.
Hint: For one side of the inequality, write
This chapter discusses various topics that are of profound interest in number the-
ory. Section 1 on cryptography is on an application of number theory in the field
of message decoding, while the other sections on elliptic curves and the Riemann
zeta function are deeply connected with number theory. The section on Fermat’s
last theorem is related, through Wile’s proof of Fermat’s conjecture on the non-
existence of integer solutions to xn + y n = z n for n > 2, to the field of elliptic
curves (and thus to section 2).
9.1 Cryptography
In this section we discuss some elementary aspects of cryptography, which con-
cerns the coding and decoding of messages.
185
186 CHAPTER 9. OTHER TOPICS IN NUMBER THEORY
The security of this approach is based on the fact that even though f , g, e, and
m are publicly known, and the method of computing m from e and d is well-
established, it is practically impossible to reverse-engineer d from all this infor-
mation. This gives a level of security so high that not even the sender can decrypt
the message: only the recipient!
Let’s look at an example that relates to Number Theory.
Lemma 17. Let a and m be two integers, with m positive and (a, m) = 1. If k
and k̄ are positive integers with k k̄ ≡ 1(mod φ(m)), then akk̄ ≡ a(mod m).
and hence that akk̄ ≡ a(mod m), and the result follows.
Before giving the proof, one has to note that the above lemma is in fact an
if-and-only-if statement, i.e. (k, φ(m)) = 1 if and only if r1k , r2k , · · · , rnk forms a
reduced residue system modulo m. However we only need the if part, as in the
lemma.
Proof. Assume first that (k, φ(m)) = 1. We show that r1k , r2k , · · · , rnk is a reduced
residue system modulo m. Assume otherwise, i.e. assume that ∃i, j such that
rik = rjk (mod m), in which case rik and rjk would belong to the same class and thus
r1k , r2k , · · · , rnk would not form a reduced residue system. Then, since (k, φ(m)) =
1, ∃k̄ with k k̄ = 1(mod φ(m)), and so
by the previous lemma. But if rik = rjk (mod m) then (rik )k̄ = (rjk )k̄ (mod m), and
since rikk̄ = ri (mod m) and rjkk̄ = rj (mod m), then ri = rj (mod m) giving that ri
and rj belong to the same class modulo m, contradicting that r1 , r2 , · · · , rn form a
reduced residue system. Thus ri 6= rj implies that rik 6= rjk if (k, φ(m)) = 1.
1. Choose a couple p1 and p2 of very large prime numbers, each (for exam-
ple) of the order of a hundred digit integer, and these should be strictly kept
known only to the recipient. Then form the product m = p1 p2 , which is
itself a very large number to the point that the chances of an eavesdropper’s
discovering the prime number factorization p1 p2 of m is incredibly small,
even if they know this integer m. Now one has, by standard results concern-
ing the φ-function, that φ(p1 ) = p1 − 1 and φ(p2 ) = p2 − 1, and that, since
p1 and p2 are relatively prime, φ(m) = φ(p1 )φ(p2 ) = (p1 − 1)(p2 − 1).
Thus φ(m) is a very large number, of the order of m itself, and hence m
9.1. CRYPTOGRAPHY 189
has a reduced residue system that contains a very large number of integers
of the order of m itself. Hence almost every integer smaller than m, with a
probability of the order 1 − 1/10100 (almost 1), is in a reduced residue system
r1 , r2 , · · · , rφ(m) of m. Thus almost every positive integer smaller than m is
relatively prime to m, with probability of the order 1 − 1/10100 .
3. The sender transforms S into a (large) integer a by replacing each letter and
each space between words by a certain representative integer (e.g. three
or four digit integers for each letter). a is formed by concatenating the
representative integers that are produced.
4. Now given that almost every positive integer smaller than m is relatively
prime with m, the integer a itself is almost certainly relatively prime with
m, and hence is in a reduced residue system for m. Hence, by Lemma 17
above, if k is a (large) integer such that (k, φ(m)) = 1, then ak belongs to
a reduced residue system for m, and there exists a unique positive b smaller
than m with b = ak (mod m).
5. The sender sends b to the original broadcaster, where the original prime
numbers, and hence φ(m), are known. With this information, which was
never broadcast, the destination can determine a k̄ such that k k̄ = 1(mod φ(m)),
and then finds the unique c such that c = bk̄ (mod m). Now since, almost
certainly, (a, m) = 1, then almost certainly c = a since c = bk̄ (mod m) =
(ak )k̄ (mod m) = akk̄ (mod m) ≡ a(mod m) by Lemma 17. Now the desti-
nation translates a back to letters and spaces to reveal the sentence S.
6. Note that if any third party intercepts b, they almost certainly cannot reveal
the integer a since the chance of them knowing φ(m) = p1 p2 is almost zero,
even if they know m and k. In this case they practically won’t be able to
determine a k̄ with k k̄ = 1(mod φ(m)), to retrieve a and transform it to S.
190 CHAPTER 9. OTHER TOPICS IN NUMBER THEORY
Exercises
1. You will want the assistance of a computer for this. Pick two large primes,
and compute the modulus, an encryption key, and a decryption key. Use
these to confirm your understanding of the RSA algorithm: practice by in-
venting a message, encrypting it, then trying to decrypt it again. Be sure you
use a computer that knows how to take large exponents modulo a number
quickly, or you could be waiting a long time. . .
You might wonder if RSA is really safe. After all, the method is well-known, and
is based on the fact that k̄ is the multiplicative inverse of k modulo φ(m). But
this is easy if you know φ(m); the fact that gcd(φ(m), k) = 1 means you can just
apply the Euclidean algorithm to find two integers k̄, ` such that
k k̄ + `φ(m) = 1.
.Even worse, m is the product of two primes p and q, so φ(m) = (p − 1)(q − 1).
Thus, breaking RSA is as simple as factoring m into primes — and you already
know that there are only two primes!
Example 69. To drive home how simple this can be, consider that 6 = 2 × 3,
14 = 2 × 7, and so forth are fairly easy to factor. Then again, they’re pretty small
numbers. . .
22601385262034057849416540486101975135080389157197
76718321197768109445641817966676608593121306582577
25063156288667697044807000181114971186300211248792
81994874820660701310665866460833279828035603792053
91980139946496955261.
Although they no longer offer the prize, you might want to give it a go if you have
a lot of free time coming up.
However, one can’t just pick any two large primes. We can illustrate this with
10403: an obvious approach to factor it (and usually a very bad one) is to start
√ √
at the floor of 10403 and work one’s way down; since 10403 ≈ 102, the first
number to try is 101. Oops!
There is a large body of scientific work dedicated to finding good primes for
the RSA algorithm, which is a good thing, because commerce on the internet
(such as that One-click purchase at Amazon!) is based on its security. Recently,
scientists working in the strange world of quantum computing have developed al-
gorithms that factor primes very, very quickly — but quantum computers work
only with very, very small numbers. It is not yet clear whether quantum com-
putation will advance to the point where this will become practical for cracking
formerly secure communications.
Exercises
1. Ask a mathematically literate friend to choose two “large” primes, but not
to tell you what they are. Instead, your friend should tell you what their
product is. See if you can determine the two prime numbers. (Here, “large”
means two to three digits long — not RSA grade!)
192 CHAPTER 9. OTHER TOPICS IN NUMBER THEORY
Of course one can also similarly define the algebraic curve Cf (Q) over a field Q,
where Q is either a subfield of the field F where the coefficients of f exist, or is
an extension field of F. Thus if f ∈ F[x, y], and if Q is either an extension or a
subfield of F, then one can define Cf (Q) = {(x, y) ∈ Q × Q : f (x, y) = 0}. Our
main interest in this section will be in third order polynomials (cubic curves)
with coefficients in R, with the associated curves Cf (Q) over the field of rational
numbers Q ⊂ R. Thus, basically, we will be interested in points (x, y) ∈ R2
that have rational coordinates x and y, and called rational points, that satisfy
f (x, y) = 0. Of course one can first imagine the curve f (x, y) = 0 in R2 , i.e.
the curve Cf (R) over R, and then choosing the points on this curve that have ra-
tional coordinates. This can simply be expressed by writing that Cf (Q) ⊂ Cf (R).
It has to be mentioned that ”rational curves” Cf (Q) are related to diophantine
equations. This is in the sense that rational solutions to equations f (x, y) = 0
produce integer solutions to equations f 0 (x, y) = 0, where the polynomial f 0 is
very closely related to the polynomial f , if not the same one in many cases. For
example every point in Cf (Q), where f (x, y) = xn + y n , i.e. every rational solu-
9.2. ELLIPTIC CURVES 193
where f is a polynomial of degree three, we shall require that this curve be such
that any straight line that passes through two points (x1 , y1 ), (x2 , y2 ) ∈ Cf (R),
where the two points could be the same point if the curve at one of them is differ-
entiable with the tangent at that point to the curve having same slope as that of the
line, will also pass through a unique third point (x3 , y3 ). By the above theorem,
if a line intersects the curve Cf (R) associated with the third order polynomial f
in more than three points, then the line itself is a subset of Cf (R). This will be
excluded for the kind of third degree polynomials f whose associated algebraic
curves shall be called elliptic curves.
One other thing to be excluded, to have third order curves characterized as
elliptic curves, is the existence of singular points on the curve, where a singular
point is one where the curve does not admit a unique tangent.
It has to be mentioned that in the previous discussion, the points on the curve
Cf (R) may lie at infinity. To deal with this situation we assume that the curve is
in fact a curve in the real projective plane P2 (R). We now can define an elliptic
curve Cf (R) as being such that f (x, y) is an irreducible third order polynomial
with Cf (R) having no singular points in P2 (R).
The main idea behind the above definition for elliptic curves is to have a curve
whereby any two points A and B on the curve can determine a unique third point,
to be denoted by AB, using a straight line joining A and B. The possibilities
are as follows: If the line joining A and B is not tangent to the curve Cf (R) at
any point, then the line intersects the curve in exactly three different points two of
which are A and B while the third is AB. If the line joining A and B is tangent
to the curve at some point p then either this line intersects Cf (R) in exactly two
points, p and some other point p0 , or intersects the curve in only one point p. If the
line intersects Cf (R) in two points p and p0 , then either p = A = B in which case
AB = p0 , or A 6= B in which case (irrespective of whether p = A and p0 = B or
vice-versa) one would have p = AB. While if the line intersects Cf (R) in only
one point p then p = A = B = AB.
9.2. ELLIPTIC CURVES 195
The above discussion establishes a binary operation on elliptic curves that pro-
duces, for any two points A and B a uniquely defined third point AB. This binary
operation in turn produces, as will be described next, another binary operation,
denoted by +, that defines a group structure on Cf (R) that is associated with the
straight-line construction discussed so far.
A group structure on an elliptic curve Cf (R) is defined as follows: Consider
an arbitrary point, denoted by 0, on Cf (R). We define, for any two points A and
B on Cf (R), the point A + B by
A + B = 0(AB), (9.5)
meaning that we first determine the point AB as above, then we determine the
point 0(AB) corresponding to 0 and AB. Irrespective of the choice of the point 0,
one has the following theorem on a group structure determined by + on Cf (R).
Theorem 94. Let Cf (R) be an elliptic curve, and let 0 be any point on Cf (R).
Then the above binary operation + defines an Abelian group structure on Cf (R),
with 0 being the identity element and −A = A(00) for every point A.
The proof is very lengthy and can be found in [18]. We first note that if 0 and
00 are two different points on an elliptic curve with associated binary operations
+ and +0 , then one can easily show that for any two points A and B
A +0 B = A + B − 00 . (9.6)
This shows that the various group structures that can be defined on an elliptic curve
by considering all possible points 0 and associated operations +, are essentially
the same, up to a ”translation”.
Lemma 19. Consider the group structure on an elliptic curve Cf (R), correspond-
ing to an operation + with identity element 0. If the cubic polynomial f has
rational coefficients, then the subset Cf (Q) ⊂ Cf (R) of rational solutions to
f (x, y) = 0 forms a subgroup of Cf (R) if and only if 0 is itself a rational point
(i.e. a rational solution).
196 CHAPTER 9. OTHER TOPICS IN NUMBER THEORY
Proof. If Cf (Q) is a subgroup of Cf (R), then it must contain the identity 0, and
thus 0 would be a rational point. Conversely, assume that 0 is a rational point.
First, since f has rational coefficients, then for any two rational points A and B
in Cf (Q) one must have that AB is also rational, and thus (since 0 is assumed
rational) that 0(AB) is rational, making A + B = 0(AB) rational. Thus Cf (Q)
would be closed under +. Moreover, since for every A ∈ Cf (Q) one has that
−A = A(00), then −A is also rational, which makes Cf (Q) closed under inver-
sion. Hence Cf (Q) is a subgroup.
Thus by lemma 18, the set of all rational points on an elliptic curve form
a subgroup of the group determined by the curve and a point 0, if and only if
the identity element 0 is itself a rational point. In other words, one finds that if
the elliptic curve Cf (R) contains one rational point p, then there exists a group
structure on Cf (R), with 0 = p and the corresponding binary operation +, such
that the set Cf (Q) of all rational points on Cf (R) is a group.
One thing to note about rational solutions to general polynomial functions
f (x, y), is that they correspond to integer solution to a corresponding homoge-
neous polynomial h(X, Y, Z) in three variables, and vice-verse, where homoge-
neous practically means that this function is a linear sum of terms each of which
has the same power when adding the powers of the variables involved in this term.
For example XY 2 − 2X 3 + XY Z + Z 3 is homogeneous.
In fact a rational solution x = a/b and y = c/d for f (x, y) = 0, where
a, b, c, d are integers, can first be written as x = ad/bd and y = cb/bd, and thus
one can always have this solution in the form x = X/Z and y = Y /Z, where
X = ad, Y = cb and Z = bd. If x = X/Z and y = Y /Z are replaced in
f (x, y) = 0, one obtains a new version h(X, Y, Z) = 0 of this equation written
in terms of the new variables X, Y, Z. One can immediately see that this new
polynomial function h(X, Y, Z) is homogeneous in X, Y, Z. The homogeneous
function h(X, Y, Z) in X, Y, Z is the form that f (x, y) takes in projective space,
where in this case the transformations x = X/Z and y = Y /Z define the projec-
9.2. ELLIPTIC CURVES 197
h(X, Y, Z) = aX 3 + bX 2 Y + cXY 2 + dY 3 + eX 2 Z
+ f XY Z + gY 2 Z + hXZ 2 + iY Z 2 + jZ 3 , (9.7)
Which, by using the projective transformation again, and using new coefficients,
gives that points on the curve Cf (R) are precisely those on the curve Ch (R), where
i.e. that Cf (R) = Cg (R). The equation g(x, y) = 0, where g is given in (8.10),
is said to be the Weierstrass normal form of the equation f (x, y) = 0. Thus, in
particular, any elliptic curve defined by a cubic f , is birationally equivalent to an
elliptic curve defined by a polynomial g(x, y) as above. Birational equivalence
between curves is defined here as being a rational transformation, together with
its inverse transformation, that takes the points on one curve to another, and vice-
versa.
198 CHAPTER 9. OTHER TOPICS IN NUMBER THEORY
The Riemann zeta function ζ(z) is an analytic function that is a very important
function in analytic number theory. It is (initially) defined in some domain in the
complex plane by the special type of Dirichlet series given by
∞
X 1
ζ(z) = , (9.11)
n=1
nz
where Re(z) > 1. It can be readily verified that the given series converges locally
uniformly, and thus that ζ(z) is indeed analytic in the domain in the complex
plane C defined by Re(z) > 1, and that this function does not have a zero in this
domain.
We first prove the following result which is called the Euler Product Formula.
Theorem 95. ζ(z), as defined by the series above, can be written in the form
∞
Y 1
ζ(z) = , (9.12)
1
n=1 1− pzn
∞
1 X
= xk , (9.13)
1 − x k=0
1
one finds that each term 1− p1z
in ζ(z) is given by
n
∞
1 X 1
= , (9.14)
1 − p1z pkz
k=0 n
n
9.3. THE RIEMANN ZETA FUNCTION 199
since every |1/pzn | < 1 if Re(z) > 1. This gives that for any integer N
N N
Y 1 Y 1 1
= 1 + z + 2z + · · ·
n=1 1− 1
n=1
pn pn
pzn
X 1
= k z
(9.15)
pkn11z · · · pnji
X 1
=
nz
where i ranges over 1, · · · , N , and j ranges from 0 to ∞, and thus the integers n
in the third line above range over all integers whose prime number factorization
consist of a product of powers of the primes p1 = 2, · · · , pN . Also note that each
such integer n appears only once in the sum above.
Now since the series in the definition of ζ(z) converges absolutely and the
order of the terms in the sum does not matter for the limit, and since, eventu-
ally, every integer n appears on the right hand side of 8.15 as N −→ ∞, then
= ζ(z). Moreover, limN →∞ N
P 1
1 exists, and the re-
Q
limN →∞ nz N n=1 1
1− pz
n
sult follows.
The Riemann zeta function ζ(z) as defined through the special Dirichlet series
above, can be continued analytically to an analytic function through out the com-
plex plane C except to the point z = 1, where the continued function has a pole
of order 1. Thus the continuation of ζ(z) produces a meromorphic function in C
with a simple pole at 1. The following theorem gives this result.
Given this continuation of ζ(z), and also given the functional equation that is
satisfied by this continued function, and which is
πz
ζ(z) = 2z π z−1 sin Γ(1 − z)ζ(1 − z), (9.16)
2
200 CHAPTER 9. OTHER TOPICS IN NUMBER THEORY
(see a proof in [3]), where Γ is the complex gamma function, one can deduce that
the continued ζ(z) has zeros at the points z = −2, −4, −6, · · · on the negative
real axis. This follows as such: The complex gamma function Γ(z) has poles at
the points z = −1, −2, −3, · · · on the negative real line, and thus Γ(1 − z) must
have poles at z = 2, 3, · · · on the positive real axis. And since ζ(z) is analytic at
these points, then it must be that either sin πz
2
or ζ(1 − z) must have zeros at
the points z = 2, 3, · · · to cancel out the poles of Γ(1 − z), and thus make ζ(z)
analytic at these points. And since sin πz
2
has zeros at z = 2, 4, · · · , but not at
z = 3, 5, · · · , then it must be that ζ(1 − z) has zeros at z = 3, 5, · · · . This gives
that ζ(z) has zeros at z = −2, −4, −6 · · · .
It also follows from the above functional equation, and from the above men-
tioned fact that ζ(z) has no zeros in the domain where Re(z) > 1, that these zeros
at z = −2, −4, −6 · · · of ζ(z) are the only zeros that have real parts either less
that 0, or greater than 1. It was conjectured by Riemann, The Riemann Hypothe-
sis, that every other zero of ζ(z) in the remaining strip 0 ≤ Re(z) ≤ 1, all exist on
the vertical line Re(z) = 1/2. This hypothesis was checked for zeros in this strip
with very large modulus, but remains without a general proof. It is thought that
the consequence of the Riemann hypothesis on number theory, provided it turns
out to be true, is immense.
Bibliography
[2] George E. Andrews, The Theory of Partitions. Reprint of the 1976 original.,
Cambridge Mathematical Library. Cambridge University Press, Cambridge,
1998
201
202 BIBLIOGRAPHY
[8] H.M. Edwards, Riemann’s Zeta Function, Dover, New York, 2001.
[9] E. Grosswald, Topics from the Theory of Numbers. New York: The Macmil-
lan Co. (1966).
[10] G.H. Hardy and E.M. Wright, An Introduction to the Theory of Numbers,
5th ed. Oxford University Press, Oxford, 1979.
[15] W.J. Leveque, Elementary Theory of Numbers, Dover, New York, 1990.
BIBLIOGRAPHY 203
[16] W.J. Leveque, Fundamentals of Number Theory, Dover, New York, 1996.
[21] Kenneth H. Rosen, Elementary Number Theory and its Applications. Fifth
Edition. Pearson, Addison Wesley, USA, 2005.
204 BIBLIOGRAPHY