0% found this document useful (0 votes)

167 views301 pages

Crypto Slides

This course provides an overview of basic modern cryptographic techniques. The course aims to familiarize students with commonly used cryptographic building blocks, help students understand how application requirements relate to security definitions, and explain various adversarial capabilities and attack algorithms. Over the course, students will learn about techniques like block ciphers, hash functions, key distribution problems, and more.

Uploaded by

Beni Rodriguez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

167 views301 pages

Crypto Slides

Uploaded by

Beni Rodriguez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 301

Cryptography

Markus Kuhn

Computer Laboratory, University of Cambridge

https://www.cl.cam.ac.uk/teaching/1920/Crypto/

Lent 2020 – CST Part II

crypto-slides.pdf 2020-02-20 22:09 60c0312 1 / 230

What is this course about?
Aims
This course provides an overview of basic modern cryptographic
techniques and covers essential concepts that users of cryptographic
standards need to understand to achieve their intended security goals.

Objectives
By the end of the course you should
I be familiar with commonly used standardized cryptographic building
blocks;
I be able to match application requirements with concrete security
definitions and identify their absence in naive schemes;
I understand various adversarial capabilities and basic attack
algorithms and how they affect key sizes;
I understand and compare the finite groups most commonly used with
discrete-logarithm schemes;
I understand the basic number theory underlying the most common
public-key schemes, and some efficient implementation techniques.
2 / 230
1 Historic ciphers
2 Perfect secrecy
3 Semantic security
4 Block ciphers
5 Modes of operation
6 Message authenticity
7 Authenticated encryption
8 Secure hash functions
9 Secure hash applications
10 Key distribution problem
11 Number theory and group theory
12 Discrete logarithm problem
13 RSA trapdoor permutation
14 Digital signatures

3 / 230
Related textbooks
Main reference:
I Jonathan Katz, Yehuda Lindell:
Introduction to Modern Cryptography
2nd ed., Chapman & Hall/CRC, 2014
Further reading:
I Christof Paar, Jan Pelzl:
Understanding Cryptography
Springer, 2010
http://www.springerlink.com/content/978-3-642-04100-6/
http://www.crypto-textbook.com/
I Douglas Stinson:
Cryptography – Theory and Practice
3rd ed., CRC Press, 2005
I Menezes, van Oorschot, Vanstone:
Handbook of Applied Cryptography
CRC Press, 1996
http://www.cacr.math.uwaterloo.ca/hac/

The course notes and some of the exercises also contain URLs with more
detailed information.
4 / 230
Common information security targets
Most information-security concerns fall into three broad categories:

Confidentiality ensuring that information is accessible only to those

authorised to have access
Integrity safeguarding the accuracy and completeness of
information and processing methods
Availability ensuring that authorised users have access to
information and associated assets when required

Basic threat scenarios:

Eavesdropper: Alice Bob
(passive)
Eve
Middle-person attack: Alice Mallory Bob
(active)
Eve
Storage security: Alice disk
Mallory
5 / 230
Encryption schemes
Encryption schemes are algorithm triples (Gen, Enc, Dec) aimed at
facilitating message confidentiality:

Private-key (symmetric) encryption scheme

I K ← Gen private-key generation
I C ← EncK (M ) encryption of plain-text message M
I DecK (C) = M decryption of cipher-text message C

Public-key (asymmetric) encryption scheme

I (PK , SK ) ← Gen public/secret key-pair generation
I C ← EncPK (M ) encryption using public key
I DecSK (C) = M decryption using secret key

Probabilistic algorithms: Gen and (often also) Enc access a random-bit

generator that can toss coins (uniformly distributed, independent).
Notation: ← assigns the output of a probabilistic algorithm, := that of a deterministic algorithm.
6 / 230
Message integrity schemes
Other cryptographic algorithm triples instead aim at authenticating the
integrity and origin of a message:

Message authentication code (MAC)

I K ← Gen private-key generation
I T := MacK (M ) message tag generation
?
I M 0 6= M ⇒ MAC verification:
MacK (M 0 ) 6= T recalculate and compare tag

Digital signature
I PK , SK ← Gen public/secret key-pair generation
I S ← SignSK (M ) signature generation using secret key
I VrfyPK (M, S) = 1, signature verification using public key
?
M 0 6= M ⇒
VrfyPK (M 0 , S) = 0

7 / 230
Key exchange
Key-agreement protocol
I (PK A , SK A ) ← Gen public/secret key-pair generation by Alice
I (PK B , SK B ) ← Gen public/secret key-pair generation by Bob
I K := DH(SK A , PK B ) key derivation from exchanged public keys
= DH(PK A , SK B )

Diffie–Hellman protocol:
Alice and Bob standardize suitably chosen very large public numbers g, p and q.
Alice picks a random number 0 < x < q and Bob a secret number 0 < y < q as
their respective secret keys. They then exchange the corresponding public keys:
A→B: PK A = g x mod p
B→A: PK B = g y mod p
Alice and Bob each now can calculate
K = (g y mod p)x mod p = (g x mod p)y mod p
and use that as a shared private key. With suitably chosen parameters, outside
observers will not be able to infer x, y, or K.
Why might one also want to sign or otherwise authenticate PK A and/or PK B ?
8 / 230
Key types
I Private keys = symmetric keys
I Public/secret key pairs = asymmetric keys
Warning: this “private” vs “secret” key terminology is not universal in the literature

I Ephemeral keys / session keys are only used briefly and often
generated fresh for each communication session.
They can be used to gain privacy (observers cannot identify users from public keys
exchanged in clear) and forward secrecy (if a communication system gets compromised in
future, this will not compromise past communication).

I Static keys remain unchanged over a longer period of time (typically

months or years) and are usually intended to identify users.
Static public keys are usually sent as part of a signed “certificate” SignSK (A, PK A ),
C
where a “trusted third party” or “certification authority” C certifies that PK A is the public
key associated with user A.

I Master keys are used to generate other derived keys.

I By purpose: encryption, message-integrity, authentication, signing,
key-exchange, certification, revokation, attestation, etc. keys

9 / 230
When is a cryptographic scheme “secure”?
For an encryption scheme, if no adversary can . . .
I . . . find out the secret/private key?
I . . . find the plaintext message M ?
I . . . determine any character/bit of M ?
I . . . determine any information about M from C?
I . . . compute any function of the plaintext M from ciphertext C?
⇒ “semantic security”
For an integrity scheme, should we demand that no adversary can . . .
I . . . find out the secret/private key?
I . . . create a new message M 0 and matching tag/signature?
I . . . create a new M 0 that verifies with a given tag/signature?
I . . . modify or recombine a message+tag so they still verify?
I . . . create two messages with the same signature?
10 / 230
What capabilities may the adversary have?
I access to some ciphertext C
I access to some plaintext/ciphertext pairs (M, C) with
C ← EncK (M )?
I ability to trick the user of EncK into encrypting some plaintext of
the adversary’s choice and return the result?
(“oracle access” to Enc)
I ability to trick the user of DecK into decrypting some ciphertext of
the adversary’s choice and return the result?
(“oracle access” to Dec)?
I ability to modify or replace C en route?
(not limited to eavesdropping)
I how many applications of EncK or DecK can be observed?
I unlimited / polynomial / realistic ( 280 steps) computation time?
I knowledge of all algorithms used

Wanted: Clear definitions of what security of an encryption scheme

means, to guide both designers and users of schemes, and allow proofs.
11 / 230
Kerckhoffs’ principles (1883)
Requirements for a good traditional military encryption system:
1 The system must be substantially, if not mathematically,
undecipherable;
2 The system must not require secrecy and can be stolen by the
enemy without causing trouble;
3 It must be easy to communicate and remember the keys without
requiring written notes, it must also be easy to change or modify the
keys with different participants;
4 The system ought to be compatible with telegraph communication;
5 The system must be portable, and its use must not require more
than one person;
6 Finally, regarding the circumstances in which such system is applied,
it must be easy to use and must neither require stress of mind nor
the knowledge of a long series of rules.
Auguste Kerckhoffs: La cryptographie militaire, Journal des sciences militaires, 1883.
http://petitcolas.net/fabien/kerckhoffs/
12 / 230
Kerckhoffs’ principles (1883)
Requirements for a good traditional military encryption system:
1 The system must be substantially, if not mathematically,
undecipherable;
2 The system must not require secrecy and can be stolen by the
enemy without causing trouble;
3 It must be easy to communicate and remember the keys without
requiring written notes, it must also be easy to change or modify the
keys with different participants;
4 The system ought to be compatible with telegraph communication;
5 The system must be portable, and its use must not require more
than one person;
6 Finally, regarding the circumstances in which such system is applied,
it must be easy to use and must neither require stress of mind nor
the knowledge of a long series of rules.
Auguste Kerckhoffs: La cryptographie militaire, Journal des sciences militaires, 1883.
http://petitcolas.net/fabien/kerckhoffs/
12 / 230
Kerckhoffs’ principle today
Requirement for a modern encryption system:
1 It was evaluated assuming that the enemy knows the system.
2 Its security relies entirely on the key being secret.

13 / 230
Kerckhoffs’ principle today
Requirement for a modern encryption system:
1 It was evaluated assuming that the enemy knows the system.
2 Its security relies entirely on the key being secret.

Note:
I The design and implementation of a secure communication system is
a major investment and is not easily and quickly repeated.
I Relying on the enemy not knowing the encryption system is
generally frowned upon as “security by obscurity”.
I The most trusted cryptographic algorithms have been published,
standardized, and withstood years of cryptanalysis.
I A cryptographic key should be just a random choice that can be
easily replaced, by rerunning a key-generation algorithm.
I Keys can and will be lost: cryptographic systems should provide
support for easy rekeying, redistribution of keys, and quick
revocation of compromised keys.
13 / 230
A note about message length
We explicitly do not worry in the following about the adversary being
able to infer something about the length m of the plaintext message M
by looking at the length n of the ciphertext C.
Therefore, we will consider here in security definitions for encryption
schemes only messages of fixed length m.
Variable-length messages could be extended to a fixed length, by
padding, but this can be expensive. It will depend on the specific
application whether the benefits of fixed-length padding outweigh the
added transmission cost.
Nevertheless, in practice, ciphertext length must always be considered as
a potential information leak. Examples:
I Encrypted-file lengths often permit unambiguous reconstruction of
what pages a HTTPS user accessed on a public web site.
G. Danezis: Traffic analysis of the HTTP protocol over TLS.
http://www0.cs.ucl.ac.uk/staff/G.Danezis/papers/TLSanon.pdf
I Data compression can be abused to extract information from an
encrypted message if an adversary can control part of that message.
J. Kelsey: Compression and information leakage of plaintext.
http://www.iacr.org/cryptodb/archive/2002/FSE/3091/3091.pdf
Also: CVE-2012-4929/CRIME
14 / 230
Demo: leaking plaintext through compressed data length
$ cat compression-leak
#!/bin/bash
PLAINTEXT=cafe ←
KEY="N-32m5qEj/emdVr.69w1fX"
ENC="openssl enc -aes-128-ctr -pass pass:$KEY"
for t in {a,b,c,d,e,f}{a,b,c,d,e,f}{a,b,c,d,e,f}{a,b,c,d,e,f} ; do
echo -n "$t "
echo $t $PLAINTEXT | gzip -c | $ENC | wc -c
done | sort -nk2
$ ./compression-leak

15 / 230
Demo: leaking plaintext through compressed data length
$ cat compression-leak
#!/bin/bash
PLAINTEXT=cafe ←
KEY="N-32m5qEj/emdVr.69w1fX"
ENC="openssl enc -aes-128-ctr -pass pass:$KEY"
for t in {a,b,c,d,e,f}{a,b,c,d,e,f}{a,b,c,d,e,f}{a,b,c,d,e,f} ; do
echo -n "$t "
echo $t $PLAINTEXT | gzip -c | $ENC | wc -c
done | sort -nk2
$ ./compression-leak
aafe 44
acaf 44
bafe 44
bcaf 44
cafe 44 ←
ccaf 44
dafe 44
dcaf 44
eafe 44
ecaf 44
fafe 44
fcaf 44
aaaa 46
aaab 46
[. . . remaining 1282 lines not shown . . . ] 15 / 230
1 Historic ciphers
2 Perfect secrecy
3 Semantic security
4 Block ciphers
5 Modes of operation
6 Message authenticity
7 Authenticated encryption
8 Secure hash functions
9 Secure hash applications
10 Key distribution problem
11 Number theory and group theory
12 Discrete logarithm problem
13 RSA trapdoor permutation
14 Digital signatures

16 / 230
Historic examples of simple ciphers
Shift Cipher: Treat letters {A, . . . , Z} like integers {0, . . . , 25} = Z26 .
Choose key K ∈ Z26 , encrypt each letter individually by addition modulo
26, decrypt by subtraction modulo 26.
Example with K = 25 ≡ −1 (mod 26): IBM→HAL.
K = −3 known as Caesar Cipher, K = 13 as rot13.
The tiny key-space size 26 makes brute-force key search trivial.

Transposition Cipher: K is permutation of letter positions.

Key space is n!, where n is the permutation block length.
K
C
A T T A C K A T D A W N T
T
A
A
W
N
T
A D O
T N E
A R
U T O
B F
T A N W T C A K D A T A B
E

Skytale

Substitution Cipher (monoalphabetic): Key is permutation

K : Z26 ↔ Z26 . Encrypt plaintext M = m1 m2 . . . mn with ci = K(mi )
to get ciphertext C = c1 c2 . . . cn , decrypt with mi = K −1 (ci ).
Key space size 26! > 4 × 1026 makes brute-force search infeasible.
17 / 230
Statistical properties of plain text
English letter frequency
13 E
12
11
10 T
9 A
8 O
I N
%

7 H R S
6
5 D L
4
3 M U W
C F G P Y
2 B K V
1 J Q X Z
0

The most common letters in English:

7 S H R
6
5 D L
4
3 U WM
F C G Y P
2 B K
V
1 J X Q Z
0

The most common letters in English:

E, T, A, O, I, N, S, H, R, D, L, U, . . .
The most common digrams in English:
TH, HE, IN, ER, AN, RE, ED, ON, ES, ST, EN, AT, TO, . . .
The most common trigrams in English:
THE, ING, AND, HER, ERE, ENT, THA, NTH, WAS, ETH, . . .
English text is highly redundant: very roughly 1 bit/letter entropy.
Monoalphabetic substitution ciphers allow simple ciphertext-only attacks based on
digram or trigram statistics (for messages of at least few hundred characters).
18 / 230
Vigenère cipher
ABCDEFGHIJKLMNOPQRSTUVWXYZ
Inputs: BCDEFGHIJKLMNOPQRSTUVWXYZA
CDEFGHIJKLMNOPQRSTUVWXYZAB
I Key word K = k1 k2 . . . kl DEFGHIJKLMNOPQRSTUVWXYZABC
EFGHIJKLMNOPQRSTUVWXYZABCD
I Plain text M = m1 m2 . . . mn FGHIJKLMNOPQRSTUVWXYZABCDE
GHIJKLMNOPQRSTUVWXYZABCDEF
HIJKLMNOPQRSTUVWXYZABCDEFG
Encrypt into ciphertext: IJKLMNOPQRSTUVWXYZABCDEFGH
JKLMNOPQRSTUVWXYZABCDEFGHI
ci = (mi + k[(i−1) mod l]+1 ) mod 26 KLMNOPQRSTUVWXYZABCDEFGHIJ
LMNOPQRSTUVWXYZABCDEFGHIJK
MNOPQRSTUVWXYZABCDEFGHIJKL
NOPQRSTUVWXYZABCDEFGHIJKLM
Example: K = SECRET OPQRSTUVWXYZABCDEFGHIJKLMN
PQRSTUVWXYZABCDEFGHIJKLMNO
S E C R E T S E C ... QRSTUVWXYZABCDEFGHIJKLMNOP
RSTUVWXYZABCDEFGHIJKLMNOPQ
A T T A C K A T D ... STUVWXYZABCDEFGHIJKLMNOPQR
TUVWXYZABCDEFGHIJKLMNOPQRS
S X V R G D S X F ... UVWXYZABCDEFGHIJKLMNOPQRST
VWXYZABCDEFGHIJKLMNOPQRSTU
WXYZABCDEFGHIJKLMNOPQRSTUV
The modular addition can be replaced with XOR: XYZABCDEFGHIJKLMNOPQRSTUVW
YZABCDEFGHIJKLMNOPQRSTUVWX
ci = mi ⊕ k[(i−1) mod l]+1 mi , ki , ci ∈ {0, 1} ZABCDEFGHIJKLMNOPQRSTUVWXY

Vigenère is an example of a polyalphabetic cipher.

19 / 230
Attacking the Vigenère cipher
First determine the key length l. For each candidate keylength l:
I Treat each l-th ciphertext character as part of a separate message
M1 , M2 , . . . , Ml encrypted with just a (monoalphabetic) shift cipher, resulting in
separate ciphertexts C1 , C2 , . . . , Cl .
I Consider the l letter-frequency histograms for these Ci (1 ≤ i ≤ l).
I If choice of l is incorrect, the letter-frequency histograms of each of
C1 , C2 , . . . , Cl will be more even/flatter (as they are the average of several
rotated histograms) than if l was correct.
I If pa,i is the relative frequency of letter a in Ci (for all a in alphabet A), then
the index of coincidence X
IC (Ci ) = p2a,i
a∈A
is the probability that two randomly chosen letters from Ci are identical. IC is a
measure of the unevenness of a histogram (minimal if ∀a ∈ A : pa,i = |A|−1 ).
I Pick the key length l that leads to the highest l−1 li=1 IC (Ci ). In other words,
P
maximise the probability of two letters being identical when looking only at
letters that are a multiple of l characters apart in C.

Once the correct key length l is known, compare the histograms of C1 , C2 , . . . , Cl .

They will just be shifted versions of each other (pa,2 = p(a−k2 +k1 ) mod 26,1 , etc.), and
the shift offsets reveal the differences between the corresponding key characters.
Finally, try decryption with all possible first key characters k1 .
20 / 230
shifted letter freq. (IC=0.065) shifted letter freq. (IC=0.065)

13 13
12 12
11 11
10 10
9 9
8 8
7 7
6 6
5 5
4 4
3 3
2 2
1 1
0 0

shifted letter freq. (IC=0.065) averaged letter freq. (IC=0.046)

13 13
12 12
11 11
10 10
9 9
8 8
7 7
6 6
5 5
4 4
3 3
2 2
1 1
0 0
1 Historic ciphers
2 Perfect secrecy
3 Semantic security
4 Block ciphers
5 Modes of operation
6 Message authenticity
7 Authenticated encryption
8 Secure hash functions
9 Secure hash applications
10 Key distribution problem
11 Number theory and group theory
12 Discrete logarithm problem
13 RSA trapdoor permutation
14 Digital signatures

21 / 230
Perfect secrecy

Computational security
The most efficient known algorithm for breaking a cipher would require
far more computational steps than all hardware available to any adversary
can perform.

Unconditional security
Adversaries have not enough information to decide (from the ciphertext)
whether one plaintext is more likely to be correct than another, even with
unlimited computational power at their disposal.

22 / 230
Perfect secrecy II
Consider a private-key encryption scheme

Enc : K × M → C, Dec : K × C → M

with DecK (EncK (M )) = M for all K ∈ K, M ∈ M, where M, C, K are

the sets of possible plaintexts, ciphertexts and keys, respectively.
Let also M ∈ M, C ∈ C and K ∈ K be values of plaintext, ciphertext
and key. Let P(M ) and P(K) denote an adversary’s respective a-priori
knowledge of the probability that plaintext M or key K are used.
The adversary can then calculate the probability of any ciphertext C as
X
P(C) = P(K) · P(DecK (C)).
K∈K

and can also determine the conditional probability

X
P(C|M ) = P(K)
{K∈K|M =DecK (C)}

23 / 230
Perfect secrecy III
Having eavesdropped some ciphertext C, an adversary can then use
Bayes’ theorem to calculate for any plaintext M ∈ M
P
P(M ) · P(C|M ) P(M ) · {K|M =DecK (C)} P(K)
P(M |C) = = P .
P(C) K P(K) · P(DecK (C))

Perfect secrecy
An encryption scheme over a message space M is perfectly secret if for
every probability distribution over M, every message M ∈ M, and every
ciphertext C ∈ C with P(C) > 0 we have

P(M |C) = P(M ).

In other words: looking at the ciphertext C leads to no new information

beyond what was already known about M in advance ⇒ eavesdropping
C has no benefit, even with unlimited computational power.
C.E. Shannon: Communication theory of secrecy systems. Bell System Technical Journal, Vol 28,
Oct 1949, pp 656–715. http://netlab.cs.ucla.edu/wiki/files/shannon1949.pdf
24 / 230
Vernam cipher / one-time pad I
Shannon’s theorem:
Let (Gen, Enc, Dec) be an encryption scheme over a message space M
with |M| = |K| = |C|. It is perfectly secret if and only if
1 Gen chooses every K with equal probability 1/|K|;

2 for every M ∈ M and every C ∈ C, there exists a unique key K ∈ K

such that C = EncK M .

The standard example of a perfectly-secure symmetric encryption scheme:

One-time pad
K = C = M = {0, 1}m
I Gen : K ∈R {0, 1}m (m uniform, independent coin tosses)
I EncK (M ) = K ⊕ M (⊕ = bit-wise XOR)
I DecK (C) = K ⊕ C

Example:
0xbd4b083f6aae ⊕ “Vernam” = 0xbd4b083f6aae ⊕ 0x5665726e616d = 0xeb2e7a510bc3
25 / 230
Vernam cipher / one-time pad II

The one-time pad is a variant of the Vigenère Cipher with l = n: the

key is as long as the plaintext. No key bit is ever used to encrypt more
than one plaintext bit.
Note: If x is a random bit with any probability distribution and y is one with uniform probability
distribution (P(y = 0) = P(y = 1) = 21 ), then the exclusive-or result x ⊕ y will have uniform
probability distribution. This also works for addition modulo m (or for any finite group).

For each possible plaintext M , there exists a key K = M ⊕ C that turns

a given ciphertext C into M = DecK (C). If all K are equally likely, then
also all M will be equally likely for a given C, which fulfills Shannon’s
definition of perfect secrecy.
What happens if you use a one-time pad twice?
One-time pads have been used intensively during significant parts of the 20th century for
diplomatic communications security, e.g. on the telex line between Moscow and Washington. Keys
were generated by hardware random bit stream generators and distributed via trusted couriers.

In the 1940s, the Soviet Union encrypted part of its diplomatic communication using recycled
one-time pads, leading to the success of the US decryption project VENONA.
http://www.nsa.gov/public_info/declass/venona/

26 / 230
1 Historic ciphers
2 Perfect secrecy
3 Semantic security
4 Block ciphers
5 Modes of operation
6 Message authenticity
7 Authenticated encryption
8 Secure hash functions
9 Secure hash applications
10 Key distribution problem
11 Number theory and group theory
12 Discrete logarithm problem
13 RSA trapdoor permutation
14 Digital signatures

27 / 230
Making the one-time pad more efficient
The one-time pad is very simple, but also very inconvenient:
one key bit for each message bit!
Many standard libraries contain pseudo-random number generators
(PRNGs). They are used in simulations, games, probabilistic algorithms,
testing, etc.
They expand a “seed value” R0 into a sequence of numbers R1 , R2 , . . .
that look very random:
Ri = f (Ri−1 , i)
The results pass numerous statistical tests for randomness (e.g. Marsaglia’s “Diehard” tests).

Can we not use R0 as a short key, split our message M into chunks
M1 , M2 , . . . and XOR with (some function g of) Ri to encrypt Mi ?

Ci = Mi ⊕ g(Ri , i)

28 / 230
Making the one-time pad more efficient
The one-time pad is very simple, but also very inconvenient:
one key bit for each message bit!
Many standard libraries contain pseudo-random number generators
(PRNGs). They are used in simulations, games, probabilistic algorithms,
testing, etc.
They expand a “seed value” R0 into a sequence of numbers R1 , R2 , . . .
that look very random:
Ri = f (Ri−1 , i)
The results pass numerous statistical tests for randomness (e.g. Marsaglia’s “Diehard” tests).

Can we not use R0 as a short key, split our message M into chunks
M1 , M2 , . . . and XOR with (some function g of) Ri to encrypt Mi ?

Ci = Mi ⊕ g(Ri , i)

But what are secure choices for f and g?

What security propery do we expect from such a generator, and what
security can we expect from the resulting encryption scheme?
28 / 230
A non-secure pseudo-random number generator
Example (insecure)
Linear congruential generator with secret parameters (a, b, R0 ):

Ri+1 = (aRi + b) mod m

Attack: guess some plain text (e.g., known file header), obtain for
example (R1 , R2 , R3 ), then solve system of linear equations over Zm :

R2 ≡ aR1 + b (mod m)
R3 ≡ aR2 + b (mod m)

Solution:

a ≡ (R2 − R3 )/(R1 − R2 ) (mod m)

b ≡ R2 − R1 (R2 − R3 )/(R1 − R2 ) (mod m)

Multiple solutions if gcd(R1 − R2 , m) 6= 1: resolved using R4 or just by

trying all possible values.
29 / 230
Private-key (symmetric) encryption
A private-key encryption scheme is a tuple of probabilistic
polynomial-time algorithms (Gen, Enc, Dec) and sets K, M, C such that
I the key generation algorithm Gen receives a security parameter `
and outputs a key K ← Gen(1` ), with K ∈ K, key length |K| ≥ `;
I the encryption algorithm Enc maps a key K and a plaintext
message M ∈ M = {0, 1}m to a ciphertext message
C ← EncK (M );
I the decryption algorithm Dec maps a key K and a ciphertext
C ∈ C = {0, 1}n (n ≥ m) to a plaintext message M := DecK (C);
I for all `, K ← Gen(1` ), and M ∈ {0, 1}m : DecK (EncK (M )) = M .

Notes:
A “polynomial-time algorithm” has constants a, b, c such that the runtime is
always less than a · `b + c if the input is ` bits long. (think Turing machine)
Technicality: we supply the security parameter ` to Gen here in unary encoding (as a sequence of `
“1” bits: 1` ), merely to remain compatible with the notion of “input size” from computational
complexity theory. In practice, Gen usually simply picks ` random bits K ∈R {0, 1}` .
30 / 230
Security definitions for encryption schemes
We define security via the rules of a game played between two players:
I a challenger, who uses an encryption scheme Π = (Gen, Enc, Dec)
I an adversary A, who tries to demonstrate a weakness in Π.
Most of these games follow a simple pattern:
1 the challenger uniformly picks at random a secret bit b ∈R {0, 1}
2 A interacts with the challenger according to the rules of the game
3 At the end, A has to output a bit b0 .
The outcome of such a game XA,Π (`) is either
I b = b0 ⇒ A won the game, we write XA,Π (`) = 1
I b 6= b0 ⇒ A lost the game, we write XA,Π (`) = 0

Advantage
One way to quantify A’s ability to guess b is

AdvXA,Π (`) = P(b = 1 and b0 = 1) − P(b = 0 and b0 = 1)

31 / 230
Negligible advantage
Security definition
An encryption scheme Π is considered “X secure” if for all probabilistic
polynomial-time (PPT) adversaries A there exists a “negligible” function
negl such that
1
P(XA,Π (`) = 1) < + negl(`).
2
Some authors prefer the equivalent definition with

AdvXA,Π (`) < negl(`).

Negligible functions
A function negl(`) : N → R is “negligible” if, as ` → ∞, it converges
faster to zero than 1/poly(`) does for any polynomial poly(`).

In practice: We want negl(`) to drop below a small number (e.g., 2−80 or

2−100 ) for modest key lengths ` (e.g., log10 ` ≈ 2 . . . 3). Then no realistic
opponent will have the computational power to repeat the game often enough
to win at least once more than what is expected from random guessing.
32 / 230
“Computationally infeasible”
With good cryptographic primitives, the only form of possible
cryptanalysis should be an exhaustive search of all possible keys (brute
force attack).
The following numbers give a rough idea of the limits involved:
Let’s assume we can later this century produce VLSI chips with 10 GHz
clock frequency and each of these chips costs 10 $ and can test in a
single clock cycle 100 keys. For 10 million $, we could then buy the chips
needed to build a machine that can test 1018 ≈ 260 keys per second.
Such a hypothetical machine could break an 80-bit key in 7 days on
average. For a 128-bit key it would need over 1012 years, that is over
100× the age of the universe.
Rough limit of computational feasiblity: 280 iterations
(i.e., < 260 feasible with effort, but > 2100 certainly not)
For comparison:
I The fastest key search effort using thousands of Internet PCs (RC5-64, 2002) achieved in
the order of 237 keys per second.
http://www.cl.cam.ac.uk/~rnc1/brute.html
http://www.distributed.net/
I Since January 2018, the Bitcoin network has been searching through about 1019 ≈ 263
cryptographic hash values per second, mostly using ASICs.
http://bitcoin.sipa.be/ 33 / 230
Indistinguishability in the presence of an eavesdropper
Private-key encryption scheme Π = (Gen, Enc, Dec), M = {0, 1}m , security parameter `.

Experiment/game PrivKeav
A,Π (`):

1` 1`
M0 , M 1
b ∈R {0, 1}
K ← Gen(1` ) A
C ← EncK (Mb )
challenger C adversary
b b0

Setup:
1 The challenger generates a bit b ∈R {0, 1} and a key K ← Gen(1` ).
2 The adversary A is given input 1`
Rules for the interaction:
1 The adversary A outputs a pair of messages:
M0 , M1 ∈ {0, 1}m .
2 The challenger computes C ← EncK (Mb ) and returns
C to A
Finally, A outputs b0 . If b0 = b then A has succeeded ⇒ PrivKeav
A,Π (`) = 1
34 / 230
Indistinguishability in the presence of an eavesdropper

Definition: A private-key encryption scheme Π has indistinguishable

encryption in the presence of an eavesdropper if for all probabilistic,
polynomial-time adversaries A there exists a negligible function negl,
such that
1
P(PrivKeav
A,Π (`) = 1) ≤ + negl(`)
2

In other words: as we increase the security parameter `, we quickly

reach the point where no eavesdropper can do significantly better than
just randomly guessing b.

35 / 230
Pseudo-random generator I

G : {0, 1}n → {0, 1}e(n) where e(·) is a polynomial (expansion factor)

Definition
G is a pseudo-random generator if both
1 e(n) > n for all n (expansion)
2 for all probabilistic, polynomial-time distinguishers D there exists a
negligible function negl such that

|P(D(r) = 1) − P(D(G(s)) = 1)| ≤ negl(n)

where both r ∈R {0, 1}e(n) and the seed s ∈R {0, 1}n are chosen at
random, and the probabilities are taken over all coin tosses used by
D and for picking r and s.

36 / 230
Pseudo-random generator II
A brute-force distinguisher D would enumerate all 2n possible outputs of
G, and return 1 if the input is one of them.
It would achieve

P(D(G(s)) = 1) = 1
2n
P(D(r) = 1) =
2e(n)
the difference of which converges to 1, which is not negligible.
But a brute-force distinguisher has a exponential run-time O(2n ), and is
therefore excluded!

We do not know how to prove that a given algorithm is a pseudo-random

generator, but there are many algorithms that are widely believed to be.
Some constructions are pseudo-random generators if another well-studied
problem is not solvable in polynomial time.

37 / 230
Encrypting using a pseudo-random generator

We define the following fixed-length private-key encryption scheme:

ΠPRG = (Gen, Enc, Dec):

Let G be a pseudo-random generator with expansion factor e(·),
K = {0, 1}` , M = C = {0, 1}e(`)
I Gen: on input 1` chose K ∈R {0, 1}` randomly
I Enc: C := G(K) ⊕ M
I Dec: M := G(K) ⊕ C

Such constructions are known as “stream ciphers”.

We can prove that ΠPRG has “indistinguishable encryption in the

presence of an eavesdropper” assuming that G is a pseudo-random
generator: if we had a polynomial-time adversary A that can succeed
with non-negligible advantage against ΠPRG , we can turn that using a
polynomial-time algorithm into a polynomial-time distinguisher for G,
which would violate the assumption.

38 / 230
Security proof for a stream cipher
Claim: ΠPRG has indistinguishability in the presence of an eavesdropper
if G is a pseudo-random generator.
Proof: (outline) If ΠPRG did not have indistinguishability in the presence
of an eavesdropper, there would be an adversary A for which
1
(`) := P(PrivKeav
A,ΠPRG (`) = 1) −
2
is not negligible.
Use that A to construct a distinguisher D for G:
I receive input W ∈ {0, 1}e(`)
I pick b ∈R {0, 1}
I run A(1` ) and receive from it M0 , M1 ∈ {0, 1}e(`)
I return C := W ⊕ Mb to A
I receive b0 from A
I return 1 if b0 = b, otherwise return 0
Now, what is |P(D(r) = 1) − P(D(G(K)) = 1)|?
39 / 230
Security proof for a stream cipher (cont’d)

What is |P(D(r) = 1) − P(D(G(K)) = 1)|?

I What is P(D(r) = 1)?
Let Π̃ be an instance of the one-time pad, with key and message
length e(`), i.e. compatible to ΠPRG . In the D(r) case, where we
feed it a random string r ∈R {0, 1}e(n) , then from the point of view
of A being called as a subroutine of D(r), it is confronted with a
one-time pad Π̃. The perfect secrecy of Π̃ implies P(D(r) = 1) = 12 .
I What is P(D(G(K)) = 1)?
In this case, A participates in the game PrivKeav
A,ΠPRG (`). Thus we
eav
have P(D(G(K)) = 1) = P(PrivKA,ΠPRG (`) = 1) = 21 + (`).
Therefore
|P(D(r) = 1) − P(D(G(K)) = 1)| = (`)
which we have assumed not to be negligible, which implies that G is not
a pseudo-random generator, contradicting the assumption.
Katz/Lindell (1st ed.), pp 73-75

40 / 230
Security proofs through reduction
Some key points about this style of “security proof”:
I We have not shown that the encryption scheme ΠPRG is “secure”.
(We don’t know how to do this!)
I We have shown that ΠPRG has one particular type of security
property, if one of its building blocks (G) has another one.
I We have “reduced” the security of construct ΠPRG to another
problem X:
instance of
instance of
problem X Reduction scheme Π A
A0
solution
attack
to X
Here: X = distinguishing output of G from random string
I We have shown how to turn any successful attack on ΠPRG into an
equally successful attack on its underlying building block G.
I “Successful attack” means finding a polynomial-time probabilistic
adversary algorithm that succeeds with non-negligible success
probability in winning the game specified by the security definition.
41 / 230
Security proofs through reduction

In the end, the provable security of some cryptographic construct (e.g.,

ΠPRG , some mode of operation, some security protocol) boils down to
these questions:
I What do we expect from the construct?
I What do we expect from the underlying building blocks?
I Does the construct introduce new weaknesses?
I Does the construct mitigate potential existing weaknesses in its
underlying building blocks?

42 / 230
Indistinguishability in the presence of an eavesdropper
Private-key encryption scheme Π = (Gen, Enc, Dec), M = {0, 1}m , security parameter `.

Experiment/game PrivKeav
A,Π (`):

1` 1`
M0 , M 1
b ∈R {0, 1}
K ← Gen(1` ) A
C ← EncK (Mb )
challenger C adversary
b b0

Setup:
1 The challenger generates a bit b ∈R {0, 1} and a key K ← Gen(1` ).
2 The adversary A is given input 1`
Rules for the interaction:
1 The adversary A outputs a pair of messages:
M0 , M1 ∈ {0, 1}m .
2 The challenger computes C ← EncK (Mb ) and returns
C to A
Finally, A outputs b0 . If b0 = b then A has succeeded ⇒ PrivKeav
A,Π (`) = 1
← 34 / 230
Security for multiple encryptions
Private-key encryption scheme Π = (Gen, Enc, Dec), M = {0, 1}m , security parameter `.

Experiment/game PrivKmult
A,Π (`):

1` 1`
M01 , M02 , . . . , M0t
b ∈R {0, 1}
K ← Gen(1` ) M11 , M12 , . . . , M1t
A
C ← EncK (Mb )
challenger C1, C2, . . . , Ct adversary
b b0

Setup:
1 The challenger generates a bit b ∈R {0, 1} and a key K ← Gen(1` ).
2 The adversary A is given input 1`
Rules for the interaction:
1 The adversary A outputs two sequences of t messages:
M01 , M02 , . . . , M0t and M11 , M12 , . . . , M1t , where all Mji ∈ {0, 1}m .
2 The challenger computes C i ← EncK (Mbi ) and returns
C 1 , C 2 , . . . , C t to A
Finally, A outputs b0 . If b0 = b then A has succeeded ⇒ PrivKmult
A,Π (`) = 1
43 / 230
Security for multiple encryptions (cont’d)
Definition: A private-key encryption scheme Π has indistinguishable
multiple encryptions in the presence of an eavesdropper if for all
probabilistic, polynomial-time adversaries A there exists a negligible
function negl, such that
1
P(PrivKmult
A,Π (`) = 1) ≤ + negl(`)
2