Delta Functions
Delta Functions
044 Notes
1 Introduction
The (Dirac) δ-function is a mathematical representation of the physical concept
of the density associated with a point object.1 Consider charge density, for
example. The charge density ρ(x) associated with a point charge of unit strength
at a point x0 has the property that it vanishes everywhere except when x = x0 .
Its integral over any volume containing the point x0 is one, but its integral over
any volume not containing x0 is zero. We represent this behavior by a symbol
δ(x − x0 ). Summarizing its properties,
δ(x − x0 ) = 0, for x ̸= x0
Z
dx δ(x − x0 ) = 1, if x0 ∈ V
V
Z
dx δ(x − x0 ) = 0, if x0 ∈
/ V. (1)
V
as long as the region of integration V includes the point x0 . This follows because
the δ-function vanishes everywhere except at x0 where f takes the value f (x0 ).
The δ-function is not really a function as mathematicians define “function”.
Among other things, any function of the real numbers that is zero everywhere
except at one point must integrate to zero. Instead, the δ-function is an example
of a generalized function, another category of useful mathematical objects, which
have been studied in depth. We are only going to need very basic properties of
δ-functions, which we will introduce informally in these notes. 2
Physicists like to represent δ-functions as the limit of a sequence of sharply
peaked ordinary functions of unit strength of ever narrower profile and ever
greater peak height. It is also useful to think of δ-functions abstractly as a
rule for doing integrals as eq. (2) suggests. In these notes we first explore some
of the general properties of integrals involving δ-functions without introducing
any explicit representation for the δ-function. Then we introduce some of the
commonly used representations of δ-functions. Although the definition, eq. (1),
refers to any number of dimensions, we need only consider δ-functions of a single
variable.
1 The δ-function was introduced into physics by Paul Dirac in the 1920’s in his formulation
of quantum mechanics.
2 You can read about generalized functions in the classic book by M. J. Lighthill, An
Example
processing.
©R. L. Jaffe MIT 8.044 Notes 5
This is almost, but not quite correct. It is missing an absolute value sign on
g ′ (x) and it failed to pick up the possibility that there may be several values of
x corresponding to a single value of y.
Both of these problems are cured, and the correct result is obtained if we use
δ-functions to define the probability distributions. Here are the steps to obtain
the correct result:
1. Write the identity
Z
dP
p(x) = = dx′ δ(x − x′ )p(x′ ) ,
dx
4 Although it is not part of 8.044 — the Fourier transform of a function f (x) is defined by
1
dxeikx f (x) and the inverse theorem says f (x) = dke−ikx g(k).
R R
g(k) = 2π
©R. L. Jaffe MIT 8.044 Notes 6
where the integration goes over the whole range of values allowed for x.
We can interpret this as an integral overall probabilities subject to the
constraint that the variable x′ takes on the value x.
2. Then define Z
dP
= dx′ δ(y − g(x′ ))p(x′ ) . (11)
dy
This equation has the interpretation of integrating over all probabilities
subject to the constraint that the function g(x′ ) takes on the value y.
3. Evaluate eq. (11) by the usual rules for δ-functions. First find all the zeros
of the argument of δ,
which is the correct result including both the absolute value and the sum over
possible multiple values of x that correspond to a given y.5
An example will illustrate the method.
5 It agrees with the “find the cumulant and then differentiate” approach described in Prof.
Greytak’s notes.
©R. L. Jaffe MIT 8.044 Notes 7
Example
• Step 2: Find zeros of x − cos θ. There are no solutions for |x| > 1,
so dP/dx = 0 for |x| > 1. For |x| < 1, it is clear from the figure
that there are two solutions for θ for every value of x: θ1 = arccos(x)
and θ2 = 2π − arccos(x). (Remember that the range of the arc-cosine
function usually taken to be 0 < θ < π.)
• Step 3: Compute |g ′ (xj )|.
d p
| (x − cos θ)| = | sin θj | = 1 − x2 ,
dθ θj