(From Wikipedia, the free encyclopedia) (Thanks to wikipedia)
The
fundamental theorem of calculus specifies the relationship between the two central operations of
calculus:
differentiation and
integration.
The first part of the theorem, sometimes called the
first fundamental theorem of calculus, shows that an
indefinite integration[1] can be reversed by a differentiation. The first part is also important because it guarantees the existence of
antiderivatives for
continuous functions.
[2]
The second part, sometimes called the
second fundamental theorem of calculus, allows one to compute the
definite integral of a function by using any one of its infinitely many
antiderivatives. This part of the theorem has invaluable practical applications, because it markedly simplifies the computation of
definite integrals.
The first published statement and proof of a restricted version of the fundamental theorem was by
James Gregory (1638–1675).
[3] Isaac Barrow (1630–1677) proved the first completely general version of the theorem, while Barrow's student
Isaac Newton (1643–1727) completed the development of the surrounding mathematical theory.
Gottfried Leibniz (1646–1716) systematized the knowledge into a calculus for infinitesimal quantities.
[edit] Physical intuition
Intuitively, the theorem simply states that the sum of
infinitesimal changes in a quantity over time (or some other quantity) add up to the net change in the quantity.
In the case of a particle traveling in a straight line, its position,
x, is given by
x(
t) where
t is time and
x(
t) means that
x is a function of
t. The derivative of this function is equal to the infinitesimal change in quantity, d
x, per infinitesimal change in time, d
t (of course, the derivative itself is dependent on time). This change in displacement per change in time is the velocity
v of the particle. In
Leibniz's notation:
Rearranging this equation, it follows that:
By the logic above, a change in
x (or Δ
x) is the sum of the infinitesimal changes d
x. It is also equal to the sum of the infinitesimal products of the derivative and time. This infinite summation is integration; hence, the integration operation allows the recovery of the original function from its derivative. It can be concluded that this operation works in reverse; the result of the integral can be differentiated to recover the original function.
[edit] Geometric intuition
The area shaded in red stripes can be computed as
h times
ƒ(
x). Alternatively, if the function
A(
x) were known, it could be estimated as
A(
x +
h) −
A(
x). These two values are approximately equal, particularly for small
h.
For a continuous function
y = ƒ(x) whose graph is plotted as a curve, each value of
x has a corresponding area function
A(
x), representing the area beneath the curve between 0 and
x. The function
A(
x) may not be defined, but it is given that it represents the area under the curve.
The area under the curve between
x and
x +
h could be computed by finding the area between 0 and
x +
h, then subtracting the area between 0 and
x. In other words, the area of this “sliver” would be
A(x + h) − A(x).
There is another way to
estimate the area of this same sliver.
h is multiplied by ƒ(
x) to find the area of a rectangle that is approximately the same size as this sliver. It is intuitive that the approximation improves as
h grows smaller.
At this point, it is true
A(
x +
h) −
A(
x) is approximately equal to ƒ(
x)·
h. In other words,
ƒ(x)·h ≈ A(x + h) − A(x), with this approximation becoming an equality as
h approaches 0 in the
limit.
When both sides of the equation are divided by
h:
As
h approaches 0, it can be seen that the right hand side of this equation is simply the
derivative A’(
x) of the area function
A(
x). The left-hand side of the equation simply remains ƒ(
x), since no
h is present.
It can thus be shown, in an informal way, that
ƒ(x) = A’(x). That is, the derivative of the area function
A(
x) is the original function ƒ(
x); or, the area function is simply the
antiderivative of the original function.
Computing the derivative of a function and “finding the area” under its curve are "opposite" operations. This is the crux of the Fundamental Theorem of Calculus. Most of the theorem's proof is devoted to showing that the area function
A(
x) exists in the first place.
[edit] Formal statements
There are two parts to the Fundamental Theorem of Calculus. Loosely put, the first part deals with the derivative of an
antiderivative, and the second part deals with the relationship between antiderivatives and
definite integrals.
[edit] First part
This part of the theorem is sometimes referred to as the First Fundamental Theorem of Calculus.
[4]
A real-valued function
F is defined on a
closed interval [
a,
b] by setting, for all
x in [
a,
b],
where
ƒ is a real-valued function continuous on [
a,
b]. Then,
F is continuous on [
a,
b], differentiable on the open interval (
a,
b), and
for all
x in (
a,
b).
For a given
f(
t), define the function
F(
x) as
For any two numbers
x1 and
x1 + Δ
x in [
a,
b], we have
and
Subtracting the two equations gives
It can be shown that
This makes clear that the two integrals can be added (see the
properties of integrals)
Substituting the above into (1) results in
According to the
mean value theorem for integration, there exists a
c in [
x1,
x1 + Δ
x] such that
Substituting the above into (2) we get
Dividing both sides by Δ
x gives
- Notice that the expression on the left side of the equation is Newton's difference quotient for F at x1.
Take the limit as Δ
x → 0 on both sides of the equation.
The expression on the left side of the equation is the definition of the derivative of
F at
x1.
To find the other limit, we will use the
squeeze theorem. The number
c is in the interval [
x1,
x1 + Δ
x], so
x1 ≤
c ≤
x1 + Δ
x.
Also,
and
Therefore, according to the squeeze theorem,
Substituting into (3), we get
The function
f is continuous at
c, so the limit can be taken inside the function. Therefore, we get
which completes the proof.
(Leithold et al., 1996)
[edit] Corollary
The fundamental theorem is often employed to compute the definite integral of a function
ƒ for which an antiderivative
F is known. Specifically, if
ƒ is a real-valued continuous function on [
a,
b], and
F is an antiderivative of
ƒ in [
a,
b], then
The corollary assumes continuity on the whole interval. This result is strengthened slightly in the following theorem.
Let
with
ƒ continuous on [
a,
b]. If
g is an antiderivative of
ƒ, then
g and
F have the same derivative, by the
first part of the theorem. It follows that there is a number
c such that
F(x) = g(x) + c, for all
x in [
a,
b]. Letting
x = a,
which means
c = −
g(
a). In other words
F(
x) =
g(x) − g(a), and so
[edit] Second part
This part is sometimes referred to as the Second Fundamental Theorem of Calculus
[5] or the Newton-Leibniz Axiom.
Let
ƒ be a real-valued function defined on a
closed interval [
a,
b] that admits an
antiderivative F on [
a,
b]. That is,
ƒ and
F are functions such that for all
x in [
a,
b],
If
ƒ is integrable on [
a,
b] then
Because it does not assume that
ƒ is continuous, the Second Part is slightly stronger than the Corollary.
When an antiderivative
F exists, there are infinitely many antiderivatives for
ƒ, obtained by adding an arbitrary constant to
F (which is
lost in differentiation). Also, by the first part of the theorem, antiderivatives of
ƒ always exist when
ƒ is continuous.
This is a limit proof by
Riemann sums. Let
ƒ be (Riemann) integrable on the interval [
a,
b], and let
ƒ admit an antiderivative
F on [
a,
b]. Begin with the quantity
F(b) − F(a). Let there be numbers
x1, ...,
xn such that
It follows that
Now, we add each
F(
xi) along with its additive inverse, so that the resulting quantity is equal:
The above quantity can be written as the following sum:
Next we will employ the
mean value theorem. Stated briefly,
Let
F be continuous on the closed interval [
a,
b] and differentiable on the open interval (
a,
b). Then there exists some
c in (
a,
b) such that
It follows that
The function
F is differentiable on the interval [
a,
b]; therefore, it is also differentiable and continuous on each interval
[xi −1, xi ]. According to the mean value theorem (above),
Substituting the above into (1), we get
The assumption implies
F'(ci) = f(ci). Also,
xi − xi − 1 can be expressed as
Δx of partition
i.
A converging sequence of Riemann sums. The numbers in the upper right are the areas of the grey rectangles. They converge to the integral of the function.
Notice that we are describing the area of a rectangle, with the width times the height, and we are adding the areas together. Each rectangle, by virtue of the
mean value theorem, describes an approximation of the curve section it is drawn over. Also notice that
Δxi need not be the same for all values of
i, or in other words that the width of the rectangles can differ. What we have to do is approximate the curve with
n rectangles. Now, as the size of the partitions get smaller and
n increases, resulting in more partitions to cover the space, we will get closer and closer to the actual area of the curve.
By taking the limit of the expression as the norm of the partitions approaches zero, we arrive at the
Riemann integral. We know that this limit exists because
ƒ was assumed to be integrable. That is, we take the limit as the largest of the partitions approaches zero in size, so that all other partitions are smaller and the number of partitions approaches infinity.
So, we take the limit on both sides of (2). This gives us
Neither
F(
b) nor
F(
a) is dependent on ||Δ||, so the limit on the left side remains
F(
b) -
F(
a).
The expression on the right side of the equation defines the integral over
ƒ from
a to
b. Therefore, we obtain
which completes the proof.
It almost looks like the first part of the theorem follows directly from the second, because the equation
where
g is an antiderivative of
ƒ, implies that
has the same derivative as
g, and therefore
F ′ = ƒ. This argument only works if we already know that
ƒ has an antiderivative, and the only way we know that all continuous functions have antiderivatives is by the first part of the Fundamental Theorem
[6]. For example if
ƒ(
x) = e
−x2, then
ƒ has an antiderivative, namely
and there is no simpler expression for this function. It is therefore important not to interpret the second part of the theorem as the definition of the integral. Indeed, there are many functions that are integrable but lack antiderivatives that can be written as an
elementary function. Conversely, many functions that have antiderivatives are not Riemann integrable (see
Volterra's function).
[edit] Examples
As an example, suppose the following is to be calculated:
Here,
and we can use
as the antiderivative. Therefore:
Or, more generally, that
is to be calculated. Here,
and
can be used as the antiderivative. Therefore:
Or, equivalently,
[edit] Generalizations
We don't need to assume continuity of
ƒ on the whole interval. Part I of the theorem then says: if
ƒ is any
Lebesgue integrable function on
[a, b] and
x0 is a number in
[a, b] such that
ƒ is continuous at
x0, then
is differentiable for
x =
x0 with
F'(
x0) =
ƒ(
x0). We can relax the conditions on
ƒ still further and suppose that it is merely locally integrable. In that case, we can conclude that the function
F is differentiable
almost everywhere and
F'(
x) =
ƒ(
x) almost everywhere. On the real line this statement is equivalent to
Lebesgue's differentiation theorem. These results remain true for the Henstock–Kurzweil integral which allows a larger class of integrable functions (
Bartle 2001, Thm. 4.11).
In higher dimensions Lebesgue's differentiation theorem generalizes the Fundamental theorem of calculus by stating that for almost every
x, the average value of a function
ƒ over a ball of radius
r centered at
x will tend to
ƒ(
x) as
r tends to 0.
Part II of the theorem is true for any Lebesgue integrable function
ƒ which has an antiderivative
F (not all integrable functions do, though). In other words, if a real function
F on [
a,
b] admits a derivative
ƒ(
x) at
every point
x of
[a, b] and if this derivative
ƒ is Lebesgue integrable on [
a,
b], then
- Rudin (1987, th. 7.21)
This result may fail for continuous functions
F that admit a derivative
ƒ(
x) at almost every point
x, as the example of the
Cantor function shows. But the result remains true if
F is
absolutely continuous: in that case,
F admits a derivative
ƒ(
x) at almost every point
x and, as in the formula above,
F(b) − F(a) is equal to the integral of
ƒ on [
a,
b].
The conditions of this theorem may again be relaxed by considering the integrals involved as
Henstock-Kurzweil integrals. Specifically, if a continuous function
F(
x) admits a derivative
ƒ(
x) at all but countably many points, then
ƒ(
x) is Henstock-Kurzweil integrable and
F(b) − F(a) is equal to the integral of
ƒ on [
a,
b]. The difference here is that the integrability of
ƒ does not need to be assumed. (
Bartle 2001, Thm. 4.7)
The version of
Taylor's theorem which expresses the error term as an integral can be seen as a generalization of the Fundamental Theorem.
There is a version of the theorem for
complex functions: suppose
U is an open set in
C and
ƒ :
U →
C is a function which has a
holomorphic antiderivative
F on
U. Then for every curve γ : [
a,
b] →
U, the
curve integral can be computed as
The fundamental theorem can be generalized to curve and surface integrals in higher dimensions and on
manifolds.
One of the most powerful statements in this direction is
Stokes' theorem: Let
M be an oriented
piecewise smooth
manifold of
dimension n and let
ω be an
n−1 form that is a
compactly supported differential form on
M of class C
1. If ∂
M denotes the
boundary of
M with its induced
orientation, then
Here
is the
exterior derivative, which is defined using the manifold structure only.
The theorem is often used in situations where
M is an embedded oriented submanifold of some bigger manifold on which the form
ω is defined.
[edit] See also
- ^ More exactly, the theorem deals with definite integration with variable upper limit and arbitrarily selected lower limit. This particular kind of definite integration allows us to compute one of the infinitely many antiderivatives of a function (except for those which do not have a zero). Hence, it is almost equivalent to indefinite integration, defined by most authors as an operation which yields any one of the possible antiderivatives of a function, including those without a zero.
- ^ Spivak, Michael (1980), Calculus (2nd ed.), Houstan, Texas: Publish or Perish Inc.
- ^ See, e.g., Marlow Anderson, Victor J. Katz, Robin J. Wilson, Sherlock Holmes in Babylon and Other Tales of Mathematical History, Mathematical Association of America, 2004, p. 114.
- ^ Apostol 1967, §5.1
- ^ Apostol 1967, §5.3
- ^ Spivak, Michael (1980), Calculus (2nd ed.), Houston, Texas: Publish or Perish Inc.
[edit] References
- Apostol, Tom M. (1967), Calculus, Vol. 1: One-Variable Calculus with an Introduction to Linear Algebra (2nd ed.), New York: John Wiley & Sons, ISBN 978-0-471-00005-1 .
- Bartle, Robert (2001), A Modern Theory of Integration, AMS, ISBN 0821808451 .
- Larson, Ron; Edwards, Bruce H.; Heyd, David E. (2002), Calculus of a single variable (7th ed.), Boston: Houghton Mifflin Company .
- Leithold, L. (1996), The calculus of a single variable (6th ed.), New York: HarperCollins College Publishers .
- Malet, A, Studies on James Gregorie (1638-1675) (PhD Thesis, Princeton, 1989).
- Rudin, Walter (1987), Real and Complex Analysis (third ed.), New York: McGraw-Hill Book Co.
- Stewart, J. (2003), "Fundamental Theorem of Calculus", Calculus: early transcendentals, Belmont, California: Thomson/Brooks/Cole .
- Turnbull, H. W., ed. (1939), The James Gregory Tercentenary Memorial Volume, London .
- Spivak, Michael (1980), Calculus (2nd ed.), Houston, Texas: Publish or Perish Inc. .
Read more...