Chain Rule Definition: What Is It?

In summary: It states that if a function has continuous partial derivatives, then it can be written as a composition of functions, where each function depends on only one of the variables. In summary, the chain rule is an elementary rule of calculus that allows for the calculation of derivatives and integrals of composite functions. It states that the rate of change of a function composed of other functions is equal to the product of the rates of change of its individual components. This rule is important in various mathematical concepts, such as the theorem of implicit functions and integration by substitution.
  • #1
19,480
10,167
Definition/Summary

The chain rule is an elementary rule of calculus, but it can be understood without any knowledge of calculus:

If [itex]a[/itex] depends on [itex]b[/itex], and [itex]b[/itex] depends on [itex]c[/itex], then the rate at which a changes with respect to b times the rate at which b changes with respect to c equals the rate at which a changes with respect to c.

In other words: small change in a per small change in b times small change in b per small change in c equals small change in a per small change in c.

In symbols, derivatives may be treated as ordinary fractions: two copies of [itex]db[/itex] may be canceled in [itex]da/db\ db/dc = da/dc[/itex] (if you don't understand calculus, then read "d" as meaning "a small change in").

For example, if pressure depends on length, and length depends on temperature, then the speed at which pressure changes when you change the temperature equals the speed at which pressure changes when you change the length times the speed at which length changes when you change the temperature.

Equations

[tex]\frac{da}{dc}\ =\ \frac{da}{db}\,\frac{db}{dc}[/tex]

[tex](a\circ b)' = (a'\circ b)b'\text{, ie }(a(b(c)))' = a'(b(c))b'(c)[/tex]

Partial derivative version (if [itex]a[/itex] depends on [itex]b_1,\cdots b_n[/itex], and [itex]b_1,\cdots b_n[/itex] depend only on [itex]c[/itex]):

[tex]\frac{da}{dc}\ =\ \frac{\partial a}{\partial b_1}\frac{db_1}{dc}\ +\ \cdots \frac{\partial a}{\partial b_n}\frac{db_n}{dc}\ =\ (\mathbf{\nabla_b}\,a)\cdot \frac{d\mathbf{b}}{dc}[/tex]

Partial derivative version (if [itex]a[/itex] depends on [itex]b_1,\cdots b_n[/itex], and [itex]b_1,\cdots b_n[/itex] depend on [itex]c_1,\cdots c_m[/itex]):

[tex]\frac{\partial a}{\partial c_i}\ =\ \sum_{j\ =\ 1}^n\frac{\partial a}{\partial b_j} \frac{\partial b_j}{\partial c_i} \text{, for }i\ =\ 1,\cdots m [/tex]

This is the same as the saying that the gradient vector of the composite function is the matrix product of the gradient vector and the Jacobian matrix:

[tex](\mathbf{\nabla_c}\,a)^T\ =\ \frac{\partial a}{\partial (b_1,\cdots b_n)}\frac{\partial (b_1\cdots b_n)}{\partial (c_1,\cdots c_m)}\ =\ (\mathbf{\nabla_b}\,a)^T\frac{\partial (b_1,\cdots b_n)}{\partial (c_1,\cdots c_m)}[/tex]

Similarly, for a vector [itex](a_1,\cdots a_k)[/itex], the Jacobian matrix of the composite function is the matrix product of the two individual Jacobians:

[tex]\frac{\partial (a_1,\cdots a_k)}{\partial (c_1,\cdots c_m)}\ =\ \frac{\partial (a_1,\cdots a_k)}{\partial (b_1,\cdots b_n)}\frac{\partial (b_1,\cdots b_n)}{\partial (c_1,\cdots c_m)}[/tex]

Extended explanation

Integration by substitution:

Substitution (used in evaluating integrals, or in solving differential equations) involves applying the chain rule to replace a "d" term (a differential).

We may write [tex]db\ =\ \frac{db}{dc}\,dc\text{ .}[/tex]

For example, if [itex]b = c^2[/itex], then [itex]db/dc\ =\ 2c\text{, or }db\ =\ 2c\,dc[/itex], and so [itex]\int 2c\,\sin (c^2)\,dc\ =\ \,\int\,\sin b\,db[/itex][itex]\ =\ -\cos b\,+\,\text{constant}\ =\ -\cos c^2\,+\,\text{constant}[/itex]

* This entry is from our old Library feature. If you know who wrote it, please let us know so we can attribute a writer. Thanks!
 
Mathematics news on Phys.org
  • #2
The theorem of implicit functions is one of the most important theorems in this context.
 

Related to Chain Rule Definition: What Is It?

1. What is the chain rule in mathematics?

The chain rule is a rule in calculus that allows you to find the derivative of a composite function, which is a function that is made up of two or more other functions. It is used to find the rate of change of a dependent variable with respect to an independent variable.

2. How do you apply the chain rule?

To apply the chain rule, you first need to identify the composite function and its component functions. Then, you need to take the derivative of the outer function, substitute in the inner function, and multiply it by the derivative of the inner function. This will give you the derivative of the composite function.

3. What is the purpose of the chain rule?

The chain rule is an essential tool in calculus that allows us to find the derivative of complicated functions. It is particularly useful when dealing with functions that involve nested functions, such as trigonometric, logarithmic, or exponential functions.

4. Can you give an example of the chain rule in action?

Sure, let's say we have the function f(x) = (3x + 5)^2. To find the derivative of this function, we can use the chain rule by first identifying the outer function as (3x + 5)^2 and the inner function as 3x + 5. The derivative of the outer function is 2(3x + 5), and the derivative of the inner function is 3. Therefore, the derivative of f(x) is (2(3x + 5))(3) = 6(3x + 5).

5. How does the chain rule relate to other differentiation rules?

The chain rule is a fundamental rule in calculus and is closely related to other differentiation rules. It is essentially a combination of the power rule and the product rule. Additionally, the chain rule is also related to the quotient rule and the inverse function rule, as these can also be used to find the derivative of composite functions.

Back
Top