3.15 Solving Recurrence Relations
Note: This section is work in progress
Recurrence relations are often used to model the cost of recursive functions. For example, the standard Mergesort takes a list of size , splits it in half, performs Mergesort on each half, and finally merges the two sublists in steps. The cost for this can be modeled as
In other words, the cost of the algorithm on input of size is two times the cost for input of size (due to the two recursive calls to Mergesort) plus (the time to merge the sublists together again).
There are many approaches to solving recurrence relations, and we briefly consider three here. The first is an estimation technique: Guess the upper and lower bounds for the recurrence, use induction to prove the bounds, and tighten as required. The second approach is to expand the recurrence to convert it to a summation and then use summation techniques. The third approach is to take advantage of already proven theorems when the recurrence is of a suitable form. In particular, typical divide-and-conquer algorithms such as Mergesort yield recurrences of a form that fits a pattern for which we have a ready solution.
3.15.1 Estimating Upper and Lower Bounds
The first approach to solving recurrences is to guess the answer and then attempt to prove it correct. If a correct upper or lower bound estimate is given, an easy induction proof will verify this fact. If the proof is successful, then try to tighten the bound. If the induction proof fails, then loosen the bound and try again. Once the upper and lower bounds match, you are finished. This is a useful technique when you are only looking for asymptotic complexities. When seeking a precise closed-form solution (i.e., you seek the constants for the expression), this method will probably be too much work.
Example: Mergesort
Use the guessing technique to find the asymptotic bounds for Mergesort, whose running time is described by the equation
We begin by guessing that this recurrence has an upper bound in . To be more precise, assume that
We prove this guess is correct by induction. In this proof, we assume that is a power of two, to make the calculations easy. For the base case, . For the induction step, we need to show that implies that for . The induction hypothesis is
It follows that
which is what we wanted to prove. Thus, is in .
Is a good estimate? In the next-to-last step we went from to the much larger . This suggests that is a high estimate. If we guess something smaller, such as for some constant , it should be clear that this cannot work because and there is no room for the extra cost to join the two pieces together. Thus, the true cost must be somewhere between and .
Let us now try . For the base case, the definition of the recurrence sets . Assume (induction hypothesis) that . Then,
which is what we seek to prove. In similar fashion, we can prove that is in . Thus, is also .
Example: Factorial function
We know that the factorial function grows exponentially. How does it compare to ? To ? Do they all grow βequally fastβ (in an asymptotic sense)? We can begin by looking at a few initial terms.
1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | |
---|---|---|---|---|---|---|---|---|---|
2 | 4 | 8 | 16 | 32 | 64 | 128 | 256 | 512 | |
1 | 2 | 6 | 24 | 120 | 720 | 5040 | 40320 | 362880 | |
1 | 4 | 9 | 256 | 3125 | 46656 | 823543 | 16777216 | 387420489 |
We can also look at these functions in terms of their recurrences.
At this point, our intuition should be telling us pretty clearly the relative growth rates of these three functions. But how do we prove formally which grows the fastest? And how do we decide if the differences are significant in an asymptotic sense, or just constant factor differences?
We can use logarithms to help us get an idea about the relative growth rates of these functions. Clearly, . Equally clearly, . We can easily see from this that is , that is, grows asymptotically faster than .
How does fit into this? We can again take advantage of logarithms. Obviously , so we know that is . But what about a lower bound for the factorial function? Consider the following.
Therefore
In other words, is in . Thus, .
Note that this does not mean that . Because , it follows that but . The log function often works as a βflattenerβ when dealing with asymptotics. That is, whenever is in we know that is in . But knowing that does not necessarily mean that .
Example: Fibonacci sequence
What is the growth rate of the Fibonacci sequence? We define the Fibonacci sequence as for ; .
In this case it is useful to compare the ratio of to . The following table shows the first few values.
1 | 2 | 3 | 4 | 5 | 6 | 7 | |
---|---|---|---|---|---|---|---|
1 | 2 | 3 | 5 | 8 | 13 | 21 | |
1 | 2 | 1.5 | 1.666 | 1.625 | 1.615 | 1.619 |
If we continue for more terms, the ratio appears to converge on a value slightly greater then 1.618. Assuming really does converge to a fixed value as grows, we can determine what that value must be.
for some value . This follows from the fact that . We divide by to make the second term go away, and we also get something useful in the first term. Remember that the goal of such manipulations is to give us an equation that relates to something without recursive calls.
For large , we also observe that:
as gets big. This comes from multiplying by and rearranging.
If exists, then . Using the quadratic equation, the only solution greater than one is
This expression also has the name . What does this say about the growth rate of the Fibonacci sequence? It is exponential, with . More precisely, converges to
3.15.2 Expanding Recurrences
Estimating bounds is effective if you only need an approximation to the answer. More precise techniques are required to find an exact solution. One approach is called expanding the recurrence. In this method, the smaller terms on the right side of the equation are in turn replaced by their definition. This is the expanding step. These terms are again expanded, and so on, until a full series with no recurrence results. This yields a summation, and techniques for solving summations can then be used.
Example: Building a heap
Our next example models the cost of the algorithm to build a heap. You should recall that to build a heap, we first heapify the two subheaps, then push down the root to its proper position. The cost is:
Let us find a closed form solution for this recurrence. We can expand the recurrence a few times to see that
We can deduce from this expansion that this recurrence is equivalent to following summation and its derivation:
3.15.3 Divide-and-Conquer Recurrences
The third approach to solving recurrences is to take advantage of known theorems that provide the solution for classes of recurrences. Of particular practical use is a theorem that gives the answer for a class known as divide-and-conquer recurrences. These have the form
where , , , and are constants. In general, this recurrence describes a problem of size divided into subproblems of size , while is the amount of work necessary to combine the partial solutions. Mergesort is an example of a divide and conquer algorithm, and its recurrence fits this form. So does binary search. We use the method of expanding recurrences to derive the general solution for any divide and conquer recurrence, assuming that .
Here is a more visual presentation of this same derivation.
So, we are left with this result:
At this point, it is useful to note that
This gives us
The summation part of this equation is a geometric series whose sum depends on the ratio . There are three cases.
. From Equation (4) of Module summation,
Thus,
. Because , we know that . From the definition of logarithms it follows immediately that . Also note that since we defined , then . Thus,
Because , we have
. From Equation (5) of Module summation,
Thus,
We can summarize the above derivation as the following theorem, sometimes referred to as the Master Theorem.
Theorem: The Master Theorem
The Master Theorem: For any recurrence relation of the form , the following relationships hold.
This theorem may be applied whenever appropriate, rather than re-deriving the solution for the recurrence.
Example: Using the Master Theorem
Apply the Master Theorem to solve
Because , , , and , we find that . Applying case (3) of the theorem, .
Example: Master Theorem for Mergesort
Use the Master Theorem to solve the recurrence relation for Mergesort:
Because , , , and , we find that . Applying case (2) of the theorem, .
3.15.4 Average-Case Analysis of Quicksort
In Module Quicksort, we determined that the average-case analysis of Quicksort had the following recurrence:
The term is an upper bound on the findpivot and partition steps. This equation comes from assuming that the partitioning element is equally likely to occur in any position . It can be simplified by observing that the two recurrence terms and are equivalent, because one simply counts up from to while the other counts down from to . This yields
This form is known as a recurrence with full history. The key to solving such a recurrence is to cancel out the summation terms. The shifting method for summations provides a way to do this. Multiply both sides by and subtract the result from the formula for :
Subtracting from both sides yields:
At this point, we have eliminated the summation and can now use our normal methods for solving recurrences to get a closed-form solution. Note that , so we can simplify the result. Expanding the recurrence, we get
for , the Harmonic Series. From Equation (10) of Module summation, , so the final solution is .