3.10 Multiple Parameters
Sometimes the proper analysis for an algorithm requires multiple parameters to describe the cost. To illustrate the concept, consider an algorithm to compute the rank ordering for counts of all pixel values in a picture. Pictures are often represented by a two-dimensional array, and a pixel is one cell in the array. The value of a pixel is either the code value for the color, or a value for the intensity of the picture at that pixel. Assume that each pixel can take any integer value in the range 0 to . The problem is to find the number of pixels of each color value and then sort the color values with respect to the number of times each value appears in the picture. Assume that the picture is a rectangle with pixels. A pseudocode algorithm to solve the problem follows.
// Initialize the counts:
for i = 0 to C-1:
= 0
count[i] // Increment the pixel value count for each of the pixels:
for i = 0 to P-1:
= count[value(i)]+1
count[value(i)] // Sort the pixel value counts:
sort(count)
In this example, count
is an array of size
C
that stores the number of pixels for each color value.
Function value(i)
returns the color value for pixel
.
The time for the first for
loop (which initializes
count
) is based on the number of colors,
.
The time for the second loop (which determines the number of pixels with
each color) is
.
The time for the final line, the call to sort
, depends on
the cost of the sorting algorithm used. We will assume that the sorting
algorithm has cost
if
items are sorted, thus yielding
as the total algorithm cost.
Is this a good representation for the cost of this algorithm? What is actually being sorted? It is not the pixels, but rather the colors. What if is much smaller than ? Then the estimate of is pessimistic, because much fewer than items are being sorted. Instead, we should use as our analysis variable for steps that look at each pixel, and as our analysis variable for steps that look at colors. Then we get for the initialization loop, for the pixel count loop, and for the sorting operation. This yields a total cost of .
Why can we not simply use the value of for input size and say that the cost of the algorithm is ? Because, is typically much less than . For example, a picture might have 1000 1000 pixels and a range of 256 possible colors. So, is one million, which is much larger than . But, if is smaller, or larger (even if it is still less than ), then can become the larger quantity. Thus, neither variable should be ignored.