PROGRAMMING PARADIGMS

Developing Taxonomy for Parallel Algorithms

What he does next is interesting: He runs the algorithms themselves as metrics, taking care not to include the case in which an algorithm is used both as the measuring device and the object measured, and finds the algorithms uniformly better average predictors of machine performance than any of the abstract metrics. He concludes that the best metric for the problem is a good suite of benchmarks, covering fundamental parallel processing tasks, and he presents one.

His suite of benchmarks consists of these tasks:

broadcasting (necessary for any parallel algorithm to coordinate tasks)
reporting (necessary for coordination and control)
selecting (such as finding the maximum; the point is to collect data from all nodes)
sorting (tests the ability to handle arbitrary communication patterns)
propagating (same as testing the transitive closure of a graph)
saturating (each node messages every other node; this tests bottlenecks or congestion)

The Analysis of Paradigms

Can we, Nelson and Snyder ask (or seem to me to be asking, because it's a question very much on my mind as read their chapter), develop a computer science discipline that deals with the analysis of paradigms in a way that is analogous to the analysis of algorithms? They think so and suggest that such a discipline would teach techniques such as contraction analysis, which enables you to solve a problem for the paradigm and have the solution work for all algorithms that follow the paradigm. Here's Jamieson, talking about algorithms, and in the process arguing that paradigms can be identified by shared characteristics of algorithms, and therefore can be analyzed:

The rationale behind the "characteristics-based" approaches is that many algorithms do possess an identifiable structure. For example, at the data dependency level, many algorithms share similar communications patterns. At the process level, algorithms based on the same paradigm - that is, divide-and-conquer -- may exhibit similar communications requirements. Algorithms that operate on similar data structures may lend themselves to execution on similar architectures. In the area of digital signal processing, it is possible to identity a set of canonical algorithm structures (that is, second order section, FFT, autocorrelation, convolution) and that algorithms that can be expressed in terms of these structures can be constructed from a set of basic building blocks (e.g., multiplications, complex multiplications, butterflies, sums-of-products, address arithmetic). A concise description of the algorithm in terms of a basic set of features allows selection of an appropriate machine configuration and can facilitate the mapping process by relating the characteristics of the current algorithm to known layout patterns.

The paradigms examined in the book are mostly familiar. Nelson and Snyder acknowledge that those they discuss do not provide the parallel programmer with a full toolkit of paradigms to guide him or her in developing new algorithms. But they conclude their chapter of the book by saying, "We expect more paradigms to be discovered." But that's an issue of a different context.

The Context of Discovery

Just fifty years ago the philosopher of science Hans Reichenbach drew a distinction between what he called the context of discovery and the context of justification in science. The former deals with the origins of theories; the latter takes theories as existing artifacts and investigates their properties.

The fact that Kepler arrived at his views on the structure of the solar system from a consideration of parallels with the Holy Trinity, Reichenbach maintained, was a fact in the domain of history or possibly of psychology, but definitely not in the domain of science or the philosophy of science. It belonged to the context of discovery, and science (actually he used the word epistemology) is concerned only with the context of justification.

As difficult as it may be to develop a rigorous computer science course on the analysis of paradigms, it's at least a challenge within the context of justification. Much less likely to be rigorously codified in the near future is the issue of how we discover a paradigm, map the paradigm to the problem, see the applicability, find the new rules. This ability resides in the context of discovery, the domain of serendipity and sudden insight. It's the art of the science.

It's easy to convince yourself that there are no rules for discovering rules, that seeing the connections is all luck or unanalyzable genius, that the art of the science is outside science, that this art is really artlessness. It's convenient to say that perhaps the Holy Trinity is as reasonable a place as any to look for inspiration in developing programming paradigms.

But this is a mistake. The context of discovery is accessible to the study of psychology, and although there may be some debate about the extent to which psychology is a science, it is at least a discipline with rules. As George Polya says, "There are rules and rules."

Polya has written the definitive book on discovery in problem solving (Mathematical Discovery, two volumes, John Wiley & Sons, 1965). In this extraordinary work Polya discusses very broad paradigms for problem solving and sets down the ten commandments for nurturing serendipity.

"Solving problems," Polya says, "is a practical art, like swimming, or skiing, or playing the piano: you can learn it only by imitation and practice." Polya presents in his book several examples for imitation, and discusses how to discover the pattern in example solutions, so that you can apply it to similar problems. He presents several such useful patterns, and by pattern he means nothing more or less than paradigm. He identifies several mathematical paradigms, but also such broad problem solving paradigms as problem reduction ( two kinds) and guess-and-test.

But Polya goes beyond merely identifying problem-solving paradigms; he also enters the context of discovery and gives rules for discovering solutions to problems. These are rules of the sort found in practical arts: general guidelines that make intuitive sense and seem to work well. They include, somewhat rephrased here, the following rules of problem solving. (If they seem too obvious to mention, I suggest you examine your own methods the next time you attempt to solve a problem; perhaps you're ignoring the "obvious"!)

When considering different approaches to a problem, the less difficult precedes the more difficult, the more familiar precedes the less familiar, and an item with more points in common with the problem precedes one with fewer such points.

When considering where to begin analyzing a problem, the whole precedes the parts, the principal parts precede other parts, the less remote parts precede more remote parts.

When considering related problems, problems equivalent to the one being studied precede problems that are more or less ambitious, and these precede the rest. (Bilateral reduction precedes unilateral reduction, which precedes looser connections.) Formerly solved problems with the same kind of result as the problem you are trying to solve precede other formerly solved problems.

If it happens that you find yourself designing the paradigm requirement for a computer science department curriculum for the 1990s, I suggest that you consider putting mathematical discovery on the reading list.