The Harvest Object Cache

Making Internet information services scale better

Peter B. Danzig

Peter is a professor of computer science at the University of Southern California. He can be reached at danzig@usc.edu. The software system described here is the result of a collaboration between Anawat Chankhunthod, Chuck Neerdaels, and Peter Danzig of USC, and Michael Schwartz and Duane Wessels of the University of Colorado-Boulder.

As traffic on the Internet continues to grow and new kinds of information (such as audio and video data) are transmitted across it, some of the design limitations behind popular Internet applications become increasingly apparent. If you've recently tried to reach a popular Web site such as Lycos or Yahoo, for instance, you've perhaps encountered delays. Likewise, if your Internet provider has added lots of new users lately, you've likely found the system unusably slow at peak times during the day.

Although some of these problems are strictly local, others result from nonlocal design limitations. Internet information services such as FTP, Gopher, and the World Wide Web have evolved so rapidly that their designers and implementors postponed performance and scalability in favor of functionality and easy deployment. These popular services have been designed with little regard for efficient use of network bandwidth. As an example, they lack caching support in their core protocols.

There are a variety of approaches for addressing Net-related performance problems. The Harvest cache, for example, is a hierarchical object cache designed to make Internet information systems scale better. It has been in use for two years at about 100 sites on the Net and it can function both as a proxy-cache and as an httpd accelerator. As an httpd accelerator, the cache works in conjunction with existing HTTP daemons (Web server software) to increase throughput dramatically. This can be accomplished in a mostly transparent manner.

In this article, I'll present measurements that show that the Harvest cache achieves an order-of-magnitude performance improvement over other proxy caches, such as the cache used in the CERN 3.0 server software. Our results demonstrate that HTTP is not an inherently slow protocol, but rather that many popular implementations have ignored the sage advice to make the common case fast.

The Harvest cache is designed to support a highly concurrent stream of requests with minimal queueing for operating-system-level resources. This is achieved by use of implementation techniques such as nonblocking I/O, application-level threading, and virtual memory management.

The Harvest cache runs under several operating systems, including SunOS, Solaris, DEC OSF/1, HP/UX, SGI, Linux, and IBM AIX. Binary and source distributions for the cache are available from http://excalibur.usc.edu. General information about the Harvest system, including the user's manual, is available from http://harvest.cs.colorado.edu. A commercial version should be available at press time.

Origins of the Harvest Cache

Hierarchical caching distributes server load away from server hot spots raised by globally popular information objects, reduces access latency, and protects the network from erroneous clients. High performance is particularly important for higher levels in the cache hierarchy, which may experience heavy service-request rates. The Harvest cache allows individual caches to be interconnected hierarchically in a way that mirrors the topology of an internetwork, resulting in additional efficiency increases.

In a hierarchical cache, misses at one level are passed to caches located at higher levels, as illustrated in Figure 1. In addition to the parent-child relationships, the cache supports a notion of "siblings" (that is, caches at the same level in the hierarchy) provided to distribute cache server load. Each cache in the hierarchy independently decides whether to fetch the reference from the object's home site or from its parent or sibling caches, using a simple resolution protocol that works as follows.

If the URL contains any of a configurable list of substrings, then the object is fetched directly from the object's home, rather than through the cache hierarchy. This feature is used to force the cache to resolve noncacheable ("cgi-bin") URLs and local URLs directly from the object's home. Similarly, if the domain name of a URL matches a configurable list of substrings, then the object is resolved through the particular parent bound to that domain. Otherwise, when a cache receives a request for a URL that misses, it performs a remote procedure call (RPC) to all of its siblings and parents, appropriate for the particular URL checking if the URL hits any sibling or parent. The cache retrieves the object from the site with the lowest-measured latency.

Additionally, a cache option can be enabled that tricks the referenced URL's home site into implementing the resolution protocol. When this option is enabled, the cache sends a "Hit" message to the UDP echo port of the object's home machine. When the object's home echoes this message, the cache treats it like a hit generated by a remote cache that had the object. This option allows the cache to retrieve the object from the home site if it happens to be closer than any of the sibling or parent caches.

A cache resolves a reference through the first sibling, parent, or home site to return a UDP "Hit" packet, or through the first parent to return a UDP "Miss" message if all caches miss and the home's UDP "Hit" packet fails to arrive within two seconds. However, the cache will not wait for a home machine to time out; it will begin transmitting as soon as all of the parent and sibling caches have responded. The resolution protocol's goal is for a cache to resolve an object through the source (cache or home) that can provide it most efficiently. This protocol is really a heuristic, as fast response to a ping indicates low latency. We plan to evolve to a metric that combines both response and available bandwidth. Hierarchies as deep as three caches add little noticeable access latency. The only case where the cache adds noticeable latency is when one of its parents fails, but the child cache has not yet detected it. In this case, references to this object are delayed by two seconds, which is the length of the parent-to-child-cache timeout. As the hierarchy deepens, the root caches become responsible for more clients. To keep root cache servers from becoming overloaded, we recommend that the hierarchy terminate at the first place in the regional or backbone network where bandwidth is plentiful.

Our trace-driven simulation study of Internet traffic in 1993 showed that hierarchical caching of FTP files could eliminate half of all file transfers over the Internet's WAN links. Other studies seem to arrive at different conclusions. For example, both "Long-term Caching Strategies for Very Large Distributed File Systems," by Rafael Alonso and Matthew Blaze (Proceedings of the USENIX Summer Conference, June 1991), and "Multi-level Caching in Distributed File Systems, Or Your Cache Ain't Nuthin' but Trash," by D. Muntz and P. Honeyman (Proceedings of the USENIX Winter Conference, January 1992), show that hierarchical caches can, at best, achieve 20 percent hit rates and cut server workload in half. We believe the different conclusions reached by these studies is a result of examining different kinds of workloads. Our study traced wide-area FTP traffic from a switch near the NSFNET backbone. In contrast, the other studies analyzed LAN workstation file-system traffic. Because LAN files rarely change over, say, a five-day period, the other studies found little value in hierarchical caching over flat-file caches at each workstation.

In contrast to workstation file systems, applications such as FTP, WWW, and Gopher facilitate read-only sharing of autonomously owned and rapidly evolving object spaces. We found that over half of NSFNET FTP traffic is due to sharing of read-only objects, and since Internet topology tends to be organized hierarchically, that hierarchical caching can yield a 50 percent hit rate and can reduce server load dramatically. Claffy and Braun reported similar statistics for Web traffic, which has displaced FTP traffic as the largest component of Internet packets.

The Harvest Object Cache

Figure 1: Hierarchical cache arrangement.