ADVANCED 80386 MEMORY MANAGEMENT

Paging is the 80386's answer to the memory management challenges for today's multitasking operating systems

Neal Margulis

Neal is an applications engineer for Intel Corp. and can be reached at 2625 Walsh Ave., SC4-40, Santa Clara, CA 95051.

Memory management is a challenge for multitasking operating systems. To combat this difficulty, the Intel 80386 architecture has a method for managing memory called "paging," which is in addition to the segmentation features of the 80286. Paging can increase efficiency of virtual memory multitasking operating systems that run 8086, 80286, and 80386 microprocessor software. This article explains how paging increases the performance for multitasking operating systems and why paging is a requirement for multitasking 8086 and 32-bit 80386 microprocessor applications. In order to make use of the information in this article, you should have basic knowledge of protected mode and segmentation on the 80286 or 80386 microprocessor.

Both the 8086 and the 80286 address memory with a linear address. For systems that use these processors, or the 386 CPU without paging, the linear address is equivalent to the physical address. Address translation on the 386 CPU is shown in Figure 1. Notice that the paging unit comes after the linear address calculation. The paging unit translates the logical address seen by the programs into the physical address that goes out on the bus, which allows paging to be performed by the operating system but does not impact applications in any manner.

The Logical Basis

A segment is used to define the task's logical address space, which consists of one or more segments. The 80286 allows segments of up to 64K, and the 386 microprocessors allow segments up to 4 gigabytes long. As any experienced programmer knows, segments are visible to the application programmer, although less so on the 386 due to its larger size.

Microsoft's OS/2 currently uses segments as the basis for its virtual memory management. The 80286's maximum segment size of 64K makes segment-based physical memory allocation possible. With the 386 microprocessor, in which segments can be up to 4 gigabytes, allocating memory on such a large basis is not practical.

Segmentation-based memory management has additional shortcomings. When variable sized segments are used for physical memory allocation, for example, memory fragmentation often occurs. Fragmentation occurs when the free memory in a system consists of discontinuous small sections. Then when the operating system needs to load a large segment, it must perform a costly rearranging of memory. To overcome fragmentation, some segmentation-based schemes allocate the maximum segment size when any size segment is loaded. Although this overcomes the fragmentation problem, it wastes memory. Clearly a new method that overcomes these inefficiencies and works with 32-bit code is needed. The new method for virtual memory management is paging.

While paging is enabled, the processor translates a linear address to a physical address with the aid of page tables. Like mainframe computers, the page tables are arranged in a two-level hierarchy, as shown in Figure 2. The page table directory base, which is the control register CR3, points to the page table directory. The directory is one page long and contains entries for 1,024 page tables. Page tables are also one page long, and the entries in a page table describe 1,024 pages; each page is 4K in size. As an option, tasks can have their own page table directory, for there is a page table directory base associated with each task.

The processor uses the upper 10 bits of the linear address as an index into the directory. Each directory entry holds 20 bits of addressing information, which contain the address of a page table. The processor uses these 20 bits and the middle 10 bits of the linear address to form the page table address. The address contents of the page table entry and the lower 12 bits of the linear address form the 32-bit physical address.

The Translation Lookaside Buffer (TLB)

Paging information is stored in the on-chip TLB and in memory. If the processor had to access these page tables in memory each time a reference was made, performance would suffer. To save the overhead of the page table lookups, the processor automatically caches mapping information for 32 recently used pages in an on-chip translation lookaside buffer. The TLB's 32 entries cover 4K, each providing a total cover of 128K of memory addresses. The TLB is flushed by changing the value of CR3, which is commonly accomplished by a privileged MOV instruction or a task switch.

As shown in Figure 3, only when the processor does not find the mapping information for a page in the TLB does it perform a page table lookup from information stored in memory. To improve hit rates, the TLB is four-way set associative, meaning each translation can be stored in one of four locations in the TLB. The result is that for typical systems, 97-99 percent of the address references are TLB hits, requiring no memory references to translate. When a TLB miss occurs, the processor replaces an older TLB entry with the new entry that is likely to be used again. This replacement, called TLB miss processing, is performed entirely in hardware.

Using Paging for Virtual Memory Management

Virtual memory allows large or multiple programs to be executed as if the entire program were in memory, even though portions are still on disk. In the case of a large program that has 20 Mbytes of data, for example, and a computer that has only 2 Mbytes of memory, the operating system can load and run the program. An operating system that uses demand paging can multitask more applications in less physical memory than an operating system that uses segmentation for memory allocation. The information for efficiently accomplishing memory management lies within the page directory entries and page table entries. In Figure 4, in the lower 12 bits of each of these entries there are several control bits used by the operating system for keeping track of which pages are in memory, which pages are on disk, information for deciding which page should be swapped out in favor of a new page, and if the swapped page needs to be written back to disk or merely discarded.

If set to 1, the P (present) bit indicates that the entry is present in memory. If the P bit is 0, any attempt to access this page will cause a page fault (exception 14) prior to the memory access. When a page fault occurs, the processor passes control to the interrupt 14 handler, part of the operating system that must read the needed page into memory and return execution to the program. The handler reads the contents of CR2 to decide which page is required. If there is no more room in physical memory to load in another page, the handler must decide which presently loaded page should be discarded. Although the operating system cannot be sure which pages will not be needed in the future, it can make a very good guess based on the least recently used pages. The A (accessed) bit and the bits reserved for operating-system use determine which pages have not been used recently. The processor's hardware automatically sets the A bit to 1 whenever the processor accesses the page; the bit can only be cleared by software. By periodically clearing the A bits, the operating system can keep track of pages not recently used. The A bit, combined with managing the operating system reserved bits, allows an accurate "least recently used" algorithm to be implemented for page management.

More Details.

Once the operating system determines the page that will be discarded from RAM, it must then decide if the page needs to be written back to disk. The D (dirty) bit indicates if the page has been written to. If the D bit is set, then the operating system knows that it must be written back to disk. If the D bit is not set, then the copy of the page that is currently on the disk is the most recent version. The U/S (user/supervisor) and R/W (read/write) bits are described in the page-based protection section later on in this article.

The method of swapping pages in and out of memory when needed is called demand paging. Unix System V for the 386 microprocessor has always offered this feature, and in September 1988, PharLap Software announced that 386/VMM, which runs on top of 386/DOS-Extender, will also support demand paging. With PharLap's development tools and a compiler, such as 80386 High C Compiler from Metaware, users can develop large applications that can take advantage of demand paging.

In addition to virtual memory management, paging has another useful feature: It can be used to do a simple remapping of memory, a feature used in some DOS control programs. Programs such as Compaq's CEMM, Quarterdeck's QEMM-386, and Qualitas's 386-to-the-MAX use remapping ability to implement various features. Application programs addresses that use the LIM (Lotus, Intel, Microsoft) specification for accessing expanded memory, for instance, are remapped in software to use fast extended memory, thus eliminating the need for special memory board's external mapping hardware. This type of program also allows extended memory to be mapped into the DOS-accessible 512K - 640K range on 386 microprocessor-based PCs that have 512K of memory on the motherboard. In addition, it allows relocation of terminate-and-stay-resident utilities outside of DOS's 640K.