ROUTING ALGORITHMS FOR INTERNETWORKING

Routing metrics are applied in a cumulative fashion, so that the cost of a particular hop is equal to the sum of all applicable metrics. Figure 1 is a logical diagram of an internet that indicates these costs. Each circle represents a router, and the two arrowed lines between each pair of circles represents a subnetwork or link connecting the two routers; the numbers on the lines indicate the current hop cost in each direction.

The Least-cost Algorithm

The hop costs are used as input to the path-calculation routine. Each router maintains an information base containing the topology and hop costs of each hop in the internet. This information is used to perform what is referred to as a "least-cost" routing algorithm, which can be simply stated as:

Given a network of nodes connected by bidirectional links, where each link has a cost associated with it in each direction, define the cost of a path between two nodes as the sum of the costs of the links traversed. For each pair of nodes, find the path with the least cost.

The algorithm used in OSPF, IS-IS, and Integrated IS-IS was originally proposed by Dijkstra. It enables each router to find the least-cost route to every other router. Dijkstra's algorithm can be stated thus: Find the shortest paths from a given source node to all other nodes by developing the paths in order of increasing path length. The algorithm proceeds in stages. By the kth stage, the shortest paths to the k nodes closest to (least cost away from) the source node have been determined; these nodes are in a set M. At the (k + 1) stage, the node not in M that has the shortest path from the source is added to M. As nodes are added to M, their path from the source is defined. The algorithm is formally described in Example 1. Table 1 and Figure 2 show the results of applying this algorithm to Figure 1, using s = 1. Note that at each step, the path to each node plus the total cost of that path is generated. After the final iteration, the least-cost path to each node and the cost of that path has been developed. The same procedure can be used with node 2 as source node, and so on.

Example 1: Dijkstra's algorithm.

  Define:
    N =set of node in the network
    s =source node
    M =set of nodes so far incorporated by the algorithm
    d[ij] =hop cost from node i to node j; d[ii] =O, and d[ij]= infinity
           if the nodes are not directly connected
    D[n] =cost of the least-cost path from node s to node n that is
          currently known to the algorithm

  The algorithm has three steps; steps 2 and 3 are repeated until M=N:

  1. Initialize:
     M = {s} (set of nodes incorporated is only the source node)
     D[n] =d[sn] for n not equal s (initial path costs to neighboring nodes
           are simply the hop costs)

  2. Find the neighboring node not in M that has the least-cost path from
     node s and incorporate that node into M:
                                 min
     Find w epsi s M such that D[w]=          D[j]
                                 j epsi s M
     Add w to M

  3. Update least-cost paths:
     D[n]=min[D[n], D[w]+d[w,n]]for all nepsiM
     If the latter term is the minimum, the path from s to n is now the
     path from s to w, concatenated with the hop from w to n.

Table 1: Least-cut routing calculation.

  Iteration  M    D[2] Path  D[3]  Path  D[4]  Path   D[5]      Path    D[6]     Path
  -------------------------------------------------------------------------

  1         {1}      2  1-2   5     1-3     1   1-4  infinity  --     infinity     --
  2       {1,4}      2  1-2   4   1-4-3     1   1-4    2      1-4-5   infinity     --
  3     {1,2,4}      2  1-2   4   1-4-3     1   1-4    2      1-4-5   infinity     --
  4   {1,2,4,5}      2  1-2   3  1-4-5-3    1   1-4    2      1-4-5      4     1-4-5-6
  5  {1,2,3,4,5}     2  1-2   3  1-4-5-3    1   1-4    2      1-4-5      4     1-4-5-6
  6 {1,2,3,4,5,6     2  1-2   3  1-4-5-3    1   1-4    2      1-4-5      4     1-4-5-6

Dijkstra's algorithm is known to converge under static conditions of topology and hop costs. If the hop costs change over time, the algorithm will attempt to catch up with these changes. However, if the hop cost depends on traffic, which in turn depends on the routes chosen, then a feedback condition exists, and instabilities may result.

The Routing Protocol

The validity of the algorithm will, of course, depend on the validity of the information used as input. The routing information may change over time. A router or subnetwork failure can alter the topology, and some of the costs, especially delay, are variable. Thus, some sort of information-exchange discipline is needed to govern the frequency with which routers exchange routing information. This is the purpose of all three routing protocols, and all three use essentially the same strategy for distributing information. The general strategy has the following elements:

1. Each router generates and maintains information about its local environment, consisting of:

The identity of each of its router neighbors and the cost to reach each neighbor, using some or all of the routing metrics. Thus, there must be some type of discovery function by which each router learns of its neighbors. This is part of the protocol, consisting of discovery-type PDUs.
The identity of each ES directly reachable from this router.

2. The information generated in step #1 is referred to as "link-status" information. Each router distributes this information to all peer routers, using the routing protocol. The information to be distributed is contained in link-status PDUs (LSPs).

3. Based on steps #1 and #2, each router can maintain the following information:

The topology, consisting of a set of routers and the links connecting them.
The cost of each link in each direction.
A list of ESs that are neighbors of each router.

This information is sufficient to make a routing decision for each datagram.

Since the point of distributing LSPs is to enable each router to build up a picture of the topology of its area or domain, how can a router know to whom and how to deliver the LSPs? There appears to be a circular dilemma: A router needs to know the topology in order to directly address its LSPs to each other router, yet the router learns the topology from LSPs. The way out of this dilemma is flooding: The LSP is sent by the source router to every one of its neighbor routers. At each router, an incoming LSP is retransmitted on all outgoing links except for the link that it arrived from. Eventually, all routers will receive a copy of the LSP.

Flooding solves the delivery problem, but raises some new problems:

1. How is the flooding stopped? That is, how do we prevent the indefinite retransmission of an LSP by every router that receives it?
2. How do we prevent obsolescence? That is, if a router fails, how will other routers know that the last LSP that they received from the failed router is obsolete?
3. How do we ensure currency? Since LSPs are being transmitted in a connectionless fashion, an older LSP may arrive after a younger LSP from the same source.

Let us examine each of these problems in turn. First, flooding is easily controlled since each router stores a copy of each LSP that it receives. Each LSP includes a sequence number so that two different LSPs from the same source can be distinguished. When a router receives an LSP from a given source, it checks to see whether it already has a copy of that particular LSP (same source, same sequence number). If so, it does not retransmit the incoming LSP. The result is that each LSP traverses each link in the topology only once.

Next, consider the problem of obsolescence. Each LSP includes a parameter called "remaining lifetime." This parameter is set to some default maximum value by the source router. The parameter is decremented each time it is retransmitted by a router. Once a router stores an LSP in its database, it periodically decrements the lifetime parameter until it reaches 0. If the sequence number of an LSP reaches 0 (no new LSP from the same source arrives in time), the LSP is purged from the database.

Finally, a router can determine which of two LSPs from the same source is the more recent on the basis of the sequence number. When a router is initialized, it issues its first LSP with a sequence number of 1 and increments the sequence number for each subsequent LSP it creates. With a 32-bit sequence number, it is unlikely that all of the sequence numbers will be used up. However, if a router reaches the maximum sequence number (2{32}-1), the routing function at the router is disabled for a period sufficient to ensure that all of the prior LSPs issued by this router have expired. The router then begins again with a sequence number of 1.

Hierarchical Routing

The routing discipline can actually be more complex than that just described. All of the routing protocols we have been describing are designed to work in a multilevel, hierarchical routing environment. We will describe the environment designed for OSI configurations; the one for TCP/IP configurations is similar.

In an OSI configuration, an internet can be divided into a number of routing domains. A domain is a large-scale portion of an internet, generally organized along geographical or organizational lines. For example, all of the local area networks at a site, such as a military base or campus, could be linked by ISs to form a routing domain. This complex might be linked through a wide area network to other routing domains. Domains can be further subdivided into areas. This hierarchical approach has a number of advantages:

It minimizes the amount of information exchanged by ISs, thus simplifying the operation of ISs at all levels.

It allows different routing optimizations within each level of the hierarchy.

It protects the entire routing environment from inaccurate information generated by any intermediate system.

It constructs "firewalls" between different portions (areas, domains) to provide access control and other mechanisms to protect and secure the environment.

It simplifies routing protocol evolution, since ISs at one level need not know the protocol or topology at other levels.

Four levels of routing can be defined:

Level-0 routing: routing of traffic between ESs and ISs on the same sub-network.
Level-1 routing: routing of traffic between ISs within the same area.
Level-2 routing: routing of traffic between different areas within the same routing domain.
Level-3 routing: routing of traffic between different domains.

Level-0 routing is covered by routing protocols that operate between ESs and ISs; these tend to be quite simple. Levels 1 and 2 are covered by the IS-IS routing protocols discussed in this article. Currently, there is no standard for level-3 routing. At this level, the gross topology will generally be rather simple, and static routing based on manual configuration will usually suffice.

The information required by an IS to perform the routing function depends on its role. For topology information, a level-1 IS only needs to know the existence of the other level-1 ISs in its area and at least one level-2 IS in its area, plus the way in which these ISs are interconnected. Similarly, a level-2 IS needs only know the identity of the other level-2 ISs in its routing domain and how they are interconnected. In either case, we can abstract the topology into a graph consisting of nodes connected by edges. Each node is an IS, and each edge is either a point-to-point link or a subnetwork.

The second type of information needed by the IS is ES reachability. In the case of a level-1 IS, it needs to know, for each ES in its area, the identity of the subnetwork that contains that ES. In the case of a level-2 IS, it needs to know, for each ES, the area that contains that ES and a level-2 IS in that area.

Finally, for either level-1 or level-2 routing, each hop must be assigned a cost in each direction. In the case of level-1 routing, a hop is a subnetwork or point-to-point link connecting ISs. For level-2 routing, a hop is a point-to-point link between level-2 ISs.