Distributed Applications and NeXT's PDO

PDO brings distributed objects to non-NextStep environments

To make an object distributed (and, therefore, available to other applications), a server program must first "vend" (make publicly available) the object. Example 1(a) shows a simple application that shares a central stock-price data feed. The NXConnection class provides other, more commonly used methods than run that allow the waiting to take place asynchronously. Example 1(a) instantiates a price server, then registers that server with the network name server stockPriceServer. The last line loops to wait for remote messages. You must include this code in each application that will participate in Distributed Objects.

To use an object that has been vended, a client looks up the desired server object and stores a handle to it in a local NXProxy object. Example 1(b) gets a handle and stores it in the variable theServer. If this code returns a non-nil value to theServer, the client may then refer to the stock-price server on the server machine as if it were implemented in the client, with only a few exceptions; see Example 1(c). This transparency is at the heart of Distributed Objects.

Passing Objects

Perhaps the most important data type that clients and servers can pass to each other is the id. In the example application, the server explicitly vends and the client looks up only one serving object. After that, either the client or the server may pass ids of objects that each wishes to implicitly vend as arguments or return values.

As long as the client is prepared to handle remote messages by way of some form of NXConnection-run message, and as long as the client vends an object to the server in this way, the server may then use objects in the client. Thus, the client and server switch roles. More commonly, the original server can make additional objects available that are useful to the client, without additional setup code overhead.

For example, suppose the stock price server needs to return more attributes than just the price of the stock. A good way to do this is to have the server return a Stock object that the client can then query for the stock attributes. The client code might look like Example 2. Executing the first line implicitly vends a Stock object from the server, which is then accessible by way of the id mystock. Each printf statement invokes the remote stock object in the server, even though the client refers to myStock just like any local object.

Multithreaded Servers

In Example 1(a), the last line of the program, [myConnection run], never returns--it just loops while waiting for incoming remote messages. However, in most applications, a server must do more than simply service remote messages. For example, a real stock-price server also might update a database from a real-time data feed. Multiple threading is the mechanism that allows a server to continue with other tasks while it also waits for messages to objects it has vended. The DO system makes multithreading very easy for programs built with the Application Kit, using the NXConnection method runFromAppkit.

Although in most applications, you can use PDO objects in exactly the same way as DO objects, you can't currently write multithreaded PDO servers. This is because there are no tools for threads in the HP operating system. For example, the DO code from the server shown previously might be enhanced to look like that in Example 3. The runFromAppkit method creates a new thread whose sole purpose is to loop, waiting for remote method invocations. The runFromAppkit method also is aware that the Application Kit isn't thread-safe, so it waits to dispatch remote methods until your application is between Application Kit events. If your server doesn't use the Application Kit and requires finer-grained parallelism, other methods let you create threads that dispatch remote methods without waiting for the Application Kit. These methods are documented in the NextStep General Reference book.

Avoiding Pitfalls

If your application is simple (like the example shown here), you'll find that making your application distributed is fairly transparent. However, if you're building a more complex, robust application, you must be aware of a few more issues.

One is that the semantics of returning self are different. In Objective C, it's common to return the id self to indicate success of a method. While this has reasonable performance for local objects, returning self to a remote caller actually vends the object to which self refers, with all the overhead involved. Unused object vending is not excessively expensive, but for maximum efficiency, objects should return a more appropriate type than self.

For example, to indicate success or failure, an object should return a scalar type such as YES or NO, instead of self or nil. If the server doesn't need to return a status at all, it can return void, and the method call can use the oneway keyword. This results in a very fast one-way asynchronous call--meaning that the caller doesn't even have to wait for the remote method to finish.

Another issue in distributed applications is how to handle a failure in the network or in remote machines. You must make sure that cooperating programs deal gracefully with the failure of their clients or servers. The exact action an application should take depends on the nature of the cooperating programs, but DO provides a reasonably simple mechanism that allows programs to notice the loss of a cooperating program. To be notified of the loss of a cooperating program, an object must request notification and implement a senderIsInvalid: method. When the object is asynchronously notified by way of this method, it must determine which remote objects have become inaccessible, and decide what to do about it.

Another pitfall in making an application distributed is that a few data types can't be passed and returned transparently. These types are unions, void pointers, and pointers inside structures other than ids and char *s. The basic problem with these types is that, in general, the compiler can't know the size of the data being referenced, so it can't pass the data to a remote program in a meaningful way. Another problem is that the computer on which the remote object is running might deal with the data differently (for example, it might use different byte ordering). The result is that it's not possible to pass data types whose layout can't be known.

Presently, there are at least two ways to deal with this limitation. One is to type-cast pointers, which works around the void pointer problem, as long as the pointer is to a nonrecursive structure. Another approach is to enclose complex structures into an object and transmit that object. However, if you find yourself often transmitting objects around, you might consider redesigning or repartitioning your application to lessen network traffic.

To transmit object copies instead of vending them, use the new bycopy Objective C keyword in the parameter list. Be sure to conform to the NXTransport protocol, which requires that you write three simple methods: encodeRemotelyFor:freeAfterEncoding:isBycopy:, encodeUsing:, and decodeUsing:. The first of these is actually implemented in the Object class. You typically override it with a simple two-line method that uses the isByCopy parameter to decide whether to send a copy of the object. If a copy is to be sent, the other two methods cooperate to send the data necessary to create a copy of the object at its new location. The encodeUsing: method runs locally and packs up the unique data of the object. On the remote computer, the decodeUsing: method unpacks the object to instantiate a copy.

With regard to strings, a nontransparency arises in the current version of DO, which manages the memory for storing pointers to chars (strings) differently than it does for pointers to other data types. Normally, pointer data is automatically freed when the server returns. However, in the current DO, the server must explicitly free strings when it has finished with them. If your server code doesn't free strings, the memory for those strings is lost. In a future version, DO will manage the memory for strings just as it currently does for other data types.

Pitfalls in Scalability

Performance, deadlock avoidance, and transaction management are issues that you don't need to worry about in smaller or simpler DO applications. However, in large distributed applications, these issues can become very important. With a larger network that has complex needs, latent problems in your existing DO applications may then become apparent. This isn't completely bad news, because once problems are apparent, you have a better chance of fixing them.

Dealing with these advanced issues is beyond the scope of this article. However, it behooves you to consider the inherent complexity involved in writing distributed applications before you begin work on a large-scale distributed application. To deal with deadlock, you must carefully think through the behavior of cooperating and competing servers to make sure they can never mutually rely on the same resources at the same time in order to make progress. Likewise, to deal with managing atomic transactions, you should consider using a two-phase commit protocol.

If you plan to put compute-intensive tasks on a server, keep an eye on scaling issues. For example, no current PDO server has the aggregate computing power of 500, or even ten Pentium-based NextStep machines. Therefore, if you might eventually decentralize your application, you shouldn't plan to saturate a single central server. Rather, consider distributing compute-intensive tasks across multiple server machines.The trade-off, of course, is that it can be more difficult and time consuming to correctly implement your computation for parallel processing.

If you decide to distribute a task across several computers, keep in mind that the network has a finite bandwidth that can be saturated by a few high-performance machines sending remote messages extensively. Design your application to take advantage of the facility in DO for moving objects from one machine to another. This can reduce the amount of remote messaging that might otherwise occur.

Conclusion

DO and PDO offer excellent tools for developing client/server applications. Their design also provides the flexibility to expand applications as NextStep and PDO become available on more platforms.

References

Andleigh, Prabhat and Michael Gretzinger. Distributed Object-Oriented Data-Systems Design. Englewood Cliffs, NJ: Prentice Hall, 1992. ISBN 0-13-174913-7.

Elmasri, Ramez and Shamkant B. Navathe. Fundamentals of Database Systems. Menlo Park, CA: Benjamin/Cummings, 1994. ISBN 0-8053-1748-1.

NeXT Computer Inc. NEXTSTEP 3.2 General Reference, Volume II. Reading, MA: Addison-Wesley, 1992. ISBN 0-201-62221-1.

NeXT Computer Inc. NEXTSTEP 3.2 Release Notes. NeXT Computer, 1993.

NeXT Computer Inc. Object-Oriented Programming and the Objective C Language. Reading, MA: Addison Wesley, 1993.

NeXT Computer Inc. Portable Distributed Objects 1.0 Release Notes. NeXT Computer, 1993.

Example 1: (a) Server code for an application that provides a stock-price data feed; (b) client code that connects to the stock-price server; (c) client code that accesses data from the server.

(a)  id myServer = [ [PriceServer alloc] init];
     id myConnection = [NXConnection
                            registerRoot: myServer
                            withName: "stockPriceServer"];
     // the following statement does not return
     [myConnection run];

(b)  id theServer = [NXConnection connectToName:"stockPriceServer"];

(c)  printf("IBM is currently at %d\n", [theServer priceFor:"IBM"]);

Example 2: Client code that gets an object from the server and sends messages to that object.

id myStock = [theServer stockFor:"IBM"];
struct tm today = gmtime();
printf("IBM is currently at %d\n", [myStock priceAtTime:today]);
printf("IBM's last dividend was %d\n", [myStock dividend]);

Example 3: Server code that uses multiple threads.

id myServer = [[PriceServer alloc] init];
id myConnection = [NXConnection
                         registerRoot: myServer
                         withName: "stockPriceServer"];

// the following line creates a new thread that waits
[myConnection runFromAppkit];

// Code to receive the data feed goes here
// and is executed in the original thread...