STRUCTURED PROGRAMMING

Action at a Distance

I've sketched this out in Figure 1. Your application is Here, and the actual database-manager code is There. The gap between them is crossed by some connection or another; as I said before, it doesn't matter what the nature of that connection actually is. SQL commands are issued by your application, cross the gap, and are interpreted by the database manager's SQL-command interface. These commands typically set up a database query of some sort, which the database manager executes. The results of the query, usually rows from a table, are sent back over the gap to your application.

Keep in mind that SQL is an API. It's an interface language, not something that compiles to independent native code files. The database manager simply has a module that interprets SQL statements as they come in and responds appropriately. The database manager often has its own proprietary command interface, or it may support still other command interfaces like Xbase.

Little Action Here, Big Action There

That's how most people envision client-server database management operating, and that's the way it has usually been done, up until now. The client application has generally been some simple user-interface front end without a great deal of power or intelligence; hardly more than a smart terminal.

You can take it further than that. This past November, Microsoft introduced its first home-grown database manager at Comdex/Fall, with an interesting feature: In addition to being a fully relational database manager with its own internal command language (a dialect of Basic), it can also act as a SQL front end. I've drawn this one out in Figure 2.

Access can send SQL commands to a remote database manager on a big system, and bring home a subset of the big system's database as one or more query results. The Access user can then work with the local tables (retrieved through SQL commands) by using Access's own language and macros.

This all resolves to the opportunity to do all pertinent data management on the desktop, while leaving the big system to do whatever data management must be done on all company data. If a company has local offices in 14 states, each state office can have its own 486 with a copy of Access to do its local data management at home, close to the people who do the work and need the results. The big machine back at CHQ still contains all the data from all local offices, and can do company-wide queries and reports as needed. Someone (I forget who) coined the term "rightsizing" to cover systems like this.

(I'm not implying, by the way, that Microsoft Access is the first or the only database manager to be able to do this. It's only the first in my experience. There are probably others, and there will be more in the near future.)

Learning SQL

I think it's a good idea to learn SQL even if you aren't in a shop that has to bridge the gap between the big systems and the important systems. The gap between client and server can be as small as the gap across a function call and in such a situation, using a SQL database manager is pretty much the same thing as linking in a database library like the Paradox Engine or B-Tree Filer. That's pretty much what I've been doing, and the process has been delightful.

I've been using a product called Ocelot2 (whose full name is Ocelot2--the SQL!, including the exclamation point) from Ocelot Computer Services in Edmonton, Alberta. The same box gives you a Windows DLL and a DOS linkable library. It's not what I call cheap ($700.00), but SQL products are generally expensive, and it's less expensive than many I've seen.

One good thing about SQL's being a strong standard is that numerous books have been written on it, nearly all of which are better than Ocelot's somewhat lame documentation. Any sizable bookstore will offer you a number of books on SQL, and most are at least readable. (One to avoid at all costs, however, is SQL Structured Query Language, by Dr. Carolyn Hirsch and Dr. Jack L. Hirsch, published by Windcrest/McGraw-Hill. This has the dubious honor of being one of the worst computer books I've read, and cements my conviction that one should never buy computer books by people who insist on putting "Dr." in front of their names--especially when neither doctorate has anything to do with computer science.) I learned SQL by skimming a few books and then just hacking around interactively with the Ocelot2 SQL back end, through a simple SQL terminal" application provided with the Ocelot2 product. The terminal simply allows you to type in a SQL command and then transfer it to the back-end database manager. Any responses from the database manager are displayed for examination.

It was a lot of fun. The Ocelot2 product is solid and fast, and I do recommend it. The documentation should be rewritten and reprinted by the time you read this.

The Structure of a SQL Command

SQL commands have a relatively simple underlying structure that is awesomely cluttered with qualifiers. Its data-management power is in the qualifiers--but its advantage for learning is that you can shovel away the qualifiers and see the bones of the language in the form of simple (if not necessarily useful) commands.

SQL commands typically begin with a SQL reserved word, followed by a string of qualifiers, and terminated by a semicolon. Unlike Pascal, SQL's semi-colons are terminators, not separators. Each statement must be terminated by a semicolon, regardless of that statement's position in a sequence. Some statements may have other statements embedded within them--hence the "structured" in Structured Query Language. The language is not case sensitive, but standard practice is to place all SQL identifiers and reserved words in upper case.

Listing One (page 126) is a series of SQL CREATE commands that I used to create a database. If you tuned in last month, you'll recall a three-table database of contact names, locations, and phone numbers shown in last month's Figure 3. Listing One is the SQL code it took to create that database through Ocelot2.

The first statement creates the database, which in SQL is a named umbrella covering all of the database's diverse components. This umbrella is called a catalog, and it contains information summarizing the current state of the tables and indexes comprising the database. The CREATE TABLESPACE and CREATE INDEXSPACE commands direct SQL to store the database's various tables in a single file called CONTABLE.TBL, and all its indexes in a single file called CONINDEX.IND. This reduces file clutter somewhat, though it may also reduce performance with larger files.

The CREATE TABLE statements define the individual tables and their component fields. If a table has a primary key, the primary-key field is marked PRIMARY KEY. If a table contains a foreign key (that is, another table's primary key) to link it to its parent's table, that foreign key is marked by the REFERENCES qualifier, followed by the name of the parent table. The REFERENCES qualifier assumes that the name of the primary-key field in the parent table is the same as the name of the foreign-key field in the child table. That is, if a field ConID) references the table ConBase, the field it references in ConBase must also be named ConID.

Most of the field definitions should be self explanatory. Mostly what they do is name a field, give it a type, and then specify how large it is. NOT NULL means that SQL must disallow a record update that leaves a NOT NULL field empty.

I created an index for ConBase to speed queries, but indexes are optional and the database will work well (if slowly) without them.

Asking Questions

Ocelot2 has a nice feature that is evidently nonstandard SQL: It can import a properly structured ASCII comma-delimited text file into a SQL table. This allowed me to suck in a file I had exported from Paradox 3.5 as ASCII, and immediately begin work with a 500-record database. That sure beat typing in beaucoup lines of demo data!

But once you've created a database and gotten data into it somehow, the interesting stuff becomes possible. The SELECT statement is the most-used one in all SQL, and through it you create subset tables from existing tables, according to the qualifiers you place after the SELECT reserved word.

SELECT statements read easily until they get heavily nested. Here's a simple one that selects all records from ConBase with the string "Editor" in the Specialty field:

  SELECT*FROM ConBase   WHERE Specialty='Editor';

The SELECT* clause means "select all fields." You could also have written SELECT FName, LName and gotten only the first-name and last-name fields.

A host of logical qualifiers is available so that you can pin things down any way you want. You could get very choosy, like this:

  SELECT* FROM ConBase   WHERE (Specialty='Editor' OR   Specialty='Writer')   AND (Tag='A');

Bringing Back the Bacon

Once you've successfully executed a query in the server, you've got to get the results table home somehow. When the server executes a query, it retains the result table internally. The server does not automatically squirt the whole result table back over the link to the client. The client has to ask for it and fetch it back over the link, one row at a time.

A SQL cursor is an invisible pointer to one row of a table. When a table is created and a cursor is defined for it, the cursor initially points to the first row in the table. You can use the FETCH command to position the cursor to some row in the table and then retrieve that row, one field at a time. FETCH can position the cursor to the next or the prior row, to the first or last row in the table, or to a row specified by row number -- either the absolute row number or some number relative to the current cursor position.

EXEC SQL FETCH NEXT MyCursor INTO :ln :fn :tg :cl :sp :tx;

Here, the named cursor MyCursor is moved to the next row in the results table, and brings back the row into six host variables named ln, fn, tg, cl, sp, and tx. The host variables are separated by their prefixed colons.

FETCH is only available as an embedded command; that is, a user cannot interactively type a FETCH command from a terminal. This is the reason for the EXEC SQL immediately before FETCH. EXEC SQL indicates that what follows is a SQL statement and not just another host language (Pascal, Basic, Cobol, and the like) statement. Combining SQL statements and host language statements is still an ugly business and could be made a lot easier (as I'll emphasize a little later) by the hostlanguage compiler vendors.

The Boundaries of Standardization

There's a whole lot more to SQL than I've shown on this quick tour. There are additional reserved words for updating and deleting databases, for restructuring them, and for handling issues like concurrent access and security.

On the whole, the SQL standard is remarkably strong, stronger than Xbase (which has lots of loose ends) with only a few, probably necessary exceptions. Embedded SQL (at least through Ocelot2) requires that the source file be precompiled by a host-language specific SQL precompiler, which takes standard SQL statements and translates them into host-language statements that implement the SQL statements in the host language. Other SQL vendors may handle embedded SQL in a different fashion; the standard says little or nothing about the process of creating an embedded SQL application.

And because SQL is platform independent, there are going to be some bumps when you finally have to hook the logical to the physical somehow and store SQL databases in DOS files. Ocelot2's mechanisms make sense to me, but they are not identical to those used by other SQL vendors.

Statements Held in Common

If you're working on a data-management application in Pascal, you're wasting serious time trying to implement the data manager proper from scratch. You might as well smelt your own steel and hammer out a replacement fender in a blacksmith's forge. Auto parts are widely available and don't cost that much, and neither do the multitude of database engines and development tools of various kinds.

The truly ugly question has been turning up more and more often: Do I need to work in Pascal at all? The current generation of high-end database managers are lightning-fast and contain everything you need to create a whole application, and every time I've done it, I've accomplished in days what in Pascal would have taken weeks. I know that a lot of people I've spoken with have been forced by productivity squeezes to abandon traditional languages entirely and work in "database languages" instead.

Over time this could hurt the Pascal industry. One way out has become pretty obvious to me: Borland (and all Pascal language vendors) should think very hard about agreeing on an embedded SQL standard, and incorporating that standard into the language, just as x86 assembly language has now been incorporated into Turbo Pascal through BASM. The data-manager executable itself doesn't necessarily have to be included with the Pascal product, but the full embedded SQL syntax should be understood by the Pascal compiler, and the means by which SQL statements may be passed to the data manager should be standardized as well. Then the programmer could choose from among many third-party, back-end SQL data managers, all of which would respond to identical SQL commands generated from within a Pascal application.

Like it or not, we're going to have to start shaving the time it takes to produce useful things in traditional languages, and one way to do this is to start using standardized parts. SQL is one such standardized part. It should be a lot easier to make use of the SQL standard from Pascal. The SQL people have done pretty much what they can. The ball is now in the Pascal vendors' (and specifically Borland's) court.

Products Mentioned

Ocelot2--the SQL! Ocelot Computer Services Inc. Suite 1104, Royal Trust Tower Edmonton Center Edmonton, AB T5J 2Z2 Canada 403-421-4187 $695.00