PROGRAMMING PARADIGMS

Natural Language

Maybe a lot of us should.

Boot's beefs don't mean that the C User's Journal piece wasn't interesting and useful. In fact, Plauger, no dummy, found it interesting; he claims that his readers did; and I admit that I did, too. So I, for one, am happy to judge it a good article for its intended audience, but what about that audience? Are we all ignorant?

In this one area, yeah, I suspect that we are. This is only a guess (maybe I should work in an "amid"), but I suspect that the distance between academic and commercial work in computational linguistics is greater than the corresponding gap in a lot of other areas of computer science.

If true, doesn't that suggest an opportunity? Isn't it possible that computational linguistics could be a fruitful area for the kitchen-table software entrepreneur?

Granted, if the distance between academic and commercial work in computational linguistics is greater than the corresponding gap in a lot of other areas of computer science, it may be because computational linguistics is a lot harder than a lot of other areas of computer science. But Fermat's Last Theorem was hard, and its cracking last year just demonstrates that hard problems can often be broken down into smaller, more manageable problems.

Maybe there are small advances to be made in computational linguistics that are open to the kitchen-table programmer. And, not to be overlooked, maybe these advances could become successful commercial products. Many natural-language applications that do not require a complete model of the English language.

Computational linguistics is an area of interest to me, but I'm sure M. Boot would judge me to also be 19 years out of date. If any DDJ readers are doing interesting work in this area, and are willing to talk about it, I'd love to hear from you.

Negroponte Gets Hooted

In the November 1993, issue of New Media, editor-in-chief David Bunnell ridiculed the idea that there is a convergence happening in the area of multimedia, and passed along the intelligence that the word "convergence" was invented by Nicholas Negroponte as a marketing gimmick for his MIT Media Lab.

Did Southern Pacific Railroad and U.S. Rubber merge to create the auto industry, he asked, or G.E. team up with the Royal Shakespeare Company to launch the movie industry?

Historically, new industries are created and dominated by new companies, and Bunnell predicted that the multimedia heroes will be new companies, still in the garage today.

Ah, you say, but the new industry of multimedia depends on content, and the big companies are buying up all the content. But Bunnell also questions the notion of repurposing existing content.

Nicholas Negroponte had his own say on the issue of repurposing in the November 1993 issue of Wired. At least that's what I think he was talking about. I honestly believe that Negroponte consciously tries to write like Marshall McLuhan. I'd better let him speak for himself:

Modern multimedia_must include the automatic transcoding from one medium into another, or the translation of a single representation into many media_. Books that read themselves when you are dozing off, or movies that explain themselves with text are good examples.

I don't know about you, but I'm encouraged. I've been writing this column all along so that it would read itself if you fell asleep.

Issues Get Disputed

Also in that November issue of Wired, which is the first monthly issue, are an interview with Alvin Toffler (touching on such Toffleresque predictions as the breakup of China, a Constitutional crisis in the USA, a global revolt of the rich, and niche wars with personal nukes) and a whole slew of what Wired calls idées fortes and any other magazine would call "viewpoints."

One of these idées mused on the issue of the viability of copyright out on the information highway. Another was billed as being about digital archaeology, and darned if that wasn't a fair description. Can we assume that we are leaving a readable record behind as we generate all this electronic data? Anyone who can read German can read the first book ever printed, but I can't read my Osborne 1 disks. What will information archaeologists of the future make of our era, and on the basis of what data?

I cite these idées as evidence that discussion of the social implications of technological change is alive in computer publications. But Wired is a special case, and not actually written by or for the agents of that change.

There are magazines that are written by and for, et cetera. This one, for example. And I observe with pride that the issues of several programmer's magazines that I have before me do indeed touch on these social issues.

Here's the October/November issue of PC Techniques, in which editor-in-chief Jeff Duntemann debates encryption legislation, drug policy, and crime control with a reader. Here's November's Windows Tech Journal, in which Zack Urlocker talks about copyright law. And as we know, Jon's editorials often delve into the social consequences of technological change and of governmental reaction (or lack thereof) to that change.

Two thoughts about this: 1. It's important, because ignorance is power, placed in the hands of others. What you don't know can hurt you and what others don't understand can, too; 2. the best such discussions tend to be among the most technically knowledgeable. It's encouraging that the technical community is thinking about these things, and it's a laugh in the face of the common view that engineers and technologists don't consider the consequences of their work.

Which brings us back to writing, since ideas poorly expressed are not well understood. And it brings a chance for writer/editor/programmer P.J. Plauger to redeem himself.

In his "State of the Art" column in the November 1993 issue of Embedded Systems Programming, Plauger talks about the "other" interfaces of embedded systems. Most products are designed to be easy for the daily user, he says. But there are also the rare reconfiguration uses that may crop up monthly or yearly, and these typically sport interfaces and documentation that are all but unusable to anyone but a trained technician. Bad. As he puts it:

Any interface you provide that gets only occasional use had better do lots of prompting. Favor menu-style choices over open-ended command sets that must be memorized or looked up in a manual. Provide at least brief hints about what each option actually means.

In other words, consider your audience. The ultimate practical advice for writers and software developers.