Programming Language Holy War

PhantomWolf Posts: 4,620	Programming Language Holy War Apr 15, 2007 18:46:29 GMT -4 Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by PhantomWolf on Apr 15, 2007 18:46:29 GMT -4 For every language I know, I can make a fairly balanced Love and Hate list for it. You obviously don't know LISP
	It must be fun to lead a life completely unburdened by reality. -- JayUtah "On two occasions, I have been asked, 'Pray, Mr. Babbage, if you put into the machine wrong figures, will the right answers come out?' I am not able to rightly apprehend the kind of confusion of ideas that could provoke such a question." -- Charles Babbage (1791-1871)

Nowhere Man

Posts: 186

Programming Language Holy War Apr 15, 2007 20:27:50 GMT -4

Post by Nowhere Man on Apr 15, 2007 20:27:50 GMT -4

Feh. The only beef I have with the latest crop of languages is that they are case-sensitive. IF is not the same as if, the variable X is not x, etc. Add implicit declarations (i.e., you don't have to declare a variable before you use it) and you have a recipe for chaos.

I truly think that K&R were too lazy to add case-insensitivity to their original C parser.

And object-oriented programming is very useful. But it can also be the latest incarnation of spaghetti code, if you're not careful.

Comments in the code are your best friends!

Fred

Last Edit: Apr 15, 2007 22:40:21 GMT -4 by Nowhere Man

JayUtah

Posts: 5,253

Programming Language Holy War Apr 16, 2007 12:28:25 GMT -4

Post by JayUtah on Apr 16, 2007 12:28:25 GMT -4

You obviously don't know LISP

I know many dialects of Lisp: Utah Common Lisp, for whose creator I worked as a graduate student, Rlisp (same language without the S-expressions), Elisp for customizing Emacs, and Scheme, which some consider its own language.

What do I love about it? Its syntax is based on S-expressions and so it can be learned entirely in about 5 minutes. Compare to Perl, whose many variations on syntax stump even experienced Perl programmers. So powerful and elegant is Lisp syntax that programs and data are interchangeable at any level of inspection. Lisp is the most common functional language, and functional programming teaches you valuable things about the decomposition of computational problems that is sorely missing in much of today's world. Go read the Google paper on the map-reduce algorithm and you'll see why Lisp techniques make Google a parallel-processing powerhouse.

What do I hate about it? It's a pie-in-the-sky language that lacks connections and bindings to real world practical problems on real world systems. So Lisp most often has to be combined with native code and libraries, leading to headaches at the Lisp-to-whatever interface. Lisp is great for abstracting difficult, squishy problems, but not very efficient at day-to-day programming.

gwiz

Posts: 1,682

Programming Language Holy War Apr 16, 2007 14:49:06 GMT -4

Post by gwiz on Apr 16, 2007 14:49:06 GMT -4

jayutah said:

Java sucks. C sucks. Fortran sucks. Perl sucks. Python sucks. Long live REXX!

Discuss.

The only ones on your list I've used are Fortran and C. I started my career with oddities like Pegasus Autocode, moved on to Algol (the daddy of Pascal) and settled on Fortran for most of my career. The switch to C was followed by my company getting so uptight about QA that it was generally easier to work something out on a spreadsheet that to code it up. By this stage, anyway, commercial codes for my type of work were coming on the market, and I haven't coded anything professionally for some eight years. Since then I've written a few Delphi (a sort of visual Pascal) programs for personal use, mainly because I got hold of a free compiler.

Multiple exclamation marks are a sure sign of a diseased mind - Terry Pratchett
...the ascent module ... took off like a rocket - Moon Man

JayUtah

Posts: 5,253

Programming Language Holy War Apr 16, 2007 14:55:33 GMT -4

Post by JayUtah on Apr 16, 2007 14:55:33 GMT -4

IF is not the same as if, the variable X is not x, etc.

I argue they should not be. Consistency of coding style among practitioners is a useful and beautiful thing. If some programers write Switch, others write switch, and still others write SWITCH, then code begins to get messy and too individualized. We're not writing novels; we're writing code. Precision and consistency matter.

The standard story I tell in this case is the Mervin Story. Back when I was a fledgling C hacker, Modula 2 had just come out. I programmed probably less than 50,000 lines of code in it and then gave it up as fun but ultimately useless. In evolving Pascal into Modula, Nicholas Worth removed everything that was useful from Pascal, empahsized the annoying strictness of Pascal, and introduced more annoying useless things like mandatory upper-case keywords. But Mervin fell in love with Module 2 so much that he starting writing C code that included a suspicious header file modula.h. And his C code stopped resembling C and started to look suspiciously like Modula. Sure enough, when we peaked into modula.h we found dozens of lines of preprocessor macros that effectively introduced a new set of syntactic conventions on the language. Mervin's code was always harder to maintain because the maintainer had to get familiar with the unholy union of syntax that was "his" version of C. Hence the fundamental rule of macros: thou shalt not use macros to redefine the syntax of the language. No one liked Mervin much anyway.

The problem with case-sensitivity for programming is that there is no consistent philosophy. Some argue that all tokens should be case-sensitive. Others argue that keywords and identifiers should follow different case-sensitivity rules. Still others argue that there should be no case-sensitivity at all anywhere.

C is strictly case-sensitive. SQL is case-sensitive for identifiers but not keywords. Fortran historically does not distinguish the case of identifiers, but some dialects respect case in keywords; still Fortran programmers prefer upper-case keywords just to be on the safe side. There is no rhyme or reason across the community what proper capitlization should be.

Personally I think it would be a wonderful thing to inflect on camel-case evangelists to make them all work in Fortran where thisFunc and ThisFunc aren't different names. But I'm not sure allowing interchangeable case is overall a good thing. Keeping code human-readable is far more important than many in the industry seem to realize.

Add implicit declarations (i.e., you don't have to declare a variable before you use it) and you have a recipe for chaos.

Rampant implicit declaration is already a recipe for chaos and should be eschewed. Fortran had implicit declaration, but only in a very limited way. If you didn't declare a variable, and its name started with one of the letters I through N (the first two letters of the word integer and the most commonly used ephemeral subscription variables) it was implicitly delcared as an integer, otherwise it was implicitly declared as a real number. Fortran is still a strongly- and statically-typed language, so there's a limit to how much of your leg gets blown off when you shoot yourself in the foot that way.

Implicit declaration coupled with dynamic weak typing in more modern languages is a recipe for unproductive programming. If in one case you type Receiver and in another case you fumble-finger it and type Reciever, implicit declaration and dynamic-typing means you'll spend forever figuring out why your program is broken: the interpreter will happily accept that symbol and give it whatever default value is appropriate for the type suggested by that context. A statically-typed language requires annoying implicit conversion at each necessary step, but at least the interpreter helps you out of this frequent typographical jam by saying, "Um, I've never seen this symbol before in your program, and it's being used in a context that suggests I should have." Very helpful, and very respectful of the programmer's time.

Fortran got it right. It requires you to declare variables that have complex and significant meaning, while avoiding cluttering up the declarations with ephemeral crap whose meaning is clear in the context.

Of course Fortran is guilty of the infamous NASA do-loop bug. That was


      DO 500 I=1. 100
C      Some important stuff.
500    CONTINUE

The bug is hard to find. It's the punctuation in the first line. Because early Fortran didn't respect white space, the compiler interpreted it not as "peform the operations up until line 500 while iterating variable I from 1 to 100 inclusive," but rather as


DO500I=1.100

which means "implicitly declare the variable DO500I as a real number and assign it the value 1.1." Yikes. According to legend, code like that was lurking in human-rated launch pad routines.

I truly think that K&R were too lazy to add case-insensitivity to their original C parser.

Any evidence for this? I ask not to rake you over the coals, but because this is an ongoing question in programming morphology. Since many modern languages take their syntax cues from C, understanding its historical roots is important to knowing what to keep and what to evolve.

The most common reason given for C's case-sensitivity is efficiency at compilation time. A compiler whose tokenizer has to sift through various spellings cannot be made as fast and correct as one that respects case. While that is less concerning to modern programmers, it was an issue for Kernighan and Richie, who understandably wanted to minimize compile-run-evaluate cycle time when compilation took many seconds.

But I find no documentary evidence for that; it's simply a reasonable conclusion based on known circumstances. There doesn't seem to be any reason to suppose that's what K and R were thinking.

More likely they were case-sensitive because it didn't occur to them to be anything else. When I learned Fortran it didn't occur to me to write in anything other than upper case because that was the only case available. And so when lower case was introduced, it was for string data only. No one really considered whether we should be writing the language itself in lower case.

And object-oriented programming is very useful. But it can also be the latest incarnation of spaghetti code, if you're not careful.

A disease that afflicts a great deal of Java code being written and passed around today. A class hierarchy more than about 4 levels deep is too deep to be kept meaningfully in the programmer's mind, and hopelessly opaque to a newcomer. Java's object model and its requirement that the object model appear in every program leads to code that could have been better organized.

Comments in the code are your best friends!

That they are, but not as a substitute for clearly-written code.

Maybe for some people $0 =~ s!.*/!!; is instantly synonymous with "strip the leading directory path elements from the as-invoked command name," but not to me. (At least I think that's what that does.) If someone wants to get all concise in my face, I can respond with a +2 stream of APL that, in four symbols, can solve a matrix, match an array of regular expressions, change the oil in your car, and give you sound financial advice. Of course finding a keyboard to type it on is the real trick these days, but the point remains that being able to express complexity in fewer and fewer syllables is not always the sustainable thing to do.

There are big differences between computer programming, computer science, and software engineering. The latter requires being profitable and sustainable, not just clever. And so when one is in the business of software engineering, one must write clear code that does what is expected of it, in the language best suited for the task.

PhantomWolf

Posts: 4,620

Programming Language Holy War Apr 16, 2007 18:11:46 GMT -4

Post by PhantomWolf on Apr 16, 2007 18:11:46 GMT -4

I have to admit I am sure that if I was using LISP today, it probably wouldn't be the hassel it was when I learned it first because I'm not as stuck in the "text" style coding now and do far more "list" style, but back then it just never really clicked. We were using it to do Expert Systems AI programming, so the "pie-in-the-sky" nature was helpful.

I know to a lot of programmers the mere metion of MS is like praying to the devil, but when it comes to sorting out the case issues and making the text readable, VB and to some extent Visual C and C# really do have a lot of advantages. The .NET platform has it's problems (heh, let me count the ways) but it has opened up a lot of things that MS had locked up out of the programmer's reach and the ability to mix and match language modules into your overall project does give an ability to interface various systems in a way that has never been done before.

I certainly don't see it replacing any of the major languages out there because of the major limitations based on running enviroment, but in the MS enviroment, I think that .NET will certainly have a future, especially if they develop a decent graphics package for it one day.

It must be fun to lead a life completely unburdened by reality. -- JayUtah

"On two occasions, I have been asked, 'Pray, Mr. Babbage, if you put into the machine wrong figures, will the right answers come out?' I am not able to rightly apprehend the kind of confusion of ideas that could provoke such a question." -- Charles Babbage (1791-1871)

Joe Durnavich

Posts: 583

Programming Language Holy War Apr 16, 2007 18:50:20 GMT -4

Post by Joe Durnavich on Apr 16, 2007 18:50:20 GMT -4

Sure enough, when we peaked into modula.h we found dozens of lines of preprocessor macros that effectively introduced a new set of syntactic conventions on the language.

Totally unforgivable! I have heard of this practice being used in other projects, trying to make C look more like Pascal.

There was one product I had contact with, written in C, that was developed on PCs. The company was trying to port it to a Unix system that used the big-endian integer format. They were reading the records in their (large) data files directly into C structures. Of course, it didn't work on the Unix system because the integer format as well as the structure padding was different. I recommended that they rewrite the file read routines to read a record, peel off bytes as needed, and then set each value in the structure one by one. They argued that they preferred the simplicity of reading directly into a structure.

After failing to convince the Unix compiler vendor to change how it lays out structures in memory, they chose to code CHAR, INT, SHORT, and LONG all throughout their programs and used the #define macro to translate to native types as needed. It made an unnecessary mess of the code and didn't address the fundamental problem.

The most common reason given for C's case-sensitivity is efficiency at compilation time.

Since you get case-sensitivity by default, that sounds reasonable.

Back in those days, linkers often were limited to 6 characters, as I remember, and case-insensitive. Thus, you would see recommendations to make sure your external variables and functions names were unique in the first 6 characters.

That they [comments] are, but not as a substitute for clearly-written code.

Comments tend to get out of sync with the code because the mechanism to keep them in sync is voluntary.

Maybe for some people $0 =~ s!.*/!!; is instantly synonymous with "strip the leading directory path elements from the as-invoked command name," but not to me.

Heh. And that's an easy one! That expression, I believe (sigh), will strip off just the first path element. Perhaps the programmer wanted to say basename($0), which returns the filename component.

Of course finding a keyboard to type it on is the real trick these days, but the point remains that being able to express complexity in fewer and fewer syllables is not always the sustainable thing to do.

This becomes apparent when you read a book like "Mastering Regular Expressions" and see the author have to comment the long, nearly indecipherable expressions. (An excellent book, by the way.)

Joe Durnavich

Posts: 583

Programming Language Holy War Apr 16, 2007 18:58:23 GMT -4

Post by Joe Durnavich on Apr 16, 2007 18:58:23 GMT -4

The .NET platform has it's problems (heh, let me count the ways)...

I like the concept of .NET for business applications, but I hate seeing it used for apps that it is overkill for. ATI, the video card manufacture, for example, rewrote its little control applet that sits in the system tray and let's you view or change the card's settings. Now it requires you to install the .NET 3.0 framework, has a huge memory footprint, and is slow. (Fortunately, you can find a free alternative written in either C++ or C.)

Last Edit: Apr 16, 2007 18:58:40 GMT -4 by Joe Durnavich

JayUtah

Posts: 5,253

Programming Language Holy War Apr 16, 2007 19:39:25 GMT -4

Post by JayUtah on Apr 16, 2007 19:39:25 GMT -4

I have heard of this practice being used in other projects, trying to make C look more like Pascal.

Never make X look like Y. You'll produce only a Frankenstein's monster of a code base that no one who knows either X or Y will be able to navigate, or will want to. The prevalence of tampering with nature through macro coding has led some to question the value of macro substitution altogether; but it remains a substantial tool in the avoidance of error-prone, tedious repetition in coding. It is a good thing.

In fact, our embedded code to access satellite-delivered data structures relied heavily on preprocessor macros. Conventional wisdom says you unpack the tables when the bird streams them to you and put them in native data structures. After analyzing access patterns we realized that clients of this data made only sporadic use of some of the available fields. So we threw conventional wisdom out on its ear and wrote a system that used preprocessor macros to do the proper offsetting and masking in the captured raw satellite data. "You mean you compute those offsets and masks every time someone asks for it?" was the typical objection. When they saw our code running four times faster and ten times smaller than previously, they pretty much shut up.

Back in those days, linkers often were limited to 6 characters, as I remember, and case-insensitive.

Six or eight or some other very small number, depending on which exact system. That's because object file formats were generally built around small fixed-length records with known offsets to key fields. Linkage editors had to work as fast as possible because they were often called for every program at execution time.

That was actually one of the first arguments in favor of camel-case identifiers: underscores ate up precious significant characters in the symbol tables. But luckily that hasn't been a factor in linkage editors for about 20 years.

When the Linux dynamic shared object system became available, I implemented a plugin architecture for a customer who wanted to distribute plugins as shared objects. I devised a skewless calling convention across the plugin boundary whereby the host program supplied a plugin with opaque data structures and jump tables to accessor functions. This decoupled the plugin architecture from C calling conventions that would break silently if the plugin or the host changed the argument list. The customer at first objected on speed grounds, and after I proved empirically that the overhead was minimal, if even noticeable, he warmed up to the idea that this was essentially how Linux programs worked anyway. I had merely made the jump tables explicit.

I was inspired to do this by the hand-rolled method dispatching design in Netscape 3.x BTW, the code I wrote is still running and still being deployed by Hewlett-Packard, although someone else maintains it now.

Perhaps the programmer wanted to say basename($0), which returns the filename component.

Or maybe he really did only want to strip off the first path element. Who knows? What if some new hire "fixes" the code after misunderstanding what it was supposed to do? Don't be clever; be clear.

...comment the long, nearly indecipherable expressions. (An excellent book, by the way.)

Hence the phrase write-only code. Although Perl seems especially susceptible to this, I have seen many programmers labor in many languages to write optimally concise expressions, "regular" or otherwise, for some task, only to come back six months later and be largely unable to figure out what it was supposed to do. The compiler or interpreter has no problem dealing with it, but code that can't be read easily even by its author isn't maintainable. Any time it has to change, it has to be rewritten, hence it is write-only.

I was taught formal grammars by Tom Pittman, one of the original MIT hackers, who can write complex regular expressions as easily as some men can belch while watching television. The fact that he generally didn't do that (write complex regular expressions, not belch) says a lot. Be clear; don't be clever.

JayUtah

Posts: 5,253

Programming Language Holy War Apr 16, 2007 19:56:07 GMT -4

Post by JayUtah on Apr 16, 2007 19:56:07 GMT -4

...back then it just never really clicked.

The principal value of Lisp is in having it click. Recursion isn't anything special, but because having only recursion available makes you decompose problems into general and terminal cases and think carefully about invariance, your iteration code all of a sudden improves. And once you understand what lambda buys you, your ability to write elegant, minimal, and useful APIs increases dramatically.

I think that .NET will certainly have a future, especially if they develop a decent graphics package for it one day.

.NET is an evolutionary step that consolidates much prior art. Divided compilation, virtual execution, and runtime frameworks are nothing new. Microsoft has simply packaged them conveniently and integrated them appropriately with the underlying system in a way that greatly improves the synergy.

The future will tell whether Microsoft intends this to be yet another attempt at vendor-locking and monopoly.

ajv

Posts: 107

Programming Language Holy War Apr 16, 2007 21:14:11 GMT -4

Post by ajv on Apr 16, 2007 21:14:11 GMT -4

$0 =~ s!.*/!!; is instantly synonymous with "strip the leading directory path elements from the as-invoked command name," but not to me.

That expression, I believe (sigh), will strip off just the first path element.

No, it does strip off all the leading directory path elements (the * is greedy).

JayUtah Posts: 5,253	Programming Language Holy War Apr 16, 2007 21:30:55 GMT -4 Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by JayUtah on Apr 16, 2007 21:30:55 GMT -4 The ever-growing cornucopia of regular-expression implementations is another of my pet peeves.

Joe Durnavich Posts: 583	Programming Language Holy War Apr 17, 2007 1:28:27 GMT -4 Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by Joe Durnavich on Apr 17, 2007 1:28:27 GMT -4 *(the is greedy)** Indeed it is!

Programming Language Holy War

Post by PhantomWolf on Apr 15, 2007 18:46:29 GMT -4

Post by Nowhere Man on Apr 15, 2007 20:27:50 GMT -4

Post by JayUtah on Apr 16, 2007 12:28:25 GMT -4

Post by gwiz on Apr 16, 2007 14:49:06 GMT -4

Post by JayUtah on Apr 16, 2007 14:55:33 GMT -4

Post by PhantomWolf on Apr 16, 2007 18:11:46 GMT -4

Post by Joe Durnavich on Apr 16, 2007 18:50:20 GMT -4

Post by Joe Durnavich on Apr 16, 2007 18:58:23 GMT -4

Post by JayUtah on Apr 16, 2007 19:39:25 GMT -4

Post by JayUtah on Apr 16, 2007 19:56:07 GMT -4

Post by ajv on Apr 16, 2007 21:14:11 GMT -4

Post by JayUtah on Apr 16, 2007 21:30:55 GMT -4

Post by Joe Durnavich on Apr 17, 2007 1:28:27 GMT -4