The origin and unexpected evolution of the word "mainframe"

masswerk · 2025-02-02T01:45:19 1738460719

Great article!

Another (mid-late) data point may be the original NeXT brochure[1], which claims the NeXT to be "a mainframe on two chips". It provides a definition along the lines of a throughput oriented architecture and peripheral channels (in analogy to the NeXT's two DSP chip) and, while doing so, also marries the concepts of physical size and architecture (there's also some romanticism around uncompromised designs and "ruthless efficiency" involved):

> The NeXT Computer acknowledges that throughput is absolutely key to performance. For that reason, we chose not to use the architecture of any existing desktop computer. The desired performance could be found only in a computer of a different class: the mainframe.

> Having long shed any self-consciousness over such mundane matters as size and expense, mainframes easily dwarf desktop computers in the measure of throughput.

> This is accomplished by a different kind of architecture. Rather than require the attention of the main processor for every task, the mainframe has a legion of separate Input/Output processors, each with a direct channel to memory It's a scheme that works with ruthless efficiency.

[1] https://lastin.dti.supsi.ch/VET/sys/NeXT/N1000/NeXTcube_Broc...

kens · 2025-02-02T03:04:58 1738465498

There's also the Intel iAPX 432 "micro-mainframe" processor (1981) on two chips. (This was supposed to be Intel's flagship processor, but it was a disaster and the 8086 took over instead. The NYT called it "one of the great disaster stories of modern computing".) I think that microprocessor manufacturers had mainframe envy in the 1980s.

neverartful · 2025-02-02T14:33:06 1738506786

Intel also had the 80860 (aka i860) that was hyped up as a "Cray on a chip".

tommiegannert · 2025-02-01T22:07:20 1738447640

> Based on my research, the earliest computer to use the term "main frame" was the IBM 701 computer (1952)

> This shows that by 1962, "main frame" had semantically shifted to a new word, "mainframe."

> IBM started using "mainframe" as a marketing term in the mid-1980s.

I must conclude it takes the competition 10 years to catch up to IBM, and IBM about 20 years to realize they have competition. Setting a countdown timer for IBM to launch an LLM in 2040.

Thanks for researching and writing this up. It's a brilliant read!

masswerk · 2025-02-02T02:05:11 1738461911

I can kind of see why this should have been. The 1401, which was really intended as a replacement for IBM's punchcard appliances, was widely known as IBM's small mainframe. On the other hand, there are the 701, things like the 7030 (Stretch), and then the ranges of the S/360 and S/370. – Considering this rather inconceivable wide class of machines, stepping in and deciding, what's a mainframe and what's not, is a losing game, from a marketing perspective. So better keep silent and reap the fruits…

stonemetal · 2025-02-02T16:26:31 1738513591

> IBM to launch an LLM in 2040.

What about Watson? How did it generate language? My understanding is it output language well enough to match what LLMs do today.

zdragnar · 2025-02-02T19:58:55 1738526335

Watson was a different flavor of NLP that was turned into a gimmick. The jeopardy show it was on was not exactly what it appeared to be.

According to someone who worked on it, it would fail a second grade level reading exam: https://www.nytimes.com/2021/07/16/technology/what-happened-...

tiagod · 2025-02-02T23:02:30 1738537350

My understanding is it's not even close to the output and capability of modern LLMs

wmf · 2025-02-02T02:18:49 1738462729

https://www.ibm.com/granite

MonkeyClub · 2025-02-02T16:28:24 1738513704

Combined, proof that we are in the future.

kens · 2025-02-01T19:24:20 1738437860

Author here. Anyone have interesting mainframe stories?

ggm · 2025-02-02T00:05:48 1738454748

A rumour from my mainframe days was that Digital Equipment hired lacemakers from france to show people how they did it. This was wiring up the core memory planes for the Dec-10 (I have one, a folded 3 part card) which just barely squeezes into the mainframe class.

The guy who told me this was the Australian engineer sent over to help make the machine to bring back for UQ. He parked in the quiet side of the Maynard factory, not realising why the other drivers avoided it. Then his car got caught in a snowdrift.

A prior engineer told me about the UUO wire wrap feature on the instruction set backplane: you were allowed to write your own higher level ALU "macros" in the instruction space by wiring patches in this backplane. Dec 10 had a 5 element complex instruction model. Goodness knows what people did in there but it had a BCD arithmetic model for the six bit data (36 bit word so 6 bytes of six bits in BCD mode)

A guy from Latrobe uni told me for their Burroughs, you edited the kernel inside a permanently resident Emacs like editor which did recompile on exit and threw you back in on a bad compile. So it was "safe to run" when it decided your edits were legal.

We tore down our IBM 3030 before junking it to use the room for a secondhand Cray 1. We kept so many of the water cooled chip pads (6" square aluminium bonded grids of chips, for the water cooler pad. About 64 chips per pad) the recycler reduced his bid price because of all the gold we hoarded back.

The Cray needed two regenerator units to convert Australian 220v to 110v for some things, and 400hz frequency for other bits (this high voltage ac frequency was some trick they used doing power distribution across the main CPU backplane) and we blew one up spectacularly closing a breaker badly. I've never seen a field engineer leap back so fast. Turned out reusing the IBM raised floor for a Cray didn't save us money: we'd assumed the floor bed for liquid cooled computers was the same; not so - Cray used a different bend radius for flourinert. The flourinert recycling tank was clear plastic, we named the Cray "yabby" and hung a plastic lobster in it. This tank literally had a float valve like a toilet cistern.

When the Cray was scrapped one engineer kept the round tower "loving seat" module as a wardrobe for a while. The only CPU cabinet I've ever seen which came from the factory with custom cushions.

dhosek · 2025-02-02T03:45:21 1738467921

I heard a story of Seymour Cray doing a demo of one of the machines and it turned out there was a bug in some hardware procedure. While the customers were at lunch, Seymour opened up the machine, redid the wire wrap and had the bug fixed when they returned. (Note that many details are likely inaccurate as this is a 35-year-old memory of a second-hand story.)

adrian_b · 2025-02-02T13:00:30 1738501230

I have searched once for the origin of the term "Central Processing Unit".

In his report, John von Neumann had used the terms "central arithmetical part (CA)" and "central control part (CC)". He had not used any term for the combination of these 2 parts.

The first reference to CPU that I could find is the IBM 704 manual of operation from 1954, which says: “The central processing unit accomplishes all arithmetic and control functions.”, i.e. it clearly defines CPU as the combination of the 2 parts described by von Neumann.

In IBM 704, the CPU was contained in a single cabinet, while in many earlier computers multiple cabinets were used just for what is now named CPU. In IBM 704, not only the peripherals were in separate cabinets, but also the main memory (with magnetic cores) was in separate cabinets. So the CPU cabinet contained nothing else.

The term "processor" has appeared later at some IBM competitors, who used terms like "central processor" or "data processor" instead of the "central processing unit" used by IBM.

Burroughs might have used "processor" for the first time, in 1957, but I have not seen the original document. Besides Burroughs, "processor" was preferred by Honeywell and Univac.

The first use of "multiprocessing" and "multiprocessor" that I have seen was in 1961, e.g. in this definition by Burroughs: "Multiprocessing is defined here as the sharing of a common memory and all peripheral equipment by two or more processor units."

While "multi-tasking" was coined only in 1966-09 (after IBM PL/I had chosen in 1964-12 the name "task" for what others called "process"), previously the same concept was named "multiprogramming", which was already used in 1959, when describing IBM Stretch. ("multitasking" was an improved term, because you can have multiple tasks executing the same program, while "multiprogramming" incorrectly suggested that the existence of multiple programs is necessary)

kens · 2025-02-02T18:53:34 1738522414

You've done a lot of interesting historical research there! I wanted to get into a discussion of "central processing unit", but decided my article was long enough already :-) The term "central processing unit" is unusual since it is a seven-syllable term for a fundamental idea. Even "CPU" is a mouthful. I think that the "central" part is in opposition to systems such as ENIAC or the Harvard Mark I, where processing is spread out through the system through accumulators that each perform addition. Centralizing the processing was an important innovation by von Neumann.

zozbot234 · 2025-02-02T21:57:42 1738533462

> Centralizing the processing was an important innovation by von Neumann

a.k.a. the "von Neumann bottleneck" that we're now trying to get rid of.

adrian_b · 2025-02-03T08:12:53 1738570373

We will never get rid of the "von Neumann bottleneck", except for a relatively small number of niche applications.

The bottleneck consists in the fact that instead of having a huge number of specialized automata that perform everything that must be done to execute a useful application you have just an universal automaton together with a big memory, where the universal automaton can perform anything when given an appropriate program.

The use of a shared automaton prevents many actions to be done concurrently, but it also provides a huge economy of logical circuits.

The "von Neumann bottleneck" is alleviated by implementing in a computer as many processor cores as possible at a given technological level, each with its own non-shared cache memory.

However removing completely the concept of programmable processor with separate memory would multiply the amount of logic circuits too much for any imaginable technology.

The idea of mixing computational circuits with the memory cells is feasible only for restricted well defined applications, e.g. perhaps for something like ML inference, but not for general-purpose applications.

d1sxeyes · 2025-02-02T13:09:15 1738501755

And of course back when everyone had desktop towers, “the CPU” was everything except the monitor and the peripherals.

AnimalMuppet · 2025-02-02T12:03:07 1738497787

My mom was a programmer back in the 1950s. First thing in the morning, she would run a memory test. If it failed, she would slide out the failing tray of memory (100 words by 36 bits, and tubes), hand it off to a tech to fix, slide in a fresh tray, and proceed.

She had one CPU she worked on where you could change its instruction set by moving some patch cords.

wslh · 2025-02-02T01:39:54 1738460394

A tax agency unified all its data from different agencies via X.25 and satellite connections. However, the process was expensive and slower than expected because the files were uncompressed and stored as basic plain-text ASCII/EBCDIC files.

One obvious solution to this problem was to buy an Ethernet network device for the mainframe (which used Token Ring), but that was yet another very expensive IBM product. With that device, we could have simply compressed and uncompressed the files on any standard PC before transferring them to/from the mainframe.

Another obvious solution was to use C to compile a basic compression and decompression tool. However, C wasn’t available—buying it would have been expensive as well!

So, we developed the compression utility twice (for performance comparisons), using COBOL and REXX. These turned out to be two amusing projects, as we had to handle bits in COBOL, a language never intended for this purpose.

Spooky23 · 2025-02-02T03:20:39 1738466439

The capture of all thought by IBM at these palaces was nuts.

Circa 2002 I’m a Unix admin at a government agency. Unix is a nascent platform previously only used for terminal services. Mostly AIX and HPUX, with some Digital stuff as well. I created a ruckus when I installed OpenSSH on a server (Telnet was standard). The IBM CE/spy ratted me out to the division director, who summoned me for an ass chewing.

He turned out to be a good guy and listened to and ultimately agreed with my concerns. (He was surprised, as mainframe Telnet has encryption) Except one. “Son, we don’t use freeware around here. We’ll buy an SSH solution for your team. Sit tight.”

I figured they’d buy the SSH Communications software. Turned out we got IBMSSH, for the low price of $950/cpu for a shared source license.

I go about getting the bits and install the software… and the CLI is very familiar. I grab the source tarball and it turns out this product I never heard of was developed by substituting the word “Open” with “IBM”. To the point that the man page had a sentence that read “IBM a connection”.

somat · 2025-02-02T01:54:21 1738461261

> C wasn’t available—buying it would have been expensive as well!

On the subject of expensive mainframe software, I got to do the spit take once of "you are paying how much for a lousy ftp client? per month!" I think it was around $500 per month.

Man open source software really has us spoiled.

dhosek · 2025-02-02T03:41:46 1738467706

The youngs have no idea. You used to have to pay for development tools. In high school, I would hand-assemble my 6502 assembler programs by writing them out longhand on graph paper, fill in the hex in the left columns than type in the hex in the Apple monitor to get the program running. Heaven forbid there was anything wrong with the code, the hand-assembling or the keyboarding, but I couldn’t afford one of the paid assemblers (in retrospect, I should have written one in AppleSoft, but hindsight is 20/20, and I don’t know that I was smart enough then to write an assembler anyway). Spending $500–1000 for a compiler in the 90s was typical. Now the kids whine about paying the relatively cheap fee for things like JetBrain IDEs.

philiplu · 2025-02-02T04:29:26 1738470566

Back about roughly 1978 to 1982, I wrote and sold a 6800/6809 disassembler ("Dynamite Disassembler") for I think $200 a copy. Sold enough copies to pay for computer equipment during my college days. I think we reduced the price to $100 or maybe $150 when the TRS-80 Color Computer came out with a 6809 powering it.

$200 back in 1980 is about $800 today. Amazing to think anyone would spend that much for a fairly simple tool.

wslh · 2025-02-02T07:08:17 1738480097

Turbo Pascal was revolutionizing when they retailed it at USD 49.99 [1].

[1] https://en.wikipedia.org/wiki/Turbo_Pascal

johnohara · 2025-02-02T17:11:04 1738516264

Diskettes inside the book as I recall.

We were running Oregon Pascal on a PDP/11-44 (later upgraded to a VAX 11/780) that cost thousands. To have access to Pascal for $49 was too good to be true. Kept thinking it had to be deficient somehow, but it wasn't.

The paradigm shift was underway right in front of us.

dugmartin · 2025-02-03T15:08:35 1738595315

My CS 101 class in 1989 was all in Pascal and had to be entered via a IBM terminal and ran as a batch job on our school mainframe. There was no interactive feedback and you had to hike across campus to a basement of a building that had an enormous chain printer to get your greenbar paper output of your run to see if it 1) compiled and 2) output the right thing that the autoscorer checked when you flagged your assignment as complete.

I was lucky in that I had a Tandy 1000SX in my dorm room and I had Turbo Pascal (bought using an educational discount at the school bookstore). A hidden feature of Turbo Pascal was that it supported multiple comment delimiters, including the comment delimiters used by IBM Pascal (the assignments were also graded on comment quality). I was able to do all my class work locally, using interactive debugging, and thanks to a guy I met while working at a local computer shop that was the student IBM rep I got a file uploader and the phone number of hidden 2400 baud that it used so I could directly upload my code and then dial into the interactive terminal number and submit it.

I sort of felt bad for all the other kids in the class for the write/submit/walk/debug loop they endured, but not really.

dugmartin · 2025-02-02T12:05:11 1738497911

I think I paid $299 (one time fee) for a license for PKZIP in 1993 for a product I was building. Open source is pretty amazing.

guenthert · 2025-02-02T11:17:55 1738495075

> Another obvious solution was to use C to compile a basic compression and decompression tool. However, C wasn’t available—buying it would have been expensive as well!

I would have thought in the (IBM) mainframe world, PL/I (or PL/S) would have been the obvious choice.

dwheeler · 2025-02-02T15:53:47 1738511627

I want to say "thank you" for the careful research examining multiple original sources from a variety of viewpoints.

raldi · 2025-02-02T06:55:16 1738479316

Does this guy deserve a call-out?

https://en.wikipedia.org/wiki/Mainframe_(G.I._Joe)

dboreham · 2025-02-02T01:28:22 1738459702

Ken, surely frames significantly pre-date the computer? They were used in telephone exchanges. The Colossus reproduction at Bletchley is constructed from frames.

kens · 2025-02-02T03:59:22 1738468762

I discuss this a bit in footnote 1. Thousands of things were built on frames before computers, of course. However, IBM both built computers from semi-portable cabinets built around metal frames and also referred to the computer's units as "frames", including the "main frame".

Telephone systems used main distribution frames, but these are unrelated. First, they look nothing like a computer mainframe, being a large rack with cable connections. Second, there isn't a path from the telephony systems to the computer mainframes; if the first mainframes were developed at Bell Labs, for instance, it would be plausible.

As for Colossus, it was built on 90-inch racks, called the J rack, K rack, S rack, C rack, and so forth; see https://www.codesandciphers.org.uk/lorenz/colossus.htm

cyberax · 2025-02-02T06:01:15 1738476075

AT&T had https://en.wikipedia.org/wiki/Number_Five_Crossbar_Switching... in 1947. It used relay-driven "markers" to set up a call that goes through various frames.

So it's entirely possible that somebody from the telephone industry decided to borrow a term of art from it for computing.

BOOSTERHIDROGEN · 2025-02-04T07:12:37 1738653157

Could you kindly write a blog post regarding a recent development in GaN chargers? Many thanks.

Animats · 2025-02-02T03:52:20 1738468340

Telephone switches introduced the idea of frames. They were the first systems to have enough components to require modular organization. Western Electric tended to use 24 inch spacing.

19 inch racks seem to come from railroad interlocking and may have been introduced by Union Switch and Signal, founded by Westinghouse in 1881, and still around, as a unit of Ansaldo.

ggm · 2025-02-02T02:52:22 1738464742

Likewise bus bars, and hence data bus

wduquette · 2025-02-03T22:16:34 1738620994

Fascinating. I read Byte Magazine, among others, back in the late 70's and 80's and I don't recall ever running across "mainframe" as a synonym for "CPU" or seeing it applied to mini- and micro-computers. I won't dispute the OP's citations from Byte, obviously, but I have to think it wasn't that common.

daishi424 · 2025-02-02T18:41:18 1738521678

"I come from the Net - through systems, peoples, and cities - to this place: MAINFRAME."

The word is forever etched in my mind together with this quote.

fjgreco · 2025-02-02T17:36:26 1738517786

The term goes back to a time when each component of a data processing system (cpu, storage devices, printers, card readers/punch card machines/teletype, and later monitors and communication devices were all separately housed in structure or frames. The central or main component was the cpu ie. the main frame. Note: IBM was required per consent decree unbundle ie. to sell these devices as separately priced products not as an all or nothing system.

m463 · 2025-02-02T02:54:54 1738464894

something... related?

https://youtu.be/QHnJ9NmK3Pc

(the mainframe song, uncertain of its background)

exodust · 2025-02-02T06:14:44 1738476884

Loosely related to unexpected word origins, I didn't know until recently that when "gaslighting" someone, you're using an expression tied to literal gaslights from a 1944 movie. I'm likely late to the party on learning this origin.

kQq9oHeAz6wLLS · 2025-02-02T15:25:44 1738509944

That was Angela Lansbury's first movie, as well. It's a quote decent film.

kens · 2025-02-02T08:07:54 1738483674

Yes; I watched the movie "Gaslight" recently, and the meaning that everyone uses has drifted pretty far from the movie's meaning. But it's probably hopeless to resist a meaning when it becomes trendy.