Japanese writing system basics

viraptor · on Aug 20, 2016

This was an interesting symbol to choose for an internet explanation. It took me a while to realise that the rectangle is actually the symbol that I'm supposed to see, rather than a missing glyph.

cynix · on Aug 20, 2016

It's not exactly a rectangle though, the two downward strokes extend slightly past the bottom horizontal stroke.

kazinator · on Aug 20, 2016

In handwriting, or calligraphy, it might be rendered like this, in fact:

       2
      --------+
  1 \        /
     \ ---- /
        3

The three strokes not all connected, and the left and right sides tapering downward a bit (exaggerated in the ASCII above).

Image search for "kuchi-no kanji + shodou (calligraphy)":

https://www.google.com/search?q=%E5%8F%A3%E3%81%AE%E6%BC%A2%...

It seems there are two schools on the relationship between stroke 2 or 3: 2 can extend downward past 3, or 3 can underscore 2.

kozhevnikov · on Aug 20, 2016

Wikipedia has stoke order animations for radicals.

https://en.wikipedia.org/wiki/口

viraptor · on Aug 20, 2016

Not on my screen (http://i.imgur.com/korTPAQ.png)

Maybe it is a missing character after all :) (Oh no, I've been fooled by chrome/freetype/something!)

cynix · on Aug 20, 2016

I see, it is indeed showing a missing character symbol on your computer. The actual character is much more square than that, in addition to the strokes I mentioned. (http://i.imgur.com/s6bW7u0.png)

fenomas · on Aug 20, 2016

Yep, that's the missing character glyph - known colloquially as "tofu".

atombender · on Aug 20, 2016

Yes, it is.

bemmu · on Aug 20, 2016

It has the added bonus of being almost correct even if the symbol is missing.

arcticfox · on Aug 20, 2016

Clever.

robzyb · on Aug 20, 2016

I agree. And I'm an Australian in Japan learning Japanese.

(I was expecting it to be a rant about insufficient unicode usage.)

PeCaN · on Aug 20, 2016

Doesn't help that at least for me it gets rendered differently in the title on HN and the actual article.

That said, I expected something Japanese related, given that it's Candy Japan, but you'd think common fonts would have different looking characters for “no glyph” and the kanji for mouth.

kazinator · on Aug 20, 2016

Another problem is that if the character appears in isolation, it's hard to tell whether you are looking at 口　(mouth) or 囗 (the "box radical" kanji, or "kunigamae").

Ha, for me, they look quite different in the font used for editing, but not so much in the rendered comment other than a small difference in size.

kazinator · on Aug 20, 2016

That rectangle denoting a missing glyph is called "tofu" because it looks like it, so there is a connection to Japanese anyway.

It's not clear whether or not that naming for that character originated in Japan.

imron · on Aug 20, 2016

In Chinese 食 is pronounced shí.

As someone who speaks Chinese, I got a chuckle out of reading 'put your favorite snack in your 口 and 食t it!' due to the association in my mind of that character and its Chinese pronunciation, immediately followed by a 't'.

matthewrudy · on Aug 20, 2016

Japanese use of Kanji is usually more similar to Cantonese (Cantonese is a much older language than modern Mandarin)

So in Mandarin you wouldn't find people using 食 as a verb, but in cantonese it is the correct verb.

In Cantonese it's pronounced something like "sekk"

("sihk" in Yale romanisation, but I think if you're not familiar with Yale you'd try and read that "sick" or "sikh")

Edit: "sikh"

binaryabyss · on Aug 20, 2016

Not to mention that Mandarin would use 嘴（里）rather than 口

The pronunciations of kanji correlate to different Chinese eras and areas.

The usage and choice of characters are influenced by Classical Chinese, which was the literary language in Japan until, uhm, recently-ish.

There is a very good recent YouTube video that explains the usage of Chinese characters in Japanese and which I recommend to anyone interested in the subject:

https://www.youtube.com/watch?v=CF3MRMBjd20

I was very impressed by the accuracy and knowledge displayed, as most of what is written and said on the subject is ehhh somewhat disappointing.

matthewrudy · on Aug 27, 2016

I've never learned Japanese, nor spent much time there.

From my brother who studied Japanese for a while, the Kanji was a real blocker for him. So I did wonder what it'd be like, given I already know a good amount of Chinese characters.

I guess this video answers that question:

Even more confusing.

edwinyzh · on Aug 20, 2016

Yes, 食 is a verb in Cantonese and Hakka! Both of them are more close to archaic Chinese than Mandarin.

matthewrudy · on Aug 20, 2016

As I understand Hakka is actually a collection of languages, not necessarily mutually intelligible.

I learned a bit of one dialect from a friend's mum. And the basics were very similar to Cantonese.

Would be interested to learn more dialects once I'm done with Mandarin and Cantonese.

nbouscal · on Aug 20, 2016

Standard Mandarin (especially Beijing dialect) pronunciation of shí is much closer to the American pronunciation of "sure"†, though, so shít would be pronounced more like "shirt". (Sorry to ruin the joke.)

†IPA something like this: /ʃɝ/

muddyrivers · on Aug 20, 2016

A technical correction. In Mandarin, shí is pronounced very close to the sound of "shi" in ship and "shee" in sheep. You are right that it sounds close to "sure" when it is pronounced by people with strong Beijing dialect.

nbouscal · on Aug 20, 2016

Um. Modern Standard Mandarin pronunciation is based on Beijing dialect, and it doesn't take a Beijing accent to pronounce it like "sure", that's totally standard. Beijing dialect does feature more erhua than other dialects, but even 哪儿 is considered standard, not specific to Beijing dialect. There are definitely dialects (mostly southern iirc) that pronounce shi as in "ship", but that's not Standard Mandarin. Every television show I've seen, for example, pronounces it like "sure".

Edit: To add a clear example, look at 事. Standard Mandarin would pronounce it "sure", Beijing dialect goes even further and pronounces it more like "shar".

muddyrivers · on Aug 20, 2016

I would argue 事儿, the combination of two characters, are pronounced like "sure", in most Northern dialects. (Speaking strictly, some are dialects and some are accents. Let's skip this difference here since it doesn't matter to the current topic.)

Not all the people in television shows, including talk shows, speak Standard Mandarin. This is especially true in shows produced in the north. They adopt Beijing accent to certain degrees, which, more or less, leads to the erhua pronunciation.

For Standard Mandarin, listen to 新闻联播，the official news program from CCTV.

I notice that for non-native Chinese speakers whose mother tongue is Indo-European language, it sounds erhua pronunciation is easier for them than Standard Mandarin. If erhua pronunciation is pushed to extreme, it is called 大舌头。

Also erhua pronunciation is usually NOT used in very formal conditions, like presentation, etc. Some people regards erhua pronunciation as vulgar, except for very widely adopted cases.

nbouscal · on Aug 20, 2016

I've listened to 新闻联播, the way they pronounce shi sounds much closer to sure than to ship. It is somewhere in-between, and that vowel is more rhotic in Beijing dialect, but Standard Mandarin definitely still rhotacizes that vowel (especially compared to southern dialects where it isn't rhotacized at all).

imron · on Aug 21, 2016

Yeah I know it's pronounced differently, but as someone who enjoys (sometimes strained) cross-language puns, it didn't stop me from finding it funny :-)

primitivesuave · on Aug 20, 2016

Another interesting aspect of traditional Chinese characters is that complex words are expressed by combining simpler symbols. For example, the Chinese word for computer is 電腦. The first character represents "electricity", and the second character represents "brain". Which is really what a computer is, an electric brain. Similarly, a computer programmer is 程序員, where the three symbols are "rule", "order", and "person" - one who orders rules.

An interesting consequence of this is that you only need to learn around 3000 symbols to read a Chinese newspaper, just like how you can ascertain the meaning of an unfamiliar English word by having knowledge of a small set of Latin/Greek roots.

unfamiliar · on Aug 20, 2016

> you only need to learn around 3000 symbols

This is not a good result. Learning 3000 characters could take years, and even then there would still be some you don't know. To read an English newspaper you only need to learn 26 symbols. Yes you need to learn the words, but you also have to do that for Chinese which is separate from learning the symbols for each word. It's a hugely inefficient way to write.

Furthermore, while combining smaller words to represent a more obscure word is an improvement over introducing a new character (and is a step towards an alphabet), the examples I've heard are very one way. You might look at 程序員 and think "one who orders rules" makes sense for "programmer." But that sequence of characters could just have easily meant any number of other things (lawyer, politician, etc).

ddeck · on Aug 20, 2016

>It's a hugely inefficient way to write.

Agreed. Remindeds me of an anecdote from David Moser:

"I was once at a luncheon with three Ph.D. students in the Chinese Department at Peking University, all native Chinese (one from Hong Kong). I happened to have a cold that day, and was trying to write a brief note to a friend canceling an appointment that day. I found that I couldn't remember how to write the character 嚔, as in da penti 打喷嚔 "to sneeze". I asked my three friends how to write the character, and to my surprise, all three of them simply shrugged in sheepish embarrassment. Not one of them could correctly produce the character. Now, Peking University is usually considered the "Harvard of China". Can you imagine three Ph.D. students in English at Harvard forgetting how to write the English word "sneeze"??"

Regarding the grandparent's point of joining characters, my favorite has always been 火雞 (fire chicken) for turkey.

[1] Why Chinese Is So Damn Hard http://www.pinyin.info/readings/texts/moser.html

bsdetector · on Aug 20, 2016

> my favorite has always been 火雞 (fire chicken) for turkey.

Turkeys are not native to Asia so it had to be added late to the language, which is why it doesn't make much sense.

The problem with Chinese is that when you have a new thing like turkeys you have to either create a new character, which then people have to memorize forever, or you have to string together sort-of related ideas into a compound word that has only some tangential relation to its parts. When you have thousands of words that are only somewhat related to their parts, the parts lose their meaning and become not much more than a really large and complicated alphabet.

Chinese was over-engineered to work great for maybe a few thousand words, but the world keeps getting bigger and bigger and every new thing makes the Chinese language worse.

anabis · on Aug 20, 2016

This has been mitigated to a large degree by phones. Even pre-smartphone, they were used to look up difficult characters.

athenot · on Aug 20, 2016

> To read an English newspaper you only need to learn 26 symbols.

Replace English with any other language using the latin alphabet and this doesn't hold up as there is no indication of meaning. For that matter, there's not a strong indication of pronunciation. I may know English and French but that doesn't help me understand Hungarian or Swedish.

I do not know Chinese but my understanding is those 3000 characters are not only glyphs, they are elements of meaning. Incidentally, that is the same order of magnitude of words one needs to know to read an English paper, with the difference that in English, there is a portion of the meaning that is encoded in grammar, ie. the relation & order of words. That a cognitive overhead we take for granted in our language system.

FabHK · on Aug 20, 2016

That's the point though. To learn Hungarian or Swedish, you'll have to learn the spoken language, and then reading and writing comes essentially for free.

In Chinese, you have to learn the spoken language, and then you have spend just as much time (if not more) to learn the written language.

Many Chinese characters do contain elements of meaning and elements of pronunciation, but that doesn't mean that you don't have to learn them.

"Dilapidated" contains elements of its meaning in the root lapis = stone, but you'd be hard pressed to deduce the meaning from it.

binaryabyss · on Aug 20, 2016

English has 52 characters including both cases. Moreover, the ways in which these are combined to form words are highly inconsistent and idiosyncratic.

Chinese characters are made by combining 214 radicals, and most of the characters are written by combining two of these. Most words are created by combining two such characters. It's not really that much more difficult than remembering how to spell an English word.

程序 means program. 程序员 builds on this, not the individual parts of the constituent words.

Chinese is actually a very logical language. The writing system is arguably more complicated than it needs to be (see Victor Mair and his quixotic crusade for romanization), but it conceals a language with very flexible morphemes and simple grammar. I recently had to translate 照顾着 and 被照顾着 for an app, and I had a hard time coming up with concise and decent translations that mirrored the simple relational antonymity of the Chinese.

xiaoma · on Aug 20, 2016

Characters do give a lot of semantic information than an alphabet and it's definitely easier to learn new words through context if you can see the characters vs just hearing them.

Knowing Chinese characters is closer to knowing not only the alphabet but also understanding most basic English words plus Latin/Greek stems like auto (self), locus/loco (place), -ous (possessing or full of -), etc. From that base it's pretty quick to learn longer words and there are tons of clues to help you remember them.

felixding · on Aug 20, 2016

> It's a hugely inefficient way to write.

Not necessarily true.

For instance, "Not necessarily true" is 18 characters and 2 white spaces, while in Chinese, it could be as simple as "未必".

FabHK · on Aug 20, 2016

I think "hugely inefficient" was meant not so much in terms of space requirement or number of strokes, but rather in terms of cognitive load and time required to master it.

While English might be among the worst alphabetic writing systems (compared to, say, Spanish, which has a wonderful correspondence between what is written and what is spoken), it is certainly a more efficient writing system than Chinese, in that fewer years of school have to be dedicated to just learning to read and write.

Furthermore, as highlighted by someone else, it is not uncommon for writers of Chinese to just completely forget how to write a word.

In English, you might misspell it, but it'll be rare that you can't render it at all.

Great book on the topic of Chinese, debunking several misconceptions, is "The Chinese Language: Fact and Fantasy" by John DeFrancis. http://www.uhpress.hawaii.edu/p-819-9780824810689.aspx

xiaoma · on Aug 20, 2016

What traditional character using place are you speaking of? In Taiwan, programmer is 程式設計師 and in Hong Kong, it's 程式設計員. I had thought 程序 was basically a mainland-only variant.

Also, FWIW, I couldn't read a paper at 3k characters. It's more like 4-5k if the goal is reading rather than slogging through with a dictionary.

cynix · on Aug 20, 2016

Perhaps GP was thinking of the mainland variant, but only had a Traditional Chinese IME available on the computer.

Speaking of mainland-only variants, their official translation of computer is "计算机" (lit. computation machine).

JimDabell · on Aug 20, 2016

Isn't that the same in most languages though? For instance in English "electronic mail" became "e-mail", which became "email". "Web log" became "weblog", which became "blog". "Cellular phone" became "cell phone".

vidarh · on Aug 20, 2016

And indeed in older books and articles it is not unusual to see people writing "electronic brain" to refer to a computer.

A lot of non-English words for computer terms in various languages are translations of outdated English terms for the same that stuck around after the English terms changed.

TillE · on Aug 20, 2016

English tends to sneakily disguise its words with Latin or Greek. So you may not think about the literal meaning of a word like "predict", while in German "vorhersagen" (to say before) is the exact same thing, just more obvious.

_kdhr · on Aug 20, 2016

> Another interesting aspect of traditional Chinese characters is that complex words are expressed by combining simpler symbols. For example, the Chinese word for computer is 電腦. The first character represents "electricity", and the second character represents "brain". Which is really what a computer is, an electric brain.

That's nice, but I think most languages do this. It's not particular to Chinese. The word "computer" comes from "com-" (with/together) and "putare" (to reckon). Or "composition"; to position together. "compile"; to pile together, etc.

ced · on Aug 20, 2016

Right, but when learning English as a second language, they don't teach you "com" and "putare", because the roots are too far away and too diverse (Latin, Germanic) whereas in Chinese, it's often immediately obvious. Well, more often than in other languages. Most of it is still random-looking.

gurkendoktor · on Aug 21, 2016

Well, that's the cool thing about Chinese - it's the Latin/Greek of East Asia :) It's to Japanese/Korean/Vietnamese what Latin/Germanic are to English, so there are less indirections.

mcguire · on Aug 20, 2016

"To pile together?"

anqurvanillapy · on Aug 20, 2016

In Chinese, 'eat' is usually '吃'. We use '吃了 (le)' to say we 'ate' and '在 (zài) 吃' or '吃着 (zhě)' for 'eating'.

It is really interesting that in China people often ask their friends '吃了吗? (Have you eaten?)' rather than '嗨 (Hi)' in daily life. So initially I thought this post was describing something in Chinese w/o my noticing the URL.

geomark · on Aug 20, 2016

Same in Thai. They ask กินข้าวหรือยัง ("Have you eaten yet?", or literally "Have you eaten rice yet?") as a greeting.

dingaling · on Aug 20, 2016

In lowland Scotland one finds 'You'll have had your tea?' which is a semi-polite way by which the enquirer informs the visitor that he'll not be given any tea ( i.e. dinner, food ).

Has entered mainstream as a cliche of Scottish parsimony but does exist in the wild.

geomark · on Aug 20, 2016

Interesting. It's sort of the opposite when Thais ask if you have eaten already because if you answer "not yet" it is common for them to offer you something.

dingaling · on Aug 20, 2016

The Scottish response to 'not yet' is usually something like 'well it seems foolish to have ventured out in that case', obviously delivered in the local dialect.

Though I have extracted one dinner from an Edinburgh man who deploys the phrase, but that was probably because his ( English ) wife had put it on to cook earlier!

dghughes · on Aug 20, 2016

That reminds me that sometimes in my family we'll jokingly say jeet? It comew from the comedian Jeff Foxworthy who said it's redneck for "Did you eat? "

I'm in Canada and not redneck I wonder if jeet is actually used in the US.

throwanem · on Aug 20, 2016

> I wonder if jeet is actually used in the US

Not really. Foxworthy's joke isn't that that is actually a word, which it's not; it's that that is how the phrase "Did you eat?" comes out sounding in the accent of his home, which is a couple states east of mine. (Also apparently in the accent of New Jersey, which suggests to me that this particular trait may be more coastal than specifically Southern.)

dsr_ · on Aug 20, 2016

My father's native New-Jerseyan uses "jeet jet" for "did you eat yet".

The one bit of that I consistently use is "agnishna", for "air conditioner".

fenomas · on Aug 20, 2016

I recently found a list of these for Scottish that were amazing.

E.g.: "space ghettos" (in standard American accent) becomes Scottish "Spice Girls" :D

leafsleep · on Aug 20, 2016

If you want literal, it's "eat rice already" - Thai does not have conjugation

geomark · on Aug 20, 2016

Actually, if you want it literally it would be "eat rice or not yet" (กิน=eat, ข้าว=rice, หรือยัง=or not yet). Like you said, verbs are not conjugated in Thai but instead verb tenses are accomplished with helper words to indicate past, present or future. Sure is a lot more orderly than English verb conjugation with all its irregularities.

mveety · on Aug 20, 2016

That greeting got me a bit when I was just starting to learn to read Chinese (I never attempted to learn to speak it). Learning something so different to the languages I know really made me appreciate the similarities between Swedish and English, let's say, and how that made learning easier.

jackcosgrove · on Aug 20, 2016

I was taught that "Chi le ma?" was an ancient greeting. In old times when food was scarcer, asking whether someone had eaten was another way of asking about someone's general situation.

rosstex · on Aug 20, 2016

How do you respond? "Yes"?

hunvreus · on Aug 20, 2016

Depends on the question.

If the question "你好吗" ("How are you?" although literally translates to "Are you good?"), you can answer with "好" ("Good"). If you're asked "你是老外吗" ("You're a foreigner?") you can answer "是的" ("I am").

So it usually depends on the verb that was used in the question. Although you often can say "对", which is with “是的" the closest Chinese have to "Yes".

In this specific case, you would likely just answer "吃了" ("I ate") or "吃了，你呢？" ("I hate, what about you?").

Keep in mind that this is more or less the equivalent to"How are you doing?" or "What's up?" in the US; it's a greeting and people aren't necessary expecting an actual answer.

eriknstr · on Aug 20, 2016

How about if one hasn't eaten. Should one say so in reply to a stranger asking or would that be rude?

hunvreus · on Aug 20, 2016

You'd probably say something like "还没有", "没有" or "没吃" ("Not yet", "No" or "I haven't eaten"). Again, it's just a greeting, the answer doesn't matter much (unless it's your mom asking you in which case she'll make a big deal of it, but that's a pretty international behavior).

sidedishes · on Aug 20, 2016

Typically just a restatement: 吃了。

euske · on Aug 20, 2016

Yeah yeah, this all makes sense until you see 品, which isn't "a three-mouthed monster" but "goods".

Languages are weird, man.

artichokeheart · on Aug 20, 2016

I wOUld agree with you if it weren't for the wOUnd I have where a bOUlder pOUnded me in the head, so I don't have the cOUrage.

igravious · on Aug 20, 2016

I see what yOU did there.

ju-st · on Aug 20, 2016

I interpret that character as "three boxes" so "goods" is a very intuitive translation :)

throwanem · on Aug 20, 2016

Yeah, I'm not sure what world you have to live in for "three-mouthed monster" to be a more obvious meaning for that glyph than "stacked crates".

But I think I might like to visit.

cheiVia0 · on Aug 20, 2016

I see you haven't gotten to the 4th paragraph of TFA...

"口" = mouth, not crate

"品" is a stack of mouth characters

throwanem · on Aug 20, 2016

I read the whole article, thanks. Do you think it's possible that more than one square or rectangular thing may exist?

comex · on Aug 20, 2016

According to kanjinetworks.com that is, in fact, what the etymology boils down to:

> Square-shaped object (tripled) → high-quality goods spread throughout a protective container (compare 舗 and 販) → quality; counter for goods; (person's) character; grade; value.

http://www.kanjinetworks.com/eng/kanji-dictionary/online-kan...

jimworm · on Aug 20, 2016

Out of the several uses of 品, one is the verb "taste" which makes slightly more sense.

dboreham · on Aug 20, 2016

Whoa! I know that one -- it is "Shinagawa". I know this because my hotel was at that train station in Tokyo on a trip in the early 90's, long before signage in English was common in Japan..

adsofhoads · on Aug 20, 2016

It's actually just the "shina". The "gawa" is 川, river.

Tharkun · on Aug 20, 2016

You need a lot of goods to feed many mouths.

Two or three of something is a pretty common pattern in kanji. Two basically implies "several", while three implies "a fuckton". Two suns, for instance, is "bright". Three suns is a sparkling crystal.

cooper12 · on Aug 20, 2016

You might enjoy (and probably have already seen) "Why Japanese People": https://youtu.be/AqragQq63Js

andreygrehov · on Aug 20, 2016

I liked the beginning and expected the next word after "mouth" to look almost the same, with a subtle change, which would be a logical extension. But it was quite a jump from a simple square (口) to god-knows-what (食).

Also, why 食 is translated as "Eclipse" in Google?

zorceta · on Aug 20, 2016

It's not a very good choice to use "食” to represent "eat" here. In modern Chinese we say "吃", and you can see that "吃" has a square part, which is exactly "口". And "食" also means "food", but "吃" doesn't.

Also, since ancient Chinese people thought eclipse was caused by a "dog in the sky" eating the moon, it's reasonable for them to use "食" to describe it. But modern Chinese almost always say "月食", meaning eclipse of the moon, to distinguish from eclipse of the sun, so Google Translate did this wrong. It should translate it to "eat" or "food" first.

0xfaded · on Aug 20, 2016

Putting the parts together, 吃 would mean the begging mouth. The native Japanese word for eat, kuu, was at least at one time written 喰う which I think we would all agree makes a lot of sense. Both languages underwent different simplifications, I suspect in Chinese it became 吃 and Japanese 食, so in both languages the original logic has been lost.

I'm a westerner who learned Japanese as an adult. I feel its quite unfortunate how much of the meaning was lost with the Chinese simplification. I can mostly make out Taiwanese/Republic of China newspapers, but can see nothing in the simplified characters.

Edit: Yay, I'm wrong, see below. Thank you internet.

Zarel · on Aug 20, 2016

Nah, 喰 is entirely unrelated; it's one of the few characters Japan invented (most are imported and sometimes simplified from Traditional Chinese).

On the other hand, you might recognize 吃 as the 喫 from 喫茶店 (café).

The Chinese simplification was overall a good thing. A lot of the simplifications are from Japanese, even. Like, compare the Traditional 體 with the Simplified 体 (body) - the latter is from Japanese.

prewett · on Aug 20, 2016

The Japanese simplifications were pretty good, but a lot of the Communist ones are aesthetically ruinous. 车东气门 have none of the symmetry that 車東氣門 have. 气 doesn't even have its center of mass over its base of support, although at least in this case it is six strokes less. The Japanese simplifications seem to have kept the artistic flavor.

FabHK · on Aug 20, 2016

Agreed.

Those simplifications appear to have been designed purely for reduction of stroke count (that is, making it faster to write by hand), not for simplification in the sense of making it more simple, logical, and consistent.

(As a matter of fact, that "simplification" introduced further inconsistencies, in that certain radicals were written differently when part of a character, while the traditional writing maintained it. Example: 金 gold is the left part of money, which you can see in the traditional 錢, but not in the simplified 钱. Similarly 言 in traditional 說 vs simplified 说.)

prewett · on Aug 20, 2016

Yeah, and the radicals sure got uglified. I calmed down a bit when I found that apparently a lot of the simplifications where just officializing shortcuts people were already taking. Kind of like spelling "with" as "w/", I'm guessing.

FabHK · on Aug 24, 2016

Prewett - true. But then, why make it "simple" but ugly for _printing_? It's absurd... just keep the complex form in books and reading printed text, and tolerate what people are writing out by hand in cursive. That's distinct anyway. It's as if we'd "simplify" the "-ing" at the end of words to some wiggle with a dot and a loop in printed matter.

rangibaby · on Aug 21, 2016

for those who don't know, this is called 略字 (ryakuji) eg 門 > 门

gurkendoktor · on Aug 21, 2016

To add some anecdata from native speakers (not me), I've noticed many simplifications like 机車 in "Traditional" Taiwanese handwriting, but I've never seen 机车.

gbog · on Aug 20, 2016

That's completely subjective, most characters are not symmetrical anyway, and when they are it's a mistake to draw them symmetrical.

gurkendoktor · on Aug 21, 2016

Characters are mostly displayed on screen nowadays, and most fonts are indeed symmetrical. I don't find Simplified Chinese handwriting ugly at all, but it looks odd on-screen.

qd6pwu4 · on Aug 20, 2016

喰 is not a Chinese character, Japanese created this character...吃 is not a modern simplification for 喰, it has appeared in ancient written Chinese but has a different meaning. Now in modern Chinese 吃 has the same meaning as ancient 喫(eat). 食 has always been in Chinese characters, composed of 人良, which means "things that hold one's life". 食 serves both as a noun and a verb.

Japanese kanji come from Chinese characters, but has become very different too. As a native Chinese, I think there is a clear link between modern simplified Chinese and 'near ancient' Chinese character. Here 'near ancient' means characters used up to 汉(漢, Han) Dynasty. Before Han, Chinese was very different too. There has always been a simplification process and a link.

zorceta · on Aug 20, 2016

You can't simply take a characater apart and glue meanings of the parts together - it's more complicated. Having "乞" part doesn't necessarily mean it has the meaning of begging - it's a "sound", rather than meaning, element, which "provides" the sound of "吃".

And although I don't know the "喰" character, I can tell it didn't become Chinese 吃 and Japanese 食. 食 is more "ancient", where "吃" seems only used so widely in modern time.

Simplification of Chinese characters indeed started many arguments, but the "tranditional" Chinese used in Taiwan has also developed some "simplified" characters.

mveety · on Aug 20, 2016

Yeah the Japanese version of hànzì is much more ancient (Song or Tang dynasties, I think) and have changed comparatively little since then. Also, they simplified some characters in a very different way than how the Chinese did. (I'm probably wrong but: if I remember correctly, Japanese has changed a lot, but the writing system hasn't so the sounds like part of the character isn't always correct or even close. It's a lot like Irish or English in that the language has changed much but the writing hasn't.)

rangibaby · on Aug 21, 2016

I have lived in Japan since I was a teenager and I had a much better time reading in China (was there for a few weeks as a tourist) than speaking.

It was quite funny because staff at restaurants thought I was some kind of weirdo who could point at exactly what he wanted on the menu but couldn't answer basic questions.

They appear to have simplified them predictably in a way that is not impossible to understand if you have a senior HS-level of kanji knowledge.

gleenn · on Aug 20, 2016

It's probably because it's a Japanese website and 食 does make sense to represent 'eat' in Japanese.

zorceta · on Aug 20, 2016

True. And it also makes sense in Chinese despite its "ancient" feel. I just didn't realize it's a Japanese website :/

imron · on Aug 20, 2016

Well, it is called Candy Japan.

chewxy · on Aug 20, 2016

When I was younger and took chinese lessons, there was a push to use more "official" chinese. We'd be told that in official use "吃" is a verb used by ghosts (which is to say, it's impolite to use 吃, and "食" was to be used by humans. So... eh YMMV.

I personally find that elitist though, but then again, in chinese history there was always a distinction between what the commoner spoke (白话文) and what the intellectual elites wrote (文言文) so I'm not surprised that this mentality has continued on

EDIT:

I'd also like to add that 食物 is food, which translates literally to "eat" and "thing". Put together it means "edible thing", i.e. food. The original meaning of 食 still means, "eat", not "food". It's a modern contraction that "食" means food

ttflee · on Aug 20, 2016

According to Kangxi Dictionary (康熙字典), 吃 is equivalent to 喫, which means `to eat`. Its another meaning was `stuttering`.

The ghosts part was not supported.

http://tool.httpcn.com/Html/KangXi/22/PWCQUYAZMEUYILAZKO.sht... http://tool.httpcn.com/Html/KangXi/22/PWCQRNKOCQUYUYAB.shtml

chewxy · on Aug 20, 2016

Agreed. The ghosts bit is apocryphal. Not difficult to see where it comes from though. Mouth + Qi = spiritual nonsense.

prewett · on Aug 20, 2016

I had to look a bit to figure out how you got "qi" from a word pronounced "chi". 契, the right part of the character is pronounced qi4, as is the character 氣, which does have spiritual meanings. 契, however, has no such meaning (according to wiktionary), so I assume this is another one of those Chinese superstitious puns.

Is a preference for 食 a Taiwanese thing? All the mainland Chinese people I've ever heard say 吃.

chewxy · on Aug 20, 2016

There was a linguistic shift to prefer 吃. I would say it probably happened in the Cultural Revolution. In Cantonese 食 is pronounced "sek", and is regularly used - "sek fan" as in "eat rice".

Since HK was quite insulated from the Cultural Revolution, and evidence from older texts that use 食 all the time (喫 was not really used IINM), it would not be amiss to say that the development to prefer 吃 is quite new. Hence in my other post I mentioned that it was political agenda that drove selection of preferred words to use.

addendum: I think there is also a nice narrative in the shift to use 吃 - it was more a "commoner" word, and communism was then about replacing the elite sounding words with simpler words that is common to everyone.

乞 is most commonly used with 乞丐 (begger), but the etymology of the word comes from qi (气) according to zhongwen.com

prewett · on Aug 20, 2016

I learned Japanese first, and it bothered me that 食 wasn't "eat" in Chinese. I'm glad to know to know the history!

zorceta · on Aug 20, 2016

Didn't know that ghost part.. Fun to know :)

I'm just suggesting from modern Chinese's perspective, because, after all, we are modern people. 文言文 (uh I don't know its English translation) is fun to read and learn, but it's like Latin since basically nobody writes it anymore.

chewxy · on Aug 20, 2016

Probably some stupid made up shit to scare kids into using "proper" chinese (for certain definitions of proper as defined by political agendas).

I do find 文言文 to be quite elegant and terse though.

zorceta · on Aug 20, 2016

So a native Chinese might have less knowledge in his language :) thanks for the added part.

FabHK · on Aug 20, 2016

The article is about Japanese (as you can see from the part on inflection).

edwinyzh · on Aug 20, 2016

Both Cantonese and Hakka use "食" as verb since ancient times, no doubt about it.

shasheene · on Aug 20, 2016

If anyone is interested in this kind of etymology, academic Kenneth Henshall has written 'A Guide To Remembering Japanese Characters' which contains the etymologies for the ~2000 general use Kanji. Over the millennia of evolution of the characters, some characters have multiple disputed etymologies which are still unresolved by the academics who study the history.

For the record, 食 is a pictogram of a small amount of food (the "roof" looking thing) stacked on a heavily stylized pictogram of a kind of table or plate (do an image search for 'takatsuki table'). It's claimed the Japanese word for bean 豆, has an older stylization of the same takatsuki table with a little bit of food at the top.

From a learning to read and memorization perspective, most people will probably find doing Look/Cover/Write/Check type drills (either manually or with a spaced repetition flashcard program like Anki) much more effective than using mnemonics based on (sometimes very complex) etymology.

chewxy · on Aug 20, 2016

The etymology of 食 is a bit more complex than that. It's ancient chinese that combined an upside down mouth: 亼 (best approximation) and a bowl of rice. This is the 甲骨文 version: http://imgur.com/NRBoG7F. Cute eh? It looks like someone nomnomnoming a bowl of rice.

EDIT: found another one: http://imgur.com/BsQsNBb

Al-Khwarizmi · on Aug 20, 2016

The method of "Tuttle Learning Chinese Characters" ( https://www.amazon.com/Tuttle-Learning-Chinese-Characters-Re... ) is working very well for me for memorization.

It uses mnemonics, but it only loosely follows real etymologys. It diverges to nonsensical, but memorable stories when this will make things easier to remember. It also has mnemonics to remember tone and pronunciation (for Mandarin Chinese only though).

gizmo686 · on Aug 20, 2016

That is not wrong, but weird to give as a first definition. The word "食" can mean either food or eclipse. In both cases it is read the same way; so I assume this happened because they did not have a symbol for eclipse so decided to use a homophone.

http://jisho.org/search/%E9%A3%9F

cynix · on Aug 20, 2016

I thought the original character for eclipse is 蝕, but people got lazy and started using a more common character with the same pronunciation.

asimjalis · on Aug 20, 2016

Maybe because in the eclipse the moon eats the sun.

userbinator · on Aug 20, 2016

...and in the Eclipse, the Oracle eats the 日.

(https://en.wiktionary.org/wiki/%E5%8F%A3#Etymology and https://en.wiktionary.org/wiki/%E6%97%A5#Etymology if you didn't get the admittedly horrible pun.)

siong1987 · on Aug 20, 2016

you are right: http://dictionary.hantrainerpro.com/chinese-english/translat...

andolanra · on Aug 20, 2016

The link you provided has no information as to whether that's a correct etymology or not, only that 月食 yuèshí is made up of characters 月 yuè 'moon' and 食 shí 'eat'. There are numerous other explanations for why those two characters could be used: for example, perhaps 食 used to have a different meaning which dropped out of common usage, or maybe 月食 used to be written with different characters and people switched to the easier-to-write 食 from a more complicated character, or any of a number of other reasons.

I don't know which of these is true, if any of them is. But your link doesn't have enough etymological information to indicate one way or another, either!

haimez · on Aug 20, 2016

Thus proving that symbolic meaning applied at the glyph level is too cumbersome to be practical because it requires pure memorization of all possible combinations of glyphs and conjugations thereof.

sdrothrock · on Aug 20, 2016

Thus proving that the alphabet is too cumbersome to be practical because it requires the pure memorization of all possible combinations of letters, with the lengths of such combinations becoming longer and longer as words are added to vocabulary.

madeofpalk · on Aug 20, 2016

Eh.

It's a "leaky abstraction". Makes things easier, until it doesn't. Much lik3 "I before E, except after C"

sdrothrock · on Aug 20, 2016

The primary problem with people who don't know Japanese or Chinese arguing against kanji/hanji is that they use very loose comparisons that they think are 1:1.

For example, when you write "inconceivable," you're not regurgitating every single letter in a line from memory. You probably remember the prefix "in," "conceive," and you know "able." You probably also know the common patterns "cei" or "eive" or "con," so the word "inconceivable" really isn't as complex as the initial length makes it look, as long as you know the blocks.

Kanji/hanzi are the same way -- they look complex and inscrutable to the uneducated eye, but they're all made of common building blocks that make it easier to remember them. After all, human memory works roughly the same way all the world around; people wouldn't be able to memorize thousands of 20-stroke character if they were all completely patternless.

The vocabulary utilizing kanji/hanzi works the same way.

Someone could look at "inconceivable" and say "well shit, that doesn't make sense! It's long, you'd have to memorize so many letters, and the letters themselves have so many bits! Plus it has 'in' in it, which makes no sense because 'in' commonly means 'inside of something', and 'con' usually means 'to swindle someone'! This alphabet thing is completely useless."

It's absurd, reductionist, and a bit offensive.

vacri · on Aug 20, 2016

If chinese characters weren't unusually cumbersome, why then do chinese schoolchildren learn a different alphabet first (pinyin), just to assist them in learning chinese characters?

> people wouldn't be able to memorize thousands of 20-stroke character if they were all completely patternless.

Well, people don't. 20 strokes is an unusually high stroke count, and people don't remember thousands of those. Simplified chinese characters were created because traditional characters were too complex and cumbersome for people to remember.

GFK_of_xmaspast · on Aug 20, 2016

> If chinese characters weren't unusually cumbersome, why then do chinese schoolchildren learn a different alphabet first

Also why did Korea and Vietnam abandon them entirely?

prewett · on Aug 20, 2016

Probably because Chinese characters are a poor fit for Korean and Vietnamese grammar/vocabulary. They are a poor fit for Japanese grammar/vocabulary, too, as you can see by the fact that every character has multiple possible sounds depending on which word it is used in. In Chinese, however, the characters very much make sense for the language. Most words are one or two syllables, and correspond characters correspond to both the meanings and the pronunciation. A large number of characters even have a pronunciation hint built in. I, personally, think that Chinese is much easier to read in characters than pinyin, and you certainly won't find any Chinese ever using pinyin for more than a teaching tool. (Especially because nobody except foreigners seems to put tones on the pinyin). The fact that China has kept using them, despite a very pragmatic government that wanted to move the language more phonetic, should say something about their utility.

vidarh · on Aug 20, 2016

> The fact that China has kept using them, despite a very pragmatic government that wanted to move the language more phonetic, should say something about their utility.

Or just inertia. Norway has had a steady stream of language reforms over the last century aimed at bringing the official written language better into sync with a majority of spoken dialects. This is a result of hundreds of years of Danish rule that ended in 1814, followed by the period of national-romanticism in the period up to the subsequent break from Sweden, that led to a lot of desire to make language etc. more uniquely Norwegian.

As a simple example, we inherited parts of Danish counting.

It used to be in some parts of the country that we'd say "fire og tyve" for 24 - literally "four and twenty". This was changed to "tjuefire" (twentyfour) in the early 1950's. Anyone who has learned Norwegian in school since then has learned the new form in school and been marked down for using the old forms etc.

Despite that, and being born to parents who were in primary school when this had just changed and who learned the new forms, I still regularly use the old form.

I never learned it at school, and I occasionally had teachers complain about it. I don't use it consistently, to make matters worse - it's not a conscious choice to use a more conservative style or anything, it's just habit I picked up mostly from my dad, which is persisting in my spoken language now, when I'm 41, despite having changed in a language reform a couple of decades before I was born.

This is a difference where there's no practical benefit at all to the old form - it's longer, and the new form is more consistent with spoken Norwegian overall -, yet more than half a century later the old form still persists out of habit.

In particular, trying to engineer changes to language tends to take a long time even when there's no resistance to the change.

yaowenjiaozi · on Aug 22, 2016

> learn a different alphabet first (pinyin), just to assist them in learning chinese characters?

Is that what they use hanyu pinyin for these days? I've always thought of pinyin as a pronunciation guide for Mandarin, similar to furigana in Japanese.

Margh · on Aug 20, 2016

A child/foreigner can point at "inconceivable" and ask:

'what does "in-con-ceiv-able" mean?'

as opposed to 'what does "..." mean'

A japanese beginner will see 照り焼き and say "uhhhh.. ri..."

compare this with seeing テリヤキ

sdrothrock · on Aug 20, 2016

> 'what does "in-con-ceiv-able" mean?'

Really? You don't think someone with less experience in English would say "what does... inkonsayvaybull mean?" There are plenty of instances where the word is not pronounced the way you think it is.

> A japanese beginner will see 照り焼き and say "uhhhh.. ri..."

A child or beginner would probably be more likely to say "uh, what's that thing with the 日 and the 火, it's something ri something ki." Just because something is a symbol doesn't mean you can't describe it. Children are also very likely to just sketch out a picture of what they remember, even if it's incorrect, and you can usually figure that out.

mveety · on Aug 20, 2016

> Really? You don't think someone with less experience in English would say "what does... inkonsayvaybull mean?" There are plenty of instances where the word is not pronounced the way you think it is.

You're missing his point and English is a crappy example because it's spelling is an unmitigated disaster. For example, if you can read and vocalize the Greek alphabet, you can just ask someone what "νόστιμο φαγητό"* means because you can vocalize it. You only need basic knowledge of the alphabet there. Where as with Chinese/Japanese you need to have a good base of characters to be able to potentially vocalize an unknown character which requires much more work than learning a new alphabet.

(* νόστιμο φαγητό means delicious food)

FabHK · on Aug 20, 2016

Similarly in Spanish, which has great consistency, in that if you see a word written, you pretty much know how to pronounce it, and vice versa. English is pretty bad in that department, but still much better than Chinese.

While many Chinese characters have a phonetic component (in addition to a component related to the meaning), it rarely corresponds exactly to the current pronunciation (in Mandarin).

Furthermore, you can very rarely conjure the right character out of pronunciation and and some aspect of the meaning.

PeCaN · on Aug 20, 2016

Alternatively, it could mean that Google Translate is not very good.

If I had to guess, most Japanese people aren't going to have much trouble disambiguating ‘eat’ from ‘eclipse’. (And as zorceta explains, Chinese uses different hanzi for them anyway.)

shiro · on Aug 20, 2016

The original character for eclipse was "蝕". We use 食 now in Japan because of some simplification. (But the symbolism of moon eating sun is there, as you see 蝕 has 食 in it.)

ekianjo · on Aug 20, 2016

蝕 was still used with this writing in Berserk, though.

rett12 · on Aug 20, 2016

Sometimes I wonder if all these people that criticize or that think that a Latin alphabet can be adapted seamlessly to all languages have tried to study past a beginner level any logographic language.

gizmo686 · on Aug 20, 2016

I have studied Japanese, and still think that a logographic writing system was a mistake. Consider the time and effort it takes for native speakers to become literate.

I also think that the Latin alphabet could be easily used for Japanese, which does not contain any sounds that do not have an obvious equivalent in English, and even if it did, we could always repurpose a character or sequence of characters for that sound (do we really need a 'c').

Having said that, the Japanese phonetic system writes voiced sounds as a modification of their unvoiced counterparts. why can't we all do that.

The biggest risk of using Latin is that simply sharing an alphabet could cause spelling conventions of other languages to bleed in.

rett12 · on Aug 20, 2016

Native speakers seem to do fine. Learning a language while growing up, having the Hiragana as a helper, while all your media is written in Japanese makes everything easier. When they finish school they know enough Japanese to go by. It's obviously different for non-native people.

Also, it's not like you stop learning even after school. For example English has according to the Oxford dictionary 171,476 words in current use excluding inflections, and several technical and regional vocabularies. Does all English university students know these words?

ggreer · on Aug 20, 2016

Logographic systems have some major disadvantages:

• It's possible to know how to say a word, but have no clue how to write it. This phenomenon is called character amnesia, and it affects most native speakers.[1] Phonetic languages allow you to write out a misspelled word, which readers can understand (or autocorrect can fix).

• Likewise, it's possible to know what a symbol means, but have no idea how to pronounce it. This is extra-fun in Japanese, where most kanji have multiple pronunciations.

• Looking up words is harder, as there are no "letters" to sort by. Sorting can be done by stroke count, by radical (four corners or SKIP), or by phonetic spelling (in pinyin or hiragana). Modern technology has made this easier, and some phone apps (like Pleco) can even OCR hanzi. Still, it's far less convenient than phonetic languages.

The only aspect in which logographic systems win is information density. You can fit more words on a single page. This is obvious if you've ever seen Chinese or Japanese copies of works that were originally written in English. The Harry Potter books are crazy thin. Also, Chinese and Japanese tweets can express a paragraph of information.

1. https://en.wikipedia.org/wiki/Character_amnesia

weinzierl · on Aug 20, 2016

> It's possible to know how to say a word, but have no clue how to write it.

> Likewise, it's possible to know what a symbol means, but have no idea how to pronounce it.

As a second language learner of English I can attest that this is not just a problem of languages written in logographic systems:-)

>The only aspect in which logographic systems win is information density.

I vaguely remember a paper that claimed that information density is pretty much constant across languages and writing systems, but I couldn't find it as for now. There is another thread on HN [1] where people compared the size of "Universal Declaration of Human Rights" in different languages. I think this misses the point because it doesn't account for intra-character information density. It'd be much more interesting to render the text into a bitmap and then compare compressed bitmap sizes.

[1] https://news.ycombinator.com/item?id=8236135

ggreer · on Aug 20, 2016

People like to joke about English spelling, but see farther down-thread for examples of how bad things are in logographic systems. Even native-speaking PhDs can forget how to write words like "sneeze" or "toad". It's a failure mode that simply doesn't exist in phonetic languages (even ones as imperfect as English).

Sorry if it wasn't clear, but by "information density" I meant area on a page or screen, not digital bytes. In the thread you linked to, people correctly point out that digital information density depends on encoding and compression schemes matter far more than language.

The paper you're probably thinking of is A Cross-Language Perspective on Speech Information Rate[1][2], which (as the title indicates) studied spoken language, not written. Annoyingly, the study was widely misrepresented in the media. It found that languages with lower information density tended to have higher syllabic rates. That is: Spanish contained less information per syllable than English or Mandarin, but Spanish speakers spoke faster to make up for that. Most media summaries of the paper omitted an important finding: the compensations didn't balance out. Different languages had different information rates. In the study, English had the highest. The runner-up (French) was 10% slower. And Japanese was 30% slower at conveying information.

1. http://ohll.ish-lyon.cnrs.fr/fulltext/pellegrino/Pellegrino_...

2. This blog post has a more accessible summarization of the data: https://www.tofugu.com/japanese/why-do-japanese-people-talk-...

cthalupa · on Aug 20, 2016

>Phonetic languages allow you to write out a misspelled word, which readers can understand (or autocorrect can fix).

You can certainly write things out in kana. When I was more serious about studying Japanese, I knew less than 1000 kanji, but had a vocabulary several times that size, and would at times write out the word I meant in hiragana. And if we're counting autocorrect, your IME is going to take that hiragana and let you find the character.

>• Looking up words is harder, as there are no "letters" to sort by. Sorting can be done by stroke count, by radical (four corners or SKIP), or by phonetic spelling (in pinyin or hiragana). Modern technology has made this easier, and some phone apps (like Pleco) can even OCR hanzi. Still, it's far less convenient than phonetic languages.

Eh, I disagree here. It's harder if you're used to looking things up by the spelling, but once you're fast at looking things up by radical, it's not that difficult. My misguided attempts at slogging through 1Q84 while reading at a, at best, middle school level got me pretty fast at looking up kanji. Not any appreciable difference vs. looking things up in a regular dictionary.

FabHK · on Aug 20, 2016

You cannot write things out in Kana in Chinese. As such, GP's point against logographic writing systems stands, notwithstanding mixed writing systems such as Japanese.

Even without autocorrect, you can write a word in English such that most people would understand. Of course, in a logographic system you'd just write a homophone (which is what people actually do, write a simpler word pronounced the same).

As for looking up, it is in principle easier though. You only need to learn the order of about 26 things, not about 200, and can then run iterative binary search over it, and don't have to switch to stroke count. It is possible, of course.

pmontra · on Aug 20, 2016

Some upper and lower case letters have no clear resemblance, see Aa Rr Gg Nn, so one has to learn 52 symbols. Add other 52 symbols for script, if you have to. Then in the case of English learn how to pronounce or spell words, because in some cases there are no rules (why ocean and not oshean? Because of derivation from Greek, still...)

Anyway, any alphabet is better than Chinese characters.

andrioni · on Aug 20, 2016

>• It's possible to know how to say a word, but have no clue how to write it. This phenomenon is called character amnesia, and it affects most native speakers.[1] Phonetic languages allow you to write out a misspelled word, which readers can understand (or autocorrect can fix). > >• Likewise, it's possible to know what a symbol means, but have no idea how to pronounce it. This is extra-fun in Japanese, where most kanji have multiple pronunciations.

I don't think English is much better in these cases. In fact, the writing can be so divorced from speech that spelling bees are a thing.

ggreer · on Aug 20, 2016

I've had Chinese colleagues who, when asked to write a word they'd just used in a sentence, were simply unable to. At first I thought they were playing a joke on me. But nope, they'd just forgotten the appropriate hanzi, and they couldn't even hazard a guess. It's a totally different failure mode than imperfectly-phonetic languages like English.

w1ntermute · on Aug 20, 2016

From Why Chinese Is So Damn Hard[0]:

> I was once at a luncheon with three Ph.D. students in the Chinese Department at Peking University, all native Chinese (one from Hong Kong). I happened to have a cold that day, and was trying to write a brief note to a friend canceling an appointment that day. I found that I couldn't remember how to write the character 嚔, as in da penti 打喷嚔 "to sneeze". I asked my three friends how to write the character, and to my surprise, all three of them simply shrugged in sheepish embarrassment. Not one of them could correctly produce the character. Now, Peking University is usually considered the "Harvard of China". Can you imagine three Ph.D. students in English at Harvard forgetting how to write the English word "sneeze"?? Yet this state of affairs is by no means uncommon in China. English is simply orders of magnitude easier to write and remember. No matter how low-frequency the word is, or how unorthodox the spelling, the English speaker can always come up with something, simply because there has to be some correspondence between sound and spelling.

0: http://www.pinyin.info/readings/texts/moser.html

andrezsanchez · on Aug 20, 2016

To be fair, you can also "come up with something" in Chinese. Since there aren't all that many sounds, you can write in generic characters for the sound of the word that you can't remember.

fenomas · on Aug 20, 2016

Yep. The analogy I use is, it's a bit like if someone walked up and asked you to draw the logo of this or that company. Even if you've seen the logo a million times, you might not be able to summon up a mental picture of it, or you might remember the rough shape but have no idea how many lines go where.

matthewrudy · on Aug 20, 2016

I've never heard this term "Character Amnesia" but its an analogue to my situation.

I can read and write (via pinyin) a large number of characters, but cannot recollect their shape in abstraction.

I think that's just because as a foreigner learning chinese in the modern world I've never had to learn this skill.

The difference between Recollection and Recognition.

gurkendoktor · on Aug 21, 2016

Same here - and strangely enough, it's rarely a problem. Faking characters by using the correct radical and a random homophone base character works okay in a pinch.

But because I never write characters by hand, I have a really hard time reading handwritten notes, and that is a problem.

mistercow · on Aug 20, 2016

> or autocorrect can fix

If you're bringing computers into it, isn't text entry in Japanese usually done phonetically anyway?

sampo · on Aug 20, 2016

> For example English has according to the Oxford dictionary 171,476 words in current use excluding inflections, and several technical and regional vocabularies.

Here is a website which questions you with some random sample of words from an English dictionary, mixed with randomly generated non-words. Then it estimates the percentage of English words you know.

http://vocabulary.ugent.be/wordtest/start

I am a non-native speaker, and I have scored in the 77% to 89% range, when doing this test several times.

tempestn · on Aug 20, 2016

I'm curious: did you only answer yes to the words whose meanings you knew, or to anything that you knew was indeed a word? There were some that were pretty obviously words, but I wasn't certain the exact meaning (although I could guess), so I answered no. Ended up with 77% (as a native speaker). Apparently average for native speakers is 67%, so 77-89 as a non-native speaker sounds really good.

wingerlang · on Aug 20, 2016

I just did it, and I answered yes to words I knew, or knew that were actual words but I didn't know the exact meaning of. Like Argon, I know it is something related to chemistry but I don't actually know what it is. Some words were compound words which I am not sure would be in a dictionary, but still valid words.

I got 73% and I didn't say 'yes' to any fake words.

73% is apparently "This is a high level for a native speaker."

fenomas · on Aug 20, 2016

> I also think that the Latin alphabet could be easily used for Japanese

Writing Japanese entirely in Latin characters would be no different from writing it entirely in hiragana. Have you ever tried reading that way?

gizmo686 · on Aug 20, 2016

Kind of. In my first semester of japanese we worked in hiragana+spaces.

Having read English language papers on Japanese linguistics, I can also say that reading the Latin is easy too.

fenomas · on Aug 20, 2016

Sure, I didn't mean to suggest it can't be done in short spurts. But reading a novel that way would be hellish.

The larger point being, Japanese isn't locked into using a logographic system - it already has two phonetic syllabaries that people could start using exclusively if there was some advantage to doing so.

cthalupa · on Aug 20, 2016

That sounds like an absolutely miserable experience. I'd rather be forced to look up every 3rd or 4th kanji than try to deal with all hiragana writing.

jacobolus · on Aug 20, 2016

> I also do not think that the Latin alphabet could be easily used for Japanese, [...]

You stuck an extra “do not” in your sentence

* * *

As far as alphabets go, the Phoenician/Greek/Etruscan/Latin alphabet is pretty ad hoc and mediocre. But hey, it’s what we know. At this point, I think we’re stuck with it.

Similar story for modern Hindu/Arabic/European numeral glyphs. Learning arithmetic would be noticeably simpler if the glyphs expressed some of the symmetries of the number system. Alas.

gizmo686 · on Aug 20, 2016

Removed the "do not"

As far as the alphabet itself goes, I do not think that Latin is that bad. All symbols have a canonical sound associated with them. The problem is that our usage of the alphabet is horribly inconsistent. This is partially due to the fact that English has sounds that cannot be expressed using the "pure" alphabet. Arguably Japanese has this same problem in their system, with the ゃ、ょ、ゅ modifiers. But at least they distinguish those from や、よ、ゆ by size, and are disciplined about their usage, so we can consider the set of compounds to be their own characters and not have a mess.

Of course you still have the ず/づ issue, and the pronunciation of は and を as わ and お in their most common usage. But, even in modern Japanese, these oddities are not universal.

Out of curiousity, are you aware of any numeral system that beats Arabic? By pre-Arabic European standards, Arabic numerals are a masterpiece of symmetry.

jacobolus · on Aug 20, 2016

Here’s my proposal for base twelve numerals, http://i.imgur.com/UobIObq.jpg ; multiplication mod twelve, http://i.imgur.com/dRielBv.jpg

It can also be nice to use a “balanced base”, with digits for negative numbers, e.g. in a base ten context you’d have digits for –4 to 5 (or if you’re willing to have multiple expressions for the same number, –5 to 5).

A balanced base twelve multiplication table might look like this: http://i.imgur.com/quEcxH0.png

bhaak · on Aug 20, 2016

> As far as alphabets go, the Phoenician/Greek/Etruscan/Latin alphabet is pretty ad hoc and mediocre. But hey, it’s what we know. At this point, I think we’re stuck with it.

You mix the whole development line of that Latin alphabet into one dismissive argument. I see lots of difference between the Phoenician and the Latin alphabet and FWIW, the Latin alphabet is quite versatile as its wide application shows.

It wonder what do you consider mediocre about them?

> Similar story for modern Hindu/Arabic/European numeral glyphs. Learning arithmetic would be noticeably simpler if the glyphs expressed some of the symmetries of the number system. Alas.

I don't think learning arithmetic would be much simpler with other numerals. Even the Romans could do it and they had one of the worst possible numerical systems.

I find our numerals quite fine. My daughter was recognizing numbers before she turned 2. There is some mnemonic to the first four (1 line, 2 corners on the left, 3 corners on the left, 4 corners overall) and most are quite distinct from our Latin letters. 6 and 9 are annoyingly symmetrical of each other, though.

jacobolus · on Aug 21, 2016

Writing a less dismissive / more serious argument about the Latin alphabet would take a few hundred pages. You’re right though, I’m not a speaker of (or expert in) ancient Phoenician, perhaps their alphabet was a bit better structured for that language (it looks pretty ad hoc though). I can primarily speak to the Latin alphabet’s irregularity and mediocrity for representing modern English/Spanish/etc., though it doesn’t seem to have been much better for Greek or Latin. Obviously it works well enough to be the practical anchor for written culture, and I can certainly imagine worse systems (little Egyptian-style pictographs for letters for example). But it’s hardly elegant or systematic. The ordering of the letters is also pretty much arbitrary, and has nothing to do with the separation between consonants and vowels, or the relationship between particular sounds.

For an example of a better designed alphabet, check out Korean Hangul.

* * *

The numerals 1, 2, 3 come from just writing strokes, like tally marks, which over time became connected in handwriting. The other numbers were mostly fairly arbitrary symbols, which morphed slowly over time with occasional replacements and swaps. Otherwise, the symbols have absolutely nothing to do with the numbers they represent or with the base ten number system. Overall, I’d say numbers 0 and 1 are pretty effective. The rest are a huge waste of potential.

Same story for the words/names used to represent the numbers. They are made of arbitrary sounds in arbitrary numbers of syllables, reveal nothing about the theoretical properties of the numbers, some of them are hard to say or easy to mistake, etc. Especially for numbers beyond ten, the names are irregular and confusing. This has a real practical impact. Counting is notably easier for Chinese speaking children than for English speakers.

> I don't think learning arithmetic would be much simpler with other numerals. Even the Romans could do it and they had one of the worst possible numerical systems.

In general, Romans did their arithmetic using little pebbles (“calculus”) on counting board (“abacus”), and used written symbols only for recording the output of their calculations. This made some types of computation very difficult (because using pebbles to record every step gets cumbersome), which helps explain why science has taken off in the past 500 years in Europe after we started developing better notational conventions and using Hindu–Arabic numerals and later decimal fractions, logarithms, etc.

My son is about 2 weeks old, so I can’t tell you yet how well he learns arithmetic using a different set of numerals. Ask me again in about 10 years.

mcguire · on Aug 20, 2016

We should switch to Fëanorean script. It's almost IPA without the notational horrors.

rogual · on Aug 20, 2016

> the Japanese phonetic system writes voiced sounds as a modification of their unvoiced counterparts. why can't we all do that.

Fun fact, we do do that in English, at least for C and G. (G was introduced as a modified C to indicate the voicing).

Jack000 · on Aug 20, 2016

by that measure we should forget about historical languages and learn something constructed like esperanto.

languages are not solely a means of communication but a part of a people's cultural identity. I think the greater dependence on contextual cues and ambiguity in Chinese/Japanese lends itself much better for linguistic art forms like poetry and literature.

steve19 · on Aug 20, 2016

I think the debate is more Logographic vs. Alphabet, rather than Logographic vs. the Latin Alphabet.

There are pros and cons. A big con with Alphabets is that words lose their meaning over times. I find reading Old English (1500 years old) to be less comprehensible than "modern" Latin, despite being a native english speaker, and only knowing a little latin.

I find reading even Early Modern English (400 years old) an effort initially before I get reacquainted with it (Shakespeare).

In 300 years time I hate to think what English speakers will think of our texts.

That said, if I had to choose another language to learn, it would be one with an Alphabet, which seems far easier to me to learn, and type, than memorizing 1000s of symbols.

Margh · on Aug 20, 2016

If you replace kanji with katakana and keep hiragana for particles and conjugation, you can call it a day.

Easy to learn, no more trying to guess if it's on/kunyomi, immune to mispronunciation from using a foreign alphabet, the list goes on.

fenomas · on Aug 20, 2016

Speaking as someone who started as an adult and is now fluent, I think this would make Japanese much, much harder to learn.

Or rather, the 2-3 months would be ten times easier and everything after would be ten times harder.

etatoby · on Aug 20, 2016

Why?

What is the advantage of using a different symbol for each word, that offsets the huge disadvantages of having to learn and remember a different symbol for each word?

Especially considering that the spoken language already distinguishes between all possible words through pronunciation (and context in the case of homophones.)

fenomas · on Aug 20, 2016

It's hard to explain. In English, spelling, pronunciation, and meaning are all more or less interrelated, right? In Japanese, writing (kanji) correlates to pronunciation and to meaning, but pronunciation and meaning are mostly unrelated to each other. Kanji is what disambiguates them.

So, obviously learning 1000 kanji isn't easy. But doing that is what makes it possible to learn 100,000+ words whose pronunciations and meanings would be otherwise largely unrelated.

It's quite similar to the role that Latin/Greek roots play in English. When you see a word that includes "-graph-" you know it probably involves writing, and similarly when a student of Japanese sees a word with "間 (kan)" they know it involves an interval or space. Throw away the kanji, and your student now just sees "kan" - which means the word will probably involve an interval -- or a barrier, or emotion, or appearance, or a tube, or a building, a warship, a crown, an ending, China, a publication, a government ministry, or.. you get the idea.

Zakiazigazi · on Aug 20, 2016

A lot of people think that and personally as someone fluent in Japanese (as a second, well rather something like fourth, language) I also sort of feel the same way. However if you look at it without the learned biases, there is a great example where a country with fairly similar language in terms of grammar and sounds that had used to use chinese characters switched to a phonetic alphabet and are not noticeably worse off for it: Korea.

fenomas · on Aug 20, 2016

This has been brought up and replied to lots of places elsewhere on the page.

x3al · on Aug 20, 2016

There are way too much homophones and you don't always have the luxury of the context. Learning a symbol for each root (not word!) is not that bad, English spelling is almost as bad, actually.

Spoken language is quite limited compared to written Japanese.

anatoly · on Aug 20, 2016

Do Japanese audiobooks exist?

Assuming yes, do their users have significant problems understanding the written text when pronounced in an audiobook? Are there well-known conventions or shortcuts or explanations that audiobook readers insert into their speech to signal the correct meaning of the word?

Do Japanese audiobooks provide evidence for or against the idea that doing away with kanji in writing would not harm understanding significantly?

x3al · on Aug 22, 2016

Fiction audiobooks do exists (although not nearly as common as in English-speaking countries), but audiobooks can't possibly work with non-fiction and especially technical texts unless you are going to use English words for literally every single term. I mean, Japanese has only about 100 moraes and way too much words are just 2-3 moraes long.

mcguire · on Aug 20, 2016

We should all speak Hawaiian.

Sniffnoy · on Aug 20, 2016

Not sure how great an explanation that really is. I like Zompist's explanation: http://www.zompist.com/yingzi/yingzi.htm

bemmu · on Aug 20, 2016

That certainly looks more complete. I was mostly wanting to see if I can pull the reader into learning Japanese without realizing that they were doing so.

HaloZero · on Aug 20, 2016

Your ___domain was a bit of a hint. ^_^

danvayn · on Aug 20, 2016

Thanks for this, it's a great primer.

zatkin · on Aug 20, 2016

Aren't we sort of starting to doing this with the introduction of emojis? They're a little bit ambiguous, but they do have meaning behind them, nonetheless. ‍️

alannallama · on Aug 20, 2016

That's why you see services with lots of pictographic functions eclipse pure test-based ones in Asia, because they are very used to that kind of symbolic communication. And why complex emoji using text characters in creative ways were developed in Asia in the first place.

For example, Twitter never really took off in southeast Asia, but Line is incredibly popular. Why? Stickers. Line offers endless little pictures you can use with your messages, while Twitter doesn't.

Now stickers (and emoji) are taking off a lot more in the West, because they are super compact and effective communication symbols. I think we'll see more and more of it.

dhfromkorea · on Aug 20, 2016

Another interesting side-effect that the compactness of Chinese symbols (other variations: Hanja in Korean, Kanji in Japanese and so forth) allowed was a higher chance of survival against natural disasters like wild fires or crimes like thefts or vandalism.

It was/is far easier to ensure redundancy of scripts and books since the costs of reprinting/copying was far lower compared to other forms of phonetic systems.

The compactness explains how so many archaic, buddhist scripts could survive to this day.

dctoedt · on Aug 20, 2016

A counterpoint: When a message is written with an alphabet, it's not as compact, but its meaning can be guessed at even if significant portions of the message are missing (known as lacunae). See, e.g., the TV game show Wheel of Fortune; another example is the Dead Sea Scrolls and other ancient manuscripts that have deteriorated over time.

dhfromkorea · on Aug 20, 2016

That may be true. And perhaps the same argument could be made for Chinese characters?

Could you elaborate on why the alphabetic system is intrinsically more efficient than Chinese characters in terms of recovering messages from partial loss of texts?

dctoedt · on Aug 20, 2016

Spatial dispersal of the glyphs means that fewer glyphs would be taken out by any given insect-gnawed hole, UV-radiation fading, hurled paint glob, etc., and thus less of the overall message would be lost to that single incident of damage.

It's the same principle as how soldiers are trained to spread out when in battle: If they bunch up, it increases the risk that a single mortar shell (or artillery round or machine-gun burst) could take out a lot of troops.

dhfromkorea · on Aug 21, 2016

Oh, now I see. Thank you for explaining your point succinctly.

Though I do not have data to back up my argument, I still reckon the Chinese glyphs/scriptures would have had a better chance of survival.

While I think your point is valid, its disadvantages outweigh the advantage, at least since paper/papyrus was invented.

Being spread out to double in length (double being an arbitrary multiplier) would still be inferior to being dispersed to two physical locations (redundancy). I think this is where don't put all your eggs in one basket holds true.

Plus, important docs must have been actively maintained by hired librarians(?). With human maintenance involved, less in volume could have been an advantage for it is easier to move around and maintain the docs. Ofc, when left out in the wild, it is a different story.

Personally I do not like Chinese character system as it has so high a barrier to entry for learners. I love alphabets, Korean Hangeul, or Japanese Hira/Katakana for this matter. Have you tried learning any of those? :-)

edwinyzh · on Aug 20, 2016

Anybody care to explain why "costs of reprinting/copying was far lower compared to other forms of phonetic systems"? I know nothing about printing technology but I'm a Chinese.

_pfxa · on Aug 20, 2016

Far many more symbols are required to represent a word with alphabetical systems. I used 65 symbols and 11 spaces to write this sentence whereas with ideograms I'd need about 13 symbols.

edwinyzh · on Aug 20, 2016

OIC, To express the same meaning, using Chinese needs far more less physical space than using English. And you know what? Classical Chinese takes the compactness to the next level ;D

I'll try give an example: English: The quick brown fox jumps over the lazy dog

Chinese: 敏捷的棕毛狐狸从懶狗身上跃过

Classical Chinese：棕色敏狐跃懶犬 Note: This is composed by me, maybe not very well-written, and maybe it can be even more compact, but you see what I mean ;)

Grue3 · on Aug 20, 2016

入 means "enter", 口 means "mouth". 入口 means... "entrance". Actually for most kanji there is no single meaning. Some meanings might even have nothing in common with each other, because they've been based on ancient Chinese wordplay or something.

edwinyzh · on Aug 20, 2016

But 口 also means "loophole", so 入口 means "entrance" is perfectly logical - "The loophole that allows you to enter another building/strucutre".

a_bonobo · on Aug 22, 2016

Fun coincidence:

A vomitorium (any modern person associates that with vomiting, i.e., stuff coming out of your mouth) was the name for entrances in Roman amphitheatres.

colejohnson66 · on Aug 20, 2016

Entering the mouth of the building?

minikomi · on Aug 20, 2016

I think the character does double service in that it has meanings which are very much mouth-related - 薄口 (thin-mouthed - like weak taste) 後口 (after-mouth - after taste); but it also has a ton of meanings which are like opening / spout / hole / crater.

http://compling.hss.ntu.edu.sg/omw/cgi-bin/wn-gridx.cgi?usrn...

It's not that a land's-mouth is a crater, more like, mouth and opening are more synonymous in feeling in Japanese.

force_reboot · on Aug 20, 2016

Almost certainly. One fascinating aspect of language is that many metaphors that are baked into language appear in many languages. E.g. In English we can form the future tense with modal verbs, "I will..." and "I am going to..." and in Chinese there are similar modal verbs "我要..." and "我去...". In both languages the idea of intention, or motion, are used as a metaphor in forming the future tense. Or 加油, an expression of encouragement similar to "put your foot on it" which has no equivalent in English, but does in Danish, "giv det gas".

Symbiote · on Aug 20, 2016

"Put your foot on it" means the accelerator / gas pedal, that seems very much equivalent.

force_reboot · on Aug 20, 2016

I mean in English it isn't used as a generic encouragement, while in Mandarin and Danish it is.

gayprogrammer · on Aug 20, 2016

put the pedal to the metal

(idiomatic) To exert maximum effort.

https://en.wiktionary.org/wiki/put_the_pedal_to_the_metal#En...

force_reboot · on Aug 21, 2016

Yes, that is an equivalent phrase, but much less commonly used than 加油 and "gi' det gas".

daveheq · on Aug 20, 2016

"Since we already have symbols for all the sounds we can pronounce"... Not with 26 letters we don't. Other languages have other sounds that English can only try to emulate, and even English has sounds that require multiple letters.