Who knew Unicode was so versatile?

renzev@lemmy.world · 3 months ago

Who knew Unicode was so versatile?

calcopiritus@lemmy.world · 3 months ago

The purpose of Unicode is to be able to represent everything humans have written. Doesn’t matter if correct or not.

There are some Chinese characters that appear only once in written text, but they happen to be just typos of copying other text. They exist in Unicode.

renzev@lemmy.world · 3 months ago

With all of those obscure characters that they keep on adding, you’d think they’d have the decency to have separate sets for japanese and chinese characters. But nope, those are all lumped together into the CJK Unified Ideographs block. Whether a character shows up chinese-style or japanese-style depends on the font.

calcopiritus@lemmy.world · 3 months ago

I have absolutely no idea about Chinese or Japanese characters, but if they did that there’s probably a technical reason like retro compatibility or something. Unicode has free space left for millions or billions of characters.

renzev@lemmy.world · 3 months ago

Unicode has free space left for millions or billions of characters.

I might be wrong, but isn’t unicode essentially unlimited? Like, they’re just assigning numbers (codepoints) to individual characters. Any limitation would come from encodings like utf-8, no?

MarcomachtKuchen@feddit.org · 3 months ago

That’s correct. The mistake made was making an error while transcribing the same symbol into Unicode