IRC log of #maemo for Wednesday, 2020-10-14

*** infobot has joined #maemo00:16
*** ChanServ sets mode: +v infobot00:16
*** Oksana_ has joined #maemo00:31
*** Pali has quit IRC00:52
brolin_empeyMaxdamantus: OK, thank you for the informative answer.  I will try to get around to opening your two links.  I only know how to write in languages that use an alphabet so was curious how text written in a language such as Chinese or Japanese that does not use an alphabet is sorted, assuming that it can be sorted.  My friend from Beijing showed me how he uses an IME to write in Chinese on Android but I do not know enough about a language that does not use an01:04
brolin_empeyalphabet to write in the language.01:04
brolin_empeyHe said he thinks the stroke order or number of strokes, do not remember which one he said, is used for sorting text but at least one other person I asked said they do not think text written in Chinese can be sorted.  I kept meaning to try using software that supports Chinese text to sort Chinese text to see what it does but I ran out of time then forgot about it or had other, higher priority things to do.01:08
*** Kilroo has joined #maemo01:11
MaxdamantusIt will likely be ad-hoc to the writing system. I'm not sure how Chinese logograms work exactly, but in general I would expect a writing system to be made of a relatively small number of primitive concepts.01:25
Maxdamantuseg, if you look at Hangul, you might have thousands of "characters", but each one is really just a combination of up to three primitive symbols denoting any start/middle/end sounds for a syllable.01:26
Maxdamantus(Japanese kana are similar, but with the exception of "-n", their syllables all consist of one vowel, possibly preceded by a consonant, so only two primitive concepts in each glyph)01:28
L29Ahno?01:28
Maxdamantusand since that combination in Japanese kana only leads to around 50 symbols (5*10), it doesn't need to be as regular as Hangul.01:29
MaxdamantusNo what?01:29
L29Ahah nvm, for the ordering reason it's ok01:30
L29Ahthere's ゃ, ゅ and ょ to have a little fun with01:30
L29Ahanyway though i don't see why don't you just grab unicode code points and be done with it01:31
MaxdamantusBecause Unicode code point ordering might not follow a well-understood pattern. It just depends on who designed the layout for that script in Unicode.01:43
MaxdamantusEven in Latin-based scripts, you don't have that. An obvious example would be 'ı' in Turkish.01:43
Maxdamantusor simply 'ü' in German.01:44
L29Ahi think it can even change between languages using the same character set01:44
MaxdamantusI imagine there are languages using Latin-based scripts that have orders that are inconsistent with English.01:45
Maxdamantusalso, I know that in Arabic there are at least two well-known orderings of letters (one starts with "alef, ba, gim, dal" like in Greek, the other starts with "alef, ba, ta, tha")01:46
Maxdamantusand people use those different Arabic orders in different contexts.01:46
* enyc meows01:47
brolin_empey$ cat /dev/urandom >enyc01:47
*** inz has quit IRC02:04
*** geaaru has quit IRC02:22
*** Oksana_ is now known as Oksana02:23
*** chainsawbike has quit IRC02:23
*** chainsawbike has joined #maemo02:23
*** FalconSpy has quit IRC02:25
*** inz has joined #maemo02:27
*** FalconSpy has joined #maemo02:28
*** geaaru has joined #maemo02:34
*** florian has quit IRC02:42
*** xkr47 has quit IRC02:43
*** xkr47 has joined #maemo02:55
*** jskarvad has quit IRC03:00
*** drathir_tor has quit IRC04:32
*** tm has quit IRC04:33
*** tm has joined #maemo04:36
*** minicom has quit IRC05:17
*** minicom7 has joined #maemo05:18
*** pagurus has quit IRC06:58
*** Kilroo has quit IRC06:58
*** infobot has quit IRC07:25
*** DocScrutinizer05 has quit IRC07:27
*** DocScrutinizer05 has joined #maemo07:27
*** infobot has joined #maemo07:38
*** ChanServ sets mode: +v infobot07:38
*** peetah has quit IRC08:05
*** drathir_tor has joined #maemo08:50
*** drathir_tor has quit IRC08:55
*** drathir_tor has joined #maemo09:16
*** Pali has joined #maemo09:40
*** Pali has quit IRC10:05
*** jskarvad has joined #maemo11:26
*** xmn has quit IRC11:36
*** florian_kc has joined #maemo11:42
*** esaym153 has quit IRC11:50
*** florian_kc is now known as florian12:19
*** minicom7 is now known as minicom12:57
*** esaym153 has joined #maemo14:08
*** ab has quit IRC14:40
*** peetah has joined #maemo14:41
*** chainsawbike has quit IRC14:41
*** chainsawbike has joined #maemo14:41
*** ab has joined #maemo14:42
*** ab has joined #maemo14:42
*** Linkandzelda has quit IRC15:14
*** Linkandzelda has joined #maemo15:17
*** drathir_tor has quit IRC15:48
*** drathir_tor has joined #maemo16:00
*** drathir_tor has quit IRC16:00
*** drathir_tor has joined #maemo16:05
*** drathir_tor has quit IRC16:14
*** florian_kc has joined #maemo16:23
*** drathir_tor has joined #maemo16:35
CcxWrkYou don't sort by codepoints, there's whole Unicode Collation Algorithm: https://www.unicode.org/reports/tr10/16:55
L29Ah> Siniform ideographs — most notably modern CJK (Han) ideographs — and Hangul syllables are not explicitly mentioned in the default table. Ideographs are mapped to collation elements that are derived from their Unicode code point value as described in Section 10.1.3, Implicit Weights.16:56
CcxWrkHm, even libicu pages on this seems to be full of TODOs http://site.icu-project.org/design/collation/script-reordering17:02
CcxWrkHeh and the official document on Collation points to … PowerPoint file? :]17:05
CcxWrkBut no, we better focus on adding more emoji combinations /s17:05
KotCzarnysticking to those funny chars is like keeping ebcdic around17:07
KotCzarnysure, some legacy code uses it, but whole thing should be deprecated17:07
L29Ahindeed, latin should be deprecated in favour of han17:08
KotCzarnyi think you've meant emojis17:10
L29Ahnah emojis are ideographs like han, they're fine17:11
*** florian has quit IRC18:46
*** drathir_tor has quit IRC18:47
*** florian_kc has quit IRC18:49
*** sedate has joined #maemo18:55
*** sedate has quit IRC19:01
*** drathir_tor has joined #maemo19:04
*** Pali has joined #maemo19:45
*** drathir_tor has quit IRC20:33
*** drathir_tor has joined #maemo20:37
*** xmn has joined #maemo21:08
*** drathir_tor has quit IRC21:12
*** drathir_tor has joined #maemo21:30
*** drathir_tor has quit IRC21:37
*** drathir_tor has joined #maemo21:54
*** florian has joined #maemo22:11
*** luke-jr has quit IRC22:22
*** luke-jr has joined #maemo22:22
*** akossh has joined #maemo22:50
*** luke-jr has quit IRC23:10
*** luke-jr has joined #maemo23:12
*** luke-jr has quit IRC23:23
*** luke-jr has joined #maemo23:24
*** florian_kc has joined #maemo23:27
*** florian has quit IRC23:28
*** luke-jr has quit IRC23:45
*** drathir_tor has quit IRC23:47
*** florian_kc has quit IRC23:51
*** luke-jr has joined #maemo23:55
*** drathir_tor has joined #maemo23:56

Generated by irclog2html.py 2.15.1 by Marius Gedminas - find it at mg.pov.lt!