Should you just stick to basic ASCII stuff for databases? For usernames, user input (such as posts), passwords, etc. Or should you embrace unicode and all of the potential problems it can cause? I've heard of some people saying they even have emojis in their passwords. I guess that makes it more secure in the sense that there are more combinations for any given password length, but at the same time, sometimes there are weird issues with unicode characters, like that one iOS bug that caused phones to crash when they received a combination of left-to-right and right-to-left Unicode characters.It could be a potential security issue too, or there might be issues with deserialization as well.
I would imagine ASCII-only makes regular expression easier too. I don't even know all of the characters in Unicode, or how you would test for certain ranges of characters. Plus, Unicode adds new characters all the time.
Yes, absolutely. UTF-8 should be embraced wherever, and as developers we should accept the challenges that come with it. Being able to seamlessly support other alphabets is quite valuable, and embraces the miracle that we have such a global standard.
>>11 my dubs don't need unicode, they can be represented by ASCII
Name:
Anonymous2019-08-27 5:58
Separating emoji from the rest of unicide is sutismic. Just support it, all the real languages do it out of the box anyway.
Name:
Anonymous2019-08-27 6:50
>>8 35 year-old Millenial here, i also prefer ASCII and think unicode is waste of space, especially the huge fonts to display east asian hieroglyphics(reminds me of chinese typewriter with 1000's of tiny buttons). Regional encodings were messy but they didn't require complex processing and entire libraries to "determine the shape of grapheme and its position".
>>16 By using latin syllables representing the kanji instead of thousands of graphemes?
Name:
Anonymous2019-08-27 15:24
We do not want Unicode, just 8-bit characters. We need the twelve window characters added to the VGA font 0x02-0x0D. Japan, China and Korea must switch to alphabets. Maybe, the United States will change to metric, out of good will.
>>8 Wouldn't this make it a GenX? >>17 Laughed. >>19 This lol. Why aren't computer scientist codifying in hangul? It's byte safe!
Entire thread can be summarized: ”I interpret data in some wacky language than numerics and address space. Also, I want my data represented in base85 for "cleanliness". Fuck binary. Fuck GPG.” Edited on 27/08/2019 19:54.
>>8
Wouldn't this make it a genx?
Wouldn't this make it a GenX?
>>17
Laughed.
>>19
This lol. Why aren't computer scientist codifying in hangul? It's byte safe!
Entire thread can be summarized: ”I interpret data in some language some wacky language than numerics and address space. Also, I want my data represented in base85 for "cleanliness". Fuck binary.”
Entire thread can be summarized: ”I interpret data in some wacky language than numerics and address space. Also, I want my data represented in base85 for "cleanliness". Fuck binary. Fuck GPG.”