ASCII, UNICODE et UTF8 - Spé NSI - Première Informatique
C++ : How to get Unicode for Chracter strings(UTF-8) in c or c++ language (Linux)
Unicode: Going Down the Rabbit Hole - Peter Bindels - CppCon 2019
Комментарии
1:36 Single-Byte Encodings
7:24 Variable-Length Encodings
12:11 Unicode 1.0
19:14 Unicode Encodings Today
41:45 Dynamic Composition
44:14 Unicode Types in C++
50:11 "Length"
52:17 "Equality"
59:37 Ordering
1:02:13 Other Text Operations
1:07:33 International Components for Unicode (ICU)
1:11:51 Boost.Locale and Boost.Regex
1:18:04 Proposals for Standardization
1:21:38 Questions
(Slides are in the description)
remram
this is one of the most useful talks i have watched this year so far. kudos to the presenter.
victornoagbodji
Excellent intro and very competent. Recommended.
t
There should be a template specialization std::basic_string<codepoint> that stores the string internally like Python does (if possible use Latin1 bytes, if not then UCS2 and if that isn't possible UCS4/UTF-32) and iterates over code points like u32string would. "codepoint" would be a class that is basically a char32_t. Maybe you could give it more functionality, e.g. only allowing valid code points.