Convert a unicode String In C++ To Upper Case
How we can convert a multi language string or unicode string to upper/lower case in C or C++.
If your system is already in UTF-8, by using std::use_facet
, you can write:
#include <iostream>
#include <locale.h>
int main() {
std::locale::global(std::locale("")); // (*)
std::wcout.imbue(std::locale());
auto& f = std::use_facet<std::ctype<wchar_t>>(std::locale());
std::wstring str = L"Zoë Saldaña played in La maldición del padre Cardona.";
f.toupper(&str[0], &str[0] + str.size());
std::wcout << str << std::endl;
return 0;
}
And you get (http://ideone.com/AFHoHC):
ZOË SALDAÑA PLAYED IN LA MALDICIÓN DEL PADRE CARDONA.
If it don't work you will have to change (*) into std::locale::global(std::locale("en_US.UTF8"));
or an UTF-8 locale you actually have on the plateform.
I found 2 solution of that problem_
1. setlocale(LC_CTYPE, "en_US.UTF-8"); // the locale will be the UTF-8 enabled English
std::wstring str = L"Zoë Saldaña played in La maldición del padre Cardona.ëèñ";
std::wcout << str << std::endl;
for (wstring::iterator it = str.begin(); it != str.end(); ++it)
*it = towupper(*it);
std::wcout << "toUpper_onGCC_LLVM_1 :: "<< str << std::endl;
this is working on LLVM GCC 4.2 Compiler.
2. std::locale::global(std::locale("en_US.UTF-8")); // the locale will be the UTF-8 enabled English
std::wcout.imbue(std::locale());
const std::ctype<wchar_t>& f = std::use_facet< std::ctype<wchar_t> >(std::locale());
std::wstring str = L"Chloëè";//"Zoë Saldaña played in La maldición del padre Cardona.";
f.toupper(&str[0], &str[0] + str.size());
std::wcout << str << std::endl;
This is working in Apple LLVM 4.2.
Both case i ran on Xocde. But I am finding a way to run this code in Eclipse with g++ Compiler.