Wiki > Main > UnicodeGuide (compare)
Difference: UnicodeGuide (r26 vs. r25)
All typed text we use has its �character encoding� (sometimes referred to as page code). For English or other Latin-based languages, text is usually encoded in the ISO-8859-1 page code. Here are some more examples of other page codes:
Note: each language doesn�t necessarily have only one page code. Also, Rockbox may support one of a language's page codes, but not the rest.
Unicode as defined by Wikipedia is �an industry standard whose goal is to provide the means by which text of all forms and languages can be encoded for use by computers.� To put it simply, Unicode supports multiple languages, therefore eliminating the need to use different page codes for every single language. More about the use of Unicode explained below.
Tags are usually coded in the OS's default page code (default page code on your PC's OS). This is the case for most tags, except tags very very recently created by programs like MP3tag and other tagging programs which encode tags in Unicode by default instead. When your tag is coded this way, and not in Unicode, you can only display it on Rockbox by selecting the appropriate "default page code" and "font".
Note: In Windows XP, the default page code settings can be found at �Control Panel/Regional and Language Settings/Advanced tab/Language for non-Unicode programs�.
To explain even more, say you have an ID3 tag with Chinese info, coded in the OS's default code page (in this case Chinese). This tag would show as garbage on Rockbox unless you choose Chinese as your default page code, and Unifont as your font. This way, Chinese songs and English songs (ISO-8859-1) will all display properly.
The problem with Unifont is that it might not be optimized for the WPS of your choice. The 6+12x13 font, added on Feb. 12 2006, supports Japanese and Korean characters and tends to be compatible with a larger number of WPSs. It is also smaller than Unifont and allows more characters to fit on the screen. You may wish to use 6+12x13 instead of Unifont if you only need support for Japanese and Korean characters.
If you have ID3 tags with more than 1 foreign language, then the above solution wouldn't be perfect for you. Say that in addition to Chinese ID3s, you also had Arabic, Greek, Hebrew, Korean or Japanese ID3s (a combination of any of these two or more would work). For my example, I'll choose Arabic as the second foreign language in my ID3s. If you're using windows, having ID3s in two of these languages means that you need to switch your default page code depending on which language you want to display. If I want to display an Arabic encoded ID3, I'd have to choose Arabic as my default page code. If I want to display a Chinese encoded ID3, I'd have to choose Chinese. If Arabic was set as the default page code, Arabic songs would display fine, but Chinese songs will show as garbage. The same is true for Rockbox. If your default page code on Rockbox is Chinese, Chinese ID3s will show fine, but Arabic ID3s will show as garbage.
There is a solution! Enter Unicode. Rather than encoding each tag in its native language's codec, such as encoding an Arabic tag in the Arabic page code, or a Chinese tag in the Chinese page code, we encode ALL tags regardless of the language in Unicode. This way, you do not need to tell Rockbox (or windows, or any other OS) which page code to use. Simply choose the font "Unifont" in Rockbox, and all the tags will show with no problem! You would then be able to play an Arabic song, followed by a Chinese, then Greek, then Korean etc... and all the tags would show properly! Without even changing Rockbox's default page code language!
The drawback from using Unicode encoded ID3s is that not all PC MP3 players and DAPs support Unicode.
The problem with displaying tags with foreign languages is that you have to use Unifont, which may not appeal to everyone. One solution is to use 6+12x13, which as mentioned above works well if you only need Japanese and Korean characters. A fairly large number of WPSs are compatible with the 6+12x13 font. Alternatively, you can create two different .cfg files on Rockbox. Make one for English music (or anything using ISO-8859-1) which has your favorite font (Snap, Chicago etc...) and your favorite WPS. Then have a second .cfg for "international" or "world" music, using Unifont, and an appropriate WPS of your choice. Check also the list of Rockbox unicode fonts.
stevenyu from Mistic River brought to my attention an H100 WPS optimized for the use with Unifont. Go to the WpsGallery and scroll down to FrejBon's Uniskin. Hopefully we'll have more WPSs optimized for Unifont in the future.
Notice that the current song in the screenshots shows in Japanese, and the next song that shows at the bottom is in Korean.
It seems that in Windows, Real Player does not support Unicode. However, Foobar 2000, Windows Media Player, iTunes and Winamp all support Unicode. Even displaying Unicode ID3 tags in Explorer is supported. For Linux, I read that Xmms doesn't do Unicode very well (didn't try it my self). I read that Rhythmbox and Amarok display Unicode fine.
Here are a bunch of useful links for you:
Note: It won't hurt to use multiple tagging programs if none of them have all the features you want. Most likely I'll use Mp3tag for converting to Unicode, and continue to use ID3-TagIT 3 for tagging.
You can Help! If you know other MP3 players, or tagging tools (for Windows or Linux) that support Unicode, let us know!
Other than using Unicode for ID3 tags, you can also use it to customize your WPS. For example, you can have the words "Battery", "Next Song" or "Playing Now" written in Korean, Greek or Russian etc... This increases the possibility to locolize the way your player looks. All you have to do to allow this is to make sure your .wps file is saved and encoded in Unicode (UTF-8 or UTF-16), and that you use Unifont in Rockbox.
r28 - 19 Aug 2010 - 09:22:17 - SeanInglisRevision r26 - 24 Apr 2009 - 05:30 - ChristopheGragnic
Revision r25 - 04 Oct 2008 - 05:17 - TomerShalev
Copyright © by the contributing authors.