This is a list of the encodings supported by various versions of Opera. As the support evolved over time, there are indicators noting which version of Opera the support appeared in. If no such indicator is available, the encoding was supported in the first Unicode release (Opera 6). The tables-*.txt file used to generate the chartables.bin needs to be tagged with the appropriate version, see the table below for a summary.
| Symbol | Code version required | Version tag |
|---|---|---|
| None | Opera 6 core | @version 6 |
| O7 | Opera 7 core / Rosetta 1 encodings module | @version 7 |
| R3 | Rosetta 3 encodings module | @version 8 |
| R6 | Current core mainline (Rosetta 6 encodings module, and any PPP release) | @version 8 |
Below is the list of support encodings in Opera, as they would normally appear in the View — Encoding menu (for desktop versions at least), and their corresponding internal names used by the support code in Opera. Please note that the name of the encodings do not necessarily correspond to the name of the data tables, refer to the annotated list of data tables for the list of tables needed.
| Encoding (natural name as in menu) | Tag (used by code) |
|---|---|
| Automatic detection | ""
|
| Unicode based | |
| Unicode ▸ | |
| UTF-8 | "utf-8"
|
| UTF-16 | "utf-16"
|
| UTF-32 | "utf-32" O7 6
|
| UTF-7 | "utf-7" O7 0
|
| Latin based | |
| US English ▸ | |
| US ASCII | "us-ascii" 1
|
| Western European ▸ | |
| ISO 8859-1 | "iso-8859-1" 1
|
| Windows-1252 | "windows-1252"
|
| ISO 8859-15 | "iso-8859-15"
|
| Central European ▸ | |
| ISO 8859-2 | "iso-8859-2"
|
| Windows-1250 | "windows-1250"
|
| ISO 8859-16 | "iso-8859-16"
|
| Southern European ▸ | |
| ISO 8859-3 | "iso-8859-3"
|
| Baltic ▸ | |
| ISO 8859-13 | "iso-8859-13"
|
| Windows-1257 | "windows-1257"
|
| ISO 8859-4 | "iso-8859-4"
|
| Nordic ▸ | |
| ISO 8859-10 | "iso-8859-10"
|
| Celtic ▸ | |
| ISO 8859-14 | "iso-8859-14"
|
| Turkish ▸ | |
| ISO 8859-9 | "iso-8859-9"
|
| Windows-1254 | "windows-1254"
|
| Vietnamese ▸ | |
| Windows-1258 | "windows-1258"
|
| VISCII | "viscii"
|
| Greek & Cyrillic | |
| Cyrillic ▸ | |
| Automatic detection | "AUTODETECT-RU" O7
|
| ISO 8859-5 | "iso-8859-5"
|
| KOI8-R | "koi8-r"
|
| KOI8-U | "koi8-u"
|
| Windows-1251 | "windows-1251"
|
| Codepage 866 | "ibm866"
|
| Greek ▸ | |
| ISO 8859-7 | "iso-8859-7"
|
| Windows-1253 | "windows-1253"
|
| Right-to-left scripts | |
| Arabic ▸ | |
| ISO 8859-6 | "iso-8859-6"
|
| Windows-1256 | "windows-1256"
|
| Hebrew ▸ | |
| ISO 8859-8 | "iso-8859-8"
|
| Windows-1255 | "windows-1255"
|
| Singlebyte far east | |
| Thai ▸ | |
| ISO 8859-11/TIS-620/Windows-874 | "iso-8859-11" 2
|
| Doublebyte far east | |
| Chinese ▸ | |
| Automatic detection | "AUTODETECT-ZH"
|
| Big5 (Traditional) | "big5"
|
| Big5-HKSCS (Traditional) | "big5-hkscs" O7
|
| EUC-TW (Traditional) | "euc-tw"
|
| GBK/GB2312 (Simplified) | "gbk" 3
|
| GB18030 (Simplified) | "gb18030" O7 4
|
| HZ-GB2312 (Simplified) | "hz-gb-2312"
|
| ISO 2022-CN | "iso-2022-cn" O7 5
|
| Japanese ▸ | |
| Automatic detection | "AUTODETECT-JP"
|
| Shift-JIS | "shift_jis"
|
| ISO 2022-JP | "iso-2022-jp"
|
| ISO 2022-JP-1 | "iso-2022-jp-1" R6
|
| EUC-JP | "euc-jp"
|
| Korean ▸ | |
| Automatic detection | "AUTODETECT-KO" R3
|
| EUC-KR | "euc-kr"
|
| ISO 2022-KR | "iso-2022-kr" R3
|
0. Not suitable for HTML (nor anything else, really).
1. Decoder allows all windows-1252 codes, encoder is strict.
2. TIS-620 is a subset of ISO 8859-11. Windows-874 is a superset.
3. GB2312 is a subset of GBK and will be handled as if it was GBK (both decoder and encoder).
4. GB18030 contains all of Unicode, so calling it "Simplified" is a bit of a misnomer. But since it is also a superset of GBK/GB2312 it is reasonable.
5. ISO 2022-CN encodes both simplified (GB2312 character set) and traditional (CNS 11643 character set; same as EUC-TW, similar to Big5).
6. No longer supported as of Core 2.2. Removed because of HTML5 (see bug #316395 for more info)