Copyright © 1995-2012 Opera Software ASA. All rights reserved. This file is part of the Opera web browser. It may not be distributed under any circumstances.
Some of the Unicode algorithms have been tailored to suit Opera and the web. This document lists the current tailoring.
No tailoring have been performed.
No tailoring have been performed.
(AL | IS) SP+ ÷ IS before (AL | IS) × IS.
PR % PR before PR ^ PR.
AL ÷ OP before
(AL | NU) × OP.
No tailoring have been performed.
The Extend class is currently not considered to include checks for the Other_Grapheme_Extend and Grapheme_Extend classes. This tailoring/unimplemented feature is being considered lifting.
U+003B semicolon (
),
U+FE14 presentation form for vertical
semicolon (;
)
and
U+FF1B fullwidth semicolon
(︔
)
are changed from the MidNum class to Other.
;
This is to allow semicolon-separated records of words ending with digits
or digit groups to be parsed properly.
U+002E full stop (
)
is changed from the MidNumLet class to MidNumLet.
.
This is done for backwards compatibility with word segmentation
done with previous versions of this module.
It does mean that abbreviations on the form "U.S.A." are not properly
segmented.
No tailoring have been performed.