summaryrefslogtreecommitdiff
path: root/textproc/icu/patches/patch-common_putil.cpp
AgeCommit message (Collapse)AuthorFilesLines
2020-11-05icu: Update to 68.1ryoon1-4/+4
Changelog: Common Changes CLDR 38 Support for units of measurement in inflected languages (phase 1) 14 new measurement units: candela, imperial quart, etc. Improved locale ID canonicalization spec & data New language at Modern coverage: Norwegian Nynorsk New languages at Moderate coverage: Fulah (Adlam), Chakma, Asturian New languages at Basic coverage: Dogri, Sanskrit Measurement unit preferences (ICU-20568) New usage() option on NumberFormatter to select the most appropriate unit for a locale and context New outputUnit() getter on FormattedNumber to get the output unit after resolution In skeletons, specify the context using the "usage" stem Example: locale "en-GB", skeleton "usage/person unit-width-full-name unit/kilogram", input 80 (expressed in kilograms), output "12 stone, 8.4 pounds" Usages are pulled in from CLDR (e.g. CLDR v38 Unit Preferences). PluralRules selection for ranges of numbers (ICU-21190) Locale ID canonicalization now conforms to the CLDR spec including edge cases; co-developed with CLDR spec & data improvements (ICU-21236, ICU-21115 & others) New LocaleMatcher options: custom threshold (ICU-21144), no default locale (ICU-21029) DateIntervalFormat supports output options such as capitalization (ICU-20651) Uppercasing for the Armenian language (hy) now maps ligature to (ICU-13416) Data size reduction: Rule-based segmentation data files (RBBI) use a more compact data format and are now half as large (ICU-13565) Measurement units are normalized in skeleton string output: i.e., calling toSkeleton() on a NumberFormatter returns "unit/meter" instead of "measure-unit/length-meter" The ICU User Guide has been migrated to Markdown format, hosted via GitHub Pages: https://unicode-org.github.io/icu/userguide/ Removed usage of terms like "blacklist" (ICU-21176), "master" (ICU-21242), and "grandfathered" (ICU-21184) as much as possible. Time zone data (tzdata) version 2020d (2020-oct-21) PluralRules category for compact notation numbers in French (ICU-13836) ICU4C Specific Changes New C API for number range formatting (unicode/unumberrangeformatter.h), for example "750 m - 1.2 km" (ICU-21182)
2017-04-22Changes 59.1:adam1-3/+3
* Emoji 5.0 data * Includes bidi data files from Unicode 10 beta. * Includes segmentation data files and rules from Unicode 10 beta and CLDR 31.0.1. * Does not yet include the Emoji_Component property. * Otherwise ICU 59 continues to use Unicode 9 data. CLDR 31.0.1 * Including updates for emoji 5.0, for example local names for England, Scotland, and Wales. * GMT and UTC are no longer unified, and CLDR provides distinct UTC display names, avoiding confusion with standard (winter) time in Britain. * See the CLDR download page for other CLDR features and migration issues in CLDR v31. New case mapping API (C++ & Java classes CaseMap) supports styled text.
2015-10-09Update to 56.1ryoon1-3/+3
Changelog: Release Overview The features for this release include support of CLDR 28 and Unicode 8.0. For more details, including migration issues, see below. Common Changes CLDR 28: For details of the many changes in CLDR, see CLDR 28. Unicode data updated to Unicode 8.0: 41 new emoji characters, 5,771 new ideographs for Chinese/Japanese/Korean, 6 new scripts, improved character properties data, etc. ICU data size reduced by about 7.2% (1.8MB) via sharing string values across resource bundles. [#11537] DateIntervalFormat now handles intervals with seconds, and sets FieldPosition more consistently. [#11706, #11726] DateFormat::createInstanceForSkeleton() caches DateFormat patterns rather than DateTimePatternGenerator instances, for better performance (for cache hits) and lower heap memory consumption. [#11780] StringSearch (based on collation) defaults to matches on normalization boundaries rather than grapheme cluster boundaries, which yields more matches on Indic text. [#11750] RuleBasedNumberFormat (spelled-out numbers) now handles rounding (Java only), infinity, NaN. [#11653, #11760, #8223] Most of the old Normalizer/unorm.h had been replaced by (and reimplemented via) Normalizer2, and is now deprecated. [#7303] COLON has been withdrawn as a date pattern character corresponding to the date field [UDAT_]TIME_SEPARATOR_FIELD; there is currently no pattern character corresponding to that field. [#11773] Support for locale key "cf" to specify currency format style, and interaction with NumberFormat values for UNumberFormatStyle: [#11787] For NumberFormat style UNUM_CURRENCY / CURRENCYSTYLE, the default is "standard" currency style (typically using minus sign for negative numbers), but the new locale key "cf" may be used with values "standard" or "account" to specify currency format style ("account" indicates accounting style, often using parentheses for negative numbers). For other NumberFormat styles, the locale key "cf" is ignored (they override the locale preference): UNUM_CURRENCY_ISO / ISOCURRENCYSTYLE UNUM_CURRENCY_PLURAL / PLURALCURRENCYSTYLE UNUM_CURRENCY_ACCOUNTING / ACCOUNTINGCURRENCYSTYLE UNUM_CASH_CURRENCY / CASHCURRENCYSTYLE A new NumberFormat style is availble to explicitly specify standard style, ignoring the the locale key "cf" UNUM_CURRENCY_STANDARD / STANDARDCURRENCYSTYLE ICU4C Specific Changes C API support for CompactDecimalFormat via UNumberFormatStyle additions: UNUM_DECIMAL_COMPACT_SHORT, UNUM_DECIMAL_COMPACT_LONG [#11693] Larger UnicodeString object stores more characters inside the object without heap allocation; the UnicodeString object size is now build-time-configurable. [#11551] On 64-bit machines, increase from object size 40 bytes with 15 internal UChars to a new default of 64 bytes with 27 UChars. Some C++ classes now have swap() and moveFrom() methods, and support C++11 move semantics on compilers that support them. [#10086] UnicodeString, LocalPointer, LocalArray DecimalFormat code refactored to fix bugs, improve maintainability, and improve performance. [#10458] New FilteredBreakIterator suppresses certain segment boundaries. For example, it can suppress the sentence boundary in the middle of "Mr. Smith". [#11248] The internal, shared cache has been changed from unbounded to bounded. [#11767] For [U]BreakIterator with type UBRK_SENTENCE, the locale key "ss" can now be used with value "standard" to specify that standard sentence break suppression data should be used, or with value "none" to indicate that no break suppression data should be used (the default). [#11770] Collator: first-time startup time improved 20% due to precalculated unsafe-backward table [#11886] A number of memory leaks and buffer overruns have been fixed based on static code analysis, mostly in data build tools
2014-10-26Fix compilation on Mac OS 10.4. From Sevan Janiyan in PR pkg/49077.bsiegert1-0/+19