summaryrefslogtreecommitdiff
path: root/textproc/py-Unidecode
AgeCommit message (Collapse)AuthorFilesLines
2021-10-07textproc: Remove SHA1 hashes for distfilesnia1-2/+1
2021-02-05py-Unidecode: updated to 1.2.0adam3-8/+13
unidecode 1.2.0 * Add 'errors' argument that specifies how characters with unknown replacements are handled. Default is 'ignore' to replicate the behavior of older versions. * Many characters that were previously replaced with '[?]' are now correctly marked as unknown and will behave as specified in the new errors='...' argument. * Added some missing ligatures and quotation marks in U+1F6xx and U+27xx ranges. * Add PEP 561-style type information (thanks to Pascal Corpet) * Support for Python 2 and 3.5 to be removed in next release.
2020-12-21py-Unidecode: updated to 1.1.2adam2-7/+7
unidecode 1.1.2 * Add some missing replacements in the U+23xx page. * Fix U+204A "TIRONIAN SIGN ET" replacement.
2019-07-02py-Unidecode: updated to 1.1.1adam2-7/+7
unidecode 1.1.1 * Fix tests failing on PyPy 7.1.1
2019-06-18py-Unidecode: updated to 1.1.0adam3-9/+12
unidecode 1.1.0 * Add more Latin letter variants in U+1F1xx page. * Make it possible to use the Unidecode command-line utility via "python -m unidecode" * General clean up of code and documentation
2018-11-21py-Unidecode: updated to 1.0.23adam4-11/+12
unidecode 1.0.23 * Improve transliteration of Hebrew letters * Add transliterations for the phonetic block U+1D00 - U+1D7F * Transliterate SI "micro" prefix as "u" instead of "micro" in the U+33xx block. * Add U+33DE SQUARE V OVER M and U+33DF SQUARE A OVER M. * Drop support for Python 2.6 and 3.3
2018-02-26py-Unidecode: updated CATEGORIES and HOMEPAGEadam1-4/+4
2018-01-28py-Unidecode: update to 1.0.22.wiz2-8/+7
2018-01-05 unidecode 1.0.22 * Move to semantic version numbering, no longer following version numbers from the original Perl module. This fixes an issue with setuptools (>= 8) and others expecting major.minor.patch format. (https://github.com/avian2/unidecode/issues/13) * Add transliterations for currency signs U+20B0 through U+20BF (thanks to Mike Swanson) * Surround transliterations of vulgar fractions with spaces to avoid incorrect combinations with adjacent numerals (thanks to Jeffrey Gerard)
2017-09-04Follow some redirects.wiz1-2/+2
2017-07-23Removed - from ALTERNATIVESadam1-1/+1
2017-07-23unidecode 0.04.21adam4-11/+32
* Add U+2116 NUMERO SIGN * Add U+05BE HEBREW PUNCTUATION MAQAF unidecode 0.04.20: * Fixed transliteration of circled Latin letters and numbers * Add square unit symbols. * Add Latin variants in U+20xx and U+21xx pages. * Fix U+02B1 MODIFIER LETTER SMALL H WITH HOOK. * Fix U+205F MEDIUM MATHEMATICAL SPACE. * Add "DIGIT ... COMMA" and "PARANTHESIZED LATIN CAPITAL LETTER" in U+1F1xx page. * Add missing vulgar fractions and a/c, a/s, c/o, c/u symbols. * Add universal Wheel release
2016-06-08Switch to MASTER_SITES_PYPI.wiz1-2/+2
2015-11-04Add SHA512 digests for distfiles for textproc categoryagc1-1/+2
Problems found locating distfiles: Package cabocha: missing distfile cabocha-0.68.tar.bz2 Package convertlit: missing distfile clit18src.zip Package php-enchant: missing distfile php-enchant/enchant-1.1.0.tgz Otherwise, existing SHA1 digests verified and found to be the same on the machine holding the existing distfiles (morden). All existing SHA1 digests retained for now as an audit trail.
2012-10-25Drop superfluous PKG_DESTDIR_SUPPORT, "user-destdir" is default these days.asau1-3/+1
2012-05-30Initial import of py-Unidecode-0.04.9:wiz4-0/+603
It often happens that you have text data in Unicode, but you need to represent it in ASCII. For example when integrating with legacy code that doesn't support Unicode, or for ease of entry of non-Roman names on a US keyboard, or when constructing ASCII machine identifiers from human-readable Unicode strings that should still be somewhat intelligeble (a popular example of this is when making an URL slug from an article title). Note that this module generally produces better results than simply stripping accents from characters (which can be done in Python with built-in functions). It is based on hand-tuned character mappings that for example also contain ASCII approximations for symbols and non-Latin alphabets. This is a Python port of Text::Unidecode Perl module by Sean M. Burke.