diff options
author | wiz <wiz@pkgsrc.org> | 2017-04-19 17:24:16 +0000 |
---|---|---|
committer | wiz <wiz@pkgsrc.org> | 2017-04-19 17:24:16 +0000 |
commit | d8e1d68f9d640d11e005c91cf71e1d5d77e01709 (patch) | |
tree | 28515340f797e1f46a90c0641e5b604b4cc805cc /converters | |
parent | 2f8a8d21a94254193f38cf42b29e9b5f5b4d74ad (diff) | |
download | pkgsrc-d8e1d68f9d640d11e005c91cf71e1d5d77e01709.tar.gz |
Updated py-chardet to 3.0.2.
chardet 3.0.2
Fixes an issue where detect would sometimes return None instead of a dict with the keys encoding, language, and confidence (Issue #113, PR #114).
chardet 3.0.1
This bugfix release fixes a crash in the EUC-TW prober when it encountered certain strings (Issue #67).
chardet 3.0.0
This release is long overdue, but still mostly serves as a placeholder
for the impending 4.0.0 release, which will have retrained models
for better accuracy. For now, this release will get the following
improvements up on PyPI:
Added support for Turkish ISO-8859-9 detection (PR #41, thanks @queeup)
Commented out large unused sections of Big5 and EUC-KR tables to save memory (8bc4b89)
Removed Python 3.2 from testing, but add 3.4 - 3.6
Ensure that stdin is open with mode 'rb' for chardetect CLI. (PR #38, thanks @lpsinger)
Fixed chardetect crash with non-ascii file names (PR #39, thanks @nkanaev)
Made naming conventions more Pythonic throughout (no more mTypicalPositiveRatio, and instead typical_positive_ratio)
Modernized test scripts and infrastructure so we've got Travis testing and all that stuff
Rename filter_without_english_words to filter_international_words and make it match current Mozilla implementation (PR #44, thanks @rsnair2)
Updated filter_english_letters to match C implementation (c665459)
Temporarily disabled Hungarian ISO-8859-2 and Windows-1250 detection because it is very inaccurate (da6c0a0)
Allow CLI sub-package to be importable (PR #55)
Add a hypotheis-based test (PR #66, thanks @DRMacIver)
Strip endianness from UTF with BOM predictions so that the encoding can be passed directly to bytes.decode() (PR #73, thanks @snoack)
Fixed broken links in docs (PR #90, thanks @roskakori)
Added early exit to chardetect when encoding is detected instead of looping through entire file (PR #103, thanks @jpz)
Use bytearray objects internally instead of wrap_ord calls, which provides a nice performance boost across the board (PR #106)
Add language property to probers and UniversalDetector results (PR #180)
Mark the 5 known test failures as such so we can have more useful Travis build results in the meantime (d588407)
Diffstat (limited to 'converters')
-rw-r--r-- | converters/py-chardet/Makefile | 10 | ||||
-rw-r--r-- | converters/py-chardet/PLIST | 23 | ||||
-rw-r--r-- | converters/py-chardet/distinfo | 10 |
3 files changed, 27 insertions, 16 deletions
diff --git a/converters/py-chardet/Makefile b/converters/py-chardet/Makefile index b92fead6fef..96c86325e25 100644 --- a/converters/py-chardet/Makefile +++ b/converters/py-chardet/Makefile @@ -1,7 +1,6 @@ -# $NetBSD: Makefile,v 1.16 2017/01/03 13:23:01 jperkin Exp $ +# $NetBSD: Makefile,v 1.17 2017/04/19 17:24:16 wiz Exp $ -DISTNAME= chardet-2.3.0 -PKGREVISION= 1 +DISTNAME= chardet-3.0.2 PKGNAME= ${PYPKGPREFIX}-${DISTNAME} CATEGORIES= converters python MASTER_SITES= ${MASTER_SITE_PYPI:=c/chardet/} @@ -11,7 +10,10 @@ HOMEPAGE= http://pypi.python.org/pypi/chardet COMMENT= Character encoding auto-detection in Python LICENSE= gnu-lgpl-v2.1 -REPLACE_PYTHON= chardet/chardetect.py +# TEST_DEPENDS; however, no tests found +BUILD_DEPENDS+= ${PYPKGPREFIX}-test-[0-9]*:../../devel/py-test +BUILD_DEPENDS+= ${PYPKGPREFIX}-hypothesis-[0-9]*:../../devel/py-hypothesis +BUILD_DEPENDS+= ${PYPKGPREFIX}-py-[0-9]*:../../devel/py-py post-install: ${MV} ${DESTDIR}${PREFIX}/bin/chardetect ${DESTDIR}${PREFIX}/bin/chardetect-${PYVERSSUFFIX} || ${TRUE} diff --git a/converters/py-chardet/PLIST b/converters/py-chardet/PLIST index a7d406805ec..8337fbe044c 100644 --- a/converters/py-chardet/PLIST +++ b/converters/py-chardet/PLIST @@ -1,4 +1,4 @@ -@comment $NetBSD: PLIST,v 1.8 2016/02/05 12:40:56 wiz Exp $ +@comment $NetBSD: PLIST,v 1.9 2017/04/19 17:24:16 wiz Exp $ bin/chardetect-${PYVERSSUFFIX} ${PYSITELIB}/${EGG_INFODIR}/PKG-INFO ${PYSITELIB}/${EGG_INFODIR}/SOURCES.txt @@ -14,9 +14,6 @@ ${PYSITELIB}/chardet/big5freq.pyo ${PYSITELIB}/chardet/big5prober.py ${PYSITELIB}/chardet/big5prober.pyc ${PYSITELIB}/chardet/big5prober.pyo -${PYSITELIB}/chardet/chardetect.py -${PYSITELIB}/chardet/chardetect.pyc -${PYSITELIB}/chardet/chardetect.pyo ${PYSITELIB}/chardet/chardistribution.py ${PYSITELIB}/chardet/chardistribution.pyc ${PYSITELIB}/chardet/chardistribution.pyo @@ -26,18 +23,24 @@ ${PYSITELIB}/chardet/charsetgroupprober.pyo ${PYSITELIB}/chardet/charsetprober.py ${PYSITELIB}/chardet/charsetprober.pyc ${PYSITELIB}/chardet/charsetprober.pyo +${PYSITELIB}/chardet/cli/__init__.py +${PYSITELIB}/chardet/cli/__init__.pyc +${PYSITELIB}/chardet/cli/__init__.pyo +${PYSITELIB}/chardet/cli/chardetect.py +${PYSITELIB}/chardet/cli/chardetect.pyc +${PYSITELIB}/chardet/cli/chardetect.pyo ${PYSITELIB}/chardet/codingstatemachine.py ${PYSITELIB}/chardet/codingstatemachine.pyc ${PYSITELIB}/chardet/codingstatemachine.pyo ${PYSITELIB}/chardet/compat.py ${PYSITELIB}/chardet/compat.pyc ${PYSITELIB}/chardet/compat.pyo -${PYSITELIB}/chardet/constants.py -${PYSITELIB}/chardet/constants.pyc -${PYSITELIB}/chardet/constants.pyo ${PYSITELIB}/chardet/cp949prober.py ${PYSITELIB}/chardet/cp949prober.pyc ${PYSITELIB}/chardet/cp949prober.pyo +${PYSITELIB}/chardet/enums.py +${PYSITELIB}/chardet/enums.pyc +${PYSITELIB}/chardet/enums.pyo ${PYSITELIB}/chardet/escprober.py ${PYSITELIB}/chardet/escprober.pyc ${PYSITELIB}/chardet/escprober.pyo @@ -92,6 +95,9 @@ ${PYSITELIB}/chardet/langhungarianmodel.pyo ${PYSITELIB}/chardet/langthaimodel.py ${PYSITELIB}/chardet/langthaimodel.pyc ${PYSITELIB}/chardet/langthaimodel.pyo +${PYSITELIB}/chardet/langturkishmodel.py +${PYSITELIB}/chardet/langturkishmodel.pyc +${PYSITELIB}/chardet/langturkishmodel.pyo ${PYSITELIB}/chardet/latin1prober.py ${PYSITELIB}/chardet/latin1prober.pyc ${PYSITELIB}/chardet/latin1prober.pyo @@ -119,3 +125,6 @@ ${PYSITELIB}/chardet/universaldetector.pyo ${PYSITELIB}/chardet/utf8prober.py ${PYSITELIB}/chardet/utf8prober.pyc ${PYSITELIB}/chardet/utf8prober.pyo +${PYSITELIB}/chardet/version.py +${PYSITELIB}/chardet/version.pyc +${PYSITELIB}/chardet/version.pyo diff --git a/converters/py-chardet/distinfo b/converters/py-chardet/distinfo index 1540c96f937..4fd7684a4ed 100644 --- a/converters/py-chardet/distinfo +++ b/converters/py-chardet/distinfo @@ -1,6 +1,6 @@ -$NetBSD: distinfo,v 1.6 2015/11/03 01:43:53 agc Exp $ +$NetBSD: distinfo,v 1.7 2017/04/19 17:24:16 wiz Exp $ -SHA1 (chardet-2.3.0.tar.gz) = 50af8f8771ecbeb7a22567129c6c281b8bec3b1c -RMD160 (chardet-2.3.0.tar.gz) = b68ed94bccfd97f23cc0a6288ee8c7914f4c7d04 -SHA512 (chardet-2.3.0.tar.gz) = d554c9b4564905a2b6c6971aaea2a812174d86b58137c6a2f9d54e2abd2395e6529968b2ce3ca2d97c717d2bd0bd80ab5b98cae9afac1a4add9ea305145030dd -Size (chardet-2.3.0.tar.gz) = 164346 bytes +SHA1 (chardet-3.0.2.tar.gz) = 6234b04c99465f0661301088f0ab47a77ea97031 +RMD160 (chardet-3.0.2.tar.gz) = a4f0bab40259ef8042ae562f78ac964eddcb2f4d +SHA512 (chardet-3.0.2.tar.gz) = d7b0e5ebeb9a0d5830857decbe2b11309fbb421c31cd391c143bf6f00e2c3d9eec3640954478e71be88ea3dac158b1e1df177a5e2b8b9749b27d2a956542cbf9 +Size (chardet-3.0.2.tar.gz) = 1866978 bytes |