summaryrefslogtreecommitdiff
path: root/textproc/p5-Text-Unidecode
diff options
context:
space:
mode:
authorwen <wen>2016-02-18 03:38:36 +0000
committerwen <wen>2016-02-18 03:38:36 +0000
commit3f937735ea045aa305ab6c7b90406f3fc037e633 (patch)
tree1d612e413715b37dddb095046e0b22abd8ccc462 /textproc/p5-Text-Unidecode
parent084e1e6343e54d0a7d9d16701054649e4d180f3e (diff)
downloadpkgsrc-3f937735ea045aa305ab6c7b90406f3fc037e633.tar.gz
Update to 1.27
Upstream changes: 2015-10-21 Sean M. Burke sburke@cpan.org * RELEASE 1.27. (Stable.) The release, 1.25_01, didn't blow up, so this is just a re-release of it as a normal ("stable") version. * Minor changes to the documentation. Nothing substantial. * Release 1.26 had a confusing mistake in the ChangeLog. Ignore v1.26. 2015-10-21 Sean M. Burke sburke@cpan.org * RELEASE 1.26. Mistake. See above for change notes between v1.25_01 and v1.27. 2015-10-16 Sean M. Burke sburke@cpan.org * RELEASE 1.25_01. * !DEVELOPER RELEASE!, OH GOD HELP US ALL! * Here's a new thing that makes me nervous and hesitant, and that I've been talking myself into for weeks: ************************************************************** * I've switched to accepting values in the range 0x80-0x9F * * as if they are the Windows-1252 ("ANSI") characters. * ************************************************************** Previously they had all mapped to emptystring. Technically, Unicode specifies those codepoints as control characters that I've never heard of, "C1 Controls"... ... U+0087 ESA - End of Selected Area U+0088 HTS - Character (Horizontal) Tabulation Set U+0089 HTJ - Character (Horizontal) Tabulation with Justification ... ( See "C1" in https://en.wikipedia.org/wiki/C0_and_C1_control_codes ) And Unidecode mapped all of those to emptystring. Now they are treated as if you fed the Windows-1252 characters, as that is an extremely common thing to have happen. So if you feed character value 0x80 to it, it is taken to mean "��" (which Unidecode then decodes as "EUR", at the moment at least). (This doesn't interfere with the fact that U+20AC is the proper Unicode place for the "��" to be found.) And the smartquotes at 0x91 to 0x94, �� �� �� �� turn into ' ' " " so yaaaay! Note that in theory, according to C1 Controls, 0x85 is "NEL: Next Line", "Equivalent to CR+LF. Used to mark end-of-line on some IBM mainframes." I could map this to \n or \r\n or whatever, but I've never seen 0x85 in use in the wild, and I never heard anyone complain about my not having mapped it to "\n" in all the Unidecode versions since the first, in 2001. So instead, Unidecode takes 0x85 as its Windows-1252 value, the ellipsis "��" which of course it Unidecodes as "..." I'm not thrilled with the idea of going off spec but I think this should be okay, and it has massive DWIM value. Let's hope I'm not dividing Unicode times infinity by zero and then the whole universe will disa That's why I'm making this a developer release. Unless anything besplodes by November 1st, I'll re-issue this as a stable release.
Diffstat (limited to 'textproc/p5-Text-Unidecode')
-rw-r--r--textproc/p5-Text-Unidecode/Makefile4
-rw-r--r--textproc/p5-Text-Unidecode/distinfo10
2 files changed, 7 insertions, 7 deletions
diff --git a/textproc/p5-Text-Unidecode/Makefile b/textproc/p5-Text-Unidecode/Makefile
index 27c720d0cf8..2af8d79d7da 100644
--- a/textproc/p5-Text-Unidecode/Makefile
+++ b/textproc/p5-Text-Unidecode/Makefile
@@ -1,6 +1,6 @@
-# $NetBSD: Makefile,v 1.14 2015/08/28 22:46:28 mef Exp $
+# $NetBSD: Makefile,v 1.15 2016/02/18 03:38:36 wen Exp $
-DISTNAME= Text-Unidecode-1.24
+DISTNAME= Text-Unidecode-1.27
PKGNAME= p5-${DISTNAME}
CATEGORIES= textproc perl5
MASTER_SITES= ${MASTER_SITE_PERL_CPAN:=Text/}
diff --git a/textproc/p5-Text-Unidecode/distinfo b/textproc/p5-Text-Unidecode/distinfo
index 9913c8ad60e..5a2d86cd324 100644
--- a/textproc/p5-Text-Unidecode/distinfo
+++ b/textproc/p5-Text-Unidecode/distinfo
@@ -1,6 +1,6 @@
-$NetBSD: distinfo,v 1.6 2015/11/04 01:59:54 agc Exp $
+$NetBSD: distinfo,v 1.7 2016/02/18 03:38:36 wen Exp $
-SHA1 (Text-Unidecode-1.24.tar.gz) = eb492ce66f856d709a54fe5244f424a6555bf580
-RMD160 (Text-Unidecode-1.24.tar.gz) = 601464595b2e0942c72e74cf0e09d4ea3f930b8b
-SHA512 (Text-Unidecode-1.24.tar.gz) = d9abcc2b3425457a814ffd2c1061d232d851633fca5780c87b4bfe0fcfa7025f1519776a433a650fe91f431f76ccf05b4e548f2a760acbd6fb2629675867fec0
-Size (Text-Unidecode-1.24.tar.gz) = 131589 bytes
+SHA1 (Text-Unidecode-1.27.tar.gz) = 221442bbf1fcb3a1df4b8988033154e3934124e9
+RMD160 (Text-Unidecode-1.27.tar.gz) = 07411c625707f3a2ec0adf98b419641e7deb27d2
+SHA512 (Text-Unidecode-1.27.tar.gz) = c124e09b75050717fc13716b46ca54e607fd1e093f6ce06db466cda669d772661173a394eac81b5073a757f7af5e0174aa23eac037a356f008268b2bd767428c
+Size (Text-Unidecode-1.27.tar.gz) = 134929 bytes