summaryrefslogtreecommitdiff
path: root/devel/pcre2
diff options
context:
space:
mode:
authorbsiegert <bsiegert@pkgsrc.org>2015-12-29 14:40:20 +0000
committerbsiegert <bsiegert@pkgsrc.org>2015-12-29 14:40:20 +0000
commit7cc91e7ad3c74508eb8a766af8591f280529f5d4 (patch)
treec6e769b2fbc028a153d1f7483219e3a0fd8bcae5 /devel/pcre2
parent9dfcf0e4efd53007f2b9e94b0c8def7a2177b6ad (diff)
downloadpkgsrc-7cc91e7ad3c74508eb8a766af8591f280529f5d4.tar.gz
Update pcre2 to 10.20. Fix CVE-2015-8381.
Version 10.20 30-June-2015 -------------------------- 1. Callouts with string arguments have been added. 2. Assertion code generator in JIT has been optimized. 3. The invalid pattern (?(?C) has a missing assertion condition at the end. The pcre2_compile() function read past the end of the input before diagnosing an error. This bug was discovered by the LLVM fuzzer. 4. Implemented pcre2_callout_enumerate(). 5. Fix JIT compilation of conditional blocks whose assertion is converted to (*FAIL). E.g: /(?(?!))/. 6. The pattern /(?(?!)^)/ caused references to random memory. This bug was discovered by the LLVM fuzzer. 7. The assertion (?!) is optimized to (*FAIL). This was not handled correctly when this assertion was used as a condition, for example (?(?!)a|b). In pcre2_match() it worked by luck; in pcre2_dfa_match() it gave an incorrect error about an unsupported item. 8. For some types of pattern, for example /Z*(|d*){216}/, the auto- possessification code could take exponential time to complete. A recursion depth limit of 1000 has been imposed to limit the resources used by this optimization. This infelicity was discovered by the LLVM fuzzer. 9. A pattern such as /(*UTF)[\S\V\H]/, which contains a negated special class such as \S in non-UCP mode, explicit wide characters (> 255) can be ignored because \S ensures they are all in the class. The code for doing this was interacting badly with the code for computing the amount of space needed to compile the pattern, leading to a buffer overflow. This bug was discovered by the LLVM fuzzer. 10. A pattern such as /((?2)+)((?1))/ which has mutual recursion nested inside other kinds of group caused stack overflow at compile time. This bug was discovered by the LLVM fuzzer. 11. A pattern such as /(?1)(?#?'){8}(a)/ which had a parenthesized comment between a subroutine call and its quantifier was incorrectly compiled, leading to buffer overflow or other errors. This bug was discovered by the LLVM fuzzer. 12. The illegal pattern /(?(?<E>.*!.*)?)/ was not being diagnosed as missing an assertion after (?(. The code was failing to check the character after (?(?< for the ! or = that would indicate a lookbehind assertion. This bug was discovered by the LLVM fuzzer. 13. A pattern such as /X((?2)()*+){2}+/ which has a possessive quantifier with a fixed maximum following a group that contains a subroutine reference was incorrectly compiled and could trigger buffer overflow. This bug was discovered by the LLVM fuzzer. 14. Negative relative recursive references such as (?-7) to non-existent subpatterns were not being diagnosed and could lead to unpredictable behaviour. This bug was discovered by the LLVM fuzzer. 15. The bug fixed in 14 was due to an integer variable that was unsigned when it should have been signed. Some other "int" variables, having been checked, have either been changed to uint32_t or commented as "must be signed". 16. A mutual recursion within a lookbehind assertion such as (?<=((?2))((?1))) caused a stack overflow instead of the diagnosis of a non-fixed length lookbehind assertion. This bug was discovered by the LLVM fuzzer. 17. The use of \K in a positive lookbehind assertion in a non-anchored pattern (e.g. /(?<=\Ka)/) could make pcre2grep loop. 18. There was a similar problem to 17 in pcre2test for global matches, though the code there did catch the loop. 19. If a greedy quantified \X was preceded by \C in UTF mode (e.g. \C\X*), and a subsequent item in the pattern caused a non-match, backtracking over the repeated \X did not stop, but carried on past the start of the subject, causing reference to random memory and/or a segfault. There were also some other cases where backtracking after \C could crash. This set of bugs was discovered by the LLVM fuzzer. 20. The function for finding the minimum length of a matching string could take a very long time if mutual recursion was present many times in a pattern, for example, /((?2){73}(?2))((?1))/. A better mutual recursion detection method has been implemented. This infelicity was discovered by the LLVM fuzzer. 21. Implemented PCRE2_NEVER_BACKSLASH_C. 22. The feature for string replication in pcre2test could read from freed memory if the replication required a buffer to be extended, and it was not working properly in 16-bit and 32-bit modes. This issue was discovered by a fuzzer: see http://lcamtuf.coredump.cx/afl/. 23. Added the PCRE2_ALT_CIRCUMFLEX option. 24. Adjust the treatment of \8 and \9 to be the same as the current Perl behaviour. 25. Static linking against the PCRE2 library using the pkg-config module was failing on missing pthread symbols. 26. If a group that contained a recursive back reference also contained a forward reference subroutine call followed by a non-forward-reference subroutine call, for example /.((?2)(?R)\1)()/, pcre2_compile() failed to compile correct code, leading to undefined behaviour or an internally detected error. This bug was discovered by the LLVM fuzzer. 27. Quantification of certain items (e.g. atomic back references) could cause incorrect code to be compiled when recursive forward references were involved. For example, in this pattern: /(?1)()((((((\1++))\x85)+)|))/. This bug was discovered by the LLVM fuzzer. 28. A repeated conditional group whose condition was a reference by name caused a buffer overflow if there was more than one group with the given name. This bug was discovered by the LLVM fuzzer. 29. A recursive back reference by name within a group that had the same name as another group caused a buffer overflow. For example: /(?J)(?'d'(?'d'\g{d}))/. This bug was discovered by the LLVM fuzzer. 30. A forward reference by name to a group whose number is the same as the current group, for example in this pattern: /(?|(\k'Pm')|(?'Pm'))/, caused a buffer overflow at compile time. This bug was discovered by the LLVM fuzzer. 31. Fix -fsanitize=undefined warnings for left shifts of 1 by 31 (it treats 1 as an int; fixed by writing it as 1u). 32. Fix pcre2grep compile when -std=c99 is used with gcc, though it still gives a warning for "fileno" unless -std=gnu99 us used. 33. A lookbehind assertion within a set of mutually recursive subpatterns could provoke a buffer overflow. This bug was discovered by the LLVM fuzzer. 34. Give an error for an empty subpattern name such as (?''). 35. Make pcre2test give an error if a pattern that follows #forbud_utf contains \P, \p, or \X. 36. The way named subpatterns are handled has been refactored. There is now a pre-pass over the regex which does nothing other than identify named subpatterns and count the total captures. This means that information about named patterns is known before the rest of the compile. In particular, it means that forward references can be checked as they are encountered. Previously, the code for handling forward references was contorted and led to several errors in computing the memory requirements for some patterns, leading to buffer overflows. 37. There was no check for integer overflow in subroutine calls such as (?123). 38. The table entry for \l in EBCDIC environments was incorrect, leading to its being treated as a literal 'l' instead of causing an error. 39. If a non-capturing group containing a conditional group that could match an empty string was repeated, it was not identified as matching an empty string itself. For example: /^(?:(?(1)x|)+)+$()/. 40. In an EBCDIC environment, pcretest was mishandling the escape sequences \a and \e in test subject lines. 41. In an EBCDIC environment, \a in a pattern was converted to the ASCII instead of the EBCDIC value. 42. The handling of \c in an EBCDIC environment has been revised so that it is now compatible with the specification in Perl's perlebcdic page. 43. Single character repetition in JIT has been improved. 20-30% speedup was achieved on certain patterns. 44. The EBCDIC character 0x41 is a non-breaking space, equivalent to 0xa0 in ASCII/Unicode. This has now been added to the list of characters that are recognized as white space in EBCDIC. 45. When PCRE2 was compiled without Unicode support, the use of \p and \P gave an error (correctly) when used outside a class, but did not give an error within a class. 46. \h within a class was incorrectly compiled in EBCDIC environments. 47. JIT should return with error when the compiled pattern requires more stack space than the maximum. 48. Fixed a memory leak in pcre2grep when a locale is set.
Diffstat (limited to 'devel/pcre2')
-rw-r--r--devel/pcre2/Makefile4
-rw-r--r--devel/pcre2/PLIST4
-rw-r--r--devel/pcre2/buildlink3.mk4
-rw-r--r--devel/pcre2/distinfo10
4 files changed, 12 insertions, 10 deletions
diff --git a/devel/pcre2/Makefile b/devel/pcre2/Makefile
index 863c28739a7..5b16e3db222 100644
--- a/devel/pcre2/Makefile
+++ b/devel/pcre2/Makefile
@@ -1,6 +1,6 @@
-# $NetBSD: Makefile,v 1.1 2015/04/19 19:18:22 wiz Exp $
+# $NetBSD: Makefile,v 1.2 2015/12/29 14:40:20 bsiegert Exp $
-DISTNAME= pcre2-10.10
+DISTNAME= pcre2-10.20
CATEGORIES= devel
MASTER_SITES= ftp://ftp.csx.cam.ac.uk/pub/software/programming/pcre/ \
${MASTER_SITE_SOURCEFORGE:=pcre/}
diff --git a/devel/pcre2/PLIST b/devel/pcre2/PLIST
index 153a2033126..885156116de 100644
--- a/devel/pcre2/PLIST
+++ b/devel/pcre2/PLIST
@@ -1,4 +1,4 @@
-@comment $NetBSD: PLIST,v 1.1 2015/04/19 19:18:22 wiz Exp $
+@comment $NetBSD: PLIST,v 1.2 2015/12/29 14:40:20 bsiegert Exp $
bin/pcre2-config
bin/pcre2grep
bin/pcre2test
@@ -12,6 +12,7 @@ man/man1/pcre2-config.1
man/man1/pcre2grep.1
man/man1/pcre2test.1
man/man3/pcre2.3
+man/man3/pcre2_callout_enumerate.3
man/man3/pcre2_code_free.3
man/man3/pcre2_compile.3
man/man3/pcre2_compile_context_copy.3
@@ -95,6 +96,7 @@ share/doc/pcre2/html/README.txt
share/doc/pcre2/html/index.html
share/doc/pcre2/html/pcre2-config.html
share/doc/pcre2/html/pcre2.html
+share/doc/pcre2/html/pcre2_callout_enumerate.html
share/doc/pcre2/html/pcre2_code_free.html
share/doc/pcre2/html/pcre2_compile.html
share/doc/pcre2/html/pcre2_compile_context_copy.html
diff --git a/devel/pcre2/buildlink3.mk b/devel/pcre2/buildlink3.mk
index 222865c0dfb..136b1fb98cd 100644
--- a/devel/pcre2/buildlink3.mk
+++ b/devel/pcre2/buildlink3.mk
@@ -1,11 +1,11 @@
-# $NetBSD: buildlink3.mk,v 1.1 2015/04/19 19:18:22 wiz Exp $
+# $NetBSD: buildlink3.mk,v 1.2 2015/12/29 14:40:20 bsiegert Exp $
BUILDLINK_TREE+= pcre2
.if !defined(PCRE2_BUILDLINK3_MK)
PCRE2_BUILDLINK3_MK:=
-BUILDLINK_API_DEPENDS.pcre2+= pcre2>=10.10
+BUILDLINK_API_DEPENDS.pcre2+= pcre2>=10.20
BUILDLINK_PKGSRCDIR.pcre2?= ../../devel/pcre2
.endif # PCRE2_BUILDLINK3_MK
diff --git a/devel/pcre2/distinfo b/devel/pcre2/distinfo
index 72d4eafa661..e57c6be48f1 100644
--- a/devel/pcre2/distinfo
+++ b/devel/pcre2/distinfo
@@ -1,6 +1,6 @@
-$NetBSD: distinfo,v 1.2 2015/11/03 03:29:02 agc Exp $
+$NetBSD: distinfo,v 1.3 2015/12/29 14:40:20 bsiegert Exp $
-SHA1 (pcre2-10.10.tar.bz2) = a27c0beb6bfb828586d22769f5111183122ab42d
-RMD160 (pcre2-10.10.tar.bz2) = 0b1c4554c312faeca3e207ebd0ad60f9cf5d9f1f
-SHA512 (pcre2-10.10.tar.bz2) = c012022793cb6e569009590e12aee3ce847064fe09358fe98da9d67f4d150b798a6a92d54b2df31a352a21e79a098aac9ea801d7fa8d37cdcc77b6d0d6bdb5a7
-Size (pcre2-10.10.tar.bz2) = 1339199 bytes
+SHA1 (pcre2-10.20.tar.bz2) = b2f69ea90ae8e2a9d4f62e2bce04dd5df10f97d2
+RMD160 (pcre2-10.20.tar.bz2) = 11084c0047df0768299039f620ec408fa04a0a89
+SHA512 (pcre2-10.20.tar.bz2) = 3fcad35581a9d8e3b84b3509ada618165c0b53edb9622aa6ad92e83103eddabd6cfa8ce3aa9339bf5e0cf560b6f4ed07f37fcd3faa3b977964e610f23c99f639
+Size (pcre2-10.20.tar.bz2) = 1358380 bytes