Age | Commit message (Collapse) | Author | Files | Lines |
|
licensed under the GPL, has a new home and the following changes:
Multiple-line DOCTYPE declarations are now accepted (HTMLControl.C:352).
------------------------------
Bad initialisation of "fd_" in "urlistream" fixed (urlistream.h), so that an
error message is printed if a remote document could not be retrieved.
------------------------------
A missing node now is interpreted as node '/' (urlistream.C).
------------------------------
Closing DIV, FORM and BLOCKQUOTE tags as well as the closing UL, OL and PRE
tags are now optional (HTMLParser.y), so that the program won't die any longer
if one of them is omited in the document.
------------------------------
Some dificulties with non-ASCII chars fixed (HTMLControl.C).
------------------------------
Colons in elements and attributes now are tolerated and elements are not any
longer implecitly closed at a newline (HTMLControl.C), in order to handly
MS-Word's HTML better.
------------------------------
We do not use '/dev/stdin' as file descriptor any longer (html2text.C and
urlistream.C), in order to make reading from STDIN finaly work.
------------------------------
Block elements are not enclosed in -implecit- Paragraphs any longer
(HTMLParser.y), in order to avoid superfluous newlines in the output.
------------------------------
Fixed segmentation fault on tables with "border" attribute: we now assume that
any TABLE has at least one row and one column (table.C).
------------------------------
Fixed format.C to avoid excessive runtime increment on parsing much nested
block elements.
------------------------------
Some changes in configure, Makefile.in and the documentation.
------------------------------
------------------------------
Ported to g++ version 3.0.
This uses the 'istream.h' header file from the g++3's 'backward' directory.
------------------------------
Bugfix: '-' did not work as synonym for STDIN.
------------------------------
Added support for the EURO-sign (well, almost).
------------------------------
Finaly the GNU GPL as new copyright terms for all parts of the program, after
GMRS agreed to change the program's license terms to it.
------------------------------
------------------------------
Fixed coredump when parsing empty <SCRIPT> or <STYLE> elements (HTMLControl.C).
------------------------------
New image handling: <IMG alt=""> does no longer return the value of the SRC
attribute nor "[]". Added new ~/.html2textrc options: IMG.replace.{all noalt}
and IMG.alt.{prefix suffix} with new defaults in pretty-style mode. Added
method for checking whether an attribut was set, even to a zero-value.
------------------------------
(New) Copyright terms for all changes we made since version 1.2.2.
------------------------------
Some minor changes in configure and html2text.C.
------------------------------
Updated the documentation.
------------------------------
------------------------------
Fixed segmention fault when parsing <H5> tags (typo in format.C).
------------------------------
|
|
fixes build on systems which have libpcre installed as the configure
script's detection of it is slightly broken.
|
|
|
|
|
|
NetBSD 1.6.1/shark.
|
|
also provided the buildlink2ification.
|
|
|
|
can once again find the iconv functions on systems without iconv
in libc.
|
|
|
|
variables from the catalogs.mk file to buildlink2.mk. These may be needed in
a package that does *not* provide catalog files (i.e., all the functionality
in catalogs.mk is not required).
|
|
will give it to us, and if we don't this breaks the build.
|
|
|
|
|
|
systems using gcc from a package.
|
|
- Use INSTALL_DATA_DIR instead of MKDIR.
- Capitalize comment.
|
|
|
|
|
|
|
|
parsing library.
|
|
libCrun and libm if SunPro.
fixes build with SunPro.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
New since 1.0.30:
- bugfixes: xsl:copy on namespace nodes, AVT for xsl:sort order, fix for
the debugger (Keith Isdale), output filename limitation, trio.h and
triodef.h added (Albert Chin), EXSLT node-set (Peter Breitenlohner),
xsltChoose and whitespace (Igor Zlatkovic),
stylesheet compilation (Igor Zlatkovic), NaN and sort (William Brack),
RVT bug introduced in 1.0.30
- avoid generating " (fix in libxml2-2.5.8)
- fix 64bit cleaness problem and compilation troubles introduced in
1.0.30
- Windows makefile generation (Igor Zlatkovic)
- HP-UX portability fix
|
|
Changes since 2.5.7:
- bugfixes: XPath, XInclude, file/URI mapping, UTF-16 save (Mark
Itzcovitz), UTF-8 checking, URI saving, error printing (William Brack),
PI related memleak, compilation without schemas or without xpath (Joerg
Schmitz-Linneweber/Garry Pennington), xmlUnlinkNode problem with DTDs,
rpm problem on , i86_64, removed a few compilation problems from 2.5.7,
xmlIOParseDTD, and xmlSAXParseDTD (Malcolm Tredinnick)
- portability: DJGPP (MsDos) , OpenVMS (Craig A. Berry)
- William Brack fixed multithreading lock problems
- IPv6 patch for FTP and HTTP accesses (Archana Shah/Wipro)
- Windows fixes (Igor Zlatkovic, Eric Zurcher), threading (Stéphane
Bidoul)
- A few W3C Schemas Structure improvements
- W3C Schemas Datatype improvements (Charlie Bozeman)
- Python bindings for thread globals (Stéphane Bidoul), and method/class
generator
- added --nonet option to xmllint
- documentation improvements (John Fleck)
|
|
sed, or perhaps GNU sed.
packges should define USE_GNU_SED if GNU sed is really required,
otherwise we provide ${SED} in the buildlink/bin dir, unless the sed
provided on a given platform is known to be severely broken.
[one could argue Solaris' xpg4 sed falls in the "broken" category, but
for almost all intents and purposes, it is acceptable - define
USE_GNU_SED if a package still fails.]
XXX IRIX is currently listed in _INCOMPAT_SED - this should be removed
if sed on IRIX is known to be good.
|
|
backup FTP repositories (MASTER_SITE_BACKUP) in preference to the IBM
one.
|
|
|
|
|
|
|
|
|
|
Add mdoc.local from src/gnu/usr.bin/groff/tmac.
Convert to USE_NEW_TEXINFO.
Bump PKGREVISION.
|
|
instead of share/doc/examples. Noted by Kimmo Suominen.
|
|
Many bugfixes and some new features. See CHANGES for the
full details.
|
|
|
|
environment. This makes all targets work (where-as before install wouldn't
so I'm not sure how this ever functioned..)
|
|
|
|
which now includes powerpc and sparc.
Bump PKGREVISION.
|
|
|
|
to better match other perl5 module packages. This release mirrors the
Xerces-C 1.7.0 releases and implements a DOM API (though that seems to
be broken). Also, several major memory leaks were fixed.
|
|
in the Xerces-C 1.x line and contains many fixes and improvements over
version 1.3.0. Make this package compile with ICU>=2.4.
|
|
features and new and modified APIs from version 2.4:
* Added support for Unicode 4.0
* Added support for Unicode regular expressions
* Enhanced sorting
* Added support for international domain names
* Added service registration for pluggable ICU modules
* Added layout engine API for language-specific glyphs
* Separated currencies from locales
* Added POSIX-like API for message catalogs
* Added new charset converters
|
|
This package extracts DVD subtitles from a subtitle stream and
converts it to pgm or ppm images or into VobSub format. The main
purpose is to provide the required input to OCR software to convert
the subtitle images into ASCII text. Please note that the conversion
into ASCII is not part of this package but requires an OCR program
like gocr.
|
|
GOCR is an optical character recognition program, released under the GNU
General Public License. It reads images in many formats (pnm, pbm, pgm, ppm,
some pcx and tga image files (or PNM from stdin); if PNM-tools installed and
running linux-like system you can also use pnm.gz, pnm.bz2, png, jpg, tiff,
gif, bmp and others) and outputs a text file. You do not have to train the
program or store large font bases. Simply call gocr from the command line
and get your results.
|
|
|
|
* List convrtrs.txt as a SUPPORT_FILE instead of a CONF_FILE, since the
user isn't really expected to edit it.
|
|
${PKGNAME}, so that the WRKSRC definition is immune from any subsequent
PKGREVISION changes. With thanks to jmc for the nudge.
|