summaryrefslogtreecommitdiff
path: root/converters/doc2html
AgeCommit message (Collapse)AuthorFilesLines
2001-05-14Initial import of doc2html-2.1 into the packages collection.agc6-0/+135
Provided in PR 12884 by Jesse Off (joff@newmonics.com) "External converter script for ht://Dig (version 3.1.4 and later), that converts Microsoft Word, Excel and Powerpoint files, and PDF, PostScript, RTF, and WordPerfect files to text (in HTML form) so they can be indexed. Uses a variety of conversion programs: wp2html - to convert Wordperfect and Word7 & 97 documents to HTML catdoc - to extract text from Word documents rtf2html - to convert RTF documents to HTML pdftotext - to extract text from Adobe PDFs ps2ascii - to extract text from PostScript pptHtml - to convert Powerpoint files to HTML xlHtml - to convert Excel spreadsheets to HTML or xls2csv - to obtain data from Excel spreadsheets. Written by David Adams (University of Southampton), and based on the conv_doc.pl script by Gilles Detillieux."