summaryrefslogtreecommitdiff
path: root/converters/doc2html/DESCR
blob: 78391896a5de932aa0a11b591ebe5464245eeaa4 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
External converter script for ht://Dig (version 3.1.4 and later), that
converts Microsoft Word, Excel and Powerpoint files, and PDF,
PostScript, RTF, and WordPerfect files to text (in HTML form) so they
can be indexed.  Uses a variety of conversion programs:

	wp2html		- to convert Wordperfect and Word7 & 97 documents to HTML
	catdoc		- to extract text from Word documents
	rtf2html	- to convert RTF documents to HTML
	pdftotext	- to extract text from Adobe PDFs
	ps2ascii 	- to extract text from PostScript
	pptHtml		- to convert Powerpoint files to HTML
	xlHtml		- to convert Excel spreadsheets to HTML
or
	xls2csv		- to obtain data from Excel spreadsheets.

Written by David Adams (University of Southampton), and based on the
conv_doc.pl script by Gilles Detillieux.