Page 1 of 1

sphinx

Posted: 22 Mar 2010, 15:53
by archiseek
if i install sphinx do i still need

# PdfToText [ http://www.foolabs.com/xpdf/ ]
# html2text [ http://www.mbayer.de/html2text/ ]
# UnRTF [ http://www.gnu.org/software/unrtf/unrtf.html ]

Re: sphinx

Posted: 22 Mar 2010, 16:53
by RussH
sphinx speeds the searches after they've been converted to text -therefore tools to convert PDF/HTML/DOC/RTF to TXT are still required

Re: sphinx

Posted: 22 Mar 2010, 17:31
by archiseek
so if i can manage to install those, they will automatically convert when the upload is made?

Re: sphinx

Posted: 22 Mar 2010, 17:45
by RussH
archiseek wrote:so if i can manage to install those, they will automatically convert when the upload is made?
Yep - that's what they're there for!

I don't believe the .doc app (antiword) supports docx as yet - there is a sourceforge project for this (http://docx2txt.sourceforge.net/) which released files in September 2009, but no-one has integrated this into OpenCats (as yet)