TextSearch (Content plugin for fulltext search in DOC)

Discuss and announce Total Commander plugins, addons and other useful tools here, both their usage and their development.

Moderators: white, Hacker, petermad, Stefan2

Post Reply
User avatar
m^2
Power Member
Power Member
Posts: 1413
Joined: 2006-07-12, 10:02 UTC
Location: Poland
Contact:

Post by *m^2 »

AbiWord Portable which runs from whatever directory you'll put it in. It is a matter of working environment preparation, don't know what exactly. NSIS sources inside.
User avatar
Alextp
Power Member
Power Member
Posts: 2321
Joined: 2004-08-16, 22:35 UTC
Location: Russian Federation
Contact:

Post by *Alextp »

Plugin updated.

RTF conversion is now built-in, DOC can be searched using GetTextIFilter (link in Readme).
User avatar
Alextp
Power Member
Power Member
Posts: 2321
Joined: 2004-08-16, 22:35 UTC
Location: Russian Federation
Contact:

Post by *Alextp »

Plugin updated.
Added support for RTF, UTF-16 (BE and LE) codepages.


2Ghisler
Christian, if you find plugin useful, please add it to your Plugins section (on ghisler.com).
User avatar
ghisler(Author)
Site Admin
Site Admin
Posts: 48070
Joined: 2003-02-04, 09:46 UTC
Location: Switzerland
Contact:

Post by *ghisler(Author) »

Your plugin looks very useful! Please use the form from here:
http://ghisler.ch/board/viewtopic.php?t=199
To inform me how I can put it on my addons page (link, direct download etc). If you change the plugin very often, it's better to keep it on your own site, or on totalcmd.net where you can update it without my help.
Author of Total Commander
https://www.ghisler.com
User avatar
Alextp
Power Member
Power Member
Posts: 2321
Joined: 2004-08-16, 22:35 UTC
Location: Russian Federation
Contact:

Post by *Alextp »

2ghisler(Author)
Thanks. I have sent you the plugin form by email.
If you change the plugin very often, it's better to keep it on your own site, or on totalcmd.net
Thanks for hint(!) :)
User avatar
ghisler(Author)
Site Admin
Site Admin
Posts: 48070
Joined: 2003-02-04, 09:46 UTC
Location: Switzerland
Contact:

Post by *ghisler(Author) »

:)
The big advantage of totalcmd.net vs. ghisler.com is that the plugin authors can change their plugin themselves. I find this too risky for our own company web site...
Author of Total Commander
https://www.ghisler.com
User avatar
Alextp
Power Member
Power Member
Posts: 2321
Joined: 2004-08-16, 22:35 UTC
Location: Russian Federation
Contact:

Post by *Alextp »

Plugin updated. Now it's possible to perform multiple codepages search: it's not possible in Total Commander yet!
See Readme for description.
User avatar
dott
Senior Member
Senior Member
Posts: 285
Joined: 2007-03-04, 00:58 UTC
Location: La Plata, Argentina

Post by *dott »

2Alextp
Look this converter, is freeware, 110 Kb zip.

xdoc2txt: Japanese: 1, 2 _ EDIT

It can convert the follows formats to plain text.
  • .rtf Rich text
    .docx Microsoft WORD 2007 (OOXML)
    .xlsx Microsoft Excel 2007 (OOXML)
    .pptx Microsoft PowerPoint 2007 (OOXML)
    .doc Microsoft WORD ver5.0/95/97/2000/XP/2003
    .xls Microsoft Excel ver5.0/95/97/2000/XP/2003
    .ppt Microsoft PowerPoint 97/2000/XP/2003
    .sxw/.sxc/.sxi/.sxd OpenOffice.org
    .odt/.ods/.odp/.odg Open Document
    .jaw/jtw Ichitaro ver5
    .jbw/juw Ichitaro ver6
    .jfw/jvw Ichitaro ver7
    .jtd/jtt Ichitaro ver8/9/10/11/12
    .oas/oa2/oa3 OASYS/Win
    .bun New pine/pine 5/pine 6
    .wj2/wj3/wk3/wk4/123 Lotus 123
    .wri Windows3.1 Write
    .pdf Adobe PDF
    .mht Web archive
    .html HTML
    .eml The export type of OutlookExpress
Last edited by dott on 2007-09-13, 15:13 UTC, edited 1 time in total.
Probablemente no
User avatar
dott
Senior Member
Senior Member
Posts: 285
Joined: 2007-03-04, 00:58 UTC
Location: La Plata, Argentina

Post by *dott »

-- I've made some tests with xdoc2txt and TextSearch and whatever setting I use always get an error message:

Code: Select all

Cannot run converter for "doc".
Command: "c:\Archivos de programa\totalcmd\plugins\wdx\TextSearch\Conv\XDOC\xdoc2txt.exe" "c:\Documents and settings\Daniel\Mis documentos\Mis archivos recibidos\test.doc" "C:\DOCUME~1\Daniel\CONFIG~1\Temp\TextSrch.txt"
I have proven with:
EXT=FULL_PATH_TO_CONVERTER\xdoc2txt.exe or
EXT=Conv\XDOC\xdoc2txt.exe
and all combination of "{In}", "{Out}", {InShort}, {OutShort}; even in>out and always ERROR :cry:

However run from command line it produce a TextSearch.txt(in>out) or show text in console screen.

Probably I'm missing something :oops:
:?:
Probablemente no
User avatar
Alextp
Power Member
Power Member
Posts: 2321
Joined: 2004-08-16, 22:35 UTC
Location: Russian Federation
Contact:

Post by *Alextp »

2dott
It seems you didn't double-quote the Exe filename.
You need to write:

EXT="FULL_PATH_TO_CONVERTER\xdoc2txt.exe" "{In}" "{Out}"
User avatar
dott
Senior Member
Senior Member
Posts: 285
Joined: 2007-03-04, 00:58 UTC
Location: La Plata, Argentina

Post by *dott »

2Alextp
I've proven that too, same ERROR message
Probablemente no
theosdikaios
Senior Member
Senior Member
Posts: 228
Joined: 2006-02-04, 13:02 UTC

Post by *theosdikaios »

2dott
Thanx for the fine tool xdoc2txt. Unfortunately I found only (japanese?) non-english description.

It works fine with TextSearch if TextSearch.ini contains this lines:
TextSearch.ini wrote:; xdoc2txt
XDOC=%COMSPEC% /CCONV\XDOC\XDOC2TXT.EXE "{In}">"{Out}"
sxw=XDOC
sxc=XDOC
sxi=XDOC
sxd=XDOC
odt=XDOC
ods=XDOC
odp=XDOC
odg=XDOC
docx=XDOC
docm=XDOC
xlsx=XDOC
xlsm=XDOC
pptx=XDOC
pptm=XDOC
doc=XDOC
xls=XDOC
ppt=XDOC
rtf=XDOC
jaw=XDOC
jtw=XDOC
jbw=XDOC
juw=XDOC
jfw=XDOC
jvw=XDOC
jtd=XDOC
jtt=XDOC
oas=XDOC
oa2=XDOC
oa3=XDOC
bun=XDOC
wj2=XDOC
wj3=XDOC
wk3=XDOC
wk4=XDOC
123=XDOC
wri=XDOC
pdf=XDOC
mht=XDOC
html=XDOC
eml=XDOC
"Since there are many things which have never happened and never will happen,
and which nevertheless are clearly conceivable, and imply no contradiction,
how can one say they are absolutely impossible?" Leibniz
User avatar
Alextp
Power Member
Power Member
Posts: 2321
Joined: 2004-08-16, 22:35 UTC
Location: Russian Federation
Contact:

Post by *Alextp »

2theosdikaios
Thank you. I'll include xdoc2txt link to my converters list in Readme.
User avatar
dott
Senior Member
Senior Member
Posts: 285
Joined: 2007-03-04, 00:58 UTC
Location: La Plata, Argentina

Post by *dott »

2theosdikaios

Thanks for the advice, work perfectly :D

The program itself print:

Code: Select all

Usage: xdoc2txt [-s|-e|-j][-c][-f][-r=(0|1|2)] <filename...>
        -s : convert to ShiftJIS(default)
        -e : convert to EUC
        -j : convert to JIS
        -f : output to file
        -c : activate PDF cache
        -p : print property
        -n : ignore permission on PDF;(require cryptlib.dll)
        -r=(0|1|2) : ruby style(0:suppress 1:parentheses 2:aozora bunko
        -o=0 : other option; -o=0:no show PDF page#
        -g=# : PDF gap parameter
        -v : show version number
The page in Japanese adds:(Through Babel Fish)

Code: Select all

-p 		in case of OLE2 compound document, document property indication (with Office and Ichitaro validity)
-r=		-r=2 << >> blue sky library type
-o=		-o=1 with PDF carriage return deletion (when with vertical writing it starts a new paragraph in every letter use)
Probablemente no
Carolina
Junior Member
Junior Member
Posts: 2
Joined: 2007-11-27, 10:36 UTC

Post by *Carolina »

theosdikaios wrote:It works fine with TextSearch if TextSearch.ini contains this lines:
TextSearch.ini wrote:; xdoc2txt
XDOC=%COMSPEC% /CCONV\XDOC\XDOC2TXT.EXE "{In}">"{Out}"

html=XDOC
eml=XDOC
First I'm new to Total Commander;-), but I don't get this plugin to work.
I've created a new subfolder XDOC (like PDF and ODF given by the plugin) and extracted the xdoc2txt.exe (v1.26).
Then I've updated the TextSearch.ini with the text from above, but when I search ~300 .html files I didn't get any match.
Post Reply