xPDFSearch 1.11 - Content plugin to search text in PDF files

Discuss and announce Total Commander plugins, addons and other useful tools here, both their usage and their development.

Moderators: white, Hacker, petermad, Stefan2

Post Reply
AmberBand
Junior Member
Junior Member
Posts: 6
Joined: 2014-02-16, 10:47 UTC

Post by *AmberBand »

First: Thank you for this very, very helpful Plugin!
If you are scanning a local folder for a word which should be included in one of the PDF how does xPDFSearch works? It opens every PDF Document in the Temp-Folder and scans the text?
I ask because I am looking for a possibility to scan some PDF documents which are saved in a Webdav folder or Cloud. I can't get it to work with the Webdav plugin and I don't know if it's not possible because of technical limitations or it might work if I have the right configuration.
User avatar
Lefteous
Power Member
Power Member
Posts: 9535
Joined: 2003-02-09, 01:18 UTC
Location: Germany
Contact:

Post by *Lefteous »

2AmberBand
If you are scanning a local folder for a word which should be included in one of the PDF how does xPDFSearch works? It opens every PDF Document in the Temp-Folder and scans the text?
The file is opened where it is not in the temp folder.
I ask because I am looking for a possibility to scan some PDF documents which are saved in a Webdav folder or Cloud. I can't get it to work with the Webdav plugin and I don't know if it's not possible because of technical limitations or it might work if I have the right configuration.
I guess the best way is to have a powerful server that provides this functionality. Downloading the file first and scan it locally sounds cumbersome. In TC you can use content plugins only for local and UNC paths.
User avatar
white
Power Member
Power Member
Posts: 4593
Joined: 2003-11-19, 08:16 UTC
Location: Netherlands

Re: xPDFSearch 1.07 - Content plugin to search text in PDF f

Post by *white »

Lefteous in first post wrote: Current version 1.07
A new version is ready for download. The new version supports 64 bit.

The complete list of changes can be viewed online.

Plugin page: Download, description und documentation
totalcmd.net
"Plugin page: Download, description und documentation" contains information and download of version 1.05....
Last edited by white on 2022-03-21, 16:22 UTC, edited 1 time in total.
Reason: fixed BBCode
User avatar
Lefteous
Power Member
Power Member
Posts: 9535
Joined: 2003-02-09, 01:18 UTC
Location: Germany
Contact:

Post by *Lefteous »

2white
Thanks for the hint. I will fix it as soon as possible.
User avatar
Ovg
Power Member
Power Member
Posts: 756
Joined: 2014-01-06, 16:26 UTC

Post by *Ovg »

It's impossible to lead us astray for we don't care even to choose the way.
#259941, TC 11.01 x64, Windows 7 SP1 x64
User avatar
Lefteous
Power Member
Power Member
Posts: 9535
Joined: 2003-02-09, 01:18 UTC
Location: Germany
Contact:

Post by *Lefteous »

2Ovg
Sorry for the delay. I have now continued my work on the plugin. So far I can only confirm that search for cyrillic text doesn't work.
User avatar
Ovg
Power Member
Power Member
Posts: 756
Joined: 2014-01-06, 16:26 UTC

Post by *Ovg »

2Lefteous

Thank you for answer and very useful plugin. I'll wait... :-)
It's impossible to lead us astray for we don't care even to choose the way.
#259941, TC 11.01 x64, Windows 7 SP1 x64
User avatar
Lefteous
Power Member
Power Member
Posts: 9535
Joined: 2003-02-09, 01:18 UTC
Location: Germany
Contact:

Post by *Lefteous »

2Ovg
I cannot say for sure if I could support Unicode characters which would include Cyrillic characters using the code base I use for my plugin but it seems that TC doesn't support Unicode for this kind of searches. See here:
http://www.ghisler.ch/board/viewtopic.php?p=166149#166149
User avatar
Lefteous
Power Member
Power Member
Posts: 9535
Joined: 2003-02-09, 01:18 UTC
Location: Germany
Contact:

Post by *Lefteous »

Current version 1.10
A new version is ready for download. The new version is based on xpdf version 3.04. The plugin now supports Unicode in filenames and contains now a Croatian translation. I would like to thank all who contributed to this plugin.

The complete list of changes can be viewed online.

Plugin page: Download, description und documentation
xPDFSearch Homepage
User avatar
Ovg
Power Member
Power Member
Posts: 756
Joined: 2014-01-06, 16:26 UTC

Post by *Ovg »

2Lefteous

Unfortunately it doesn't work with Cyrillic, but there is not any problem with English texts :-). BTW Thank you once more!!!
It's impossible to lead us astray for we don't care even to choose the way.
#259941, TC 11.01 x64, Windows 7 SP1 x64
User avatar
Lefteous
Power Member
Power Member
Posts: 9535
Joined: 2003-02-09, 01:18 UTC
Location: Germany
Contact:

Post by *Lefteous »

2Ovg
Yes I know - did you read my post before the release post? Don't hesitate o convince Christian Ghisler that Unicode support for text search plugins would be a nice feature.
User avatar
Ovg
Power Member
Power Member
Posts: 756
Joined: 2014-01-06, 16:26 UTC

Post by *Ovg »

2Lefteous

Yes, I read and I'll try to ask Christian for this support.
It's impossible to lead us astray for we don't care even to choose the way.
#259941, TC 11.01 x64, Windows 7 SP1 x64
User avatar
ghisler(Author)
Site Admin
Site Admin
Posts: 48021
Joined: 2003-02-04, 09:46 UTC
Location: Switzerland
Contact:

Post by *ghisler(Author) »

Hmm, what should ft_fulltextw be? UTF-16? or UTF-8?
Author of Total Commander
https://www.ghisler.com
User avatar
milo1012
Power Member
Power Member
Posts: 1158
Joined: 2012-02-02, 19:23 UTC

Post by *milo1012 »

ghisler(Author) wrote:Hmm, what should ft_fulltextw be? UTF-16? or UTF-8?
If it's called ft_fulltextw, it should use Wide strings, meaning UTF-16.
(and also conforming to the Unicode functions TC already uses)

While UTF-8 could be possible, I think it's way easier for plugin authors to handle UTF-16, because of the native Win-API functions,
and UTF-8 needs special treatment, especially when the current text Window (current text portion) relies on characters that are in the past/next Window.
Of course, you'd also need to take care on double-word UTF-16 characters (High and Low Surrogates), but it's still a lot easier to handle than the UTF-8 encoding scheme.
TC plugins: PCREsearch and RegXtract
iana
Senior Member
Senior Member
Posts: 345
Joined: 2010-07-27, 22:00 UTC

Post by *iana »

I'm not really an expert here but Cyrillic should be supported with the 8 bit ASCII code page, isn't Unicode needed for scripts with huge amount of chars like Chinese/Japanese/Korean?

ps I tested the custom fields and it displays Cyr chars properly, the searches works with Cyrillic too.

@Ovg
Unfortunately it doesn't work with Cyrillic
you should set regional settings to your language that might be the issue.
Post Reply