xPDFSearch 1.11 - Content plugin to search text in PDF files

Discuss and announce Total Commander plugins, addons and other useful tools here, both their usage and their development.

Moderators: Stefan2, white, sheep, Hacker

Post Reply
krapet
Junior Member
Junior Member
Posts: 18
Joined: 2007-08-02, 06:39 UTC
Location: Czech

Post by *krapet » 2016-02-18, 14:30 UTC

I'm using plugin in to change file attributes. See Image: http://i.imgur.com/J1RtYeU.png

For most files it is working. Unfortunately I have trouble to get Modified date and time for some PDF files,
for example this file: http://www.farnell.com/datasheets/327249.pdf

Explorer - General Tab Image: http://imgur.com/5jGuwvx.png
Explorer - PDF Tab Image: http://imgur.com/5NiC5mS.png

User avatar
Lefteous
Power Member
Power Member
Posts: 9457
Joined: 2003-02-09, 01:18 UTC
Location: Germany
Contact:

Post by *Lefteous » 2016-02-20, 18:51 UTC

2krapet
Thanks for your report. I can confirm the issue. I'm currently testing a fixed version. I will announce it here when it's done.

The problem is that different programs display different values for the date. Especially the time zone information is handled differently. So what is the reference application?

krapet
Junior Member
Junior Member
Posts: 18
Joined: 2007-08-02, 06:39 UTC
Location: Czech

Post by *krapet » 2016-02-20, 21:08 UTC

PDF format was developed by Adobe so I prefer to use as reference their Acrobat (Reader).

stanly01@excite.com
Junior Member
Junior Member
Posts: 6
Joined: 2016-06-10, 01:12 UTC

Post by *stanly01@excite.com » 2016-06-10, 12:25 UTC

How to search if a PDF file is searchable, editable or not?

User avatar
Lefteous
Power Member
Power Member
Posts: 9457
Joined: 2003-02-09, 01:18 UTC
Location: Germany
Contact:

Post by *Lefteous » 2016-06-10, 12:31 UTC

2stanly01@excite.com
The plugin respects these properties. The 'Editable' property doesn't matter to searching.

User avatar
Lefteous
Power Member
Power Member
Posts: 9457
Joined: 2003-02-09, 01:18 UTC
Location: Germany
Contact:

Post by *Lefteous » 2016-06-12, 21:59 UTC

Here is a first beta version that supports Unicode text extraction. I have a collection of test files and the results where mixed.

One document which seems to works quite well is the following:
http://lefteous.totalcmd.net/tc/archives/xpdfsearch/sample.pdf

One known problem is that German umlauts are not decoded in some files.

http://lefteous.totalcmd.net/tc/archives/xpdfsearch/wdx_xpdfsearch_2.00_beta_1.zip

User avatar
Ovg
Power Member
Power Member
Posts: 594
Joined: 2014-01-06, 16:26 UTC
Location: MOW

Post by *Ovg » 2016-06-13, 07:33 UTC

2Lefteous
Unfortunately this beta doesn't work for me at all. Even doesn't find "Die" in your sample.pdf. I have tested both x32/x64 versions of TC 9.0 β1 at Windows 7 SP1 x64... It seems that finding process doesn't start - not found return immediately. Any suggesting will be appreciate.
It's impossible to lead us astray for we don't care even to choose the way.
#259941, TC 9.21a x64, Windows 7 SP1 x64

User avatar
Lefteous
Power Member
Power Member
Posts: 9457
Joined: 2003-02-09, 01:18 UTC
Location: Germany
Contact:

Post by *Lefteous » 2016-06-13, 07:39 UTC

2Ovg
Thanks for testing this early version. Please focus your test on the 'Document start' field not the fulltext search.

User avatar
Ovg
Power Member
Power Member
Posts: 594
Joined: 2014-01-06, 16:26 UTC
Location: MOW

Post by *Ovg » 2016-06-13, 07:54 UTC

2Lefteous
Thanks for reply! I have tested - no luck.
Now I get error:

---------------------------
Total Commander 9.0Я1
---------------------------
Access violation.
Access violation
Windows 7 SP1 6.1 (Build 7601), base: 0400000

Please report this error to the Author, with a description
of what you were doing when this error occurred!

Stack trace (x64):91C842
7700530020002C
Press Ctrl+C to copy this report!
Continue execution?
---------------------------
Yes No
---------------------------

In standalone search I get similar error:
http://rgho.st/8MhGTd2FS (x32) or simple "Access violation" for x64 ....
It's impossible to lead us astray for we don't care even to choose the way.
#259941, TC 9.21a x64, Windows 7 SP1 x64

User avatar
Lefteous
Power Member
Power Member
Posts: 9457
Joined: 2003-02-09, 01:18 UTC
Location: Germany
Contact:

Post by *Lefteous » 2016-06-13, 07:56 UTC

2Ovg
OK thanks for your reply. If you have a certain pdf that you can to see included in my tests please send it to me.

User avatar
Ovg
Power Member
Power Member
Posts: 594
Joined: 2014-01-06, 16:26 UTC
Location: MOW

Post by *Ovg » 2016-06-13, 08:05 UTC

2Lefteous
Ok, I'll try!
It's impossible to lead us astray for we don't care even to choose the way.
#259941, TC 9.21a x64, Windows 7 SP1 x64

User avatar
Ovg
Power Member
Power Member
Posts: 594
Joined: 2014-01-06, 16:26 UTC
Location: MOW

Post by *Ovg » 2016-06-13, 09:28 UTC

2Lefteous

I have found that beta plugin working fine for me from general tab of File Find dialog. (Yes, I can find Cyrillic text! :mrgreen:) From plugins tab doesn't work at all with errors which I have mentioned above.
It's impossible to lead us astray for we don't care even to choose the way.
#259941, TC 9.21a x64, Windows 7 SP1 x64

User avatar
ghisler(Author)
Site Admin
Site Admin
Posts: 37349
Joined: 2003-02-04, 09:46 UTC
Location: Switzerland
Contact:

Post by *ghisler(Author) » 2016-06-13, 16:53 UTC

Thanks, I will test it too, the error may be on TC's side.
Author of Total Commander
http://www.ghisler.com

User avatar
Lefteous
Power Member
Power Member
Posts: 9457
Joined: 2003-02-09, 01:18 UTC
Location: Germany
Contact:

Post by *Lefteous » 2016-06-13, 21:01 UTC

I can confirm that the search from main site passes with my test file collection using xpdfsearch.Document start will the same field leads to an error message similar to the above posted.

User avatar
ghisler(Author)
Site Admin
Site Admin
Posts: 37349
Joined: 2003-02-04, 09:46 UTC
Location: Switzerland
Contact:

Post by *ghisler(Author) » 2016-06-15, 21:08 UTC

There were a few errors: It doesn't really work when you report the field as Ansi and then returned Unicode. This should work fine now. Also it only searched the first block, which should be fixed now too!

I noticed that the Chinese in your PDF is a bit strange: I cannot copy it with Foxit Reader nor with the Firefox PDF viewer, and entering it manually couldn't find it either. Any ideas?
Author of Total Commander
http://www.ghisler.com

Post Reply