Search in files

Thomrima · Post by *Thomrima » 2012-06-01, 12:20 UTC

Hello
I'm using the search function to seach a string in multiple text filese. Unfortunately some of the files are Unicode files, others are UTF-8 files (perhaps some ANSI files). If I click the checkbox Unicode TC finds it only in Unicode files, the same with UTF-8.
This behaviour seams to be new in TC 8.0 but I'm not sure about it.
Is there an ability to search in all files regardles their coding type?

Post by *ghisler(Author) » 2012-06-03, 15:01 UTC

No there isn't, sorry. It's not that easy to detect the encoding of a file. Some files like EXE files may even contain both ANSI text and Unicode UTF-16 (e.g. strings and resources).

Thomrima · Post by *Thomrima » 2012-06-04, 07:33 UTC

Ok, I see the problem. However what do you think about this:
Make it possible to check unicode and utf-8 (and ASCII) at the same time and search the files twice?

Thomrima · Post by *Thomrima » 2012-06-04, 07:39 UTC

And ...
what does the TC search if non of the three boxes (unicode,ASCII and UTF) is checked?

umbra · Post by *umbra » 2012-06-04, 08:00 UTC

Thomrima wrote:what does the TC search if non of the three boxes (unicode,ASCII and UTF) is checked?

By default, it presumes ANSI encoding.

Post by *ghisler(Author) » 2012-06-04, 15:10 UTC

Yes, ANSI encoding with the current codepage for non-Unicode programs.

Valentino · Post by *Valentino » 2012-06-04, 19:46 UTC

I support the idea to allow selecting several encoding checkboxes at once. It's not trivial to make it fast, plus co-operation with Lister but it would be awesome.

Post by *ghisler(Author) » 2012-06-07, 10:49 UTC

OK, moving thread to suggestions forum.

Sob · Post by *Sob » 2012-06-07, 19:21 UTC

It can't be too hard. It just needs one thread for reading data from disk to memory buffer and then separate threads for every search working on it. Result is zero slowdown on disk operations (everything is still read only once) and with no more than three parallel searches (ANSI, Unicode, UTF-8, although in theory even more different encodings could be added) it can slow down only some older single-core PCs. And even there, if user needs to search for all three variants, it's still better to do just one slower search than three separate ones.

Total Commander

Search in files

Search in files

Suggestion

And ...

Re: And ...