[9.2x] TC display garbled characters Unicode filename in compressed package

Please report only one bug per message!

Moderators: white, Hacker, petermad, Stefan2

Post Reply
User avatar
thomasmolover
Member
Member
Posts: 167
Joined: 2016-12-12, 01:32 UTC

[9.2x] TC display garbled characters Unicode filename in compressed package

Post by *thomasmolover »

When using TC to browse the files in the compressed package, the filename in the compressed package are compressed by Unicode - such as the linux compressed package, which will display garbled characters. but in any other compress app, they are displayed correct.

here is the package

https://drive.google.com/open?id=1Q--TliqT8H1QcgqkGd7FjFOlnYwEBDCu

LEFT is the correct in 7z, Right is wrong in TC

https://imgur.com/a/XY0MaMf
User avatar
ghisler(Author)
Site Admin
Site Admin
Posts: 48077
Joined: 2003-02-04, 09:46 UTC
Location: Switzerland
Contact:

Re: [9.2x] TC display garbled characters Unicode filename in compressed package

Post by *ghisler(Author) »

Strange, I get the same names as 7zip here. Which language do you use for non-Unicode programs on your system?
Control panel -> Regional and language ioptions - last tab - language for non-Unicode programs
Author of Total Commander
https://www.ghisler.com
User avatar
MVV
Power Member
Power Member
Posts: 8702
Joined: 2008-08-03, 12:51 UTC
Location: Russian Federation

Re: [9.2x] TC display garbled characters Unicode filename in compressed package

Post by *MVV »

Since it is a ZIP archive, there is a problem of missing the only standard for Unicode names...

However in my TC and 7Z names look like on the left screenshot. I have Russian as non-Unicode language. TC 9.21a 32 bit, 7-Zip 16.04.
User avatar
thomasmolover
Member
Member
Posts: 167
Joined: 2016-12-12, 01:32 UTC

Re: [9.2x] TC display garbled characters Unicode filename in compressed package

Post by *thomasmolover »

ghisler(Author) wrote: 2018-10-25, 09:33 UTC Strange, I get the same names as 7zip here. Which language do you use for non-Unicode programs on your system?
Control panel -> Regional and language ioptions - last tab - language for non-Unicode programs
I set to language with Simple Chinese, all my friend use Chinese have the problem.

One of my friends tould me that he guess Unix compress filename with UTF8noBOM, and TC can display ANSI with local or Unicode16LE,
it display the utf8noBOM in ANSI.
User avatar
ghisler(Author)
Site Admin
Site Admin
Posts: 48077
Joined: 2003-02-04, 09:46 UTC
Location: Switzerland
Contact:

Re: [9.2x] TC display garbled characters Unicode filename in compressed package

Post by *ghisler(Author) »

ZIP has a special flag for Unicode names in its standard. This shouldn't happen if the ZIP file follows the ZIP standard. I will have to analyze the file in detail to find out what's wrong.
Author of Total Commander
https://www.ghisler.com
User avatar
Usher
Power Member
Power Member
Posts: 1675
Joined: 2011-03-11, 10:11 UTC

Re: [9.2x] TC display garbled characters Unicode filename in compressed package

Post by *Usher »

2thomasmolover
It seems that you use different fonts in 7zip (what version?) and TC. Change fonts in TC, restart Windows and stop digging in fonts when testing software, please.

You can also read https://winaero.com/blog/rebuild-font-cache-windows-10/ or find similar hints for older Windows versions.
Andrzej P. Wozniak
Polish subforum moderator
User avatar
ghisler(Author)
Site Admin
Site Admin
Posts: 48077
Joined: 2003-02-04, 09:46 UTC
Location: Switzerland
Contact:

Re: [9.2x] TC display garbled characters Unicode filename in compressed package

Post by *ghisler(Author) »

I have tested this archive: The UTF-8 flag is NOT set in the headers. The problem is that in Chinese, both the UTF-8 encoding and the Chinese ANSI encoding share valid characters. Does anyone know how to reliably determine that it's UTF-8 and not local encoding? I know how to detect valid UTF-8, but this will also detect many non-UTF-8 names falsely as UTF-8.
Author of Total Commander
https://www.ghisler.com
Post Reply