TC unicode conversion - status update

English support forum

Moderators: white, Hacker, petermad, Stefan2

User avatar
ghisler(Author)
Site Admin
Site Admin
Posts: 48083
Joined: 2003-02-04, 09:46 UTC
Location: Switzerland
Contact:

Post by *ghisler(Author) »

2BeckYang
Please click on the "font" button and choose a bit larger font.

Btw, good news - I have implemented UTF-8 support via on the fly conversion of the current line to UTF-16, and it's working very well! But currently you can compare only two UTF-8 files with each other. I will implement also the comparison of an UTF-8 file with an UTF-16 file, but probably not an UTF-8 file with an Ansi file.
Author of Total Commander
https://www.ghisler.com
User avatar
Clo
Moderator
Moderator
Posts: 5731
Joined: 2003-12-02, 19:01 UTC
Location: Bordeaux, France
Contact:

The most---

Post by *Clo »

2ghisler(Author)

:) Good evening !

• That's really nice to hear, thanks ! I missed to compare some language files UTF-8 –> <–UTF-8 (i.e. for 7Z…).
- It's the most needed and useful, I guess.

:mrgreen: VG
Claude
Clo
#31505 Traducteur Français de TC French translator Aide en Français Tutoriels Français English Tutorials
BeckYang
Junior Member
Junior Member
Posts: 29
Joined: 2006-04-02, 10:33 UTC

Post by *BeckYang »

ghisler(Author) wrote:2BeckYang
Please click on the "font" button and choose a bit larger font.

Btw, good news - I have implemented UTF-8 support via on the fly conversion of the current line to UTF-16, and it's working very well! But currently you can compare only two UTF-8 files with each other. I will implement also the comparison of an UTF-8 file with an UTF-16 file, but probably not an UTF-8 file with an Ansi file.
I try the "font" button and change font charset.
But it does not help, no font support "UTF-8" charset.
I need TC convert the string to unicode or ??? like TC's internal viewer does.
And make compare result windows display these chars correctly.

For example, here is the screen capture of file1.txt
Image: http://server2.uploadit.org/files/ylonggyahoo-tc_view.jpg
The left side display file in "Text" and right side display the file in "UTF-8".

Referent the compare result window that I posted before.
Image: http://server2.uploadit.org/files/ylonggyahoo-tc_compare.jpg
The chars in compare result window is the same as "Text" which is not able to identify by us. What we need is display it as "UTF-8".
User avatar
ghisler(Author)
Site Admin
Site Admin
Posts: 48083
Joined: 2003-02-04, 09:46 UTC
Location: Switzerland
Contact:

Post by *ghisler(Author) »

I try the "font" button and change font charset.
But it does not help, no font support "UTF-8" charset.
I need TC convert the string to unicode or ??? like TC's internal viewer does.
You misunderstood me. I wrote that I just implemented it, it's for the next big version 7.5! Currently there is no UTF-8 support in Compare by contents, sorry. You have to convert it to UTF-16 (Unicode in Notepad) or Ansi.
Author of Total Commander
https://www.ghisler.com
StatusQuo
Power Member
Power Member
Posts: 1524
Joined: 2007-01-17, 21:36 UTC
Location: Germany

Post by *StatusQuo »

2ghisler(Author)
but probably not an UTF-8 file with an Ansi file.
Why not, is converting on the fly more difficult there?
It would e.g. be useful to compare REGEDIT4 and REGEDIT 5.0 files.

In C there are WideCharToMultiByte() and MultiByteToWideChar().
Usage example by MS
Who the hell is General Failure, and why is he reading my disk?
-- TC starter menu: Fast yet descriptive command access!
BeckYang
Junior Member
Junior Member
Posts: 29
Joined: 2006-04-02, 10:33 UTC

Post by *BeckYang »


You misunderstood me. I wrote that I just implemented it, it's for the next big version 7.5! Currently there is no UTF-8 support in Compare by contents, sorry. You have to convert it to UTF-16 (Unicode in Notepad) or Ansi.
I apologize for it. :wink:

Please consider to support this fucntion in future version.
Thanks for your wonderful work. TC really help me so much~
User avatar
Lefteous
Power Member
Power Member
Posts: 9535
Joined: 2003-02-09, 01:18 UTC
Location: Germany
Contact:

Post by *Lefteous »

2ghisler(Author)
What is the current status - two months later?
User avatar
ghisler(Author)
Site Admin
Site Admin
Posts: 48083
Joined: 2003-02-04, 09:46 UTC
Location: Switzerland
Contact:

Post by *ghisler(Author) »

In the meantime, I have converted also the following functions to Unicode:
- background transfer manager
- synchronize dirs
- encode/decode
- split/combine
- create/verify CRC checksums
- most configuration dialogs
- all trees
- search function
- port connection
- ignore list
- ZIP extraction and creation (info-zip and pkzip methods)
- FTP (user can choose any encoding he likes, including UTF8)
- print functions
- Associate
- Compare by contents
- Set Volume Label

I haven't converted the following yet:
- comments - I see no way to convert this so far
- Custom Columns setup - or is that included in "custom view".
- Choose Command - the commands are in English, so it should be no problem
Author of Total Commander
https://www.ghisler.com
User avatar
Lefteous
Power Member
Power Member
Posts: 9535
Joined: 2003-02-09, 01:18 UTC
Location: Germany
Contact:

Post by *Lefteous »

2ghisler(Author)
Thanks again for the detailed update :-)
User avatar
Lefteous
Power Member
Power Member
Posts: 9535
Joined: 2003-02-09, 01:18 UTC
Location: Germany
Contact:

Post by *Lefteous »

2ghisler(Author)
doesn't work with servers where UTF-8 is enabled by default and cannot be turned off. Any ideas how to auto-detect these?
The question is if all these servers where UTF-8 is enabled by default can be detected as such. In my case the server lists TVFS as supported feature which indicates UTF-8 is enabled by default. Maybe there are other servers with similar feaures which also indicate that UTF-8 is supported? If this is the case a list of servers which support UTF-8 by a certain feature could be maintained.

BTW: Have you completed Unicode support meanwhile? It seems you are already working on other features.
User avatar
ghisler(Author)
Site Admin
Site Admin
Posts: 48083
Joined: 2003-02-04, 09:46 UTC
Location: Switzerland
Contact:

Post by *ghisler(Author) »

I'm still fixing bugs regarding Unicode features which show up only over the time, but I have finished most of the conversion. I couldn't add unicode support to descript.ion because it would break the format, though.
Author of Total Commander
https://www.ghisler.com
oversky
Junior Member
Junior Member
Posts: 9
Joined: 2005-07-15, 20:49 UTC

Post by *oversky »

I have a lot of media files that have UTF-8 filenames. With TC7, I can't open these files directly by double click. I have to use the open in the right click menu. Is this the problem of associate described above?
User avatar
ghisler(Author)
Site Admin
Site Admin
Posts: 48083
Joined: 2003-02-04, 09:46 UTC
Location: Switzerland
Contact:

Post by *ghisler(Author) »

Do these files have an alternate DOS name which isn't Unicode? You can check this by turning off long names temporarily in the configuration. If they haven't, then there can be problems opening them.
Author of Total Commander
https://www.ghisler.com
oversky
Junior Member
Junior Member
Posts: 9
Joined: 2005-07-15, 20:49 UTC

Post by *oversky »

That's it. The .rmvb has a short name extension .rmv which is not assigned to the player I use. After registering the extension type, the problem is solved.

ghisler(Author) wrote:Do these files have an alternate DOS name which isn't Unicode? You can check this by turning off long names temporarily in the configuration. If they haven't, then there can be problems opening them.
User avatar
byblo
Senior Member
Senior Member
Posts: 270
Joined: 2005-02-20, 21:13 UTC
Contact:

Post by *byblo »

Any news ?
Post Reply