Page 1 of 1

Error with md5-file in UTF format

Posted: 2012-05-31, 14:02 UTC
by LonerD
If the md5-file in UTF format, TC can not correctly identify the name of the file and check the checksum.
Error with Cyrillic file names.
Example:
http://rghost.ru/38396006

Re: Error with md5-file in UTF format

Posted: 2012-05-31, 16:54 UTC
by white
LonerD wrote:If the md5-file in UTF format, TC can not correctly identify the name of the file and check the checksum.
Error with Cyrillic file names.
There is no BOM in the md5 file. Should the md5 file be detected as UTF?

Posted: 2012-05-31, 20:06 UTC
by ghisler(Author)
TC currently only detects UTF-8 checksum files by BOM. UTF-16 checksum files are also detected without BOM by the location of 0 bytes within the file, which do not appear in ANSI or UTF-8 files.

Posted: 2012-06-03, 13:32 UTC
by LonerD
ghisler
Can you add UTF-16 ?
Some programs create md5-checksum files in this format, and Total can't check this checksum correctly.

Posted: 2012-06-03, 14:07 UTC
by white
LonerD wrote:ghisler
Can you add UTF-16 ?
Some programs create md5-checksum files in this format, and Total can't check this checksum correctly.
It should work. Can you supply a test file?

Posted: 2012-06-03, 15:15 UTC
by ghisler(Author)
Actually UTF-16 should already work with and without BOM.

Posted: 2012-06-03, 21:25 UTC
by LonerD
ghisler(Author)
Please, add all varians for checksum files - UTF-16 (LE/BE) and UTF-8 with ВОМ/without BOM.

Posted: 2012-06-04, 07:28 UTC
by white
LonerD wrote:ghisler(Author)
Please, add all varians for checksum files - UTF-16 (LE/BE) and UTF-8 with ВОМ/without BOM.
All variants UTF-16, LE/BE and with ВОМ/without BOM, should already work. Do you have an example that does not work?

Posted: 2012-06-05, 17:33 UTC
by LonerD
Do you have an example that does not work?
In first post.
Files with Cyrillic names.

Posted: 2012-06-05, 20:27 UTC
by white
[mod]Moved to Suggestions forum.

White (moderator)
[/mod]

Posted: 2013-08-28, 12:35 UTC
by LonerD
Actually.
Example:
http://rghost.ru/48427211
File with cyrillic name Файл.txt
md5-file generated with Fsum Frontend
TC cann't check this file with command cm_CRCcheck.
Error:
"Cannot open input file Файл.txt !"

Posted: 2013-08-28, 15:41 UTC
by ghisler(Author)
Unfortunately the file is missing the UTF-8 byte order marker (BOM). Therefore TC assumes that it's an ANSI file. If you open the file with notepad and just save it, then the check works.

Posted: 2013-08-28, 16:23 UTC
by MaxX
I've tested such scenario:
1. Gen md5 file by TC (in cp1251)
2a. ReSave this md5 file to utf8 by AkelPad
2b. ReSave this md5 file to utf16le by AkelPad

Both "2a" and "2b" files are valid for TC and md5-check works fine.