Chinese and Taiwan Regex Issue

English support forum

Moderators: white, Hacker, petermad, Stefan2

Post Reply
User avatar
makinero
Senior Member
Senior Member
Posts: 268
Joined: 2013-10-26, 10:05 UTC

Chinese and Taiwan Regex Issue

Post by *makinero »

Need Regex
Chinese, Taiwan all type.

How to Find files with names containing at least one letter Chinese.

[A-Za-z] English

[.....] Chinese, Taiwan ?
User avatar
MVV
Power Member
Power Member
Posts: 8702
Joined: 2008-08-03, 12:51 UTC
Location: Russian Federation

Post by *MVV »

Try specifying Unicode ranges, e.g. [\x3000-\x4000] for searching characters in range 0x3000-0x4000 (type desired starting and ending Unicode character codes).
User avatar
makinero
Senior Member
Senior Member
Posts: 268
Joined: 2013-10-26, 10:05 UTC

Post by *makinero »

[\x3000-\x4000]


It does not work properly because there are other names

Any one contains chars Taiwan, Chinese etc.
(Include)
姐(Master Remix).mp3
姐(Master (姐Master Remix姐).mp3


No Chinese
(Master Remix).mp3 (Ignore) (wrong found)
User avatar
Ovg
Power Member
Power Member
Posts: 756
Joined: 2014-01-06, 16:26 UTC

Post by *Ovg »

2makinero
[\x{3000}-\x{9999}]
Last edited by Ovg on 2017-02-06, 19:15 UTC, edited 1 time in total.
It's impossible to lead us astray for we don't care even to choose the way.
#259941, TC 11.01 x64, Windows 7 SP1 x64
User avatar
makinero
Senior Member
Senior Member
Posts: 268
Joined: 2013-10-26, 10:05 UTC

Post by *makinero »

Ovg wrote:2makinero
may be?

Code: Select all

^.*[\u3000-\u9999].*$
No. Wrong Regex.
I'm Search for files on disk
Found:
Find all files, so bad regex.


must contain at least one Chinese (taiwan) letter.
Traditional and Simplified Chinese (Big5) or similar.
User avatar
Ovg
Power Member
Power Member
Posts: 756
Joined: 2014-01-06, 16:26 UTC

Post by *Ovg »

It's impossible to lead us astray for we don't care even to choose the way.
#259941, TC 11.01 x64, Windows 7 SP1 x64
User avatar
MVV
Power Member
Power Member
Posts: 8702
Joined: 2008-08-03, 12:51 UTC
Location: Russian Federation

Post by *MVV »

makinero,
As I said, you should type desired Unicode ranges yourself, I don't know these languages and their ranges, but you should know them if you have such names.
User avatar
makinero
Senior Member
Senior Member
Posts: 268
Joined: 2013-10-26, 10:05 UTC

Post by *makinero »

Ovg - Your new regex - now works correctly. Finds the correct Chinese names.
User avatar
makinero
Senior Member
Senior Member
Posts: 268
Joined: 2013-10-26, 10:05 UTC

Post by *makinero »

Ovg wrote:2makinero
[\x{3000}-\x{9999}]
[\x{3400}-\x{9fff}\x{f900}-\x{fa2d}] shows CJK ideographs.

or \p{Han} (Onigmo Regex)
Post Reply