regex for han unification

matches han unification, or cjk (chinese (hanzi), japanese (kanji), and korean (hanja))

^[\u4E00-\u9FFF\u3400-\u4DBF\u20000-\u2A6DF\u2A700-\u2B73F\u2B740-\u2B81F\u2B820-\u2CEAF\u2CEB0-\u2EBEF\u30000-\u3134F\uF900-\uFAFF\u2E80-\u2EFF\u31C0-\u31EF\u3000-\u303F\u2FF0-\u2FFF\u3300-\u33FF\uFE30-\uFE4F\uF900-\uFAFF\u2F800-\u2FA1F\u3200-\u32FF\u1F200-\u1F2FF\u2F00-\u2FDF]+$

Han characters are a common feature of written Chinese (hanzi), Japanese (kanji), and Korean (hanja)


Cheatsheet

expr usage
/\w/ matches any word character (a-z, A-Z, 0-9, _)
/[0-9]/ matches all digits
/^/ matches beginning of a line
/$/ matches end of a line
iHateRegex
by geon