admin管理员组

文章数量:1402915

I want to use regex for string replace with Cyrillic characters. I want to use exact match option. My string replace is working with Latin characters and is looking like that:

'Edin'.replace(/\Edin\b/gi, '');  // Output is ""

The same expression is not working with Cyrillic characters

'Един'.replace(/\Един\b/gi, '');  // Output is still 'Един'

I want to use regex for string replace with Cyrillic characters. I want to use exact match option. My string replace is working with Latin characters and is looking like that:

'Edin'.replace(/\Edin\b/gi, '');  // Output is ""

The same expression is not working with Cyrillic characters

'Един'.replace(/\Един\b/gi, '');  // Output is still 'Един'
Share Improve this question asked Sep 26, 2014 at 7:55 user732456user732456 2,6982 gold badges38 silver badges51 bronze badges 1
  • regex101./r/lS5tT3/42 – vks Commented Sep 26, 2014 at 8:39
Add a ment  | 

2 Answers 2

Reset to default 7

The problem here is \b word boundary chracter, which matches position at a word boundary. Word boundary is defined as (^\w|\w$|\W\w|\w\W). And in its turn word character \w is a set of ASCII characters [A-Za-z0-9_]. Obviously Cyrillic characters don't fall into this set.

For example, for the same reason /\w+/ regular expression will not match Cyrillyc string.

As dfsq wrote the problem is with word boundary. If you remove \b you will get desired output, but it is quite different regex. It will replace Един also in cases where it is a part of word. To avoid that you can use negative lookahead and define which letters shouldn't appear behind, because they could be a part of word.

'Един'.replace(/\Един(?![A-я])/gi, ''); 

本文标签: javascriptString replace exact match in cyrillicStack Overflow