Jason Orendorff (2013-10-26T13:58:46.000Z)
domenic at domenicdenicola.com (2013-10-28T14:51:53.229Z)
On Fri, Oct 25, 2013 at 11:42 PM, Norbert Lindenberg <ecmascript at lindenbergsoftware.com> wrote: > Not if the RegExp is case insensitive, or uses a character class, or ".", or a quantifier - these all require looking at code points rather than UTF-16 code units in order to support the full Unicode character set. Can you explain this more? ISTM case insensitive searches and character classes don't require finding boundaries in the string being searched. Matching /./ does, sometimes. The common use is /.*/ and in that case you don't have to find all boundaries in the text being matched, only at the end or (again, only in certain cases) if you have to backtrack. Of course all those things have code-point-oriented *semantics*, which is great. But the implementation can be faster than that. I'd like to know what you have in mind regarding quantifiers though.