FROM : Jens Alfke
DATE : Fri Jun 06 23:29:39 2008
On 6 Jun '08, at 8:13 AM, glenn andreas wrote:
> One other possible solution is to use the JavaScriptCore and make a
> JSStringRef (which works with unichars like NSString), and use
> JavaScript's regex support - that way the results will at least have
> consistent indices, work well with non-ASCII characters, etc...
JavaScriptCore is just using PCRE*. That basically supports Unicode,
but I have had problems in the past with non-Roman text in JS regular
expressions (particularly with word breaks in Japanese text). I think
ICU is a better bet.
—Jens
* and until a month or two ago it was a very ancient version of PCRE,
with prominent security holes; that's how that guy won a MacBook Pro
in the "Pwn2Own" contest.
DATE : Fri Jun 06 23:29:39 2008
On 6 Jun '08, at 8:13 AM, glenn andreas wrote:
> One other possible solution is to use the JavaScriptCore and make a
> JSStringRef (which works with unichars like NSString), and use
> JavaScript's regex support - that way the results will at least have
> consistent indices, work well with non-ASCII characters, etc...
JavaScriptCore is just using PCRE*. That basically supports Unicode,
but I have had problems in the past with non-Roman text in JS regular
expressions (particularly with word breaks in Japanese text). I think
ICU is a better bet.
—Jens
* and until a month or two ago it was a very ancient version of PCRE,
with prominent security holes; that's how that guy won a MacBook Pro
in the "Pwn2Own" contest.






Cocoa mail archive

