FROM : Deborah Goldsmith
DATE : Wed May 07 22:28:00 2008
On Apr 28, 2008, at 5:35 AM, John Joyce wrote:
> The range of the word is up to you to find and depends on the
> language. If it is any common language from Europe, your job is a
> lot easier. You mainly need to work with whitespace and punctuation.
> If you're working with Japanese, you'll need to learn of lot of
> complex tricks to identify the range of words...
There should be no need to implement your own word breaking. Please
use one of the system APIs that will do this for you, all of which
have been mentioned in this thread. For the usage of words described
here, CFStringTokenizer is probably best as it has the sophistication
necessary to break text in Chinese and Japanese into words.
Deborah Goldsmith
Apple Inc.
<email_removed>
DATE : Wed May 07 22:28:00 2008
On Apr 28, 2008, at 5:35 AM, John Joyce wrote:
> The range of the word is up to you to find and depends on the
> language. If it is any common language from Europe, your job is a
> lot easier. You mainly need to work with whitespace and punctuation.
> If you're working with Japanese, you'll need to learn of lot of
> complex tricks to identify the range of words...
There should be no need to implement your own word breaking. Please
use one of the system APIs that will do this for you, all of which
have been mentioned in this thread. For the usage of words described
here, CFStringTokenizer is probably best as it has the sophistication
necessary to break text in Chinese and Japanese into words.
Deborah Goldsmith
Apple Inc.
<email_removed>






Cocoa mail archive

