[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: new draft



Jorgen Schaefer scripsit:

> Just as an example, implementing a string API based on grapheme
> clusters (or abstract characters, which I think most people here
> mean) on top of codepoint vectors is not much of a problem.

The Unicode definition of "abstract character" is vague.  There is an
abstract character corresponding to each Unicode encoded character,
but there's no saying what other abstract characters may or may not
exist.  Consequently, an API based on abstract characters is a purely
abstract API.

Grapheme clusters have a sharp definition:  a grapheme cluster is a
single base character followed by zero or more nonspacing or enclosing
marks, or else a sequence of Hangul jamos constituting a single syllable.

-- 
John Cowan  jcowan@xxxxxxxxxxxxxxxxx  www.reutershealth.com  www.ccil.org/~cowan
Assent may be registered by a signature, a handshake, or a click of a computer
mouse transmitted across the invisible ether of the Internet. Formality
is not a requisite; any sign, symbol or action, or even willful inaction,
as long as it is unequivocally referable to the promise, may create a contract.
       --Specht v. Netscape