[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Surrogates and character representation

This page is part of the web mail archives of SRFI 75 from before July 7th, 2015. The new archives for SRFI 75 contain all messages, not just those from before July 7th, 2015.

To: Tom Emerson <tree@xxxxxxxxxxxxx>
Subject: Re: Surrogates and character representation
From: "John.Cowan" <jcowan@xxxxxxxxxxxxxxxxx>
Date: Sun, 24 Jul 2005 01:37:13 -0400
Cc: Thomas Bushnell BSG <tb@xxxxxxxxxx>, srfi-75@xxxxxxxxxxxxxxxxx
Delivered-to: srfi-75@xxxxxxxxxxxxxxxxx
In-reply-to: <17122.31220.22073.72951@xxxxxxxxxxxxxxxxxxxxxx>
References: <1122002894.6607.29.camel@xxxxxxxxxxxxxx> <17120.28178.788826.533753@xxxxxxxxxxxxxxxxxxxxxx> <20050722040917.GB7576@NYCMJCOWA2> <17120.30080.768671.539970@xxxxxxxxxxxxxxxxxxxxxx> <878xzykn0y.fsf@xxxxxxxxxxxxxxxxx> <17122.31220.22073.72951@xxxxxxxxxxxxxxxxxxxxxx>
User-agent: Mutt/1.4.2.1i

Tom Emerson scripsit:

> Surrogates are a side-effect of UTF-16. Period. Application-level code
> just doesn't see them. This entire discussion about whether or not a
> CHAR should include surrogate code points is, IMHO, a waste of
> everyones talents here. It's much ado about nothing.

I agree that applications developers rarely have to think about surrogates,
but language/library designers (whose job it is to make corner cases
unsuprising) do have to think about them.

FWIW, I now think (after some talk on a private Unicode list) that it's
correct to allow surrogates as Scheme characters; that is, the range of
char->integer should be 0 to #x10FFFF.

-- 
John Cowan  jcowan@xxxxxxxxxxxxxxxxx  www.reutershealth.com  www.ccil.org/~cowan
It's the old, old story.  Droid meets droid.  Droid becomes chameleon. 
Droid loses chameleon, chameleon becomes blob, droid gets blob back
again.  It's a classic tale.  --Kryten, Red Dwarf

Follow-Ups:
- Re: Surrogates and character representation
  - From: Shiro Kawai
- Re: Surrogates and character representation
  - From: Tom Emerson
- Re: Surrogates and character representation
  - From: Alan Watson

References:
- Re: the "Unicode Background" section
  - From: Thomas Lord
- Surrogates and character representation
  - From: Tom Emerson
- Re: Surrogates and character representation
  - From: John.Cowan
- Re: Surrogates and character representation
  - From: Tom Emerson
- Re: Surrogates and character representation
  - From: Thomas Bushnell BSG
- Re: Surrogates and character representation
  - From: Tom Emerson

Prev by Date: Re: the "Unicode Background" section
Next by Date: Re: Surrogates and character representation
Previous by thread: Re: Surrogates and character representation
Next by thread: Re: Surrogates and character representation
Index(es):
- Date
- Thread