[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Surrogates and character representation

This page is part of the web mail archives of SRFI 75 from before July 7th, 2015. The new archives for SRFI 75 contain all messages, not just those from before July 7th, 2015.

To: srfi-75@xxxxxxxxxxxxxxxxx
Subject: Re: Surrogates and character representation
From: Thomas Lord <lord@xxxxxxx>
Date: Sat, 23 Jul 2005 12:04:15 -0700
Delivered-to: srfi-75@xxxxxxxxxxxxxxxxx

Tom Emerson:
> Surrogate codepoints have a character property. They should be usable
> in a string, and individually can be considered a character. 

Thomas Bushnell:
> This is exactly part of the reason why char=codepoint is such a lose.

Nah..... because:

> Most code doesn't *want* to see this kind of garbage; 

Nobody disagrees.


> it's an encoding
> issue.  

Everybody agrees.

> I want chars where the *computer* takes care of the coding.  I
> want chars that are fully-understood characters, not little pieces of
> a character.

Hopefully, since it's not clear exactly how to give you what 
you want, R6RS will give you an environment in which you can
elaborate that idea in a portable way, propose it as standard,
and have lots of implementors try it out in their implementation
to see how they feel about it.

-t

Follow-Ups:
- Re: Surrogates and character representation
  - From: Thomas Bushnell BSG

Prev by Date: Re: Surrogates and character representation
Next by Date: Re: the "Unicode Background" section
Previous by thread: suggested rule of thumb for editors
Next by thread: Re: Surrogates and character representation
Index(es):
- Date
- Thread