[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Surrogates and character representation

This page is part of the web mail archives of SRFI 75 from before July 7th, 2015. The new archives for SRFI 75 contain all messages, not just those from before July 7th, 2015.

To: tree@xxxxxxxxxxxxx
Subject: Re: Surrogates and character representation
From: Thomas Bushnell BSG <tb@xxxxxxxxxx>
Date: Sat, 23 Jul 2005 00:19:09 -0700
Cc: "John.Cowan" <jcowan@xxxxxxxxxxxxxxxxx>, srfi-75@xxxxxxxxxxxxxxxxx
Delivered-to: srfi-75@xxxxxxxxxxxxxxxxx
In-reply-to: <17120.30080.768671.539970@xxxxxxxxxxxxxxxxxxxxxx> (Tom Emerson's message of "Fri, 22 Jul 2005 00:26:40 -0400")
References: <1122002894.6607.29.camel@xxxxxxxxxxxxxx> <17120.28178.788826.533753@xxxxxxxxxxxxxxxxxxxxxx> <20050722040917.GB7576@NYCMJCOWA2> <17120.30080.768671.539970@xxxxxxxxxxxxxxxxxxxxxx>
User-agent: Gnus/5.1007 (Gnus v5.10.7) Emacs/21.4 (gnu/linux)

Tom Emerson <tree@xxxxxxxxxxxxx> writes:

> Surrogate codepoints have a character property. They should be usable
> in a string, and individually can be considered a character. 

This is exactly part of the reason why char=codepoint is such a lose.
Most code doesn't *want* to see this kind of garbage; it's an encoding
issue.  I want chars where the *computer* takes care of the coding.  I
want chars that are fully-understood characters, not little pieces of
a character.

Follow-Ups:
- Re: Surrogates and character representation
  - From: Tom Emerson
- Re: Surrogates and character representation
  - From: Ken Dickey

References:
- Re: the "Unicode Background" section
  - From: Thomas Lord
- Surrogates and character representation
  - From: Tom Emerson
- Re: Surrogates and character representation
  - From: John.Cowan
- Re: Surrogates and character representation
  - From: Tom Emerson

Prev by Date: Re: the "Unicode Background" section
Next by Date: Re: the "Unicode Background" section
Previous by thread: Re: Surrogates and character representation
Next by thread: Re: Surrogates and character representation
Index(es):
- Date
- Thread