[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: the discussion so far

This page is part of the web mail archives of SRFI 75 from before July 7th, 2015. The new archives for SRFI 75 contain all messages, not just those from before July 7th, 2015.

To: Thomas Bushnell BSG <tb@xxxxxxxxxx>
Subject: Re: the discussion so far
From: Alex Shinn <alexshinn@xxxxxxxxx>
Date: Wed, 20 Jul 2005 11:48:51 +0900
Cc: Matthew Flatt <mflatt@xxxxxxxxxxx>, srfi-75@xxxxxxxxxxxxxxxxx
Delivered-to: srfi-75@xxxxxxxxxxxxxxxxx
Domainkey-signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:reply-to:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=iyQPWXHI++x0L7YgtgVqe77majrE5PdbVytP8PXulBnafZCX1xlgchDoHxpNTyRHbJ0BSuxzYSHUQfSCIRzSUg1PraxmXU35p9lU4KAfj1sVM4nPlpTWH2CwcG46bwxn1z3I8TK/jvnVTEnLMnktoPfIccdb3qE2Kwn4y3lnYZA=
In-reply-to: <87fyuatx8e.fsf@xxxxxxxxxxxxxxxxx>
References: <E1Dtlz7-0000Mq-00@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx> <5fb7e087050718200423778e82@xxxxxxxxxxxxxx> <87fyuatx8e.fsf@xxxxxxxxxxxxxxxxx>
Reply-to: Alex Shinn <alexshinn@xxxxxxxxx>

On 7/20/05, Thomas Bushnell BSG <tb@xxxxxxxxxx> wrote:
> Alex Shinn <alexshinn@xxxxxxxxx> writes:
> 
> >   CHAR-*CASE, CHAR-CI=?
> >     - as in R5RS
> >     - folds ASCII *only* (please don't enourage bad code)
> 
> I'm ok with this, but with bear's amendment: put "ascii" in the name.

I had originally suggested a name with ASCII (and note this is not an
encoding-based name as bear said but a char-set based bame).

The primary argument in favor of keeping the names as-is is partial
backwards compatibility with R5RS.  Character-level case operations are
currently used in programs for one of two semantic reasons - either
ASCII-based parsing or linguistic case mapping.  In the former case,
keeping the current R5RS names means no changes are needed and the
program continues to function properly.  In the latter case, the code is
fundamentally broken and needs to be rewritten to use string-level
operations anyway.  Unfortunately, in the latter case the code will
continue to work for English-speaking authors so the rewrite is not so
likely to take place.  Do we favor backwards compatibility as much as
possible, or do we introduce deliberate incompatibility and force people
not to use broken concepts?

This decision is also affected by the overall naming convention of the
SRFI.  If we are to have separate ASCII-based procedures and
Unicode-aware procedures, in general are the R5RS procedures thought of
as ASCII or as Unicode?  This is subjective - people may want to keep
the R5RS names for the semantics they use most often, but this will be
different depending on the type of programming you do.

On another note, so far the conversation is neglecting the predicates
CHAR-*CASE?.  Since these are defined as Unicode properties of
individual characters it does make sense to keep these as character
level operations.

-- 
Alex

Follow-Ups:
- Re: the discussion so far
  - From: Thomas Bushnell BSG

References:
- the discussion so far
  - From: Matthew Flatt
- Re: the discussion so far
  - From: Alex Shinn
- Re: the discussion so far
  - From: Thomas Bushnell BSG

Prev by Date: Re: the discussion so far
Next by Date: words, punctuation, and whitespace
Previous by thread: Re: the discussion so far
Next by thread: Re: the discussion so far
Index(es):
- Date
- Thread