[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Mixing characters and bytes

This page is part of the web mail archives of SRFI 68 from before July 7th, 2015. The new archives for SRFI 68 contain all messages, not just those from before July 7th, 2015.

To: Michael Sperber <sperber@xxxxxxxxxxxxxxxxxxxxxxxxxxx>
Subject: Re: Mixing characters and bytes
From: Alex Shinn <alexshinn@xxxxxxxxx>
Date: Fri, 26 Aug 2005 10:46:49 +0900
Cc: Per Bothner <per@xxxxxxxxxxx>, srfi-68@xxxxxxxxxxxxxxxxx
Delivered-to: srfi-68@xxxxxxxxxxxxxxxxx
Domainkey-signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=rS7gwZCjDzj4whLzeAcfAact0kc0KCutnjeIJWVZaFHGJ5Zr8dMHX2z+BDNBo2WRuYy7SDtTLgTJkV09dhFYlfqeIcWn/I2FuXFyeY9SO3N11y5u9kByqdutzBzmwJ/hhM4umu8cwJ3gQmYR3CK1cdlm/HEx7zJ2J+ZO1gcCzPc=
In-reply-to: <y9lbr3mc5zr.fsf@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx>
References: <430C20B1.6010102@xxxxxxxxxxx> <y9ly86rjl1x.fsf@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx> <5fb7e08705082418402cd0f486@xxxxxxxxxxxxxx> <y9lbr3mc5zr.fsf@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx>

On 8/26/05, Michael Sperber <sperber@xxxxxxxxxxxxxxxxxxxxxxxxxxx> wrote:
> 
> Alex Shinn <alexshinn@xxxxxxxxx> writes:
> 
> > On 8/25/05, Michael Sperber <sperber@xxxxxxxxxxxxxxxxxxxxxxxxxxx> wrote:
> >>
> >> The string ports specified in SRFI 6 can support byte operations
> >> perfectly meaningfully.  I believe SRFI 68 contains a variation of it.
> >
> > This is difficult to use in a Scheme that puts tight restrictions on
> > its strings, such as requiring them to be valid UTF-8, or if it performs
> > character-level semantic operations such as automatically
> > normalizing all strings.
> 
> If you feed a string output port a bad encoding, sure you get bad
> data.  This is a matter of specifying what happens in that case, which
> the SRFI does.

I don't see where the SRFI specifies what happens in this case.

However, it isn't just about bad encodings, but about string semantics.
It's much easier to perform automatic normalization if you only have to
work at the character level.

-- 
Alex

References:
- Mixing characters and bytes
  - From: Per Bothner
- Re: Mixing characters and bytes
  - From: Michael Sperber
- Re: Mixing characters and bytes
  - From: Alex Shinn
- Re: Mixing characters and bytes
  - From: Michael Sperber

Prev by Date: Re: a few questions about file-options
Next by Date: Re: Specification vs. Implementation
Previous by thread: Re: Mixing characters and bytes
Next by thread: more on finalization issue, and reference implementation
Index(es):
- Date
- Thread