[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Unicode surrogates

This page is part of the web mail archives of SRFI 75 from before July 7th, 2015. The new archives for SRFI 75 contain all messages, not just those from before July 7th, 2015.




On Mon, 13 Mar 2006, Tom Emerson wrote:

>rgburger@xxxxxxxxxxx writes:

>> For example, I can create a file called "\uD802.ss" in Windows.  How
>> would I be able to open this file in Scheme with the given proposal?
>
>Well, U+D802 is invalid, since it must be paired.

That doesn't matter, really.  The fact that it's in violation of
the unicode standard does not make it cease to exist or solve the
problem it creates.

To put it another way, Windows allows characters that are not part
of Unicode to be used to name files.  If we restrict our character
set for filenames to Unicode-only, we will not be able to open
those files.  That problem is real.

Hmmm.... can we use read-byte and write-byte to read and write
filenames?

				Bear