[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Encodings.

This page is part of the web mail archives of SRFI 52 from before July 7th, 2015. The new archives for SRFI 52 contain all messages, not just those from before July 7th, 2015.



On Thursday 12 February 2004 06:45 pm, bear wrote:
> > Defining valid identifier syntax such that case folding of
> > (unnormalized) identifier literals should be sufficient.
> >
> > What am I missing?
>
> You're missing all the tools and utilities out there that are
> programmed with the expectation and requirement that they can
> arbitrarily impose or change normalization forms without changing the
> text of the documents they handle.  There is no escaping this; even
> Emacs and Notepad do it.

Ah!  So a broken language (huge tables and complex processing) must be defined 
to deal with broken tools which do not write out Unicode data in a canonical 
format.

What about creating a tool which reads bizarre Unicode and writes it out in a 
canonical format?  Then requiring portable Scheme programs to pass through 
it?  

Sounds like a service to the entire Unicode community.  It could be written in 
portable Scheme and serve as a (presumably good) advertisement for the 
language.

Don't complexify the implementation, simplify the problem!


$0.02,
-KenD

[No Unicode was normalized in the creation of this missive ;^].