[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Why are byte ports "ports" as such?

This page is part of the web mail archives of SRFI 91 from before July 7th, 2015. The new archives for SRFI 91 contain all messages, not just those from before July 7th, 2015.

To: "Jonathan S. Shapiro" <shap@eros-os.org>
Subject: Re: Why are byte ports "ports" as such?
From: Per Bothner <per@bothner.com>
Date: Tue, 23 May 2006 13:08:58 -0700
Cc: srfi-91@srfi.schemers.org
Delivered-to: srfi-91@srfi.schemers.org
In-reply-to: <1148413901.10739.87.camel@mikado64.cs.jhu.edu>
References: <443E9048.8000804@mazama.net> <3E48308A-DDE7-4CA3-A497-9D16079F0353@iro.umontreal.ca> <443EF463.6080907@mazama.net> <A58B440E-FFFF-4912-8731-1058B24961A1@iro.umontreal.ca> <Pine.LNX.4.58.0605210717440.14258@bolt.sonic.net> <1148225285.17773.59.camel@vmx.eros-os.org> <87lksvpe4i.fsf@qrnik.zagroda> <1148229386.17773.127.camel@vmx.eros-os.org> <20060523181545.GM2798@ccil.org> <1148409836.10739.80.camel@mikado64.cs.jhu.edu> <44735B18.1070103@bothner.com> <1148413901.10739.87.camel@mikado64.cs.jhu.edu>
User-agent: Thunderbird 1.5 (X11/20060313)

Jonathan S. Shapiro wrote:

On Tue, 2006-05-23 at 11:57 -0700, Per Bothner wrote:

What is the use-case for read-char, as you define it?
What is the use-case for a "character" data type that is
*not* a codepoint data type?


We are getting to the jagged edge of what I know about UNICODE,


A little knowledge is a dangerous thing ...

but here is the situation as I understand it.

The underlying issue within UNICODE is the existence of the so-called
"combining characters". There exist characters that have no single
defining codepoint. These exist primarily in Asian languages, for
example in the form of multiple code points that together form a single
"glyph".


You're using the wrong terminology here, I think, but never mind.

The use case, then, seems self evident: programs that must be aware of
these at the code-point level.


You're contradicting yourself: I asked about a use-case for *character*
as a separate *data type*.

You given no such use-case.

The codepoint==char presumption is simply untrue in some non-western
languages.


We know that.  However, there is still no need for "character" [in the
Unicode sense] as a separate data type:

Code that works on compound characters *as a unit* can and should use a
string type.  Code that needs to look *inside* a compound character,
needs to works with codepoints.

In Java, "character" is actually a Unicode code-point.  This is how it
should be in Scheme, though we might want to replace the 16-bit size
by a 20-bit size to avoid the complexities of surrogate characters.
--
	--Per Bothner
per@bothner.com   http://per.bothner.com/

Follow-Ups:
- Re: Why are byte ports "ports" as such?
  - From: "Jonathan S. Shapiro" <shap@eros-os.org>
- Re: Why are byte ports "ports" as such?
  - From: John Cowan <cowan@ccil.org>
- Re: Why are byte ports "ports" as such?
  - From: Thomas Bushnell BSG <tb@becket.net>

References:
- Why are byte ports "ports" as such?
  - From: Ben Goetter <goetter@mazama.net>
- Re: Why are byte ports "ports" as such?
  - From: Marc Feeley <feeley@iro.umontreal.ca>
- Re: Why are byte ports "ports" as such?
  - From: Ben Goetter <goetter@mazama.net>
- Re: Why are byte ports "ports" as such?
  - From: Marc Feeley <feeley@iro.umontreal.ca>
- Re: Why are byte ports "ports" as such?
  - From: bear <bear@sonic.net>
- Re: Why are byte ports "ports" as such?
  - From: "Jonathan S. Shapiro" <shap@eros-os.org>
- Re: Why are byte ports "ports" as such?
  - From: Marcin 'Qrczak' Kowalczyk <qrczak@knm.org.pl>
- Re: Why are byte ports "ports" as such?
  - From: "Jonathan S. Shapiro" <shap@eros-os.org>
- Re: Why are byte ports "ports" as such?
  - From: John Cowan <cowan@ccil.org>
- Re: Why are byte ports "ports" as such?
  - From: "Jonathan S. Shapiro" <shap@eros-os.org>
- Re: Why are byte ports "ports" as such?
  - From: Per Bothner <per@bothner.com>
- Re: Why are byte ports "ports" as such?
  - From: "Jonathan S. Shapiro" <shap@eros-os.org>

Prev by Date: Re: Why are byte ports "ports" as such?
Next by Date: Re: Why are byte ports "ports" as such?
Previous by thread: Re: Why are byte ports "ports" as such?
Next by thread: Re: Why are byte ports "ports" as such?
Index(es):
- Date
- Thread