[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Why are byte ports "ports" as such?

This page is part of the web mail archives of SRFI 91 from before July 7th, 2015. The new archives for SRFI 91 contain all messages, not just those from before July 7th, 2015.

To: Per Bothner <per@bothner.com>
Subject: Re: Why are byte ports "ports" as such?
From: Thomas Bushnell BSG <tb@becket.net>
Date: Tue, 23 May 2006 20:33:52 -0700
Cc: srfi-91@srfi.schemers.org
Delivered-to: srfi-91@srfi.schemers.org
In-reply-to: <4473991E.8040900@bothner.com> (Per Bothner's message of "Tue, 23 May 2006 16:22:06 -0700")
References: <443E9048.8000804@mazama.net> <3E48308A-DDE7-4CA3-A497-9D16079F0353@iro.umontreal.ca> <443EF463.6080907@mazama.net> <A58B440E-FFFF-4912-8731-1058B24961A1@iro.umontreal.ca> <Pine.LNX.4.58.0605210717440.14258@bolt.sonic.net> <1148225285.17773.59.camel@vmx.eros-os.org> <87lksvpe4i.fsf@qrnik.zagroda> <1148229386.17773.127.camel@vmx.eros-os.org> <20060523181545.GM2798@ccil.org> <1148409836.10739.80.camel@mikado64.cs.jhu.edu> <44735B18.1070103@bothner.com> <1148413901.10739.87.camel@mikado64.cs.jhu.edu> <44736BDA.8010705@bothner.com> <u1h1wuk8mcm.fsf@kempis.becket.net> <44738088.9060706@bothner.com> <u1hpsi475vd.fsf@kempis.becket.net> <4473991E.8040900@bothner.com>
User-agent: Gnus/5.110004 (No Gnus v0.4) Emacs/21.4 (gnu/linux)

Per Bothner <per@bothner.com> writes:

>> Except that text is an assemblage of characters, not of code points.
>> The editor needs functions like "display this character",
>
> No, it needs functions like "display this string".

Do you use emacs?  Do you ever use C-x =?

> Yes, but this is no different from "move to next word".  It doesn't
> need to work on the character except as *part of the buffer*.

Right.  And what is most convenient for the editor is to just
increment a pointer.  You want the editor to need to peek inside and
suddenly care about encodings and whatnot, things it otherwise need
not attend to.

Encodings should only matter to the editor when exporting and
importing files.  The rest of the time, the editor should be
encoding-blind. 

>> What I want is a *character* type for a text editor.
>
> What you want and what you need are not the same thing.
> Somebody who uses a text editor does not need characters;
> they need strings.  

Sorry, but I think of a string as an array of characters.  

> When you implement a text editor, characters
> can be useful, but having them as a separate data type is just
> pointless overhead.  

Great, then you don't need characters.  But *certainly* this is not an
argument for taking code points and *calling* them characters.

>> What is *certainly* useless is a "code point" type.
>
> They're useless - except for implementing strings and buffers.

Except that a string is an array of characters.

> Of course it does.  Fonts are indexed by code-point.

No, they are not.  They are indexed by character.  Consider an
accented character that is represented by several code points.  

> Well, at some point you're going to have to ask "is this a digit"
> or "is this a space".  To do that correctly and portably, you need
> to index the Unicode tables, which are indexed by code-points.  Of
> course that is rather low-level: instead, I'm arguing for an api
> like "is the character at this position in this string/buffer
> white-space".  This is a special case of the more general: "does
> the substring after this position match this regular expression".

Except that white-space might not be a regex. ;)

> Anyway, this is all irrelevant.  Until you specify an actual
> "character API" and propose a practical implementation strategy,
> then I think the discussion is pointless.

I see, I think I already had.  Was that missing?  I'm content with the
scheme character API, with the case-related functions fixed or
removed.

Thomas

Follow-Ups:
- Re: Why are byte ports "ports" as such?
  - From: Per Bothner <per@bothner.com>

References:
- Why are byte ports "ports" as such?
  - From: Ben Goetter <goetter@mazama.net>
- Re: Why are byte ports "ports" as such?
  - From: Marc Feeley <feeley@iro.umontreal.ca>
- Re: Why are byte ports "ports" as such?
  - From: Ben Goetter <goetter@mazama.net>
- Re: Why are byte ports "ports" as such?
  - From: Marc Feeley <feeley@iro.umontreal.ca>
- Re: Why are byte ports "ports" as such?
  - From: bear <bear@sonic.net>
- Re: Why are byte ports "ports" as such?
  - From: "Jonathan S. Shapiro" <shap@eros-os.org>
- Re: Why are byte ports "ports" as such?
  - From: Marcin 'Qrczak' Kowalczyk <qrczak@knm.org.pl>
- Re: Why are byte ports "ports" as such?
  - From: "Jonathan S. Shapiro" <shap@eros-os.org>
- Re: Why are byte ports "ports" as such?
  - From: John Cowan <cowan@ccil.org>
- Re: Why are byte ports "ports" as such?
  - From: "Jonathan S. Shapiro" <shap@eros-os.org>
- Re: Why are byte ports "ports" as such?
  - From: Per Bothner <per@bothner.com>
- Re: Why are byte ports "ports" as such?
  - From: "Jonathan S. Shapiro" <shap@eros-os.org>
- Re: Why are byte ports "ports" as such?
  - From: Per Bothner <per@bothner.com>
- Re: Why are byte ports "ports" as such?
  - From: Thomas Bushnell BSG <tb@becket.net>
- Re: Why are byte ports "ports" as such?
  - From: Per Bothner <per@bothner.com>
- Re: Why are byte ports "ports" as such?
  - From: Thomas Bushnell BSG <tb@becket.net>
- Re: Why are byte ports "ports" as such?
  - From: Per Bothner <per@bothner.com>

Prev by Date: Re: Why are byte ports "ports" as such?
Next by Date: Re: Why are byte ports "ports" as such?
Previous by thread: Re: Why are byte ports "ports" as such?
Next by thread: Re: Why are byte ports "ports" as such?
Index(es):
- Date
- Thread