[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Comments on SRFI 69

This page is part of the web mail archives of SRFI 69 from before July 7th, 2015. The new archives for SRFI 69 contain all messages, not just those from before July 7th, 2015.

To: srfi-69@xxxxxxxxxxxxxxxxx
Subject: Comments on SRFI 69
From: David Van Horn <dvanhorn@xxxxxxxxxxxxxxx>
Date: Thu, 11 Aug 2005 10:23:47 -0400
Delivered-to: srfi-69@xxxxxxxxxxxxxxxxx
User-agent: Mozilla/5.0 (X11; U; SunOS sun4u; en-US; rv:1.6b) Gecko/20031206 Thunderbird/0.4

The SRFI document states the following in the Abstract:

   This SRFI specifies an API for basic hash tables. Hash tables are data
   structures that provide a mapping from some set of keys to some set of
   values associated to those keys.

From your description of what hash tables are, the name "hash table" seemstoo specific of a term. This issue was raised earlier by Marc Feeley, and asfar as I can tell was never addressed in the document or discussion list.Perhaps, I'm just missing it. However, the more important issue here is thatwhat follows in the document is far from a basic API for structures thatprovide a mapping from some set of keys to some set of values associated tothose keys.

One of the most common ways SRFIs go wrong is that their purpose is notclearly articulated. Without a clear thesis, it is impossible to evaluatedesign choices and rationales, or even provide helpful suggestions. Luckilythis SRFI does state its aims, however there are conflicts with what isstated. Further, design choices have been made that violate these aims, andrationales rarely appeal to the aims in a consistent manner.


The SRFI document states the following in the Rationale:

   The primary aim of this SRFI is to provide a simple and generic hash table
   API that will answer most of users' needs for basic usage of hash tables.

This conflates two disparate and competing aims into one; that of providing asimple and generic API for a data structure which "provides a mapping fromsome set of keys to some set of values associated to those keys," and that ofcovering the general usage patterns of hash tables (whatever those patternsmay be). The abstract makes no mention of most common hash table usage, sothis second aim seems out of the scope of this SRFI. But several choices aremade contrary to the aim of simplicity and generality, such as the ad-hoccollection of type-specialized hash table procedures and their hash functioncounterparts. Appeals for generality in the API have been discounted by theauthor saying such things as, "I'd rather define these routines to account forthe most common situation(s) and be done with it."

On the other hand, there is widespread use of immutable hash tables, tableswith weakly held keys, concurrency, and GC-sensitive tables, but this SRFIaddresses none of those common usages or the issues that arise in theirpresence, and is therefore deficient on this second stated aim.

By conflating these aims, the design choices lack a clear purpose and oftenseem to reflect the authors personal preferences rather than a reasonedrationale. Indeed, it is difficult if not impossible to evaluate a designchoice when there is not a clear and consistent aim for the SRFI. I thinkthis document would greatly benefit a more explicit statement of its purpose,resolving the conflict of the current aims. As it exists now, the SRFIneither provides a basic API for a key-value mapping datastructure, norprovides an API covering most, or even common, users' hash table needs.

Also, I think a key aim that this SRFI should have, but does not, is theselection of names that represent consistent conventions with existing Schemepractices.


The SRFI document states the following in the Rationale:

   Hash tables are widely recognized as a fundamental data structure for many
   kinds of computational tasks. Almost every non-minimal Scheme
   implementation provides some kind of hash table functionality.

This is certainly true. The majority of Scheme's I'm familiar with include ahash table datastructure and their common operations, however the names varyhighly. This highlights what should be a primary concern in the design ofthis SRFI, but which has been neglected; to identify a consistent, andportable set of names and parameter conventions.

The author has chosen the name and parameter conventions that run counter toseveral existing Scheme conventions, such as previous SRFIs, RnRS, andnumerous Scheme implementations. Some choices have no precedent whatsoever.To choose such conventions is perfectly allowable, but the advantage of thesenew conventions must be thoroughly articulated and compelling. I don't thinkthat is the case here. Further, if the aim of this goal is to cover commonhash table usage, unprecedented names and conventions run counter to this aim;something which has never been used before is not common.

My preference for this SRFI is the following. Drop the aim of covering commonhash table usage. Writing such a SRFI is a very ambitious and difficult thingto do, and it requires an extensive amount of surveying common use. I wouldexpect such a SRFI to discuss the design choices taken by most Schemeimplementations, the several related SRFIs, as well as similar libraries fromrelated languages such as ML and Lisp. SRFI 1 is a good example of such a"common use" SRFI. Shivers surveyed R4RS/R5RS Scheme, MIT Scheme, Gambit,RScheme, MzScheme, slib, Common Lisp, Bigloo, guile, T, APL and the SMLstandard basis in designing that library. A good common use hash table SRFIwould need to do likewise.

Instead, this SRFI should focus on providing a simple and generic API for datastructures that provide a mapping from some set of keys to some set of valuesassociated to those keys. All parts of this SRFI that do not contribute tothat aim should be dropped. All rationales that do not appeal this aim,should be abandoned. I would like to see this API be consistent with theexisting datastructure API conventions that exist in Scheme. Most notably,this SRFI should be consistent in its choice of names and parameterconventions with SRFI 44 [1]. If the author chooses against theseconventions, this SRFI then conflicts and competes with SRFI 44 (and others)and as such the "rationale should explain why the present proposal is asubstantial improvement" over these existing conventions, as required by theprocess document.

This SRFI will be an important one. People will turn to it regardless of howwell or poorly constructed it is. Without a clear and consistent aim, which Ibelieve is the case now, such a SRFI can do a great deal of harm.


David

[1] This issue of SRFI 44 names was raised as the first comment during thediscussion period by Scott Miller, to which Bear voiced criticism over SRFI 44on implementation and usability grounds. This is irrelevant. SRFI 44included a great deal of work on identifying the proper names for preciselythis kind of datastructure. The choices include rationales, some of whichhave been appealed to in deciding names in this SRFI. If this SRFI is notgoing to use the names of SRFI 44, it *must* include compelling rationales forthese names over the names (and rationales) identified in SRFI 44. Many ofthe choices made thus far in the SRFI directly conflict with SRFI 44,sometimes in very confusing ways, such as hash-table-equivalence-function,which a reader of SRFI 44 would expect to return an equivalence over the itemsin the hash table collection, i.e. key value pairs, whereashash-table-key-equivalence-function would return what SRFI 69 returns forhash-table-equivalence-function.

Follow-Ups:
- Re: Comments on SRFI 69
  - From: Panu Kalliokoski
- Re: Comments on SRFI 69
  - From: felix winkelmann

Prev by Date: Re: error in hash table reference implementation
Next by Date: Re: Comments on SRFI 69
Previous by thread: Draft period extension, new draft
Next by thread: Re: Comments on SRFI 69
Index(es):
- Date
- Thread