[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Should SRFI-115 character sets match extended grapheme clusters?

This page is part of the web mail archives of SRFI 115 from before July 7th, 2015. The new archives for SRFI 115 contain all messages, not just those from before July 7th, 2015.

To: Mark H Weaver <mhw@xxxxxxxxxx>
Subject: Re: Should SRFI-115 character sets match extended grapheme clusters?
From: Alex Shinn <alexshinn@xxxxxxxxx>
Date: Tue, 13 May 2014 23:07:02 +0900
Cc: SRFI-115 discussion list <srfi-115@xxxxxxxxxxxxxxxxx>
Delivered-to: srfi-115@xxxxxxxxxxxxxxxxx
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=xt+Lg+lijFvzaL4I9rbmLQFEzTNs9UKDgOzY2LUf4jU=; b=i9yE6+It7WQVmyFxiA8tZ2jIc1bMrsjVwmCgr6Im/XKkB+JKh5+fJUU/2UHEsOmlfi C84jYlWZCPuN17lTO20yPV9XWIu4ycUMJ3uaAhXKfTbTWyU+DkveYYpoQhpk6W5PJMDq HiFxqosooskUeBE+pBjjMIQpde6a0Y/lFpl2WYED5LH1bXvGAQw8MYyX0xy7K5aT7jcd RkZWvu4lHIBWAoLKFIWAZLcPGG7cbU6+pEtI0GE0bXgO0biH1/Eaz7LhKxR9SvBXl/6J l0CrNtVkS9F+efe5HWcySqoSEKY66dAFblxF2gDOgmkLy/2Xw0ldwTDg0kCL7NfosMnH alvA==
In-reply-to: <CAMMPzYMZGvLnL0SVQ1Jzz=rgQ-YX_cwvTHzYhw-hWiT5nxesGQ@mail.gmail.com>
References: <87bnv4ifwu.fsf@yeeloong.lan> <CAMMPzYMZGvLnL0SVQ1Jzz=rgQ-YX_cwvTHzYhw-hWiT5nxesGQ@mail.gmail.com>

On Mon, May 12, 2014 at 11:35 AM, Alex Shinn <alexshinn@xxxxxxxxx> wrote:

[...] Even simple mappings of large Unicode

char-sets can be expensive to compute (until the ref impl optimized
known case-insensitive char-sets I believe (w/nocase letter) took

over a minute to iterate over all 10k+ Letter code-points, look up all
their case variants an insert them into a new set).

I exaggerate - it's more like 3 seconds for 100k+ code points,

in an Scheme optimized for space over speed, using a char-set

lib optimized for space over speed. That's still slow enough to

be a concern.

Alex

References:
- Should SRFI-115 character sets match extended grapheme clusters?
  - From: Mark H Weaver
- Re: Should SRFI-115 character sets match extended grapheme clusters?
  - From: Alex Shinn

Prev by Date: Re: revised w/nocase text, considering titlecase and cased
Next by Date: Re: revised w/nocase text, considering titlecase and cased
Previous by thread: Re: Should SRFI-115 character sets match extended grapheme clusters?
Next by thread: new draft available
Index(es):
- Date
- Thread