Re: Match-result submatch extraction is weird

This page is part of the web mail archives of SRFI 115 from before July 7th, 2015. The new archives for SRFI 115 contain all messages, not just those from before July 7th, 2015.

To: SRFI-115 discussion list <srfi-115@xxxxxxxxxxxxxxxxx>

Subject: Re: Match-result submatch extraction is weird

From: Alex Shinn <alexshinn@xxxxxxxxx>

Date: Wed, 16 Oct 2013 08:04:52 +0900

Delivered-to: srfi-115@xxxxxxxxxxxxxxxxx

Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=veuyGKlzUuE5H+DRMgCgCOBZPbKa0fDx0GRgIfoj0i0=; b=iD4GBUshI2Daiej0Q0UOsGwI42kqbB77d4xb2szTWzqm+7BV4NFyBtTF+wuoiTvw1q bq4SjC6s5muFUjWEWJz4do8O5QYniMqyYueqEjm8L6jRBK13AIg2PwUzGpWMD89sfq2X 8XGQZux/P4kmYcLD0jxBXsGN1PSf6w4CILSzhruyHtYYxDt36ZPFhMVrr1VpxzLIxre2 WErzxto6oDaDPVOW2H8yLelXXrGxwKt9a3hesZ/JF/XxC/RzvNyNYEi4caGe85ryDxnD oMVA3z/oO9IlBIwIUfcW5J30SsiquR3hNw7+GMEAHT6NIY5bBAFkQ6IDkcEcp57BR7is 0hFA==

In-reply-to: <20131015192356.GC17096@frohike.xs4all.nl>

References: <20131015192356.GC17096@frohike.xs4all.nl>

On Wed, Oct 16, 2013 at 4:23 AM, Peter Bex <Peter.Bex@xxxxxxxxx> wrote:

Hi all,

After clearing the initial roadblocks, I tried to use the library.
Being used to irregex I naively typed away and got this:

(define m (regexp-match '(seq ($ "x") "y") "xy"))
(rx-match-submatch m 1) => ERROR!

So, I looked up the syntax again and was confused: why does the
rx-match-submatch procedure require the input string again?

Actually I can't remember why I made this change.

The advantage is that it uses one word less memory

for each match (and while running for each active

state in the search). It's also nicer on GC if you have

a long-lived match object but don't actually need to

extract any substrings (unlikely).

On the other hand it's less convenient, and can give

weird results if passed the wrong string as you point

out. It's also a bad design if you want a chunked

string API like in IrRegex.

Unless anyone has any other objections, I'll change the

rx-match-submatch API to omit the string input.

Alex