Re-port new integer-scanning utility methods #127

headius · 2024-12-12T22:41:10Z

This is a re-port of the C code from @byroot for fast base 10 and base 16 integer scanning.

In #125, @kou pointed out there's an intermittent failure in the JRuby extension. We were unable to confirm exactly the circumstances that cause that failure, but this re-port should at least help reduce the change it is a bug in the original Java code.

This may help prevent ArrayIndexOutOfBounds randomly seen in CI. See ruby#125

* Parse base should be from current pointer so add that to the get calls. * Pull out ascii check into util method. * Rename curr local var to ptr to better match C version.

This is to align the base10 code with the freshly-ported base16 code. See ruby#125

headius · 2024-12-12T22:42:03Z

@kou @byroot this should be ready to go but I have one concern about the C code...

It doesn't seem to update curr after the parse? I have that in the Java code because it seemed necessary.

This is temporary until we're sure that the AIOOB from ruby#125 has been fixed by ruby#127.

byroot · 2024-12-12T22:57:45Z

It doesn't seem to update curr after the parse?

It does inside strscan_parse_integer

headius · 2024-12-12T23:00:20Z

It does inside strscan_parse_integer

Aha, so it does! I will tweak this to move the final integer parse and curr update into a similar method, so it aligns better with the C code.

This aligns with C code that does the final parse and curr update in strscan_parse_integer.

kou · 2024-12-13T01:38:05Z

Thanks!

@byroot

Fix GH-152 CRuby can walk off the end because there's always a null byte. In JRuby, the byte array is often (usually?) the exact size of the string. So we need to check if len++ walked off the end. This code was ported from a version by @byroot in #127 but I missed adding this check due to a lack of tests. A test is included for both "-" and "+" parsing.

@byroot

(ruby/strscan#153) Fix ruby/strscan#152 CRuby can walk off the end because there's always a null byte. In JRuby, the byte array is often (usually?) the exact size of the string. So we need to check if len++ walked off the end. This code was ported from a version by @byroot in ruby/strscan#127 but I missed adding this check due to a lack of tests. A test is included for both "-" and "+" parsing. ruby/strscan@1abe4ca556

headius added 3 commits December 12, 2024 16:26

Re-port scan_base16_integer from the C code

acf7b67

This may help prevent ArrayIndexOutOfBounds randomly seen in CI. See ruby#125

Additional tweaks for scan_base16_integer

854c3e3

* Parse base should be from current pointer so add that to the get calls. * Pull out ascii check into util method. * Rename curr local var to ptr to better match C version.

Re-port scan_base10_integer from the C code

de7d9c0

This is to align the base10 code with the freshly-ported base16 code. See ruby#125

Add env for JRuby to force compile for better errors

79f682f

This is temporary until we're sure that the AIOOB from ruby#125 has been fixed by ruby#127.

Add strscanParseInteger equivalent

9b6f02c

This aligns with C code that does the final parse and curr update in strscan_parse_integer.

kou merged commit ea0786b into ruby:master Dec 13, 2024
37 checks passed

headius mentioned this pull request May 3, 2025

jruby: Check if len++ walked off the end #153

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Re-port new integer-scanning utility methods #127

Re-port new integer-scanning utility methods #127

Uh oh!

headius commented Dec 12, 2024

Uh oh!

headius commented Dec 12, 2024

Uh oh!

byroot commented Dec 12, 2024

Uh oh!

headius commented Dec 12, 2024

Uh oh!

Uh oh!

kou commented Dec 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Re-port new integer-scanning utility methods #127

Re-port new integer-scanning utility methods #127

Uh oh!

Conversation

headius commented Dec 12, 2024

Uh oh!

headius commented Dec 12, 2024

Uh oh!

byroot commented Dec 12, 2024

Uh oh!

headius commented Dec 12, 2024

Uh oh!

Uh oh!

kou commented Dec 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants