Align the grammar of the `Decimal` string constructor with `float`'s #128185

KommuSoft · 2024-12-22T19:42:39Z

The grammar does not cover the case with underscores between the digits of a Decimal. The PEP-515 introduced underscores for integers, floating-point numbers and decimals. The documentation for integers and floating-point numbers are updated to cover underscores, but the grammar documentation for the Decimal does not.

Linked PRs

gh-128185: Align the grammar of the Decimal string constructor with float's #128315
gh-128185: align Decimal docs with spec (case irrelevant for nan/inf's) #128323

The text was updated successfully, but these errors were encountered:

picnixz · 2024-12-22T20:24:53Z

The grammar does not cover the case with underscores between the digits of a Decimal

Whtat do you mean digits of a decimal? a Decimal object is constructed from either a float or from a string. Are you talking about the string constructor? AFAIK Decimal("123_456") is allowed.

Do you mean it's not specified on the docs? namely that you can have _ in the string form?

KommuSoft · 2024-12-22T20:48:18Z

Yes, but the documentation on that seems outdated, since the grammar mentioned in the docs does not mention the underscore style.

picnixz · 2024-12-22T21:00:28Z

Well.. it says (emphasis mine)

it should conform to the decimal numeric string syntax after leading and trailing whitespace characters, as well as underscores throughout, are removed:

So technically, it's fine? we don't want to overcharge the grammar I think.

KommuSoft · 2024-12-22T21:03:18Z

Well the grammar for the float(..) includes the underscore, and except for signaling NaNs, it is identical for Decimals, so now it is inconsistent. I think the documentation should either mention the underscores, and do not add it to the grammar, or include it in the grammar, but not present it in two different ways.

picnixz · 2024-12-22T21:35:57Z

I see. I think it would be fine if we remove the mention of underscores in the text and add it to the grammar. Concerning leading and trailing whitespaces, I think this should be outside the grammar. WDYT?

skirpichev · 2024-12-22T21:52:37Z

it is identical for Decimals

It's not.

>>> Decimal('1__.__2')
Decimal('1.2')
>>> float('1__.__2')
Traceback (most recent call last):
  File "<python-input-3>", line 1, in <module>
    float('1__.__2')
    ~~~~~^^^^^^^^^^^
ValueError: could not convert string to float: '1__.__2'

That's known, see e.g. #88433

now it is inconsistent

It's ok. So far we have an important module in the stdlib (called "this").

I think the documentation should either mention the underscores, and do not add it to the grammar

That's what it does: "If value is a string, it should conform to the decimal numeric string syntax after leading and trailing whitespace characters, as well as underscores throughout, are removed: [grammar follows]".

picnixz · 2024-12-22T22:12:35Z

That's what it does: "If value is a string, it should conform to the decimal numeric string syntax after leading and trailing whitespace characters, as well as underscores throughout, are removed: [grammar follows]".

What I meant is to remove "as well as underscores throughout" and put the _ in the grammar definition. For floats, we have:

but for decimals, we have

so we could update decimal-part to include _ as for the floats. That would at least align the two grammars a bit more.

skirpichev · 2024-12-29T03:40:44Z

I still think it's not a documentation issue, at least. If we leave current parsing intact, IMO it's better to be more vague in description on how exactly underscores are removed.

Maybe it's ok to revisit of parsing decimal literals. Here is some motivation from Stefan Krah for current version:

I'd keep it simple for Decimal: Remove left and right whitespace (we're already doing this), then remove underscores from the remaining string (which must not contain any further whitespace), then use the IBM grammar.

We could add a clause to the PEP that only those strings that follow the spirit of the PEP are guaranteed to be accepted in the future.

One reason for keeping it simple is that I would not like to slow down string conversion, but thinking about two grammars is also a problem -- part of the string conversion in libmpdec is modeled in ACL2, which would be invalidated or at least complicated with two grammars.

But patch from #88433 doesn't look complex to me (if it's correct;-)). However, if current behavior will be changed - it should be properly deprecated first.

On another hand, even after this - the decimal grammar will be slightly different from floatvalue: Python floats doesn't support signaling nan. Nor "diagnostic" information for nans.

Also, current docs doesn't mention that case is insignificant. The specification says: "the characters in the strings accepted for infinity and nan may be in any case".

Removing "easy" label as it's not an easy issue at all.

…n/inf's)

KommuSoft added the docs Documentation in the Doc dir label Dec 22, 2024

github-project-automation bot added this to docs issues Dec 22, 2024

github-project-automation bot moved this to Todo in docs issues Dec 22, 2024

picnixz added the pending The issue will be closed if no feedback is provided label Dec 22, 2024

picnixz removed the pending The issue will be closed if no feedback is provided label Dec 22, 2024

picnixz changed the title ~~The grammar of the Decimal seems to be outdated~~ Align the grammar of the Decimal string constructor with float's Dec 22, 2024

skirpichev added the pending The issue will be closed if no feedback is provided label Dec 22, 2024

skirpichev removed the pending The issue will be closed if no feedback is provided label Dec 22, 2024

picnixz added the easy label Dec 22, 2024

bedevere-app bot mentioned this issue Dec 28, 2024

gh-128185: Align the grammar of the Decimal string constructor with float's #128315

Open

skirpichev removed the easy label Dec 29, 2024

skirpichev added a commit to skirpichev/cpython that referenced this issue Dec 29, 2024

pythongh-128185: align Decimal docs with spec (case irrelevant for na…

c480e71

…n/inf's)

bedevere-app bot mentioned this issue Dec 29, 2024

gh-128185: align Decimal docs with spec (case irrelevant for nan/inf's) #128323

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Align the grammar of the `Decimal` string constructor with `float`'s #128185

Align the grammar of the `Decimal` string constructor with `float`'s #128185

KommuSoft commented Dec 22, 2024 •

edited by bedevere-app bot

Loading

picnixz commented Dec 22, 2024 •

edited

Loading

KommuSoft commented Dec 22, 2024

picnixz commented Dec 22, 2024

KommuSoft commented Dec 22, 2024

picnixz commented Dec 22, 2024

skirpichev commented Dec 22, 2024

picnixz commented Dec 22, 2024

skirpichev commented Dec 29, 2024

Align the grammar of the Decimal string constructor with float's #128185

Align the grammar of the Decimal string constructor with float's #128185

Comments

KommuSoft commented Dec 22, 2024 • edited by bedevere-app bot Loading

Linked PRs

picnixz commented Dec 22, 2024 • edited Loading

KommuSoft commented Dec 22, 2024

picnixz commented Dec 22, 2024

KommuSoft commented Dec 22, 2024

picnixz commented Dec 22, 2024

skirpichev commented Dec 22, 2024

picnixz commented Dec 22, 2024

skirpichev commented Dec 29, 2024

Align the grammar of the `Decimal` string constructor with `float`'s #128185

Align the grammar of the `Decimal` string constructor with `float`'s #128185

KommuSoft commented Dec 22, 2024 •

edited by bedevere-app bot

Loading

picnixz commented Dec 22, 2024 •

edited

Loading