Skip to content

Conversation

@InfinityMod
Copy link
Contributor

@InfinityMod InfinityMod commented Sep 9, 2024

Hi, Pull Request #1807 ends in an infinite loop while running the test setup for the distributions, causing the tests to end until the time limit is reached.

The misbehavior can be tested just by running the test_distributions setup or only especially the
test_vmapped_binominal_p0 function solely with the following code included directly after the imports in test_distributions.py:

# Enable 64bit support for higher accuracy
if sys.maxsize > 2**32:
    jax.config.update("jax_enable_x64", True)

While I can't 100% say why this endless loop only occurs when increasing the numeric accuracy to 64 bits, I'm sure to have a suitable solution for the behavior (which also seems faulty at 32 bits).

The problem stems from the _binomial_dispatch function in util.py (see the description commit). In short, zero values for the probability p are also passed to the dispatch function, even if filtered out afterward via lax.cond. Therefore, the underlying while_loop in the _binomial_inversion function runs infinitely.

This merge request solves this issue by ensuring that no p values equal to zero are passed to the underlying functions. As lax.cond is still filtering out the results of the zero-corrected values, there's no change for instances using the mentioned methods.

def _binomial_dispatch(key, p, n):
def dispatch(key, p, n):
is_le_mid = p <= 0.5
#Make sure p=0 is never taken into account as a fix for possible zeros in p.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

could we just simply clip p to tiny? jnp.clip(p, minval=jnp.finfo(p.dtype).tiny)

Copy link
Contributor Author

@InfinityMod InfinityMod Sep 10, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This works for me: jnp.clip(p, jnp.finfo(jnp.float32).tiny)
There's a change with a depreciation of the argument a_min, which changes to min for jnp.clip.
So, to support future and early versions, I set the min argument as a positional argument.
However, this can also be an error if the argument order changes.

jnp.finfo(p.dtype) doesn't seem to work, with float32 it's working.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this might affect samples with small p in x64. How about updating _binomial_inversion to

geom = jnp.floor(jnp.log1p(-u) / log1_p) + 1
...
log1_p = jnp.log1p(-p)
log1_p = jnp.where(log1_p == 0, -jnp.finfo(log1_p.dtype).tiny, log1_p)

The issue seems to come from log1_p=0 and jnp.log1p(-u) < 0, which leads to a negative geom

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you also fix the lint issue? I think you need a space # comment for the comment.

Copy link
Member

@fehiepsi fehiepsi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @InfinityMod! This issue is subtle.

@fehiepsi fehiepsi merged commit f5aca91 into pyro-ppl:master Sep 10, 2024
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants