Names to ids

Peter Corlett abuse at
Wed Jan 25 15:46:17 GMT 2006

Simon Wistow <simon at> wrote:
> But this means that we need to have a consistent filename to integer
> mapping so that it remains consistent between runs. This means that
> collisions should be kept to an absolute minimum because if a
> totally black image's filename gets mapped to id 88888 and then a
> totally white image's filename gets mapped to the same then we're
> going to get very bogus results.

With such a tiny namespace as 2^32, the birthday paradox is going to
bite you. You have a 50% chance of two pictures having the same hash
when there are 77,162 pictures.

PGP key ID E85DC776 - finger abuse at for full key

More information about the mailing list