Bayesian Classification of CPAN Module Failures (Re: Module dependencies and test results)

David Cantrell david at
Sun Aug 5 21:17:03 BST 2007

On Sun, Aug 05, 2007 at 12:29:39PM +0100, Andy Wardley wrote:

> P(deps) is the product of P(dep) for dep over deps, and P(dep) is the 
> probability of a module appearing as a dependency.

The data I'm spitting out are insufficient for calculating this.  If
module A depends on B and C, both of which depend on D, then D appears
as a dependency twice, but I only list it once.  Also I'm not convinced
that the rest of the sums are right, given that if D fails when you try
to install it as a dependency of B, then the probability of it failing
when you try to install it as a dependency of C is 1 - these are not
independent failures.

David Cantrell | Hero of the Information Age

"IMO, the primary historical significance of Unix is that it marks the
time in computer history where CPUs became so cheap that it was possible
to build an operating system without adult supervision."
                         -- Russ Holsclaw in a.f.c

More information about the mailing list