An AI startup ran five simulations, each controlled by a different model. The results varied wildly.