Statistics for Statistical Learning in Drug Discovery via Clustering and Mixtures