Gilles Vandewiele et al’s 2020 paper Overly Optimistic Prediction Results on Imbalanced Data: Flaws and Benefits of Applying Over-sampling provides a sobering reminder to take Machine Learning studies with a grain of salt: Almost 50% of the 24 peer-reviewed studies that use machine learning based on a particular publicly-available dataset, were fundamentally flawed. These studies claimed near-perfect accuracy at predicting the risk of pre-term birth for a patient; after correcting the metho...