Our paper introducing SIMPD is now out. SIMPD is an algorithm for creating training/test sets for molecular
#machinelearning based on an analysis of a large number of real-world medchem projects.
link.springer.com/article/10.1...
#opensource code and data are in github.
github.com/rinikerlab/m...