File Name : figure s1_hires.tif Caption : fig. s1 – extracted csd solvents. shown are the relative frequencies of single solvent experiments in the csd in white. the x-axis is log-scaled to improve visibility. File Name : figure s2_hires.tif Caption : fig. s2 – cluster sizes. shown is the number of molecules found within each of the largest 50 clusters. File Name : figure s3_hires.tif Caption : fig. s3 – basic scaffolds for public compound clusters. shown are the general scaffolds of each cluster for the oxdb data set. compounds in each cluster vary only in the individual r-groups. File Name : figure s4_hires.tif Caption : fig. s4 – er diagram. shown is the entity-relationship diagram for the oraclesql data base used to store all experiments and their outcomes. this also includes advanced functionality for storage of x-ray diffraction sets and outcomes.