Determining the association constant between a cyclodextrin and a guest molecule is an important task for various applications in various industrial and academic fields. However, such a task is time consuming, tedious and requires samples of both molecules. A significant number of association constants and relevant data is available from the literature. The availability of data makes the use of machine learning techniques to predict association constants possible. However, such data is mainly available from tables in articles or appendices. It is necessary to make them available in a computer friendly format and to curate them. Furthermore, the raw data need to be enriched with physicochemical information about each molecule and when such information does not allow to discriminate molecules, some additional data is needed. Here is presented a dataset built from data gathered from the literature. The dataset contains both the original raw data from the articles and the enriched ones. The scripts used to curate and enrich the raw data are also provided.
Tahıl, Gökhan and Delorme, Fabien and Le Berre, Daniel and Monflier, Eric and Sayede, Adlane and Tilloy, Sebastien, Curated Dataset of Association Constants between a Cyclodextrin and a Guest for Machine Learning. Chemical Data Collections 2023, 101022. http://dx.doi.org/10.2139/ssrn.4345654