This is a demo store. No orders will be fulfilled.
Integrating machine learning and nano-QSAR models to predict the oxidative stress potential caused by single and mixed carbon nanomaterials in algal cells
In silico methods are increasingly important in predicting the ecotoxicity of engineered nanomaterials (ENMs), encompassing both individual and mixture toxicity predictions. It is widely recognized that ENMs trigger oxidative stress effects by generating intracellular reactive oxygen species (ROS), serving as a key mechanism in their cytotoxicity studies. However, existing in silico methods still face significant challenges in predicting the oxidative stress effects induced by ENMs. Herein, we utilized laboratory-derived toxicity data and machine learning methods to develop quantitative nanostructure-activity relationship (nano-QSAR) classification and regression models, aiming to predict the oxidative stress effects of five carbon nanomaterials (fullerene, graphene, graphene oxide, single-walled carbon nanotubes, and multi-walled carbon nanotubes) and their binary mixtures on Scenedesmus obliquus cells. We constructed five nano-QSAR classification models by combining zeta potential (ζP) with the C4.5 decision tree, support vector machine, artificial neural network, naive Bayes, and K-nearest neighbor algorithms. Moreover, we constructed three classification models by integrating the features including ζP, hydrodynamic diameter (DH), and specific surface area (SSA) with the logistic regression, random forest, and Adaboost algorithms. The Accuracy, Recall, Precision and harmonic mean of Precision and Recall (F1-score) values of these models were all higher than 0.600, indicating an excellent performance in distinguishing whether CNMs have the potential to generate ROS. In addition, using the ζP, DH, and SSA descriptors, we combined decision tree regression, random forest regression, gradient boosting, and the Adaboost algorithm, and successfully constructed four nano-QSAR regression models with applicable application domains (all training and testing data points lie within 95% confidence intervals), goodness-of-fit (Rtrain2 ≥ 0.850), and robustness (cross-validation R2 ≥ 0.650) as well as predictive power (Rtest2 ≥ 0.610). The method developed would establish a fundamental basis for more precise evaluations of ecological risks posed by these materials from a mechanistic standpoint.