Silico Methodologies Modelling of Aquatic Toxicity in Tetrahymena Pyriformis Via Aromatic Amines
Nadia Ziani1,5, Khadidja Amirat2,5, Souhaila Meneceur3,5, Fatiha Mebarki4,5, Abderrhmane Bouafia3*
1Faculty of Science, Chemistry Department Badji Mokhtar University Annaba, Annaba, Algeria.
2Faculty of Science, Department of Chemistry University of Sétif 1 - Ferhat Abbas, El Bez, Setif 19000.
3Department of Process Engineering and Petrochemistry, Faculty of Technology,
University of El Oued, 39000 El-Oued, Algeria.
4Faculty of Science and Technology, Department of material sciences,
Amine Elokkal El hadjMoussa Eg Akhamouk University - Tamanrasset, Algeria.
5Renewable Energy Development Unit in Arid Zones (UDERZA), University of El Oued El-Oued, Algeria.
*Corresponding Author E-mail: abdelrahmanebouafia@gmail.com
ABSTRACT:
Being used to a great degree as industrial chemicals, Aromatic amines—anilines and their related derivate compounds—represent a significant portion of environmental pollutants. Aniline is the molecule from which an immense set of aromatic amines find their origin. It has occupied a place in the top hundred most important building blocks of chemistry since its discovery in 1826. In a plethora of varied applications, Aniline and its derivatives—comprising of chloro substituents—are utilized in the form of intermediates.
In June 2007, Registration, Evaluation, Authorization and Restriction of Chemicals (REACH) legislation was set in motion1. This has led to a tremendous wave of interest in the field of silico methodologies. By inaugurating and enforcing the concept of ‘no data, no market’, REACH made certain that manufacturing and importation abided by a high standard of safety for both human health and the environment. With turning away from non-essential testing (notably animal testing) being a goal that is stated explicitly by REACH, the latter offered the parties involved a plethora of options that could potentially allow them to seek out the accomplishment of such objective. This includes alternative test methods like in vitro and in silico methodologies.
Quantitative structure-activity relationships (QSAR) analysis emerged as one of the aforementioned methodologies. Due to its low cost and time requirements, it has proven itself extremely valuable2–11.
What constitutes QSAR analysis is a miscellaneous set of mathematical and statistical procedures that are characterized by the purpose of discovering functional relationships between chemical compound structure—as depicted through experimental or theoretical variables labelled molecular descriptors 12—, and their measured properties and activities.
To develop a model that can simulate the relationship between the structures of 48 aromatic amines and their theoretical descriptors, we have, in our work, conducted a QSAR study. And in order to pinpoint and select the most informative descriptors out of the entirety of the descriptors calculated by E-calc (Version 5.4) software, we used the genetic algorithm. By relying on MLR and ANN (artificial neutral networks) as different approaches, we developed a model through the selected descriptors. The usefulness of such model lies in predicting the log (1/CIC50) (decimal logarithm of the inverts of the 50%inhibitory growth concentration (IGC50) of Tetrahymenapyriformis. In order to make sure the models are valid, we have divided the data set into a training set (35 compounds) and a test set (13 compounds), using the Kennard and Stones algorithm. Furthermore, to emphasize the structural prerequisites for an ideal aquatic toxicity, various statistical methods were core to the model’s development. Our research purpose can be divided into three major aims: First, the exploration of the relationships between structure and activity when it comes to the aquatic toxicity of compounds; second, the selection of the best possible predictive model among other similar chemometric models of aquatic toxicity, and third, using the two approaches MLR and ANN to test the level of performance and the stability of the obtained model.
METHODS:
The anilines and their derivate compounds were subject to a QSPR study. It was conducted thanks to the assistance of a data set selection, molecular descriptors generation, Multiple linear regression statistical analysis, as well as model validation techniques. Moreover, to further test how applicable the model is, the predicted data was plotted against the experimental data of compounds. The TSAR simulation software was the tool we relied on to perform all regression analysis. The following statistical parameters served as a testing tool for predicting the efficiency of the obtained model: correlation coefficient (r), coefficient of determination (r2), standard deviation (s), and null hypothesis test (F-test). The cross-validation coefficient (rcv2) was crucial for testing the model’s validity via relying on the “leave-one-out” feature in the software Moby Digs for Windows13. Otherapproaches such as Artificial neural networks14affirmed stability and prediction accuracy.
Experimental Data:
The recognized experimental values of pIGC50 (Table I) represent a distinctly dissimilar data set that is made up of 48 aromatic amines compounds. It was taken from literature on the subject matter15.
Calculation of descriptors:
Through HYPERCHEM Software(2002), each compound’s chemical structure is outlined in a computer 16, and then, it is pre-optimized via the MM+ molecular mechanics method (Polak-Ribiere algorithm). In order to find the conclusive geometries of minimum energy conformation, the PM3 method—semi-empirical—was used at a restricted Hartree-Fock level, plus no configuration interaction, and while a gradient norm limit of 0.001 kcal Å−1 mol−1 is applied as a stopping criterion. After that, the geometries were used as input for the generation of 1664 descriptors via Dragon software (Version 5.4)17.
Kennard and Stones algorithm:
One of the most commonly used algorithms for separating a given data sets into two distinguishable subsets is the Kennard and Stones algorithm. In essence, at first it attempts to detect two samples on the basis of the farthermost input variables from one another. The samples are then taken out from the original set of data, and placed into a calibration set. Rinse and repeat until the sample number selected in the calibration set proves satisfactory. The scope of these calibration samples is always within the measured region of the overall input variable space. It also is compatible with the induced metric, and the no validation samples are always placed outside the score of the measured region. These are the main advantages of this particular algorithm, and that is why it is deemed as one of the most important methods when it comes to build training and test sets. 18,19
Descriptor selection:
By relying on the Ordinary Least Square Regression method and the GA-variable subset selection 13, we conducted both the MLR analysis and variable selection via Todeschini et al’ Moby Digs software. GAs result in a population of one hundred regression models that are organized in accordance with decreasing internal predictive performance. As they are examined through Q2, the lower the latter is, the fewer the descriptors.
a) Electrotopological descriptor:
Kier and Hall introduced an entirely new paradigm, and it was reexamined lately. This paradigm is called the electrotopological state. It is meant to represent both the electronic and topological properties of the atom12. Together with the perturbation exercised by other atoms in the same molecule, the intrinsic value of an atom—in its valence state—represent its E-state. The Kier-Hall electronegativity model constitutes the basis for an atom’s intrinsic state. It is deduced from the electro negativity to skeletal bonds o ratio.
(1)
and
are the molecularconnectivity
values. They are given as follows (for
first row atoms)
(2)
With
=
-, h = skeleton connections number(2)
where
is electrons number in T orbitals; h is
the number of atom-bonded hydrogenatoms.
represents valence electrons number;
, on the other hand, is the number of
electrons in
orbitals;
is the lone pair number of electrons; N
is the principal quantum number of the valence shell for that atom. We
can say that
-
isequal to the numberof
and lone pair electrons. It was shown by
Kier and Hall to be proportional to the valence state electronegativity
b) Estimation of Octanol/Water partition coefficient
AlogP (Atomic increment system-based Ghose and Cripen model)19
When it comes to fragments being defined on an exclusively atomic level, several models dealing with that can be found in the literature. The advantage of such models lies in the reduced complexity of both fragment recognition and calculation—correction substructures unapplied (see Eq. (1)). As for the most common atomic increment system, AlogP, it originates in the works of Ghose and Crippen20. In this system, the neighboring environment of atoms is what classifies them. For carbon atoms, it is also their hybridization. The estimated logP for a given compound is as follows:
Alogp = ∑ni ai (3)
ni stands for ith atom type occurrence, and ai is the hydrophobicity constant. In the DRAGON software, the AlogP model at hand was assessed through a group of 2648 compounds—known experimental logP come from the NCI open data base—the resulting correlation coefficient r was equal to 0.915.
Chemometric Methods. (ANN-MLR):
a)MLR:
MLR can be defined as a statistical tool that functions by regressing independent variables against a dependent variable. MLR’s purpose is determining the property of interest’s linear model. It is expressed as:
y = a0 +𝛴ai xi (4)
y stands for the property, i.e. the dependent variable; xi, on the other hand, represents the molecular descriptors; and ai descriptor coefficient, while a0 is the symbol for the equation intercept.
b)ANN:
Is it possible that the human brain and its functions are being simulated and modelled by artificial systems? Such a question lies at the core principle of ANN. The latter is an approach that is comprised of the following elements: Processing elements, i.e. nodes, connection topology between nodes, and the acquisition of the principle that govern the encoding of new information into the network. There are certainly several models of ANN, nevertheless, the most used in QSAR is represented by the three-layered feed-forward network14. In this particular type, neurons are placed into an input, a hidden layer, and an output layer. Any given neuron in any given layer is characterized by a connection with the neuron of the layer that comes after it. On the other hand, connection between same-layer neurons is inexistent.
Model development and validation:
We used the features of Ordinary Least Squares Regression (OLS) and GA-VSS (GA for variable subset selection) in the software packing Moby Digs for Windows 13 to conduct both the multiple linear regression analysis and the variable selection. The extent to which the model is functioning was tested via the multiple determination coefficient, the R2, as well as the standard deviation error in calculation (SDEC). It is penned as follows:
(5)
Assessing internal prediction levels (Q2LMOcross-validation, bootstrap) and model robustness model (Q2LOO cross-validation, Y-scrambling) can be achieved through cross validation techniques.
Cross validation via the LOO (leave-one-out) technique relies on an n training set of n-1 objects that are included in the test set, and on the process of predicting the excluded ones. The cross validated explained Q2LOO is defined as:
=1- (6)
With y I and
representing
respectively the measured and averaged (over the entire data set) values of the
dependent variable ; y ii denotes the response of the i-th
estimated by using a model without the i-th object; the summations
run over all compounds in the training set. The dispersion of predicted values
is measured via the PRESS (predictive residual sum of squares). It defines Q2,
and SDEP.
(7)
A value Q2> 0.5 is generally regarded as a good result and Q2> 0.9 as excellent 21.
Several works, however, proved that despite Q2 being essential for a high level of prediction in a model, it still is not enough.
When it comes to the method of bootstrap validation K, the generation of n-dimensional groups is conducted via a selection of n-objects—repeated randomly—from original data set. The first selected objects provide a first model that allows for the prediction of excluded sample. Only then is Q2 is measured for each and every model. The process of bootstrapping is conducted for 8000 times for each valid model. Through the chosen model 22, test objects response values are calculated; prediction quality is determined in terms of Q2ext23:
(8)
next being a representative of the total of objects belonging to the set left out by bootstrap, i.e. the external set. While ntr stands for the number of training set objects.
R² is also a very useful factor for validating chemicals via implementing the resulting model on the training set. An equally important one is the SPEDext (external standard deviation error of prediction) and it is penned as follows:
(9)
The sum summarizes the total of the set objects of the test (n)ext.
The following equation determines the external R²CVext for the test set:
(10)
and
constitute,
respectively, the observed and calculated response values. While
tra
represents the training set’s averaged value for the response
variable; the total sums up the test set.
QSAR AD (Applicability Domain):
The Williams plot of jackknife residuals vs the leverages, i.e. diagonal values (hi), was utilized to examine the Applicability Domain. Jackknife residuals—studentized residuals—can be defined as standardized and cross-validated. Each residual’s standard deviation is calculated with no reference to the i-th observation. Afterwards, each residual is divided by its corresponding SD. The leverage (hi) is the chemical’s value in the original variable space; it is as follows:
hi= xi (X T X)-1xiT (i=1,…..,n) (11)
The query compound’s descriptor row-vector is expressed by xi. For n training set compounds, the n (p+1) matrix of p model parameter values is represented by X. The matrix/vector’s transpose is denoted by the superscript T, while (3p + 1)/n is what defines the warning leverage value of (h*)24. Furthermore, when the value h of a compound is lower than h *, the predicted and actual values are characterized by a likelihood of compatibility equal to that of training set compounds. For instance, a given chemical with an h i> h* might support the model, if part of the training set. Nevertheless, it still remains unreliable when it comes to predicted data and the associated validation set. It could, in addition, not seem as an outlier due to the residual related to it being low. That is the reason why for a reliable portrayal of the AD, the leverage and the jackknife residual ought to be combined.
Permutation Test:
For the purpose of evaluating how reliable is the model training process, we conducted a randomization test. Selecting features and building predictive models were both conducted through the use of 75 randomly-labeled samples. As a deciding index, we used the models’ training set prediction accuracy, so that the level of performance of both the integrated model and the Y-randomization model can be compared to the five leading molecular descriptors and a signature gene 2526.
RESULTS AND DISCUSSION:
Applying the GA-VSS resulted in various reliable models related to the prediction that is based on several sets of molecular descriptors. The best possible model came as a result of using the electrotopological descriptor. In Table 1, the partition coefficient data of descriptor value and aquatic toxicity are listed.
Table1. Compounds and predicted results of the biological activity IGC50
|
|
Y |
X1=Alogp |
X2=ET |
ANN (3-5-1) |
MLR-LOO (2D) |
|
|
1 |
2-Ethylaniline |
-0.25 |
1.89 |
2.1029 |
-0.0939671 |
0.00548834 |
|
2 |
2-Propylaniline |
0.06 |
2.43 |
2.1592 |
0.297784 |
0.398934 |
|
3 |
2-Isopropylaniline |
0.10 |
2.27 |
4.2891 |
0.0104779 |
0.0575593 |
|
4 |
2-Phenylaniline |
0.86 |
2.84 |
0 |
0.85433 |
0.902153 |
|
5 |
2-Fluoroaniline |
-0.31 |
1.15 |
0 |
-0.356635 |
-0.355481 |
|
6 |
2-Chlorolaniline |
-0.09 |
1.59 |
0 |
-0.145657 |
-0.0175844 |
|
7 |
2-Bromoaniline |
0.46 |
1.79 |
0 |
-0.0209466 |
0.0951579 |
|
8 |
2-Iodoaniline |
0.35 |
2.09 |
0 |
0.199351 |
0.345024 |
|
9 |
3-Ethylaniline |
-0.12 |
1.84 |
2.1198 |
-0.12489 |
-0.0410103 |
|
10 |
3-Phénylaniline |
0.78 |
2.67 |
0 |
0.702071 |
0.773676 |
|
11 |
3-Fluoroaniline |
0.04 |
1.37 |
0 |
-0.261625 |
-0.214968 |
|
12 |
3-Chloroaniline |
0.09 |
1.94 |
0 |
0.0844917 |
0.245315 |
|
13 |
3-Iodoaniline |
0.61 |
2.37 |
0 |
0.434101 |
0.548842 |
|
14 |
4-Méthylaniline |
-0.02 |
1.38 |
2.0411 |
-0.343727 |
-0.407787 |
|
15 |
4-Propylaniline |
0.49 |
2.38 |
2.1769 |
0.254551 |
0.343707 |
|
16 |
4-Isopropylaniline |
0.21 |
2.26 |
4.3443 |
7.40E-05 |
0.0268159 |
|
17 |
4-Butylaniline |
1.05 |
2.88 |
2.2057 |
0.697883 |
0.700846 |
|
18 |
4-Pentylaniline |
1.67 |
3.68 |
2.2252 |
1.38809 |
1.26643 |
|
19 |
4-Hexylaniline |
2.04 |
4.21 |
2.2393 |
1.72414 |
1.63012 |
|
20 |
4-Octylaniline |
2.34 |
5.27 |
2.25 |
2.10909 |
2.55298 |
|
21 |
2,4-diméthylaniline |
-0.30 |
1.96 |
4.0781 |
-0.164913 |
-0.125714 |
|
22 |
2,6-diméthylaniline |
-0.43 |
1.88 |
4.0324 |
-0.20433 |
-0.170498 |
|
23 |
3,4-diméthylaniline |
-0.29 |
1.88 |
4.1394 |
-0.209397 |
-0.19994 |
|
24 |
3,5-diméthylaniline |
-0.37 |
1.90 |
4.09 |
-0.196955 |
-0.167349 |
|
25 |
2,4-dichloroaniline |
0.56 |
2.52 |
0 |
0.56709 |
0.669191 |
|
26 |
2,6-dichloroaniline |
0.33 |
2.29 |
0 |
0.364833 |
0.503171 |
|
27 |
3,4-dichloroaniline |
1.14 |
2.89 |
0 |
0.898491 |
0.92366 |
|
28 |
3,5-dichloroaniline |
0.71 |
2.98 |
0 |
0.976852 |
1.0228 |
|
29 |
2-chlorol-6-méthylaniline |
0.12 |
2.08 |
1.9318 |
0.0428721 |
0.151938 |
|
30 |
3-chlorol-2-méthylaniline |
0.45 |
2.43 |
1.9005 |
0.320031 |
0.4113 |
|
31 |
4-chloro-2-méhtylaniline |
0.35 |
2.32 |
1.9318 |
0.225811 |
0.327434 |
|
32 |
2,3,4-trichloroanilinel |
1.35 |
3.56 |
0 |
1.42571 |
1.44058 |
|
33 |
2,4,6-trichloroaniline |
1.01 |
3.21 |
0 |
1.16808 |
1.18599 |
|
34 |
3,4,5-trichloroaniline |
1.51 |
3.91 |
0 |
1.63719 |
1.71746 |
|
35 |
2,6-dichlorol-3-méthylaniline |
0.69 |
2.78 |
1.884 |
0.634712 |
0.672164 |
|
36 |
2-Methylaniline |
-0.55 |
1.39 |
1.9954 |
-0.426605 |
-0.269964 |
|
37 |
3-Methylaniline |
-0.43 |
1.40 |
2.0242 |
-0.423541 |
-0.314015 |
|
38 |
3-Bromoaniline |
0.52 |
2.07 |
0 |
0.465436 |
0.395733 |
|
39 |
4-Ethylaniline |
0.04 |
1.88 |
2.1305 |
-0.0634685 |
-0.0288554 |
|
40 |
4-Phénylaniline |
0.95 |
2.64 |
0 |
0.88294 |
0.813623 |
|
41 |
2-5-diméthylaniline |
-0.35 |
1.89 |
4.0462 |
-0.3253 |
-0.251059 |
|
42 |
2,3-dichloroaniline |
1.02 |
2.63 |
0 |
0.877406 |
0.790005 |
|
43 |
2,5-dichloroaniline |
0.58 |
2.63 |
0 |
0.877406 |
0.884344 |
|
44 |
2-chloro-4-méthylaniline |
0.24 |
2.07 |
1.9775 |
0.145656 |
0.125929 |
|
45 |
2-iodo-5-méthylaniline |
0.20 |
2.09 |
1.9789 |
0.165235 |
0.145551 |
|
46 |
3-chloro-4-méthylaniline |
0.45 |
2.42 |
1.9462 |
0.486763 |
0.38 |
|
47 |
3-chloro-2-méthylaniline |
0.51 |
2.43 |
1.9501 |
0.495109 |
0.377357 |
|
48 |
2,4,5-trichloroaniline |
1.30 |
3.56 |
0 |
1.19252 |
1.69921 |
Table 2. Regression line of the MLR model.
|
R2 |
Q2 |
Q2boot |
Q2ext |
R2adj |
LOF |
AIC |
SDEC |
SDEP |
SDEPboot |
SDEPext |
Kx |
Kxy |
|
92.18 |
90.51 |
89.26 |
95.26 |
91.69 |
0.046 |
0.047 |
0.191 |
0.21 |
15.61 |
0.149 |
7.56 |
50.1 |
|
F |
SE |
d.f. |
RSS |
PRESS |
TSS |
AVH |
DK |
DQ |
Rp |
Rn |
||
|
188.5 |
0.2 |
32 |
1.274 |
1.546 |
16.29 |
0.086 |
0.425 |
-0 |
0.053 |
-0.199 |
MLR results:
The linear function that results from the two molecular descriptors and the training set can be penned in the following form:
Y = - 1.20 + 0.739 X1 - 0.0969 X2 (12)
The statistical parameters of the QSAR models obtained using two molecular descriptors to predict aquatic toxicity are listed as follows:
From Figure 01, we can see that the calculated activity values and their experimental counterparts are for the most part similar.
Figure 1. Log (1/IGC) observed experimentally vs log (1/IGC) predicted by MLR
Applicability Domains of MLR:
Despite its potential, importance, and level of validation, a QSAR model cannot entirely and accurately anticipate the modeled attributes of all compounds. Consequently, we can say that prior to deploying a QSAR model for compound screening, its AD ought to be determined. Only the predictions of compounds parts of said AD can be deemed valid and reliable. The Williams Plot (Figure 02) was used to analyze the MLR model’s AD. Only one outlier with a leverage higher than warning limit 0.257 and that is compound 20. Not a single outlier with a residual more than ±3’s exists in the training set. If these two are removed, the Q2 between the experimental values of pIGC50 and the selected descriptors can be improved.
Figure2. Williams plot of the current QSAR model
Y-randomization test:
R2 and Q2 low values when it comes to the order model point towards the selected model not being the result of mere chance.
Figure 1.Randomization testassociated to theQSAR model
ANN:
Of all ANN architectures, the 3-5-1 is the soundest 1 (Q2 = 94.79, RMSE= 0.16, Q2ext= 91.71, RMSEext= 0.18). The results of MLR and ANN indicate that the model is stable, significant, and enjoys a great deal of robustness.
CONCLUSION:
Our work consists of a new modeling method that was significantly better when it comes to performance. This was achieved through integrating molecular descriptors and biological data.
A substantially useful QSAR equation was developed. It was derived from several theoretical chemical descriptors that are linked with the aquatic toxicity characteristics of forty-eight amines.
1664 descriptors and 20 classes of DRAGON descriptors were calculated for each and every compound. Data was split into training and tests in a measured manner. This assured molecular diversity in every subset thanks to the Kennard and Stones Algorithm. The most sensible set of descriptors was then chosen via the GA feature of the Moby Digs software, giving us a model of high quality statistically, and with low prediction errors.
Overall, for this particular data set, combining linear modeling methods proved to be a net improvement for said models. The results clearly demonstrate that the two selected descriptors are major contributors to the aquatic toxicity of amines structures.
CONFLICT OF INTEREST:
The authors declare no conflict of interest in this reported work.
REFERENCES:
1. Cassotti M, Ballabio D, Todeschini R, Consonni V. A similarity-based QSAR model for predicting acute toxicity towards the fathead minnow (Pimephales promelas) SAR QSAR Environ Res. 2015; 26(3):217-243. doi:10.1080/1062936X.2015.1018938
2. Rewatkar P V, Kokil GR. QSAR Studies of Novel 1-and 8-Substituted-3-Furfuryl Xanthines: An Adenosine Receptor Antagonist Asian J Res Chem. 2010; 3(2):416-420
3. Dwivedi SD, Bharadwaj A, Shrivastava A. Application of Topological Descriptor: QSAR Study of Chalcone Derivatives Asian J Res Chem. 2010; 3(4):1030-1034
4. Bhardwaj S, Dubey S. Qsar and Docking Studies of Some Novel Piperine Analogues as Monoamine Oxidase Inhibitors 2022
5. Dubey S, Bhardwaj S, Parbhakaran P, Singh E. In silico Prediction of Pyrazoline Derivatives as Antimalarial agents Asian J Pharm Res. 2022; 12(2):119-124
6. Madhawai K, Rishipathak D, Chhajed S, Kshirsagar S. Predicting the Anti-Inflammatory Activity of Novel 5-Phenylsulfamoyl-2-(2-Nitroxy)(Acetoxy) Benzoic acid derivatives using 2D and 3D-QSAR (kNN-MFA) Analysis Asian J Res Pharm Sci. 2017; 7(4):227-234
7. Dixit S, Sikarwar AK. Modeling of Acute Toxicity of Phenol Derivatives using Computational Methods Asian J Res Chem. 2017; 10(5):626-628
8. Dighe NS, Shinde P, Anap H, Bhawar S, Musmade DS. QSAR Study and Synthesis of some new 2, 5-disubstituted 1, 3, 4-oxadiazole derivatives as Anti-microbial and Anti-inflammatory Agents Asian J Pharm Res. 2014; 4(4):174-179
9. Ganatra SH, Patle MR, Bhagat GK. Studies of Quantitative Structure-Activity Relationship (QSAR) of Hydantoin Based Active Anti-Cancer Drugs Tc. 2011; 1(2.124):8-9351
10. Dutta S, Banik S, Sutradhar S, Dubey S, Sharma I. 4D-QSAR: New Perspectives in Drug Design Asian J Res Chem. 2011; 4(6):857-862
11. Sherikar AS. Quantitative Structure Activity Relationship of Some Substituted 2-Aminopyridines and Fused Bicyclic Rings as inhibitors of Nitric Oxide Synthases Asian J Res Chem. 2011; 4(10):1625-1629
12. Consonni V, Todeschini R. Molecular Descriptors for Chemoinformatics: Volume I: Alphabetical Listing/Volume II: Appendices, References John Wiley & Sons; 2009
13. Todeschini R, Ballabio D, Consonni V, Mauri A, Pavan M. MOBYDIGS, Software for Multilinear Regression Analysis and Variable Subset Selection by Genetic Algorithm. Release 1.1 for windows 2009
14. Gasteiger J, Zupan J. Neural Networks in Chemistry Angew Chemie Int Ed English. 1993; 32(4):503-527. doi:https://doi.org/10.1002/anie.199305031
15. Schultz TW, Lin DT, Wilke TS, Arnold LM. Quantitative structure-activity relationships for the Tetrahymena pyriformis population growth endpoint: a mechanism of action approach Pract Appl Quant Struct relationships Environ Chem Toxicol. 1990:241-262
16. Laxmi D, Priyadarshy S. HyperChem 6.03 Biotech Softw Internet Rep Comput Softw J Sci. 2002; 3(1):5-9
17. Todeschini R, Consonni V, Mauri A, Pavan M. DRAGON; Version 5.3 Talete SRL Milan, IT. 2005
18. Tropsha A, Gramatica P, Gombar VK. The Importance of Being Earnest: Validation is the Absolute Essential for Successful Application and Interpretation of QSPR Models QSAR Comb Sci. 2003; 22(1):69-77. doi:https://doi.org/10.1002/qsar.200390007
19. Ziani N, Amirat K, Messadi D. Inhibition of Tetrahymena pyriformis growth by Aliphatic Alcohols and Amines: a QSAR Study Synthèse Rev des Sci la Technol. 2014; 29:51-58
20. Ziani N, Amirat K, Messadi D. Chemometric modeling to predict aquatic toxicity of benzene derivatives in Pimephales Promelas Chafic-Thomas Salame P, ed. Manag Environ Qual An Int J. 2016; 27(3):299-312. doi:10.1108/MEQ-05-2015-0082
21. Bouaoune A, Lourici L, Haddag H, Messadi D. Inhibition of Microbial Growth by anilines: A QSAR study J Environ Sci Eng A. 2012; 1(5A)
22. Zine M, Bouakkadia A, Lourici L, Messadi D. Quantitative structure-property relationship studies for the prediction of the vapor pressure of volatile organic compounds J Serbian Chem Soc. 2019; 84(12):1405-1414. doi:10.2298/JSC190306059Z
23. Kiralj R, Ferreira MMC. Basic validation procedures for regression models in QSAR and QSPR studies: theory and application J Braz Chem Soc. 2009; 20(4):770-787. doi:10.1590/S0103-50532009000400021
24. Eriksson L, Jaworska J, Worth AP, Cronin MTD, McDowell RM, Gramatica P. Methods for reliability and uncertainty assessment and for applicability evaluations of classification- and regression-based QSARs. Environ Health Perspect. 2003; 111(10):1361-1375. doi:10.1289/ehp.5758
25. Shen M, Béguin C, Golbraikh A, Stables JP, Kohn H, Tropsha A. Application of Predictive QSAR Models to Database Mining: Identification and Experimental Validation of Novel Anticonvulsant Compounds J Med Chem. 2004; 47(9):2356-2364. doi:10.1021/jm030584q
26. Sharma BK, Singh P, Pilania P, Sarbhai K, Prabhakar YS. CP-MLR/PLS directed QSAR study on apical sodium-codependent bile acid transporter inhibition activity of benzothiepines Mol Divers. 2011; 15(1):135-147. doi:10.1007/s11030-009-9220-2
Received on 08.08.2022 Modified on 10.10.2022
Accepted on 15.11.2022 ©AJRC All right reserved
Asian J. Research Chem. 2023; 16(1):1-7.