Comprehensive machine learning capabilities powered by scikit-learn, featuring 8+ algorithms for classification, regression, clustering, and dimensionality reduction.
Complete machine learning toolkit integrated into your workflows.
8+ ML algorithms from scikit-learn
Classification (Logistic Regression, Random Forest, SVM, KNN)
Regression (Linear, Random Forest, SVR)
Clustering (K-Means, Agglomerative)
Dimensionality reduction (PCA)
Model persistence (save/load)
Evaluation metrics and visualization
Cross-validation support
Comprehensive collection of machine learning algorithms from scikit-learn.
Linear model for binary and multi-class classification
Ensemble of decision trees with bagging
Maximum margin classifier with kernel methods
Instance-based learning algorithm
Simple linear model for continuous targets
Ensemble method for non-linear regression
Epsilon-insensitive loss regression
Partition-based clustering algorithm
Hierarchical bottom-up clustering
Principal Component Analysis for feature reduction
Comprehensive metrics for model evaluation and validation.
Overall correct predictions
Positive predictive value
True positive rate
Harmonic mean of precision and recall
Area under ROC curve
True vs predicted classes
Mean Squared Error
Root Mean Squared Error
Mean Absolute Error
Coefficient of determination
Variance explained by model
Cluster separation quality
Within-cluster sum of squares
Cluster similarity measure
Between/within cluster ratio
Typical machine learning workflows in computational chemistry.
Build predictive models for molecular properties
Binary classification for bioactivity
Group similar molecules