Related Projects¶
Below is a list of sister-projects, extensions and domain specific packages.
Related Packages¶
Other packages useful for data analysis and machine learning.
- Pandas Tools for working with heterogeneous and columnar data, relational queries, time series and basic statistics.
- sklearn_pandas bridge for scikit-learn pipelines and pandas data frame with dedicated transformers.
- Scikit-Learn Laboratory A command-line wrapper around scikit-learn that makes it easy to run machine learning experiments with multiple learners and large feature sets.
- theano A CPU/GPU array processing framework geared towards deep learning research.
- Statsmodel Estimating and analysing statistical models. More focused on statistical tests and less on prediction than scikit-learn.
- PyMC Bayesian statistical models and fitting algorithms.
- sklearn_theano scikit-learn compatible estimators, transformers, and datasets which use Theano internally
Extensions and Algorithms¶
Libraries that provide a scikit-learn like interface and can be used with scikit-learn tools.
- pylearn2 A deep learning and neural network library build on theano with scikit-learn like interface.
- lightning Fast state-of-the-art linear model solvers (SDCA, AdaGrad, SVRG, SAG, etc...).
- Seqlearn Sequence classification using HMMs or structured perceptron.
- HMMLearn Implementation of hidden markov models that was previously part of scikit-learn.
- PyStruct General conditional random fields and structured prediction.
- py-earth Multivariate adaptive regression splines
- sklearn-compiledtrees Generate a C++ implementation of the predict function for decision trees (and ensembles) trained by sklearn. Useful for latency-sensitive production environments.
- lda: Fast implementation of Latent Dirichlet Allocation in Cython.
- Sparse Filtering Unsupervised feature learning based on sparse-filtering
- Kernel Regression Implementation of Nadaraya-Watson kernel regression with automatic bandwidth selection
- gplearn Genetic Programming for symbolic regression tasks.
Domain Specific Packages¶
- scikit-image Image processing and computer vision in python.
- Natural language toolkit (nltk) Natual language processing and some machine learning.
- NiLearn Machine learning for neuro-imaging.
- AstroML Machine learning for astronomy.
- MSMBuilder Machine learning for protein conformational dynamics time series.