For users who are still using our previous feature selection repository implemented in Matlab, please find the old project webpage here


scikit-feature is an open-source feature selection repository in Python developed at Arizona State University. It is built upon one widely used machine learning package scikit-learn and two scientific computing packages Numpy and Scipy. scikit-feature contains around 40 popular feature selection algorithms, including traditional feature selection algorithms and some structural and streaming feature selection algorithms. It serves as a platform for facilitating feature selection application, research and comparative study. It is designed to share widely used feature selection algorithms developed in the feature selection research, and offer convenience for researchers and practitioners to perform empirical evaluation in developing new feature selection algorithms.

Installing scikit-feature



After you download, unzip the archive.

For Linux users, you can install the repository by the following command:

python install

For Windows users, you can also install the repository by the following command: install

Source code is available on GitHub.

For scikit-feature API usage, please refer scikit-feature feature selection repository API Document.

A brief introduction on how to perform feature selection with the scikit-feature repository scikit-feature feature selection tutorial.


If you find scikit-feature feature selection repository useful in your research, please consider cite the following paper [pdf] :

  title={Feature Selection: A Data Perspective},
  author={Li, Jundong and Cheng, Kewei and Wang, Suhang and Morstatter, Fred and Robert, Trevino and Tang, Jiliang and Liu, Huan},


Jundong Li


Kewei Cheng


Huan Liu