Data scientists spend a huge amount of time on data pre-processing and transformation. It would be great (we thought back in the day) to gather the most frequently used data pre-processing techniques and transformations in a library, from which we could pick and choose the transformation that we need, and use it just like we would use any other sklearn class. This was the original vision for Feature-engine.

Feature-engine is an open source Python package originally designed to support the online course Feature Engineering for Machine Learning in Udemy, but has now gained popularity and supports transformations beyond those taught in the course. It was launched in 2017, and since then, several releases have appeared and a growing international community is beginning to lead the development.


The decision making process and governance structure of Feature-engine is laid out in the governance document.

Core contributors

The following people are currently core contributors to Feature-engine’s development and maintenance:

Soledad Galli

Chris Samiullah

Nicolas Galli


You can learn more about Feature-engine’s Contributors in the GitHub contributors page.

Citing Feature-engine

Coming soon