awesome-feature-engineering
github.com/aikho/awesome-feature-engineering ↗A curated list of resources dedicated to Feature Engineering Techniques for Machine Learning
599
GitHub Stars
68
Curated Resources
6
Categories
20 min ago
Last Refreshed
Numeric DataTextual DataImage DataCategorical DataTime Series DataGeospatial Data
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me mean encoding resources from awesome-feature-engineering"
Installation instructions →What's inside
Categorical Data
- Adding variance column when mean encodingMean Encoding
- Dummy Coding: The how and whyDummy Encoding
- Feature engineering: Count encodingCount Encoding
- Feature engineering: Label encodingLabel Encoding
- Feature hashing and Extraction in VowpalWabbitHashing
- Feature hashing in scikit-learnHashing
Textual Data
- A Gentle Introduction to the Bag-of-Words ModelBag of Words
- Bag-of-words modelBag of Words
- ClearTK - Feature Extraction TutorialPattern Features
Feature Extraction Tutorial
- Do Pretrained Embeddings Give You The Extra Edge?Word Embeddings
- fastTextWord Embeddings
- Gensim: models.word2vec – Word2vec embeddingsWord Embeddings
Image Data
- A Python wrapper for Google TesseractOCR Features
- Feature extraction and similar image search with OpenCV for newbiesComputer Vision Algorithm Features
- ImageStat Module -- PillowImage Statistics Features
- Keras pre-trained models feature extractionDeep Learning Features
- OpenCV -- Feature Detection and DescriptionComputer Vision Algorithm Features
- Scikit-image feature moduleComputer Vision Algorithm Features
Time Series Data
Numeric Data
- Automatic feature extraction with t-SNEt-SNE Features
- Bucketing Continuous Variables in pandasQuantization and Binning
- Data BinningQuantization and Binning
- FeaturetoolsFeature Interactions
- How to create New Features using Clustering!!Clustering Features
- pandas.catQuantization and Binning
Showing a sample of 68 resources. View the full list on GitHub →