You have survived, EVERY SINGLE bad day so far. Anonymous
Idea Transcript
Well, Technically
Feature Hashing, or the “hashing trick” Feature hashing, or the “hashing trick,” is a clever method of dimensionality reduction that uses some of the important aspects of a good hash function to do some otherwise heavy lifting in NLP. This is a good blog post with the fundamentals of how and why the hashing trick works when working with a large, sparse set of vectors: Hashing Language (http://blog.someben.com/2013/01/hashing-lang/) Feature hashing is an elegant solution to the otherwise hairy problem of fighting the curse of dimensionality. It turned out to be extremely useful for a project I’m currently working on for a course at Columbia: Computational Models of Social Meaning (http://www1.cs.columbia.edu/~smara/teaching/E6998/S15/). Scikit-Learn has an implementation of the hashing trick (http://scikit-learn.org/stable/modules/feature_extraction.html#vectorizing-a-large-text-corpus-with-the-hashing-trick) if you’d like to read more about it.
Report this ad
May 8, 2015
premgane
Report this ad curse of dimensionality, dimensionality reduction, feature hashing, hashing trick, nlp Blog at WordPress.com.