Feature Hashing, or the “hashing trick” | Well, Technically [PDF]

May 8, 2015 - Feature hashing, or the "hashing trick," is a clever method of dimensionality reduction that uses some of

3 downloads 13 Views 27KB Size

Recommend Stories


hashing i
Never let your sense of morals prevent you from doing what is right. Isaac Asimov

Bayesian Supervised Hashing
If you are irritated by every rub, how will your mirror be polished? Rumi

Multilinear Hyperplane Hashing
The only limits you see are the ones you impose on yourself. Dr. Wayne Dyer

Discrete Graph Hashing
Life isn't about getting and having, it's about giving and being. Kevin Kruse

Locality sensitive hashing
Sorrow prepares you for joy. It violently sweeps everything out of your house, so that new joy can find

Adaptive Quantization for Hashing
You often feel tired, not because you've done too much, but because you've done too little of what sparks

Stochastic Generative Hashing
You miss 100% of the shots you don’t take. Wayne Gretzky

Inductive Hashing on Manifolds
Goodbyes are only for those who love with their eyes. Because for those who love with heart and soul

Double-Bit Quantization for Hashing
Don't be satisfied with stories, how things have gone with others. Unfold your own myth. Rumi

Simultaneous hashing of multiple messages
You have survived, EVERY SINGLE bad day so far. Anonymous

Idea Transcript


Well, Technically

Feature Hashing, or the “hashing trick” Feature hashing, or the “hashing trick,” is a clever method of dimensionality reduction that uses some of the important aspects of a good hash function to do some otherwise heavy lifting in NLP. This is a good blog post with the fundamentals of how and why the hashing trick works when working with a large, sparse set of vectors: Hashing Language (http://blog.someben.com/2013/01/hashing-lang/) Feature hashing is an elegant solution to the otherwise hairy problem of fighting the curse of dimensionality. It turned out to be extremely useful for a project I’m currently working on for a course at Columbia: Computational Models of Social Meaning (http://www1.cs.columbia.edu/~smara/teaching/E6998/S15/). Scikit-Learn has an implementation of the hashing trick (http://scikit-learn.org/stable/modules/feature_extraction.html#vectorizing-a-large-text-corpus-with-the-hashing-trick) if you’d like to read more about it.

Report this ad

May 8, 2015

premgane



Report this ad curse of dimensionality, dimensionality reduction, feature hashing, hashing trick, nlp Blog at WordPress.com.

Smile Life

When life gives you a hundred reasons to cry, show life that you have a thousand reasons to smile

Get in touch

© Copyright 2015 - 2024 PDFFOX.COM - All rights reserved.