Word2vec

April 04, 2020

Page content

This is a draft.

Most important in NLP I think

In 2020, NLP is a study to try to understand human languages with computers. From my common sense, it is pragmatically impossible for human to “understand” our languages in numbers.

Yes, of course, scientism peopls could say “our world constitutes of quantum mechanical particles, and if we have enough machine power we can simulate a humam. So language is understandable with numbers.” But you should think once how many resource do you need to compute a human brain. This is why I said “pragmatically impossible.”

But this challenge is very very interesting for me! I started to lean NLP because I want to see the development of this challenge. I’m not a specialist in this field at all, but in the very first step of the learning I recognized most important part of the challenge are laid in this word-number translations.

As of 2020 April, I’ve seen in the Internet that Word2Vec is a common way to do that so I start to investigate it.

The idea is very simple so I thought it was invented around more than 20 years ago, but according to the Wikipedia page, the idea was published in 201 the idea was published in 2013.