For our vacation this year, we picked Vietnam: Christmas in the touristy region of Mui Ne, then New Year in Saigon (aka Ho Chi Minh City ). This post is a short travel log with pictures, highlig...
https://radimrehurek.com/2016/02/christmas-in-vietnam-mui-ne-and-saigon/
One year ago, Tomáš Mikolov (together with his colleagues at Google) made some ripples by releasing word2vec , an unsupervised algorithm for learning the meaning behind words. In this blog pos...
The latest gensim release of 0.10.3 has a new class named Doc2Vec . All credit for this class, which is an implementation of Quoc Le & Tomáš Mikolov: “Distributed Representations of Sentenc...
Latent Dirichlet Allocation (LDA), one of the most used modules in gensim, has received a major performance revamp recently. Using all your machine cores at once now, chances are the new LdaMul...
https://radimrehurek.com/2014/09/multicore-lda-in-python-from-over-night-to-over-lunch/
Now that I have a blog, I figured I could start posting more info about our travels. So here’s a little digest from one of our recent trips. I’m hoping it will be useful to other tourists lo...
There are tools and concepts in computing that are very powerful but potentially confusing to novices. One such concept is DATA STREAMING (aka lazy evaluation), which can be realized neatly and ...
https://radimrehurek.com/2014/03/data-streaming-in-python-generators-iterators-iterables/
MALLET, “MAchine Learning for LanguagE Toolkit” is a brilliant software tool. Unlike gensim, “topic modelling for humans” , which uses Python, MALLET is written in Java and spells “to...
https://radimrehurek.com/2014/03/tutorial-on-mallet-in-python/
I never got round to writing a tutorial on how to use word2vec in gensim. It’s simple enough and the API docs are straightforward, but I know some people prefer more verbose formats. Let this...
Previous posts explained the whys & whats of nearest-neighbour search, the available OSS libraries and Python wrappers. We converted the English Wikipedia to vector space, to be used as our t...
https://radimrehurek.com/2014/01/performance-shootout-of-nearest-neighbours-querying/
The end of the year is proving crazy busy as usual, but gensim acquired a cool new feature that I just had to blog about. Ben Trahan sent a patch that allows automatic tuning of Latent Dirichl...
https://radimrehurek.com/2013/12/python-lda-in-gensim-christmas-edition/