word embedding; keyword identification; natural language processing; category distribution