Rakuten Automated

mecab does not work on Ubuntu + apt? Then, source compile!

Summary

  • Mecab(japanese tokenizer) does not work correctly when one installed it with apt on Ubuntu
  • the solution is to compile from source.

Ubutn+apt to install mecab?

and then…. you’d like to know where dictionaries exist.

It does not exist!

So where is the dictionary?

so let’s find it!

okay… it exists under /user/share/doc. So, you look at the directory, however, you find that dictionary files do not exist again!

Solution? source compile!

now it works!

For your info

A Python package JapaneseTokenizer, which I developed, could enable following options.

If you could not refer mecab-config by ENV

If you could not refer dictionary path by ENV