An N-gram Language Model Library from UC Berkeley
2016-08-23
0 0 0
no vote
Other
Earn points
This project provides a library for estimating storing large n-gram language models in memory and accessing them efficiently. It is described in this paper. Its data structures are faster and smaller than SRILM and nearly as fast as KenLM despite being written in Java instead of C++. It also achieves the best published lossless encoding of the Google n-gram corpus.
See here for some documentation.
News
July 16, 2014: The project has been migrated to github. Any future updates will happen there.
December 6, 2014: Since Google has deprec
模型
语言
uc
NGram
伯克利
Related Source Codes
EE247 Analysis and design of analog-to-digital int
0
0
no vote
Language Translation: Chinese to English
0
0
no vote
Social force model
0
0
no vote
Matlab Foundation
0
0
no vote
Uc1638 driver
0
0
no vote
No comment