NLP-test========### 自然语言处理实验    1. 正向逆向分词    2. 一元二元词频统计    3. 拼音流切分    4. HMM简易中文输入法### 目录结构    · seg.py         &nbs...">
Home » Source Code » chinese input method by python

chinese input method by python

songboyu123
2015-04-15 04:02:49
The author
View(s):
Download(s): 1
Point (s): 1 
Category Category:
AlgorithmAlgorithm PythonPython

Description

size:16px;">NLP-test
========

### 自然语言处理实验
    1. 正向逆向分词
    2. 一元二元词频统计
    3. 拼音流切分
    4. HMM简易中文输入法

### 目录结构
    · seg.py          分词脚本
    · ngram.py        统计一元词频、二元词频
    · common.py       字符串处理集(包括转换为unicode,全角转半角,半角转全角)
    · main.py         主程序入口
    
    · core/
            Graph.py                    有向图结构
            InputMethod.py              拼音串转汉字串
            Model.py                    加载语言模型



    · corpus/         96年人民日报语料
    
    · corpus_seg/     96年人民日报语料----已切分
            
    · freq/
            freq_select.py              选择高频词脚本
            word_freq.txt               一元词频表
            unigram_freq.txt            二元词频表
            unigram_freq_selected.txt   二元词频表----高频
            dict.txt                    [词/拼音]字典
            dict-selected.txt           [词/拼音]字典----高频
    
    · pinyin/
            pinyin_list.txt             有效汉语拼音列表
            Trie.py                     Trie树操作类
            pinyin.py                   拼音流切分脚本
    
    · seg_method/
            bwd_max.py                  逆向最大匹配分词算法
            fwd_max.py                  正向最大匹配分词算法
            
Sponsored links

File list

Tips: You can preview the content of files by clicking file names^_^
Name Size Date
01.97 kB
.gitattributes505.00 B2014-12-04|15:06
.gitignore1.36 kB2014-12-11|18:42
01.97 kB
.name8.00 B2014-12-04|15:26
encodings.xml171.00 B2014-12-04|15:26
01.97 kB
profiles_settings.xml241.00 B2014-12-05|10:53
Project_Default.xml1,003.00 B2014-12-05|12:36
misc.xml219.00 B2014-12-04|15:26
modules.xml279.00 B2014-12-04|15:26
NLP-test.iml355.00 B2014-12-11|14:51
01.97 kB
scope_settings.xml143.00 B2014-12-04|15:26
vcs.xml189.00 B2014-12-04|15:26
workspace.xml57.27 kB2014-12-27|11:54
common.py1.56 kB2014-12-11|19:34
common.pyc1.58 kB2014-12-11|19:40
01.97 kB
Graph.py1.62 kB2014-12-12|16:39
Graph.pyc1.82 kB2014-12-12|18:54
InputMethod.py2.88 kB2014-12-12|17:08
InputMethod.pyc2.76 kB2014-12-12|18:54
Model.py2.11 kB2014-12-12|15:43
Model.pyc2.61 kB2014-12-12|18:54
__init__.py21.00 B2014-12-07|15:19
__init__.pyc152.00 B2014-12-07|19:38
01.97 kB
01.97 kB
dict.txt1.27 MB2014-12-07|15:19
01.97 kB
dict_selected.txt1.14 MB2014-12-07|15:19
freq_select.py500.00 B2014-12-07|15:19
word_freq.txt575.58 kB2014-12-11|21:02
main.py917.00 B2014-12-11|18:26
ngram.py2.74 kB2014-12-12|15:43
01.97 kB
Pinyin.py2.66 kB2014-12-12|18:54
Pinyin.pyc1.98 kB2014-12-12|18:54
pinyin_list.txt2.07 kB2014-12-12|18:27
Trie.py2.35 kB2014-12-12|18:10
Trie.pyc2.95 kB2014-12-12|18:16
__init__.py21.00 B2014-12-06|17:25
__init__.pyc154.00 B2014-12-11|17:39
README.md1.51 kB2014-12-11|21:49
seg.py2.96 kB2014-12-12|15:43
01.97 kB
bwd_max.py1.22 kB2014-12-04|15:06
bwd_max.pyc1.31 kB2014-12-11|19:40
fwd_max.py2.12 kB2014-12-12|15:43
fwd_max.pyc2.00 kB2014-12-11|19:40
__init__.py21.00 B2014-12-04|15:06
__init__.pyc158.00 B2014-12-11|19:40
...
Sponsored links

Comments

(Add your comment, get 0.1 Point)
Minimum:15 words, Maximum:160 words
  • 1
  • Page 1
  • Total 1

chinese input method by python (1.34 MB)

Need 1 Point(s)
Your Point (s)

Your Point isn't enough.

Get 22 Point immediately by PayPal

Point will be added to your account automatically after the transaction.

More(Debit card / Credit card / PayPal Credit / Online Banking)

Submit your source codes. Get more Points

LOGIN

Don't have an account? Register now
Need any help?
Mail to: support@codeforge.com

切换到中文版?

CodeForge Chinese Version
CodeForge English Version

Where are you going?

^_^"Oops ...

Sorry!This guy is mysterious, its blog hasn't been opened, try another, please!
OK

Warm tip!

CodeForge to FavoriteFavorite by Ctrl+D