Skip to content

Chinese segmentation tools using LSTM-CRF model in pyTorch

Notifications You must be signed in to change notification settings

ChemJeff/ChineseSeg

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

基于BiLSTM-CRF模型的中文分词

运行环境

python==3.x

torch==0.4

pickle

tensorflow(optional, for tensorboard logging)

使用方法

训练数据与测试数据均在./data目录下

首先执行data.py预处理数据,生成vocab_tag.pkl、train_corpus.pkl和test_corpus.pkl

然后就可以直接执行train.py进行模型的训练

或者使用test.py加载预训练的模型进行分词的预测

About

Chinese segmentation tools using LSTM-CRF model in pyTorch

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages