We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
我想要在一个句子中,取出前3个使用频率最高的名词,或者说最重要的前三个词。单纯的分词实现不了。不知道jieba-php有分词功能么?
The text was updated successfully, but these errors were encountered:
@wolf8210137 有關鍵詞提取功能,使用 TF/IDF 演算法,請見 Readme 功能 3):關鍵詞提取
Sorry, something went wrong.
cut可以加一个返回完整的分词带idf和词性的数组的选项吗, 返回结果类似这样 array(21) { [0]=> array(2) { ["word"]=> string(3) "这" ["idf"]=> double(8) 1.22223333 ["tag"]=> string(1) "r" }.... } 用google的simhash算法做文章的相似度比较,需要文章的全部分词的权重,同时做情感分析需要分词的词性 也就是说,在cut的返回结果里把idf和词性也都带上就好了@fukuball
是可以花時間加上這樣的功能,看有沒有人要幫忙,或是等我有空 XD
No branches or pull requests
我想要在一个句子中,取出前3个使用频率最高的名词,或者说最重要的前三个词。单纯的分词实现不了。不知道jieba-php有分词功能么?
The text was updated successfully, but these errors were encountered: