Skip to content

m00nlight/clj-bosonnlp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

clj-bosonnlp

Clojars Project

clj-bosonnlp是Boson NLP的clojure封装。

Boson NLP提供了中文自然语言处理中,包括:

  • 情感分析(Sentiment Analysis)
  • 实体识别(Name Entity Recognition)
  • 依存句法(Dependent parser)
  • 关键词提取(Keywords extraction)
  • 新闻分类(News classification)
  • 语义联想(Semantic Words Suggestion)
  • 分词和词性(Segmentation and Postag of Chinese)

更详细的信息,请参见Boson官方文档

Usage

使用Leiningin的,在project.clj文件中加入:

[clj-bosonnlp "0.1.2"]

使用maven的,在pom.xml中加入:

<dependency>
  <groupId>clj-bosonnlp</groupId>
  <artifactId>clj-bosonnlp</artifactId>
  <version>0.1.2</version>
</dependency>

然后就可以在项目中使用了,下面的是在clojure repl中使用clj-bosonnlp的例子:

user=> (use '[clj-bosonnlp.core])
nil  
;; initialize with you api-token
user=> (initialize "<your-api-token>")
"<you-api-token>"
;; sentiment example 
user=> (sentiment ["这个世界好复杂", "计算机是科学么"]
[[0.17128982245610536 0.8287101775438946] \
[0.3028239178178842 0.6971760821821158]]
user=> (suggest "粉丝" 20)
[[0.9999999999999997 "粉丝/n"] [0.4860246796131101 "脑残粉/n"] \
[0.47638025976400966 "听众/n"] [0.4574711603743687 "球迷/n"] \
[0.44279396622121586 "观众/n"] [0.4399638841304087 "喷子/n"] \
[0.4370675116868156 "乐迷/n"] [0.4365171009654033 "鳗鱼/n"] \
[0.4357353461210972 "水军/n"] [0.43320908113367257 "好友/n"] \
[0.4321432244549219 "歌迷/n"] [0.4218593870538608 "影迷/n"] \
[0.4179423555308083 "前辈/n"] [0.4142211812540118 "网民/n"] \
[0.40556773652629086 "参赛者/n"] [0.40544885221034965 "博友/n"] \
[0.3976491020591731 "公知/n"] [0.3971053944003027 "支持者/n"] \
[0.3864395283882839 "选手/n"] [0.38543008430007086 "歌手/n"]]
user=> (tag ["这个世界好复杂", "计算机是科学么"]
[{"tag" ["DT" "M" "NN" "AD" "VA"], "word" ["" "" "世界" "" "复杂"]} \
{"tag" ["NN" "VC" "NN"