stan-cn-ner

An API wrapper based on Stanford Core NLP package for Simplified Chinese users

Лицензия

Лицензия

Группа

Группа

com.guokr
Идентификатор

Идентификатор

stan-cn-ner
Последняя версия

Последняя версия

0.0.6
Дата

Дата

Тип

Тип

jar
Описание

Описание

stan-cn-ner
An API wrapper based on Stanford Core NLP package for Simplified Chinese users
Ссылка на сайт

Ссылка на сайт

https://github.com/guokr/stan-cn-ner
Система контроля версий

Система контроля версий

https://github.com/guokr/stan-cn-ner

Скачать stan-cn-ner

Как подключить последнюю версию

<!-- https://jarcasting.com/artifacts/com.guokr/stan-cn-ner/ -->
<dependency>
    <groupId>com.guokr</groupId>
    <artifactId>stan-cn-ner</artifactId>
    <version>0.0.6</version>
</dependency>
// https://jarcasting.com/artifacts/com.guokr/stan-cn-ner/
implementation 'com.guokr:stan-cn-ner:0.0.6'
// https://jarcasting.com/artifacts/com.guokr/stan-cn-ner/
implementation ("com.guokr:stan-cn-ner:0.0.6")
'com.guokr:stan-cn-ner:jar:0.0.6'
<dependency org="com.guokr" name="stan-cn-ner" rev="0.0.6">
  <artifact name="stan-cn-ner" type="jar" />
</dependency>
@Grapes(
@Grab(group='com.guokr', module='stan-cn-ner', version='0.0.6')
)
libraryDependencies += "com.guokr" % "stan-cn-ner" % "0.0.6"
[com.guokr/stan-cn-ner "0.0.6"]

Зависимости

compile (2)

Идентификатор библиотеки Тип Версия
com.guokr : stan-cn-com jar 0.0.6
com.guokr : stan-cn-seg jar 0.0.6

test (1)

Идентификатор библиотеки Тип Версия
junit : junit jar 4.11

Модули Проекта

Данный проект не имеет модулей.

stan-cn-ner

stan-cn-* family are wrappers based on Stanford CoreNLP for the convenience of Chinese users. This package provide a Chinese naming entity recognizor.

stan-cn-* family are including:

  • stan-cn-com: Common code base
  • stan-cn-seg: Chinese segmentation and related data model
  • stan-cn-ner: Naming entity recognization and related data model
  • stan-cn-tag: POS tagging and related data model

Purpose of the package

The original Stanford CoreNLP packages with default language settings in Maven central is only for English. If you are dealing with simplified Chinese, you still need to download the Chinese model and fix some configuration files.

The burden is not too much, but if you deploy these packages to a server cluster, this burden might be amplified.

Whatever you face a single node or a server farm, it would be a pleasurable solution to provide packages with default settings of Chinese language models. That is what we do.

Comments, reviews, bug reports and patches are welcomed.

Current version

Current version is 0.0.6 and based on Stanford CoreNLP 3.4 with minor fixes.

including below dependency:

  • maven:
    <dependency>
        <groupId>com.guokr</groupId>
        <artifactId>stan-cn-ner</artifactId>
        <version>0.0.6</version>
    </dependency>
  • leiningen:
    [com.guokr/stan-cn-ner "0.0.6"]
  • sbt:
    libraryDependencies += "com.guokr" % "stan-cn-ner" % "0.0.6"

Simplified API

We use a very simple API to reduce the complexity.

    new NerWrapper(settings).recognize(text);

Or if you want to use the default language models, just use

    __NER__.INSTANCE.recognize(text);

Preparation for release

Before release this package to maven central, please execute below commands:

  • mvn clean source:jar javadoc:jar package
  • export MAVEN_OPTS=-Xmx2048m
  • mvn release:clean
  • mvn release:prepare
  • mvn release:perform

Authors

License

GPLv2, just same as the license of Stanford CoreNLP package

com.guokr

Guokr.com

Версии библиотеки

Версия
0.0.6
0.0.4
0.0.3