metafacture-mediawiki

Provides Metafacture modules and auxiliary classes for processing wiki pages from Mediawikis such as Wikipedia

Лицензия

Лицензия

Группа

Группа

org.culturegraph
Идентификатор

Идентификатор

metafacture-mediawiki
Последняя версия

Последняя версия

4.0.0
Дата

Дата

Тип

Тип

jar
Описание

Описание

metafacture-mediawiki
Provides Metafacture modules and auxiliary classes for processing wiki pages from Mediawikis such as Wikipedia
Ссылка на сайт

Ссылка на сайт

http://github.com/culturegraph/metafacture-mediawiki
Система контроля версий

Система контроля версий

http://github.com/culturegraph/metafacture-mediawiki

Скачать metafacture-mediawiki

Как подключить последнюю версию

<!-- https://jarcasting.com/artifacts/org.culturegraph/metafacture-mediawiki/ -->
<dependency>
    <groupId>org.culturegraph</groupId>
    <artifactId>metafacture-mediawiki</artifactId>
    <version>4.0.0</version>
</dependency>
// https://jarcasting.com/artifacts/org.culturegraph/metafacture-mediawiki/
implementation 'org.culturegraph:metafacture-mediawiki:4.0.0'
// https://jarcasting.com/artifacts/org.culturegraph/metafacture-mediawiki/
implementation ("org.culturegraph:metafacture-mediawiki:4.0.0")
'org.culturegraph:metafacture-mediawiki:jar:4.0.0'
<dependency org="org.culturegraph" name="metafacture-mediawiki" rev="4.0.0">
  <artifact name="metafacture-mediawiki" type="jar" />
</dependency>
@Grapes(
@Grab(group='org.culturegraph', module='metafacture-mediawiki', version='4.0.0')
)
libraryDependencies += "org.culturegraph" % "metafacture-mediawiki" % "4.0.0"
[org.culturegraph/metafacture-mediawiki "4.0.0"]

Зависимости

compile (3)

Идентификатор библиотеки Тип Версия
org.sweble.wikitext : swc-engine jar 1.1.0
de.fau.cs.osr.ptk : ptk-json-tools jar 1.1.0
org.slf4j : slf4j-api jar 1.7.21

provided (1)

Идентификатор библиотеки Тип Версия
org.culturegraph : metafacture-core jar 4.0.0

test (3)

Идентификатор библиотеки Тип Версия
junit : junit jar 4.12
org.mockito : mockito-core jar 2.6.2
org.slf4j : slf4j-simple jar 1.7.21

Модули Проекта

Данный проект не имеет модулей.

Metafacture-Mediawiki

Data extraction from MediaWiki pages made easy.

Build Status

About Metafacture-Mediawiki

Metafacture-Mediawiki is a plugin for Metafacture. It provides modules for extracting information from MediaWiki pages such as Wikipedia articles. Currently, modules for extracting links and templates exist. Adding new extraction modules is easy.

The plugin relies on the excellent Sweble wikitext parser for parsing wikitext into abstract syntax trees.

Key Features

  • Extracts basic metadata information about pages from MediaWiki xml documents
  • Extracts simple information from wikitext using regular expressions (fast but not suitable for complex tasks)
  • Wraps the Sweble wikitext parser for conveniently parsing wikitext into an abstract syntax tree within a Flux flow
  • Extracts links and templates from abstract syntax trees created by Sweble and turns them into a Metafacture event stream
  • Makes writing additional extraction modules easy
  • Supports running multiple extraction modules hassle-free

Download and Install

Metafacture-Mediawiki can be used as a plugin in the Metafacture distribution or as a Java library in your own programs.

Plugin Usage

The plugin can be downloaded on the releases page. Drop this plugin jar into the /plugins folder of the metafacture-runner to use the plugin.

Java Library Usage

Metafacture-Mediawiki is available on Maven Central. To use it, add the following dependency declaration to your pom.xml:

<dependency>
    <groupId>org.culturegraph</groupId>
    <artifactId>metafacture-mediawiki</artifactId>
    <version>4.0.0</version>
</dependency>

Additionally, you need to add the metafacture-core package as a dependency:

<dependency>
    <groupId>org.culturegraph</groupId>
    <artifactId>metafacture-core</artifactId>
    <version>4.0.0</version>
</dependency>

Our integration server automatically publishes successful builds of the master branch as snapshot versions on Sonatype OSS Repository.

Documentation

The documentation of Metafacture-Mediawiki can be found in the Wiki.

License

Copyright 2013, 2015 Deutsche Nationalbibliothek.

Metafacture-Mediawiki is distributed under the Apache 2.0 License.

org.culturegraph

Версии библиотеки

Версия
4.0.0
3.0.0
0.0.0