Simple pipeline parser for HTML

Simple parser for HTML, using the pipelines library. This is not intended to be a strict parser of HTML5; the main planned use is to help with "screen-scraping" of HTML websites. It may also find use as a tool for testing HTML generation.

Лицензия

Лицензия

Категории

Категории

Сеть
Группа

Группа

net.pwall.html
Идентификатор

Идентификатор

html-pipeline
Последняя версия

Последняя версия

0.1
Дата

Дата

Тип

Тип

jar
Описание

Описание

Simple pipeline parser for HTML
Simple parser for HTML, using the pipelines library. This is not intended to be a strict parser of HTML5; the main planned use is to help with "screen-scraping" of HTML websites. It may also find use as a tool for testing HTML generation.
Ссылка на сайт

Ссылка на сайт

https://github.com/pwall567/html-pipeline
Система контроля версий

Система контроля версий

https://github.com/pwall567/html-pipeline

Скачать html-pipeline

Как подключить последнюю версию

<!-- https://jarcasting.com/artifacts/net.pwall.html/html-pipeline/ -->
<dependency>
    <groupId>net.pwall.html</groupId>
    <artifactId>html-pipeline</artifactId>
    <version>0.1</version>
</dependency>
// https://jarcasting.com/artifacts/net.pwall.html/html-pipeline/
implementation 'net.pwall.html:html-pipeline:0.1'
// https://jarcasting.com/artifacts/net.pwall.html/html-pipeline/
implementation ("net.pwall.html:html-pipeline:0.1")
'net.pwall.html:html-pipeline:jar:0.1'
<dependency org="net.pwall.html" name="html-pipeline" rev="0.1">
  <artifact name="html-pipeline" type="jar" />
</dependency>
@Grapes(
@Grab(group='net.pwall.html', module='html-pipeline', version='0.1')
)
libraryDependencies += "net.pwall.html" % "html-pipeline" % "0.1"
[net.pwall.html/html-pipeline "0.1"]

Зависимости

compile (3)

Идентификатор библиотеки Тип Версия
net.pwall.util : pipelines jar 0.8
net.pwall.html : htmlutil jar 1.1
org.jetbrains.kotlin : kotlin-stdlib-jdk8 jar 1.3.50

test (2)

Идентификатор библиотеки Тип Версия
org.jetbrains.kotlin : kotlin-test-junit jar 1.3.50
net.pwall.dom : dom-kotlin jar 0.1.1

Модули Проекта

Данный проект не имеет модулей.

html-pipeline

Simple parser for HTML, using the pipelines library. This is not intended to be a strict parser of HTML5; the main planned use is to help with "screen-scraping" of HTML websites. It may also find use as a tool for testing HTML generation.

Quick Start

Create a pipeline which feeds data into the HTMLPipeline object. The result of the pipeline will be the org.w3c.dom.Document object.

    val htmlPipeline = DecoderFactory.getDecoder(Charsets.UTF_8, HTMLPipeline()).apply {
        accept(inputStream)
    }
    val document = htmlPipeline.result

Dependency Specification

The latest version of the library is 0.1, and it may be obtained from the Maven Central repository.

Maven

    <dependency>
      <groupId>net.pwall.html</groupId>
      <artifactId>html-pipeline</artifactId>
      <version>0.1</version>
    </dependency>

Gradle

    implementation 'net.pwall.html:html-pipeline:0.1'

Gradle (kts)

    implementation("net.pwall.html:html-pipeline:0.1")

Peter Wall

2020-03-01

Версии библиотеки

Версия
0.1