Pdf2Dom

Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTML file or further processed. The inline CSS definitions contained in the resulting document are used for making the HTML page as similar as possible to the PDF input. A command-line utility for converting the PDF documents to HTML is included in the distribution package. Pdf2Dom may be also used as an independent Java library with a standard DOM interface for your DOM-based applications or as an alternative parser for the CSSBox rendering engine in order to add the PDF processing capability to CSSBox.

License

License

Categories

Categories

Net PDF Data
GroupId

GroupId

net.sf.cssbox
ArtifactId

ArtifactId

pdf2dom
Last Version

Last Version

2.0.1
Release Date

Release Date

Type

Type

jar
Description

Description

Pdf2Dom
Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTML file or further processed. The inline CSS definitions contained in the resulting document are used for making the HTML page as similar as possible to the PDF input. A command-line utility for converting the PDF documents to HTML is included in the distribution package. Pdf2Dom may be also used as an independent Java library with a standard DOM interface for your DOM-based applications or as an alternative parser for the CSSBox rendering engine in order to add the PDF processing capability to CSSBox.
Project URL

Project URL

http://cssbox.sourceforge.net/pdf2dom
Source Code Management

Source Code Management

https://github.com/radkovo/Pdf2Dom

Download pdf2dom

How to add to project

<!-- https://jarcasting.com/artifacts/net.sf.cssbox/pdf2dom/ -->
<dependency>
    <groupId>net.sf.cssbox</groupId>
    <artifactId>pdf2dom</artifactId>
    <version>2.0.1</version>
</dependency>
// https://jarcasting.com/artifacts/net.sf.cssbox/pdf2dom/
implementation 'net.sf.cssbox:pdf2dom:2.0.1'
// https://jarcasting.com/artifacts/net.sf.cssbox/pdf2dom/
implementation ("net.sf.cssbox:pdf2dom:2.0.1")
'net.sf.cssbox:pdf2dom:jar:2.0.1'
<dependency org="net.sf.cssbox" name="pdf2dom" rev="2.0.1">
  <artifact name="pdf2dom" type="jar" />
</dependency>
@Grapes(
@Grab(group='net.sf.cssbox', module='pdf2dom', version='2.0.1')
)
libraryDependencies += "net.sf.cssbox" % "pdf2dom" % "2.0.1"
[net.sf.cssbox/pdf2dom "2.0.1"]

Dependencies

compile (4)

Group / Artifact Type Version
org.apache.pdfbox : pdfbox jar 2.0.24
net.mabboud.fontverter : FontVerter jar 1.2.22
commons-io : commons-io jar 2.11.0
org.slf4j : slf4j-api jar 1.7.32

test (6)

Group / Artifact Type Version
junit : junit jar 4.13.2
org.jsoup : jsoup jar 1.14.3
org.hamcrest : hamcrest-all jar 1.3
commons-codec : commons-codec jar 1.15
org.slf4j : slf4j-simple jar 1.7.32
net.mabboud.gfxassert : GfxAssert jar 1.0.4

Project Modules

There are no modules declared in this project.

Versions

Version
2.0.1
2.0.0
1.9
1.8
1.7
1.6
1.5
1.4
1.3
1.2