Группа библиотек: org.archive.heritrix

Heritrix 3: 'modules' subproject (reusable components)

org.archive.heritrix : heritrix-modules

This project contains some of the configurable modules used within the Heritrix application to crawl the web. The modules in this project can be used in applications other than Heritrix, however.

Последняя версия: 3.4.0-20210923

Дата:

Heritrix 3: 'engine' subproject

org.archive.heritrix : heritrix-engine

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Последняя версия: 3.4.0-20210923

Дата:

Heritrix 3: 'commons' subproject (utility classes)

org.archive.heritrix : heritrix-commons

The Archive Commons Code Libraries project contains general Java utility libraries, as used by the Heritrix crawler and other projects.

Последняя версия: 3.4.0-20210923

Дата:

Heritrix 3 (distribution bundles)

org.archive.heritrix : heritrix

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Последняя версия: 3.4.0-20210923

Дата:

Heritrix 3: 'contrib' subproject

org.archive.heritrix : heritrix-contrib

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Последняя версия: 3.4.0-20210923

Дата:

  • 1