Project Group: cn.wanghaomiao

JsoupXpath

cn.wanghaomiao : JsoupXpath

一个非常好用而且强大的基于xpath的html解析器。html的DOM树生成依赖Jsoup。Lexer 和 Parser基于Antlr4,支持完备的W3C XPATH 1.0标准语法,W3C规范:http://www.w3.org/TR/1999/REC-xpath-19991116。

Last Version: 2.5.1

Release Date:

SeimiCrawler

cn.wanghaomiao : SeimiCrawler

一个支持分布式的可以高效开发且可以高效运行的爬虫框架。设计思想上融合了spring与scrapy的优点。An powerful,agile,powerful,distributed crawler framework.

Last Version: 2.1.2

Release Date:

lambda-factory

cn.wanghaomiao : lambda-factory

Sonatype helps open source projects to set up Maven repositories on https://oss.sonatype.org/

Last Version: 2.0.2

Release Date:

SeimiCrawler Package Plugin

cn.wanghaomiao : maven-seimicrawler-plugin

Package seimicrawler project so that can be fast and standalone deployed.It is based on maven-war-plugin and modified. 这是专为SeimiCrawler工程专门定制的一个maven发布工具,意在简化开发者项目发布与部署流程。本插件是基于Apache的maven-war-plugin修改而来,依然采用Apache License Version2.0发布。

Last Version: 1.3.0

Release Date:

  • 1