Semargl is a modular framework for crawling linked data from structured documents. The main goal of the project is to provide lightweight and performant tool without excess dependencies.
This module integrates with Apache Clerezza to provide direct access to the RDFa parser using the Clerezza Parser APIs.
<dependency>
<groupId>org.semarglproject</groupId>
<artifactId>semargl-clerezza</artifactId>
<version>0.7</version>
</dependency>
TODO: Code example
To build framework just run mvn clean install
.