Package de.fuberlin.wiwiss.marbles.loading

The classes of this package are used to retrieve content from the Web.

See:
          Description

Interface Summary
DereferencingListener Interface of a Listener which handles the DereferencingResults.
 

Class Summary
CacheController Implements caching of data retrieved from HTTP URLs using a Sesame repository Uses HTTP in RDF namespace, but currently does not follow the ontology as this would require deeper nesting with b-nodes, which is an overcomplication for the current use cases
ContentTypes Functionalities to identify RDF content types
DereferencerBatch Starting with one URL, the DereferencerBatch handles the nested retrieval of data by following known predicates in retrieved data, and processing retrieval results with data providers.
DereferencerThread The DereferencerThread executes a given DereferencingTask.
DereferencingResult The dereferencing result contains informations about the success or failure of a DereferencingTasks execution.
DereferencingTaskQueue The DereferencingTaskQueue is a thread which observes the DereferencerThreads.
ExtendedDereferencingTask Extends DereferencingTask with redirectStep, done and forceReload properties
HttpStatusCodes Provides functionalities to work with HTTP status codes
RDFGraphInserter An RDF Handler that adds all processed statements to a graph
SemanticWebClient Provides functionalities to load URLs and to discover related data by means of data providers.
SpongerProvider Allows to utilize the Sponger functionalities of a local Virtuoso Server installation, which allow RDF extraction from a multitude of formats including GRDDL, RDFa and Microformats
 

Package de.fuberlin.wiwiss.marbles.loading Description

The classes of this package are used to retrieve content from the Web. Classes from the Semantic Web Client library were adapted to the Sesame framework and to perform retrieval using the Apache Commons HttpClient. A CacheController manages access to cached data. A SpongerProvider allows to utilize the Sponger functionalities of a local Virtuoso Server installation, which allow RDF extraction from a multitude of formats including GRDDL, RDFa and Microformats.