RISE to Bloome Software
Log In    
r2bsoftware.se r2bsoftware.se
Click to hide navigation tree


Marshal implements a general harvester, the TCP JSON Harvester, that allows external applications to implement the harvesting process and act as if they where an integral part of Marshal. These external harvesters, referred to as routers, allows browsing the data source and continuous, asynchronous, data retrieval, i.e. they provide functionality equal to that of the ODBC harvester. This means that Routers mimic relational databases in the sense that the returned data is represented as table-rows with columns.

Router flow overview

The image above shows how a router works at a conceptual level. It simply routes the Marshal harvester to the data source, hiding any and all data source specific details. 

The purpose of the router mechanism is to accomplish code separation in a platform independent way. However, it also opens for distribution of the harvesting software across multiple computers. When deploying the router on a computer other than the one running Marshal (Archiver or Editor) make sure you've properly secured access to it.

How to configure a router

All routers share the same set of configuration properties, see below. The interpretation of most of these settings is left to the specific router. The Host and Port settings, however, have a common meaning since they are used by Marshal to connect to the router. 
NameJSON TCP harvester
Connection Stringsee specific router
For namesee specific router
HostThe tcp/ip address of the router, i.e. the address of the computer running the router software. Typically, this could be, indicating the router is running on the same computer as the on running Marshal.
PortThe tcp/ip port on which the router listens for connections. Typically, this must match the port value provided as part of the router configuration.
Group bysee specific router
Order bysee specific router
Query Timeoutsee specific router
Table or Viewsee specific router
Wheresee specific router
User Authentication
Passwordsee specific router
User IDsee specific router

Jdbc Router
The Marshal JDBC Router is a multithreaded, platform independent, Java middleware for accessing databases having a JDBC driver, such as Access, FileMaker, MySQL, PostgreSQL, SQL Server, Oracle, SQLite and many more.
SOAP Router
The SOAP Router allows Marshal to browse and call SOAP and WCF web services.
Site Router
The Site Router is a web crawler add-on to Marshal. Use the Site Router to harvest web pages and documents or entire web sites.
Custom Router
The Custom Code Router allows you to extend Marshals harvesting functionality with your own source code or assembly.