|
|
RoutersMarshal implements a general harvester, the TCP JSON Harvester, that allows external applications to implement the harvesting process and act as if they where an integral part of Marshal. These external harvesters, referred to as routers, allows browsing the data source and continuous, asynchronous, data retrieval, i.e. they provide functionality equal to that of the ODBC harvester. This means that Routers mimic relational databases in the sense that the returned data is represented as table-rows with columns.
The image above shows how a router works at a conceptual level. It simply routes the Marshal harvester to the data source, hiding any and all data source specific details.
The purpose of the router mechanism is to accomplish code separation in a platform independent way. However, it also opens for distribution of the harvesting software across multiple computers. When deploying the router on a computer other than the one running Marshal (Archiver or Editor) make sure you've properly secured access to it.
How to configure a routerAll routers share the same set of configuration properties, see below. The interpretation of most of these settings is left to the specific router. The Host and Port settings, however, have a common meaning since they are used by Marshal to connect to the router.
Harvester | Name | JSON TCP harvester | Router | Connection String | see specific router | For name | see specific router | Host | The tcp/ip address of the router, i.e. the address of the computer running the router software. Typically, this could be 127.0.0.1, indicating the router is running on the same computer as the on running Marshal. | Port | The tcp/ip port on which the router listens for connections. Typically, this must match the port value provided as part of the router configuration. | Settings | Group by | see specific router | Order by | see specific router | Query Timeout | see specific router | Table or View | see specific router | Where | see specific router | User Authentication | Password | see specific router | User ID | see specific router |
|
Jdbc Router
|
The Marshal JDBC Router is a multithreaded, platform independent, Java middleware for accessing databases having a JDBC driver, such as Access, FileMaker, MySQL, PostgreSQL, SQL Server, Oracle, SQLite and many more. |
|
SOAP Router
|
The SOAP Router allows Marshal to browse and call SOAP and WCF web services. |
|
Site Router
|
The Site Router is a web crawler add-on to Marshal. Use the Site Router to harvest web pages and documents or entire web sites. |
|
Custom Router
|
The Custom Code Router allows you to extend Marshals harvesting functionality with your own source code or assembly. |
|
|
|
|