Skip to content
Gary Anderson edited this page Dec 4, 2018 · 3 revisions

DataSpider

DataSpider is an application developed by Modak Analytics and GlaxoSmithKline (GSK) to provide source meta-data discovery functionalty to the GSK Big Data Platform. It is being contributed to the opensource community by Modak Analytics on behalf of GSK.

DataSpider is a meta-data crawler closely linked to the Kosh meta-data model. It is an application that can crawl across many types of enterprise data sources and populate the Kosh metadata repository with the information needed for Bots to orchestrate the ingestion of data sources into the Big Data repository as well as to perform curation activities within the system.