cancel
Showing results for 
Search instead for 
Did you mean: 
suxinji
Employee Alumni
Employee Alumni

 

Overview

In this article, we will show the steps that are required to leverage data prepared in Incorta as input for modeling, building and inference in DataRobot.
 

What you should know before reading this article

Solution

Here are the step by step instructions of creating a connection from DataRobot to Incorta.

Create New Project

suxinji_0-1649876258507.png

Select Data Source

suxinji_1-1649876266408.png

Create JDBC Connection

Select the postgreSQL database as the data store.  

suxinji_2-1649876362862.pngsuxinji_3-1649876380642.png

Best Practices

Below are some best practices when using Incorta as the source for data input for AI/ML platforms, such as DataRobot.

Only Share the Schemas Required

Incorta allows you to control who can have access to a schema.  A schema or a business schema can be shared with View only access which is generally sufficient for the consuming team to access the data.

Create Business Schemas

Both Incorta business schemas and physical schemas can be exposed and selected from the AI/ML platforms such as DataRobot.  However, it is easier for the consumer to uptake the data from Incorta when the data is already joined and a flattened view is provided.

Reuse and Share the Data across Multiple ML Projects

You should treat Incorta as the "Feature Store" to maximize the reusability.  Incorta business schemas and business views provides high level data abstraction.  This enables data scientists to focus on the ML model selection and optimization, and leave the data preparation to the data engineering team using Incorta.   Data can be joined, aggregated, and consolidated all within Incorta.

Reference

DataRobot documentation - DataRobot data connections

 

 

 

Best Practices Index
Best Practices

Just here to browse knowledge? This might help!

Version history
Last update:
‎05-25-2022 01:23 PM
Updated by: