Setup
Last updated Sep 25, 2025
Overview

Summary of Steps
To complete the setup of your data environment in Connecty AI, follow these steps:
- Step 0 – Organization Setup Our Support team will create your Organization and send you an invite. 
- Step 1 – Create a Data Environment Add a new Data Environment. 
- Step 2 – Configure a Data Connection Select your data source, enter credentials and verify the connection. 
- Step 3 – AI Sync Once the connection is verified, the system runs an automated sync: - Data Environment Catalog Discovery 
- Data Workspace Catalog Initialization 
- Inferring PII 
- Data Statistics Collection 
- Data Workspace Catalog Finalization 
 
- Step 4 – Create Data Workspaces (Optional) - A default Workspace is automatically created so you can start using the application right away. 
- You can create additional Workspaces under each Environment to organize access for specific teams or departments. Users can belong to multiple Workspaces at no extra cost. 
 
Step 1 – Create a Data Environment
- Add a new Data Environment (e.g., Production, Staging, Development, or a specific business unit). 
- Each Organization can support unlimited Data Environments (depending on your plan: Team vs Enterprise). 


Step 2 – Configure a Data Connection
Each Data Environment requires a data connection before syncing can begin.
- Select your data source (e.g., PostgreSQL, Snowflake, Databricks, BigQuery, Amazon Athena). 
- Enter the required credentials. Refer Integrations to check pre-requisites for your data source. 
- Use Test & Save to verify the connection. 
- (Optional Step for Advanced Customization) Select the scope for synchronization, execution and materialization. You can also skip changing the scope and directly click on 'Save'. 
- After you click on 'Save', the system starts the auto-sync process. 

Example:

Step 3 – AI Sync
After the connection is verified, the sync process runs automatically.
It consists of 5 stages, visible in the Environment Dashboard:
- Data Environment Catalog Discovery - Detects schemas, tables, views, and objects in your environment. 
- Shows detailed progress by columns, databases, objects, and schemas. 
 
- Data Workspace Catalog Initialization - Builds the catalog structure inside your Data Workspace. 
- Ensures that objects are organized and ready for use. 
 

- Inferring PII by AI 
- Automatically identifies Personally Identifiable Information (PII) at the column level. 
- Flags sensitive data for compliance and governance. 
- Users can review and override AI’s suggestions by marking columns as PII or Non-PII before finalizing. 

You can monitor progress with a real-time counter and sync status indicators.
- Data Statistics Collection: - During this step, the system gathers column-level metadata and statistics to help the AI learn and better understand your data. 
 
- Data Workspace Catalog Finalization: - The sync concludes by finalizing the Data Workspace Catalog, making the metadata available in a human-readable format under 'Context Engine' module for data governance.  
 
Step 4 – Create Data Workspaces (Optional)
- Within each Data Environment, you can add unlimited Workspaces. 
- Workspaces allow you to restrict access to specific departments or use cases (e.g., Marketing, Finance, Product). 
- Users can be assigned to multiple Workspaces at no extra cost. 

Monitoring Your Environment
The Data Environment Overview Dashboard provides:
- Number of configured connections. 
- Number of workspaces. 
- Last sync status (with live progress). 
This gives you a real-time snapshot of your environment’s settings.

Last updated
