Setup

Last updated Sep 25, 2025

Overview

Summary of Steps

To complete the setup of your data environment in Connecty AI, follow these steps:

Unlimited Data Environments are supported within one Organization (subject to the plan you've selected i.e. Team vs Enterprise). For example: production, staging, development, or by business units.

For example: Snowflake, Bigquery, Databricks. Read here more about Integration.

  • Step 3 – AI Sync Once the connection is verified, the system runs an automated sync:

    1. Data Environment Catalog Discovery

    2. Data Workspace Catalog Initialization

    3. Inferring PII

    4. Data Statistics Collection

    5. Data Workspace Catalog Finalization

  • Step 4 – Create Data Workspaces (Optional)

    1. A default Workspace is automatically created so you can start using the application right away.

    2. You can create additional Workspaces under each Environment to organize access for specific teams or departments. Users can belong to multiple Workspaces at no extra cost.

Each Data Environment can have multiple Data Workspaces.

Workspaces can be flexibly used to limit access to a group of people according to their department or access permissions. For example: it can be function or team specific - like Marketing, Product or Accounting teams - similar to a data mart.


Step 1 – Create a Data Environment

  • Add a new Data Environment (e.g., Production, Staging, Development, or a specific business unit).

  • Each Organization can support unlimited Data Environments (depending on your plan: Team vs Enterprise).


Step 2 – Configure a Data Connection

Each Data Environment requires a data connection before syncing can begin.

  1. Select your data source (e.g., PostgreSQL, Snowflake, Databricks, BigQuery, Amazon Athena).

  2. Enter the required credentials. Refer Integrations to check pre-requisites for your data source.

  3. Use Test & Save to verify the connection.

  4. (Optional Step for Advanced Customization) Select the scope for synchronization, execution and materialization. You can also skip changing the scope and directly click on 'Save'.

  5. After you click on 'Save', the system starts the auto-sync process.

Example:


Step 3 – AI Sync

After the connection is verified, the sync process runs automatically.

It consists of 5 stages, visible in the Environment Dashboard:

  1. Data Environment Catalog Discovery

    • Detects schemas, tables, views, and objects in your environment.

    • Shows detailed progress by columns, databases, objects, and schemas.

  2. Data Workspace Catalog Initialization

    • Builds the catalog structure inside your Data Workspace.

    • Ensures that objects are organized and ready for use.

  1. Inferring PII by AI

  • Automatically identifies Personally Identifiable Information (PII) at the column level.

  • Flags sensitive data for compliance and governance.

  • Users can review and override AI’s suggestions by marking columns as PII or Non-PII before finalizing.

PII marked columns will skip the next step of 'Data statistics' collection.

You can monitor progress with a real-time counter and sync status indicators.

  1. Data Statistics Collection:

    1. During this step, the system gathers column-level metadata and statistics to help the AI learn and better understand your data.

  2. Data Workspace Catalog Finalization:

    1. The sync concludes by finalizing the Data Workspace Catalog, making the metadata available in a human-readable format under 'Context Engine' module for data governance.


Step 4 – Create Data Workspaces (Optional)

  • Within each Data Environment, you can add unlimited Workspaces.

  • Workspaces allow you to restrict access to specific departments or use cases (e.g., Marketing, Finance, Product).

  • Users can be assigned to multiple Workspaces at no extra cost.


Monitoring Your Environment

The Data Environment Overview Dashboard provides:

  • Number of configured connections.

  • Number of workspaces.

  • Last sync status (with live progress).

This gives you a real-time snapshot of your environment’s settings.

Last updated