---
title: Sync data from Postgres | Tiger Data Docs
description: Sync updates to your primary Postgres database with Tiger Cloud in real time
---

Tips

**Source PostgreSQL connector vs. Livesync replication:** The source PostgreSQL connector provides **continuous ongoing replication** where PostgreSQL stays the primary and Tiger Cloud acts as a logical replica. For a **one-time migration** to Tiger Cloud with a cutover at the end, see [Livesync replication](/docs/migrate/livesync-replication/index.md) instead. Both features share the same underlying technology, but the workflows and end states differ.

You use the source PostgreSQL connector in Tiger Cloud to synchronize all data or specific tables from a PostgreSQL database instance to your service, in real time. You run the connector continuously, turning PostgreSQL into a primary database with your service as a logical replica. This enables you to leverage Tiger Cloud's real-time analytics capabilities on your replica data.

![Connectors overview in Tiger Console](/docs/_astro/tiger-console-connector-overview.C3brL-kO_Z2bCaMM.webp)

The source PostgreSQL connector in Tiger Cloud leverages the well-established PostgreSQL logical replication protocol. By relying on this protocol, Tiger Cloud ensures compatibility, familiarity, and a broader knowledge base, making it easier for you to adopt the connector and integrate your data.

You use the source PostgreSQL connector for data synchronization, rather than migration. This includes:

- Copy existing data from a PostgreSQL instance to a Tiger Cloud service:

  - Copy data at up to 150 GB/hr.

    You need at least a 4 CPU/16 GB source database, and a 4 CPU/16 GB target service.

  - Copy the publication tables in parallel.

    Large individual tables still use a single connection, except PostgreSQL declarative partitioned tables published with `publish_via_partition_root = false` (the PostgreSQL default), which are copied leaf partition by leaf partition in parallel since v0.23.0.

  - Forget foreign key relationships.

    The connector disables foreign key validation during the sync. For example, if a `metrics` table refers to the `id` column on the `tags` table, you can still sync only the `metrics` table without worrying about their foreign key relationships.

  - Track progress.

    PostgreSQL exposes `COPY` progress under `pg_stat_progress_copy`.

- Synchronize real-time changes from a PostgreSQL instance to a Tiger Cloud service.

- Add and remove tables on demand using the [PostgreSQL PUBLICATION interface](https://www.postgresql.org/docs/current/sql-createpublication.html).

- Enable features such as [hypertables](/docs/learn/hypertables/understand-hypertables/index.md), [columnstore](/docs/learn/columnar-storage/understand-hypercore/index.md), and [continuous aggregates](/docs/learn/continuous-aggregates/index.md) on your logical replica.

- Indexes, primary key, unique constraints, and sequences are **not** migrated. Create needed indexes on the target for your queries.

- TimescaleDB as source has limited support (for example, no continuous aggregates).

- Schema changes must be coordinated: apply compatible changes on the target first, then on the source.

- WAL volume on the source increases during large table copy.

- **Continuous aggregates:** The connector uses `session_replication_role=replica` during copy, so triggers (including continuous aggregate invalidation) do not run. Data synced during initial load below a continuous aggregate's materialization watermark may not appear in the aggregate until you manually refresh. If the aggregate exists on the source, include it in the connector's publication; if only on the target, use the `force` option of [refresh\_continuous\_aggregate](/docs/reference/timescaledb/continuous-aggregates/refresh_continuous_aggregate/index.md) to refresh affected ranges.

* [Tiger Console](#tab-panel-607)
* [Self-hosted PostgreSQL connector](#tab-panel-608)

## Prerequisites for this integration guide

To follow these steps, you'll need:

- A [Tiger Cloud service](/docs/get-started/quickstart/create-service/index.md).

* Your [connection details](/docs/integrate/find-connection-details/index.md).

- The [PostgreSQL client tools](/docs/integrate/query-administration/psql/index.md) installed on your sync machine.

- The source PostgreSQL instance and the target Tiger Cloud service must have the same extensions installed.

  The source PostgreSQL connector does not create extensions on the target. If the table uses column types from an extension, first create the extension on the target Tiger Cloud service before syncing the table.

## Limitations

- Indexes, including the primary key, unique constraints, and sequences are not migrated to the target Tiger Cloud service.

  We recommend that, depending on your query patterns, you create only the necessary indexes on the target Tiger Cloud service.

* Using TimescaleDB as the source has limited support (no CAGGs).

* The source must be running PostgreSQL 13 or later.

* Schema changes must be co-ordinated.

  Make compatible changes to the schema in your Tiger Cloud service first, then make the same changes to the source PostgreSQL instance.

* Ensure that the source PostgreSQL instance and the target Tiger Cloud service have the same extensions installed.

  The source PostgreSQL connector does not create extensions on the target. If the table uses column types from an extension, first create the extension on the target Tiger Cloud service before syncing the table.

* There is WAL volume growth on the source PostgreSQL instance during large table copy.

* Continuous aggregate invalidation

  The connector uses `session_replication_role=replica` during data replication, which prevents table triggers from firing. This includes the internal triggers that mark continuous aggregates as invalid when underlying data changes.

  If you have continuous aggregates on your target database, they do not automatically refresh for data inserted during the migration. This limitation only applies to data below the continuous aggregate's materialization watermark. For example, backfilled data. New rows synced above the continuous aggregate watermark are used correctly when refreshing.

  This can lead to:

  - Missing data in continuous aggregates for the migration period.
  - Stale aggregate data.
  - Queries returning incomplete results.

  If the continuous aggregate exists in the source database, best practice is to add it to the PostgreSQL connector publication. If it only exists on the target database, manually refresh the continuous aggregate using the `force` option of [refresh\_continuous\_aggregate](/docs/reference/timescaledb/continuous-aggregates/refresh_continuous_aggregate#samples/index.md).

## Set your connection string

This variable holds the connection information for the source database. In the terminal on your migration machine, set the following:

Terminal window

```
export SOURCE="postgres://<user>:<password>@<source host>:<source port>/<db_name>"
```

Important

Avoid using connection strings that route through connection poolers like PgBouncer or similar tools. This tool requires a direct connection to the database to function properly.

## Tune your source database

- [From AWS RDS/Aurora](#tab-panel-603)
- [From PostgreSQL](#tab-panel-604)

Updating parameters on a PostgreSQL instance will cause an outage. Choose a time that will cause the least issues to tune this database.

1. **Tune the Write Ahead Log (WAL) on the RDS/Aurora PostgreSQL source database**

   1. In [RDS console](https://console.aws.amazon.com/rds/home#databases:), select the RDS instance to migrate.

   2. Click `Configuration`, scroll down and note the `DB instance parameter group`, then click `Parameter Groups`.

      ![RDS parameter groups in the AWS console](/docs/_astro/awsrds-parameter-groups.CiXrFBVV_1Diea.webp)

   3. Click `Create parameter group`, fill in the form with the following values, then click `Create`:

      - `Parameter group name`, whatever suits your fancy.
      - `Description`, knock yourself out with this one.
      - `Engine type`, `PostgreSQL`
      - `Parameter group family`, the same as `DB instance parameter group` in your `Configuration`.

   4. In `Parameter groups`, select the parameter group you created, then click `Edit`.

   5. Update the following parameters, then click `Save changes`:

      - `rds.logical_replication` set to `1`: record the information needed for logical decoding.
      - `wal_sender_timeout` set to `0`: disable the timeout for the sender process.

   6. In RDS, navigate back to your [databases](https://console.aws.amazon.com/rds/home#databases:), select the RDS instance to migrate, and click `Modify`.

   7. Scroll down to `Database options`, select your new parameter group, and click `Continue`.

   8. Click `Apply immediately` or choose a maintenance window, then click `Modify DB instance`.

   Changing parameters will cause an outage. Wait for the database instance to reboot before continuing. After it comes back up, verify that the new settings are in effect on your database.

2. **Create a user for the source PostgreSQL connector and assign permissions**

   1. Create `<pg connector username>`:

      Terminal window

      ```
      psql $SOURCE -c "CREATE USER <pg connector username> PASSWORD '<password>'"
      ```

      You can use an existing user. However, you must ensure that the user has the following permissions.

   2. Grant permissions to create a replication slot:

      Terminal window

      ```
      psql $SOURCE -c "GRANT rds_replication TO <pg connector username>"
      ```

   3. Grant permissions to create a publication:

      Terminal window

      ```
      psql $SOURCE -c "GRANT CREATE ON DATABASE <database name> TO <pg connector username>"
      ```

   4. Assign the user permissions on the source database:

      Terminal window

      ```
      psql $SOURCE <<EOF
      GRANT USAGE ON SCHEMA "public" TO <pg connector username>;
      GRANT SELECT ON ALL TABLES IN SCHEMA "public" TO <pg connector username>;
      ALTER DEFAULT PRIVILEGES IN SCHEMA "public" GRANT SELECT ON TABLES TO <pg connector username>;
      EOF
      ```

      If the tables you are syncing are not in the `public` schema, grant the user permissions for each schema you are syncing:

      Terminal window

      ```
      psql $SOURCE <<EOF
      GRANT USAGE ON SCHEMA <schema> TO <pg connector username>;
      GRANT SELECT ON ALL TABLES IN SCHEMA <schema> TO <pg connector username>;
      ALTER DEFAULT PRIVILEGES IN SCHEMA <schema> GRANT SELECT ON TABLES TO <pg connector username>;
      EOF
      ```

   5. On each table you want to sync, make `<pg connector username>` the owner:

      Terminal window

      ```
      psql $SOURCE -c 'ALTER TABLE <table name> OWNER TO <pg connector username>;'
      ```

      You can skip this step if the replicating user is already the owner of the tables.

3. **Enable replication DELETE and UPDATE operations**

   Replica identity assists data replication by identifying the rows being modified. Your options are that each table and hypertable in the source database should either have:

   - **A primary key**: data replication defaults to the primary key of the table being replicated. Nothing to do.
   - **A viable unique index**: each table has a unique, non-partial, non-deferrable index that includes only columns marked as `NOT NULL`. If a UNIQUE index does not exist, create one to assist the migration. You can delete it after migration. For each table, set `REPLICA IDENTITY` to the viable unique index:
     ```
     psql -X -d $SOURCE -c 'ALTER TABLE <table name> REPLICA IDENTITY USING INDEX <_index_name>'
     ```
   - **No primary key or viable unique index**: use brute force. For each table, set `REPLICA IDENTITY` to `FULL`:
     ```
     psql -X -d $SOURCE -c 'ALTER TABLE <table_name> REPLICA IDENTITY FULL'
     ```
     For each `UPDATE` or `DELETE` statement, PostgreSQL reads the whole table to find all matching rows. This results in significantly slower replication. If you are expecting a large number of `UPDATE` or `DELETE` operations on the table, best practice is to not use `FULL`.

1) **Tune the Write Ahead Log (WAL) on the PostgreSQL source database**

   ```
   psql $SOURCE <<EOF
   ALTER SYSTEM SET wal_level='logical';
   ALTER SYSTEM SET max_wal_senders=10;
   ALTER SYSTEM SET wal_sender_timeout=0;
   EOF
   ```

   - [GUC "wal\_level" as "logical"](https://www.postgresql.org/docs/current/runtime-config-wal.html#GUC-WAL-LEVEL)
   - [GUC "max\_wal\_senders" as 10](https://www.postgresql.org/docs/current/runtime-config-replication.html#GUC-MAX-WAL-SENDERS)
   - [GUC "wal\_sender\_timeout" as 0](https://www.postgresql.org/docs/current/runtime-config-replication.html#GUC-WAL-SENDER-TIMEOUT)

   This will require a restart of the PostgreSQL source database.

2) **Create a user for the connector and assign permissions**

   1. Create `<pg connector username>`:

      ```
      psql $SOURCE -c "CREATE USER <pg connector username> PASSWORD '<password>'"
      ```

      You can use an existing user. However, you must ensure that the user has the following permissions.

   2. Grant permissions to create a replication slot:

      ```
      psql $SOURCE -c "ALTER ROLE <pg connector username> REPLICATION"
      ```

   3. Grant permissions to create a publication:

      ```
      psql $SOURCE -c "GRANT CREATE ON DATABASE <database name> TO <pg connector username>"
      ```

   4. Assign the user permissions on the source database:

      ```
      psql $SOURCE <<EOF
      GRANT USAGE ON SCHEMA "public" TO <pg connector username>;
      GRANT SELECT ON ALL TABLES IN SCHEMA "public" TO <pg connector username>;
      ALTER DEFAULT PRIVILEGES IN SCHEMA "public" GRANT SELECT ON TABLES TO <pg connector username>;
      EOF
      ```

      If the tables you are syncing are not in the `public` schema, grant the user permissions for each schema you are syncing:

      ```
      psql $SOURCE <<EOF
      GRANT USAGE ON SCHEMA <schema> TO <pg connector username>;
      GRANT SELECT ON ALL TABLES IN SCHEMA <schema> TO <pg connector username>;
      ALTER DEFAULT PRIVILEGES IN SCHEMA <schema> GRANT SELECT ON TABLES TO <pg connector username>;
      EOF
      ```

   5. On each table you want to sync, make `<pg connector username>` the owner:

      ```
      psql $SOURCE -c 'ALTER TABLE <table name> OWNER TO <pg connector username>;'
      ```

      You can skip this step if the replicating user is already the owner of the tables.

3) **Enable replication DELETE and UPDATE operations**

   Replica identity assists data replication by identifying the rows being modified. Your options are that each table and hypertable in the source database should either have:

   - **A primary key**: data replication defaults to the primary key of the table being replicated. Nothing to do.
   - **A viable unique index**: each table has a unique, non-partial, non-deferrable index that includes only columns marked as `NOT NULL`. If a UNIQUE index does not exist, create one to assist the migration. You can delete it after migration. For each table, set `REPLICA IDENTITY` to the viable unique index:
     ```
     psql -X -d $SOURCE -c 'ALTER TABLE <table name> REPLICA IDENTITY USING INDEX <_index_name>'
     ```
   - **No primary key or viable unique index**: use brute force. For each table, set `REPLICA IDENTITY` to `FULL`:
     ```
     psql -X -d $SOURCE -c 'ALTER TABLE <table_name> REPLICA IDENTITY FULL'
     ```
     For each `UPDATE` or `DELETE` statement, PostgreSQL reads the whole table to find all matching rows. This results in significantly slower replication. If you are expecting a large number of `UPDATE` or `DELETE` operations on the table, best practice is to not use `FULL`.

## Synchronize data to your Tiger Cloud service

To sync data from your PostgreSQL database to your Tiger Cloud service using Tiger Console:

1. **Connect to your Tiger Cloud service**

   In [Tiger Console](https://console.cloud.tigerdata.com/dashboard/services), select the service to sync live data to.

2. **Prepare the source database**

   ![PostgreSQL connector wizard in Tiger Console](/docs/_astro/pg-connector-wizard-tiger-console.BPVoP0DK_Z1kktnR.webp)

   1. Click `Connectors` > `Postgres`.
   2. Set the name for the new connector by clicking the pencil icon.
   3. Check the boxes for `Set wal_level to logical` and `Update your credentials`, then click `Continue`.

3. **Connect the source database and the target service**

   ![PostgreSQL connector connection string in Tiger Console](/docs/_astro/pg-connector-connection-string.DUeVjYuI_1Kk98E.webp)

   1. Enter your database credentials or a PostgreSQL connection string. This is the connection string for [`<pg connector username>`](#tune-your-source-database).

   2. **(Recommended for private databases)** If the source database is not reachable from the public Internet, toggle `Enable SSH tunneling` and route the connector through a bastion host:

      1. Copy the public key shown in the wizard and append it to the `authorized_keys` file of the user the connector will log in as on the bastion host:

         Terminal window

         ```
         echo "<public key from the connector wizard>" >> ~/.ssh/authorized_keys


         # Verify
         cat ~/.ssh/authorized_keys
         ```

      2. Enter the bastion `Hostname`, `User`, and `Port` in the wizard.

   3. Click `Connect to database`. Tiger Console connects to the source database and retrieves the schema information.

4. **Select the data to synchronize**

   ![Starting the PostgreSQL connector in Tiger Console](/docs/_astro/pg-connector-start-tiger-console.DPSXCOf0_AQ7bm.webp)

   Choose where the connector picks tables from:

   - **From an existing publication**: select a PostgreSQL `PUBLICATION` name, then pick schemas and tables from those it includes. Use this when you cannot grant the connector user `CREATE` privilege on the source — your DBA can pre-create the publication — or when you want to apply [row or column filters](https://www.postgresql.org/docs/current/logical-replication-row-filter.html) on published tables.
   - **Directly from the source**: select schemas, then pick the tables to sync from them. Tiger Console creates the publication for you.

   Click `Select tables +`. For each selected table, Tiger Console checks the schema and, if possible, suggests the column to use as the time dimension in a hypertable.

5. **Configure Initial Data Copy workers**

   ![Configure Initial Data Copy worker count in Tiger Console](/docs/_astro/pg-connector-configure-idc-worker-count.D6aj0UEK_Z12BfdB.webp)

   Specify the number of parallel connections — Initial Data Copy (IDC) workers — used to process the initial data copy. Higher values can speed up the initial copy but may increase load on the source database. Defaults to `4`.

   Tune this based on the resources available on the source database and the number of tables being synced. Large individual tables still copy through a single connection, so higher worker counts help most when you have many small to medium tables.

   Click `Create Connector`. Tiger Console starts the source PostgreSQL connector between the source database and the target service and displays the progress.

6. **Monitor and manage the connector**

   ![Editing a PostgreSQL connector in Tiger Console](/docs/_astro/edit-pg-connector-tiger-console.CAiEG08p_ZedUDY.webp)

   1. To review the syncing progress for each table, click `Connectors` > `Source connectors`, then select the name of your connector in the table.

   2. To edit the connector, click `Connectors` > `Source connectors`, then select the name of your connector in the table. You can rename the connector, delete or add new tables for syncing.

   3. To pause a connector, click `Connectors` > `Source connectors`, then open the three-dot menu on the right and select `Pause`.

   4. To delete a connector, click `Connectors` > `Source connectors`, then open the three-dot menu on the right and select `Delete`. You must pause the connector before deleting it.

And that is it, you are using the source PostgreSQL connector to synchronize all the data, or specific tables, from a PostgreSQL database instance to your Tiger Cloud service, in real time.

## Prerequisites for this integration guide

To follow these steps, you'll need:

- A target [Tiger Cloud service](/docs/get-started/quickstart/create-service/index.md).

  Best practice is to create a Tiger Cloud service with at least 8 CPUs for a smoother experience. A higher-spec instance can significantly reduce the overall migration window.

- A migration machine to run the commands that move data from your source database to your target Tiger Cloud service.

  Best practice: use an [Ubuntu EC2 instance](https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/EC2_GetStarted.html#ec2-launch-instance) hosted in the same region as your Tiger Cloud service.

- An [adjusted maintenance window](/docs/deploy/tiger-cloud/tiger-cloud-aws/upgrades#define-your-maintenance-window/index.md) to prevent maintenance from running during migration.

* The source PostgreSQL instance and the target Tiger Cloud service must have the same extensions installed.

  The source PostgreSQL connector does not create extensions on the target. If the table uses column types from an extension, first create the extension on the target Tiger Cloud service before syncing the table.

* [Docker](https://docs.docker.com/engine/install/) installed on your sync machine.

  For a better experience, use a 4 CPU/16GB EC2 instance or greater to run the source PostgreSQL connector.

* The [PostgreSQL client tools](/docs/integrate/query-administration/psql/index.md) installed on your sync machine.

  This includes `psql`, `pg_dump`, `pg_dumpall`, and `vacuumdb` commands.

## Limitations

- The schema is not migrated by the source PostgreSQL connector, you use `pg_dump`/`pg_restore` to migrate it.

* Using TimescaleDB as the source has limited support (no CAGGs).

* The source must be running PostgreSQL 13 or later.

* Schema changes must be co-ordinated.

  Make compatible changes to the schema in your Tiger Cloud service first, then make the same changes to the source PostgreSQL instance.

* Ensure that the source PostgreSQL instance and the target Tiger Cloud service have the same extensions installed.

  The source PostgreSQL connector does not create extensions on the target. If the table uses column types from an extension, first create the extension on the target Tiger Cloud service before syncing the table.

* There is WAL volume growth on the source PostgreSQL instance during large table copy.

* Continuous aggregate invalidation

  The connector uses `session_replication_role=replica` during data replication, which prevents table triggers from firing. This includes the internal triggers that mark continuous aggregates as invalid when underlying data changes.

  If you have continuous aggregates on your target database, they do not automatically refresh for data inserted during the migration. This limitation only applies to data below the continuous aggregate's materialization watermark. For example, backfilled data. New rows synced above the continuous aggregate watermark are used correctly when refreshing.

  This can lead to:

  - Missing data in continuous aggregates for the migration period.
  - Stale aggregate data.
  - Queries returning incomplete results.

  If the continuous aggregate exists in the source database, best practice is to add it to the PostgreSQL connector publication. If it only exists on the target database, manually refresh the continuous aggregate using the `force` option of [refresh\_continuous\_aggregate](/docs/reference/timescaledb/continuous-aggregates/refresh_continuous_aggregate#samples/index.md).

## Set your connection strings

The `<user>` in the `SOURCE` connection must have the replication role granted in order to create a replication slot.

These variables hold the connection information for the source database and target Tiger Cloud service. In Terminal on your migration machine, set the following:

Terminal window

```
export SOURCE="postgres://<user>:<password>@<source host>:<source port>/<db_name>"
export TARGET="postgres://tsdbadmin:<PASSWORD>@<HOST>:<PORT>/tsdb?sslmode=require"
```

You find the connection information for your Tiger Cloud service in the configuration file you downloaded when you created the service.

Important

Avoid using connection strings that route through connection poolers like PgBouncer or similar tools. This tool requires a direct connection to the database to function properly.

## Tune your source database

- [From AWS RDS/Aurora](#tab-panel-605)
- [From PostgreSQL](#tab-panel-606)

Updating parameters on a PostgreSQL instance will cause an outage. Choose a time that will cause the least issues to tune this database.

1. **Update the DB instance parameter group for your source database**

   1. In <https://console.aws.amazon.com/rds/home#databases:>, select the RDS instance to migrate.

   2. Click `Configuration`, scroll down and note the `DB instance parameter group`, then click `Parameter groups`

      ![RDS parameter groups in the AWS console](/docs/_astro/awsrds-parameter-groups.CiXrFBVV_1Diea.webp)

   3. Click `Create parameter group`, fill in the form with the following values, then click `Create`.

      - **Parameter group name** - whatever suits your fancy.
      - **Description** - knock yourself out with this one.
      - **Engine type** - `PostgreSQL`
      - **Parameter group family** - the same as `DB instance parameter group` in your `Configuration`.

   4. In `Parameter groups`, select the parameter group you created, then click `Edit`.

   5. Update the following parameters, then click `Save changes`.

      - `rds.logical_replication` set to `1`: record the information needed for logical decoding.
      - `wal_sender_timeout` set to `0`: disable the timeout for the sender process.

   6. In RDS, navigate back to your [databases](https://console.aws.amazon.com/rds/home#databases:), select the RDS instance to migrate, and click `Modify`.

   7. Scroll down to `Database options`, select your new parameter group, and click `Continue`.

   8. Click `Apply immediately` or choose a maintenance window, then click `Modify DB instance`.

      Changing parameters will cause an outage. Wait for the database instance to reboot before continuing.

   9. Verify that the settings are live in your database.

2. **Enable replication `DELETE` and `UPDATE` operations**

   Replica identity assists data replication by identifying the rows being modified. Your options are that each table and hypertable in the source database should either have:

   - **A primary key**: data replication defaults to the primary key of the table being replicated. Nothing to do.
   - **A viable unique index**: each table has a unique, non-partial, non-deferrable index that includes only columns marked as `NOT NULL`. If a UNIQUE index does not exist, create one to assist the migration. You can delete it after migration. For each table, set `REPLICA IDENTITY` to the viable unique index:
     ```
     psql -X -d $SOURCE -c 'ALTER TABLE <table name> REPLICA IDENTITY USING INDEX <_index_name>'
     ```
   - **No primary key or viable unique index**: use brute force. For each table, set `REPLICA IDENTITY` to `FULL`:
     ```
     psql -X -d $SOURCE -c 'ALTER TABLE <table_name> REPLICA IDENTITY FULL'
     ```
     For each `UPDATE` or `DELETE` statement, PostgreSQL reads the whole table to find all matching rows. This results in significantly slower replication. If you are expecting a large number of `UPDATE` or `DELETE` operations on the table, best practice is to not use `FULL`.

1) **Tune the Write Ahead Log (WAL) on the PostgreSQL source database**

   ```
   psql $SOURCE <<EOF
   ALTER SYSTEM SET wal_level='logical';
   ALTER SYSTEM SET max_wal_senders=10;
   ALTER SYSTEM SET wal_sender_timeout=0;
   EOF
   ```

   - [GUC "wal\_level" as "logical"](https://www.postgresql.org/docs/current/runtime-config-wal.html#GUC-WAL-LEVEL)
   - [GUC "max\_wal\_senders" as 10](https://www.postgresql.org/docs/current/runtime-config-replication.html#GUC-MAX-WAL-SENDERS)
   - [GUC "wal\_sender\_timeout" as 0](https://www.postgresql.org/docs/current/runtime-config-replication.html#GUC-WAL-SENDER-TIMEOUT)

   This will require a restart of the PostgreSQL source database.

2) **Create a user for the connector and assign permissions**

   1. Create `<pg connector username>`:

      ```
      psql $SOURCE -c "CREATE USER <pg connector username> PASSWORD '<password>'"
      ```

      You can use an existing user. However, you must ensure that the user has the following permissions.

   2. Grant permissions to create a replication slot:

      ```
      psql $SOURCE -c "ALTER ROLE <pg connector username> REPLICATION"
      ```

   3. Grant permissions to create a publication:

      ```
      psql $SOURCE -c "GRANT CREATE ON DATABASE <database name> TO <pg connector username>"
      ```

   4. Assign the user permissions on the source database:

      ```
      psql $SOURCE <<EOF
      GRANT USAGE ON SCHEMA "public" TO <pg connector username>;
      GRANT SELECT ON ALL TABLES IN SCHEMA "public" TO <pg connector username>;
      ALTER DEFAULT PRIVILEGES IN SCHEMA "public" GRANT SELECT ON TABLES TO <pg connector username>;
      EOF
      ```

      If the tables you are syncing are not in the `public` schema, grant the user permissions for each schema you are syncing:

      ```
      psql $SOURCE <<EOF
      GRANT USAGE ON SCHEMA <schema> TO <pg connector username>;
      GRANT SELECT ON ALL TABLES IN SCHEMA <schema> TO <pg connector username>;
      ALTER DEFAULT PRIVILEGES IN SCHEMA <schema> GRANT SELECT ON TABLES TO <pg connector username>;
      EOF
      ```

   5. On each table you want to sync, make `<pg connector username>` the owner:

      ```
      psql $SOURCE -c 'ALTER TABLE <table name> OWNER TO <pg connector username>;'
      ```

      You can skip this step if the replicating user is already the owner of the tables.

3) **Enable replication DELETE and UPDATE operations**

   Replica identity assists data replication by identifying the rows being modified. Your options are that each table and hypertable in the source database should either have:

   - **A primary key**: data replication defaults to the primary key of the table being replicated. Nothing to do.
   - **A viable unique index**: each table has a unique, non-partial, non-deferrable index that includes only columns marked as `NOT NULL`. If a UNIQUE index does not exist, create one to assist the migration. You can delete it after migration. For each table, set `REPLICA IDENTITY` to the viable unique index:
     ```
     psql -X -d $SOURCE -c 'ALTER TABLE <table name> REPLICA IDENTITY USING INDEX <_index_name>'
     ```
   - **No primary key or viable unique index**: use brute force. For each table, set `REPLICA IDENTITY` to `FULL`:
     ```
     psql -X -d $SOURCE -c 'ALTER TABLE <table_name> REPLICA IDENTITY FULL'
     ```
     For each `UPDATE` or `DELETE` statement, PostgreSQL reads the whole table to find all matching rows. This results in significantly slower replication. If you are expecting a large number of `UPDATE` or `DELETE` operations on the table, best practice is to not use `FULL`.

## Migrate the table schema to the Tiger Cloud service

Use `pg_dump` to:

1. **Download the schema from the source database**

   Terminal window

   ```
   pg_dump $SOURCE \
   --no-privileges \
   --no-owner \
   --no-publications \
   --no-subscriptions \
   --no-table-access-method \
   --no-tablespaces \
   --schema-only \
   --file=schema.sql
   ```

2. **Apply the schema on the target service**

   Terminal window

   ```
   psql $TARGET -f schema.sql
   ```

## Convert partitions and tables with time-series data into hypertables

For efficient querying and analysis, you can convert tables which contain time-series or events data, and tables that are already partitioned using PostgreSQL declarative partition into [hypertables](/docs/learn/hypertables/understand-hypertables/index.md).

1. **Convert tables to hypertables**

   Run the following on each table in the target Tiger Cloud service to convert it to a hypertable:

   Terminal window

   ```
   psql -X -d $TARGET -c "SELECT public.create_hypertable('<table>', by_range('<partition column>', '<chunk interval>'::interval));"
   ```

   For example, to convert the *metrics* table into a hypertable with *time* as a partition column and *1 day* as a partition interval:

   Terminal window

   ```
   psql -X -d $TARGET -c "SELECT public.create_hypertable('public.metrics', by_range('time', '1 day'::interval));"
   ```

2. **Convert PostgreSQL partitions to hypertables**

   Rename the partition and create a new regular table with the same name as the partitioned table, then convert to a hypertable:

   Terminal window

   ```
   psql $TARGET -f - <<'EOF'
      BEGIN;
      ALTER TABLE public.events RENAME TO events_part;
      CREATE TABLE public.events(LIKE public.events_part INCLUDING ALL);
      SELECT create_hypertable('public.events', by_range('time', '1 day'::interval));
      COMMIT;
   EOF
   ```

## Specify the tables to synchronize

After the schema is migrated, you [`CREATE PUBLICATION`](https://www.postgresql.org/docs/current/sql-createpublication.html) on the source database that specifies the tables to synchronize.

1. **Create a publication that specifies the table to synchronize**

   A `PUBLICATION` enables you to synchronize some or all the tables in the schema or database.

   ```
   CREATE PUBLICATION <publication_name> FOR TABLE <table_name>, <table_name>;
   ```

   To add tables after to an existing publication, use [ALTER PUBLICATION](https://www.postgresql.org/docs/current/sql-alterpublication.html)\*\*

   ```
   ALTER PUBLICATION <publication_name> ADD TABLE <table_name>;
   ```

2. **Publish the PostgreSQL declarative partitioned table**

   Leave `publish_via_partition_root` at its default (`false`). Since live-sync v0.23.0, this copies the partitioned table's leaf partitions in parallel and makes the initial copy resumable, instead of streaming the whole table through a single connection:

   ```
   ALTER PUBLICATION <publication_name> SET(publish_via_partition_root=false);
   ```

   Only set it to `true` if you need the older serial, whole-table copy:

   ```
   ALTER PUBLICATION <publication_name> SET(publish_via_partition_root=true);
   ```

   To convert a partitioned table to a hypertable, follow [Convert partitions and tables with time-series data into hypertables](#convert-partitions-and-tables-with-time-series-data-into-hypertables).

3. **Stop syncing a table in the `PUBLICATION`, use `DROP TABLE`**

   ```
   ALTER PUBLICATION <publication_name> DROP TABLE <table_name>;
   ```

## Synchronize data to your Tiger Cloud service

You use the source PostgreSQL connector docker image to synchronize changes in real time from a PostgreSQL database instance to a Tiger Cloud service:

1. **Start the source PostgreSQL connector**

   As you run the source PostgreSQL connector continuously, best practice is to run it as a Docker daemon.

   Terminal window

   ```
   docker run -d --rm --name livesync timescale/live-sync:latest run \
      --publication <publication_name> --subscription <subscription_name> \
      --source $SOURCE --target $TARGET --table-map <table_map_as_json>
   ```

   - `version-tag`: The latest available version tag of the live-sync image. See [Docker Hub](https://hub.docker.com/r/timescale/live-sync).

   - `--publication`: The name of the publication as you created in the previous step. To use multiple publications, repeat the `--publication` flag.

   - `--subscription`: The name that identifies the subscription on the target Tiger Cloud service.

   - `--source`: The connection string to the source PostgreSQL database.

   - `--target`: The connection string to the target Tiger Cloud service.

   - `--table-map`: (Optional) A JSON string that maps source tables to target tables. If not provided, the source and target table names are assumed to be the same.

     For example, to map the source table `metrics` to the target table `metrics_data`:

     `--table-map '{"source": {"schema": "public", "table": "metrics"}, "target": {"schema": "public", "table": "metrics_data"}}'`

     To map only the schema, use:

     `--table-map '{"source": {"schema": "public"}, "target": {"schema": "analytics"}}'`

     This flag can be repeated for multiple table mappings.

   - `--table-sync-workers`: (Optional) The number of parallel workers to use for initial table sync. Default is 4.

   - `--copy-data`: (Optional) By default, the initial table data is copied from source to target before starting logical replication. Set to `false` so only changes made after replication slot creation are replicated. Best practice is to set to `false` during dry-run livesync so you do not copy table data.

2. **Capture logs**

   Once the source PostgreSQL connector is running as a docker daemon, you can also capture the logs:

   Terminal window

   ```
   docker logs -f livesync
   ```

3. **View the progress of tables being synchronized**

   List the tables being synchronized by the source PostgreSQL connector using the `_ts_live_sync.subscription_rel` table in the target Tiger Cloud service:

   Terminal window

   ```
   psql $TARGET -c "SELECT * FROM _ts_live_sync.subscription_rel"
   ```

   You see something like the following:

   ```
    subname  | pubname   | schemaname | tablename | rrelid | state |    lsn     |          updated_at           | last_error |          created_at           | rows_copied | approximate_rows | bytes_copied | approximate_size | target_schema | target_table
   ----------+-----------+------------+-----------+--------+-------+------------+-------------------------------+------------+-------------------------------+-------------+------------------+--------------+------------------+---------------+-------------
    livesync | analytics | public     | metrics   |  20856 | r     | 6/1A8CBA48 | 2025-06-24 06:16:21.434898+00 |            | 2025-06-24 06:03:58.172946+00 |    18225440 |         18225440 |   1387359359 |       1387359359 | public        | metrics
   ```

   The `state` column indicates the current state of the table synchronization. Possible values for `state` are:

   | state | description                                                             |
   | ----- | ----------------------------------------------------------------------- |
   | i     | initial state, table data sync not started                              |
   | d     | initial table data sync is in progress                                  |
   | f     | initial table data sync completed, catching up with incremental changes |
   | s     | synchronized, waiting for the main apply worker to take over            |
   | r     | table is ready, applying changes in real-time                           |

   To see the replication lag, run the following against the SOURCE database:

   Terminal window

   ```
   psql $SOURCE -f - <<'EOF'
   SELECT
      slot_name,
      pg_size_pretty(pg_current_wal_flush_lsn() - confirmed_flush_lsn) AS lag
   FROM pg_replication_slots
   WHERE slot_name LIKE 'live_sync_%' AND slot_type = 'logical'
   EOF
   ```

4. **Add or remove tables from the publication**

   To add tables, use [ALTER PUBLICATION .. ADD TABLE](https://www.postgresql.org/docs/current/sql-alterpublication.html)\*\*

   ```
   ALTER PUBLICATION <publication_name> ADD TABLE <table_name>;
   ```

   To remove tables, use [ALTER PUBLICATION .. DROP TABLE](https://www.postgresql.org/docs/current/sql-alterpublication.html)\*\*

   ```
   ALTER PUBLICATION <publication_name> DROP TABLE <table_name>;
   ```

5. **Update table statistics**

   If you have a large table, you can run `ANALYZE` on the target Tiger Cloud service to update the table statistics after the initial sync is complete.

   This helps the query planner make better decisions for query execution plans.

   Terminal window

   ```
   vacuumdb --analyze --verbose --dbname=$TARGET
   ```

6. **Stop the source PostgreSQL connector**

   Terminal window

   ```
   docker stop live-sync
   ```

7. **(Optional) Reset sequence nextval on the target Tiger Cloud service**

   The source PostgreSQL connector does not automatically reset the sequence nextval on the target Tiger Cloud service.

   Run the following script to reset the sequence for all tables that have a serial or identity column in the target Tiger Cloud service:

   Terminal window

   ```
   psql $TARGET -f - <<'EOF'
      DO $$
   DECLARE
     rec RECORD;
   BEGIN
     FOR rec IN (
       SELECT
         sr.target_schema  AS table_schema,
         sr.target_table   AS table_name,
         col.column_name,
         pg_get_serial_sequence(
           sr.target_schema || '.' || sr.target_table,
           col.column_name
         ) AS seqname
       FROM _ts_live_sync.subscription_rel AS sr
       JOIN information_schema.columns AS col
         ON col.table_schema = sr.target_schema
        AND col.table_name   = sr.target_table
       WHERE col.column_default LIKE 'nextval(%'  -- only serial/identity columns
     ) LOOP
       EXECUTE format(
         'SELECT setval(%L,
            COALESCE((SELECT MAX(%I) FROM %I.%I), 0) + 1,
            false
          );',
         rec.seqname,       -- the sequence identifier
         rec.column_name,   -- the column to MAX()
         rec.table_schema,  -- schema for MAX()
         rec.table_name     -- table for MAX()
       );
     END LOOP;
   END;
   $$ LANGUAGE plpgsql;
   EOF
   ```

8. **Clean up**

   When you are permanently done syncing (for example, you have fully cut over to Tiger Cloud, or you want to restart the sync from scratch), use the `drop` sub-command to remove the replication slot from the source and all subscription state from the target:

   Terminal window

   ```
   docker run -it --rm --name livesync timescale/live-sync:latest drop \
      --subscription <subscription_name> --source $SOURCE --target $TARGET
   ```

   `drop` removes:

   - The replication slot (`live_sync_<subscription_name>`) on the **source** database.
   - The subscription and internal tracking tables (`_ts_live_sync.*`) on the **target** Tiger Cloud service.

   Warning

   Run `drop` only when you intend to permanently stop or fully restart the sync. Dropping the replication slot while the source is still active causes WAL to be retained until the slot is recreated, which can grow disk usage on the source.