Reverse ETL

Reverse ETL is the data movement engine in Zeotap. A reverse ETL sync connects a Model (your data) to a Destination (where you want to send it) and runs on a schedule to keep the destination up to date.

What Is a Reverse ETL Sync?

A reverse ETL sync is a configured pipeline that:

Executes a model’s SQL query against your warehouse
Compares the results to the previous run to detect changes (new, updated, deleted records)
Sends those changes to the destination using the appropriate API
Records the results (rows synced, errors, duration) for monitoring

Reverse ETL syncs are the final step in Zeotap’s data pipeline:


Source → Model → Reverse ETL Sync → Destination

Key Concepts

Sync Mode

The sync mode determines how Zeotap handles data at the destination:

Mode	Behavior
Upsert	Create new records or update existing ones. Never deletes.
Mirror	Keep the destination in perfect sync — creates, updates, and deletes records.
Append	Only add new records. Never updates or deletes.

Field Mapping

Field mapping defines how columns from your model map to fields in the destination. For example, the model column email might map to the Salesforce field Email, or the model column lifetime_value might map to a HubSpot property ltv.

Scheduling

Scheduling determines when and how often the sync runs. Options include cron expressions for precise timing and interval-based schedules for regular cadences.

Sync Runs

Each execution of a reverse ETL sync produces a sync run record with detailed information about what happened — how many rows were processed, how many were created/updated/deleted, any errors, and the total duration.

How Reverse ETL Syncs Work

Step 1: Query Execution

When a reverse ETL sync runs, Zeotap executes the model’s SQL query against the source warehouse. The full result set is retrieved and staged for comparison.

Step 2: Change Detection

Zeotap compares the current result set with the previous run’s snapshot using the model’s primary key:

New rows — Primary key values present in the current run but not the previous run
Updated rows — Primary key values present in both runs but with different attribute values
Deleted rows — Primary key values present in the previous run but not the current run (relevant for mirror mode only)

Step 3: Destination Writes

Based on the sync mode, Zeotap sends the appropriate operations to the destination API:

Creates — New records are inserted into the destination
Updates — Existing records are updated with changed attribute values
Deletes — Records are removed from the destination (mirror mode only)

Operations are sent in batches for efficiency, with automatic retry logic for transient failures.

Step 4: Results Recording

After the reverse ETL sync completes, Zeotap records:

Total rows processed
Rows created, updated, and deleted
Rows that failed with errors
Total duration
Any error details for failed rows

Reverse ETL Sync Lifecycle

1. Created

The reverse ETL sync is configured with a model, destination, field mapping, sync mode, and schedule. It is saved but has not yet run.

2. Active

The reverse ETL sync is running on its configured schedule. Each execution produces a sync run record.

3. Paused

The reverse ETL sync is temporarily stopped. No new runs are triggered, but historical run data is preserved. You can resume the sync at any time.

4. Error State

If a reverse ETL sync encounters persistent failures (e.g., destination authentication expired, model query fails), it enters an error state. The sync is effectively paused until the underlying issue is resolved.

5. Archived

The reverse ETL sync is no longer needed and has been archived. It can be restored if needed.

API Reference

Reverse ETL syncs are managed through the Zeotap REST API:


# List all syncs
GET /api/v1/syncs
 
# Get a single sync
GET /api/v1/syncs/{id}
 
# Create a sync
POST /api/v1/syncs
 
# Update a sync
PUT /api/v1/syncs/{id}
 
# Delete a sync
DELETE /api/v1/syncs/{id}
 
# Trigger a manual run
POST /api/v1/syncs/{id}/trigger
 
# Pause a sync
POST /api/v1/syncs/{id}/pause
 
# Resume a sync
POST /api/v1/syncs/{id}/resume
 
# List sync runs
GET /api/v1/syncs/{id}/runs
 
# Get a specific run
GET /api/v1/syncs/{id}/runs/{run_id}

Example: Create a Reverse ETL Sync


curl -X POST https://composable.zeotap.com/api/v1/syncs \
  -H "Authorization: Bearer $API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "name": "Active Customers to Salesforce",
    "model_id": "mod_abc123",
    "destination_id": "dst_xyz789",
    "mode": "upsert",
    "schedule": {
      "type": "interval",
      "interval_minutes": 60
    },
    "field_mappings": [
      {"source": "email", "destination": "Email"},
      {"source": "first_name", "destination": "FirstName"},
      {"source": "last_name", "destination": "LastName"},
      {"source": "lifetime_value", "destination": "LTV__c"}
    ]
  }'

Best Practices

Start with manual runs — Before setting up a schedule, trigger a manual run to verify the reverse ETL sync works correctly with your field mapping
Use upsert mode first — Upsert is the safest mode. Only use mirror mode when you’re confident about deletes.
Monitor initial runs — Watch the first few sync runs to catch mapping issues or unexpected data transformations
Set appropriate schedules — Match the sync frequency to your business needs. Not everything needs to sync every hour.
Handle errors promptly — Review failed rows and fix the underlying issues (data format mismatches, missing required fields, API limits)
Use descriptive names — Name reverse ETL syncs after their purpose: “CRM Contacts - Daily” or “Google Ads Audience - Hourly”

Next Steps

Create a reverse ETL sync — Step-by-step guide
Map fields — Configure column-to-field mapping
Set a schedule — Configure when reverse ETL syncs run
Understand sync modes — Upsert, mirror, and append explained
Monitor sync runs — Track run history and results
Troubleshoot issues — Common errors and fixes