Early Access: The content on this website is provided for informational purposes only in connection with pre-General Availability Qlik Products.
All content is subject to change and is provided without warranty.
Skip to main content Skip to complementary content

Target update methods

You can set which update method to use, either during the initial task setup, or later in the data task Settings. It is not possible to change the update method once the data task has been prepared.

The available update methods are determined by task type, connector capabilities, and connector classification. The available data movement task types are landing, replication, or landing data in a data lake.

For information about your connector's classification and capabilities, see the help for your connector. Connector classification is indicated by a badge at the top of the related help page. Connectors without a badge are classified as "Standard".

For a detailed explanation of connector classifications, see Connector classifications.

Information noteWith the exception of Update methods when working with Preview connectors, all of the update methods described in this topic assume that the source connector defined for your data movement task is certified as either Lite or Standard.

Update methods when working with Preview connectors

Preview connectors are supported with Replication projects only. When working with Preview connectors, all data is propagated to the target as changes (Inserts/Updates), including the initial full load. Consequently, the only supported update method for Preview connectors are as follows:

  • For "Replicate data" tasks: The Apply changes and Store changes replication modes only. All data will propagated to the target/change tables as changes (Inserts/Updates), including the initial full load. Note that with certain table types, it is not possible to identify changes. In such cases, all of the table's data will be processed from the beginning.
  • For "Land data in data lake" tasks: The Change data capture (CDC) update method only. All data will propagated to the target as changes (Inserts/Updates), including the initial full load. Note that with certain table types, it is not possible to identify changes. In such cases, all of the table's data will be processed from the beginning.
Information noteCapture and propagation of delete operations to the target is not supported.

When working with Preview connectors you need to schedule how often to capture changes from the source. For more information, see the following topics:

Update methods when landing data

  • Change data capture (CDC) using Change Tables

    The data task starts with a full load. The target data is then kept up-to-date using incremental loading based on date fields. CDC may not be supported by all data sources.

    Information noteDELETE operations are not supported. This means that if a row is deleted in the source, it will not be deleted in the landing data. If delete handling is important, use Reload and compare instead.

    When working with Data Movement gateway and landing data from SaaS applications, you set the interval between reading changes from the source, in Settings > Change processing tuning. When working without Data Movement gateway, you set the interval using the Scheduler. For more information, see Scheduling CDC tasks when working without Data Movement gateway.

  • Reload and compare

    The data task performs full loads only from the source. This is useful if your source does not support CDC, for example, or if you want DELETE operations (which are not supported by CDC) to be propagated to the target. Reload and compare can be used with any supported data source, and can be scheduled to occur periodically.

Update methods when replicating to database or data warehouse targets

  • Full load: Loads the data from the selected source tables to the target platform and creates the target tables if necessary. The full load occurs automatically when the task is started, but can also be performed manually should the need arise. Manual full load would be required, for example, if you need to replicate updates to Views (which are not captured during CDC) or if you are replicating from a data source that does not support CDC.

  • Apply changes: Keeps the target tables up-to-date with any changes made to the source tables.

  • Store changes: Stores the changes to the source tables in Change Tables (one per source table).

    For more information, see Store changes.

When working with Data Movement gateway, changes are captured from the source in near real-time. When working without Data Movement gateway (for example, with a Qlik Talend Cloud Starter subscription or when selecting None), changes are captured according to the scheduler settings. For more information, see Data replication task settings.

Update methods when replicating to cloud storage (data lakes)

  • Change data capture (CDC) using Change Tables: Data lake landing tasks starts with a full load (during which all of the selected tables are loaded to the target). The target data is then kept up-to-date using CDC (Change Data Capture) technology.

    Information noteCDC (Change Data Capture) of DDL operations is not supported.

    When working with Data Movement gateway, changes are captured from the source in near real-time. When working without Data Movement gateway, changes are captured according to the scheduler settings. For more information, see Settings for cloud storage targets.

  • Reload: Performs a full load of the data from the selected source tables to the target platform and creates the target tables if necessary. The full load occurs automatically when the task is started, but can also be performed manually or scheduled to occur periodically as needed.

Information note

The procedure for setting up replication to cloud storage differs according to your subscription tier.

Understanding scheduled change data capture (CDC)

When working without Data Movement gateway or when using Preview connectors, changes are captured according to a scheduled interval. It is important to be aware of how the scheduling works, which is best demonstrated by way of example. In the following example, a task has been scheduled to run every 30 minutes, starting at 9:00.

  • The task starts at 9:00 with a full load.
  • The full load ends at 9:40, meaning that the 9:30 run will be skipped.
  • The next run starts at 10:00, and captures any changes committed until 10:00.
  • The 10:00 run ends at 10:15.
  • The next run starts at 10:30 and captures any changes that occurred between 10:00 and 10:30.
Information noteDepending on the load on the system, the number of concurrent running tasks, and how many pods are available, the next scheduled instance might not start at exactly the scheduled time.

Limitations

Some tables returned by the SaaS application are not supported by Change data capture (CDC). In this case you will see a warning message in Validation errors. You can either:

  • Delete the table from the data task.

  • Change the update method of the data task to Reload and compare.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!