Structured data connection

If you will be creating datasets that process data from structured data sources (databases), you must complete additional tasks to enable this processing. Fusion accesses structured data sources through a built-in structured data processor or through connection to Structured Data Manager. Review the requirements for each and complete the connection tasks as appropriate for your environment.

Built-in structured data processor connection tasks

NOTE: JDBC is not supported by the built-in structured data processor.

Fusion includes a built-in adapter to access and process supported structured data types using both off-cloud agent clusters and cloud agent clusters.

Prior to creating agent clusters, sources, and datasets that use the built-in structured data processor, review the following.

  • Agent clusters assigned to process structured data must have the "Enable Structured Data Manager integration" option disabled. This ensures that the agents in the cluster are not expecting a connection to Structured Data Manager.

  • You will define a user to access the structured data sources. On the Connection page of the Source creation wizard, you must define a user that has the appropriate permissions to access and read from the selected data type and connection.

Structured Data Manager connection tasks

Fusion supports processing supported structured data types by connecting to Structured Data Manager and using off-cloud agent clusters.

Requirements

Prior to beginning the Structured Data Manager connection tasks, you must have the following in place.

  • Structured Data Manager 23.2.0 fully installed, with Discovery enabled and the Discovery Service installed.

  • The Fusion processing agents intended to process structured data must be installed within the same network as Structured Data Manager.

  • The Fusion processing agents intended to process structured data must be organized into one or more agent clusters.

  • Agent clusters assigned to process data from Structured Data Manager must have the "Enable Structured Data Manager integration" option enabled. If this option is not enabled, Fusion will use the built-in structured data processor instead and you will not be able to perform additional Structured Data Manager actions on data processed by the built-in processor.

Configure Structured Data Manager

Structured Data Manager must be configured for connection to Fusion. This must be performed on every Structured Data Manager host server.

Configure Fusion for Structured Data Manager connection

The Fusion processing agent must be configured to connect to Structured Data Manager using the generated certificate files. This must be completed on each processing agent host that will be processing structured data.

Complete one of the following tasks to configure the connection based on whether you will define a certificate store (more secure) or the individual generated certificate files.

Manage structured data in Fusion

After configuring Structured Data Manager and Fusion, complete the following steps to begin managing your structured data in Fusion.

Re-scan structured data (optional)

You may want to re-scan your structured datasets if the data in the managed tables has significantly increased or changed, grammars assigned to the structured datasets have changed, if you added custom grammar rules to datasets that have already been scanned, or the scan settings in Structured Data Manager have changed.