Use of Data Analysis (Sampling) for Class Assignment

Data Express for distributed data (both in the configuration with MVS knowledge base and in the configuration with XDB knowledge base) allows the capability of analyzing data content by using the sampling feature.

This feature helps in understanding the data element content and in assigning it to the appropriate class. The sampling process provides two types of results:

The result of the sampling can be used to make class assignment in two different ways:

The sampling process will be described more in detail in the "Using Distributed Sampling" chapter, and it will be object of a tutorial described in the present manual.

The standard sampling and compressed sampling result verification is described in details in the "Work with data store" chapter of the Front End User Guide.

The manual class assignment and the prototype definition are described in details in the "Work with data elements" chapter of the Front End User Guide.

The assignment from sampling result is described in details in the present manual, in paragraph "Sampling result" of "Importing classes" chapter.