Grammars
In Fusion, grammars are used to identify potentially sensitive data in your environment. You use grammars to identify this sensitive data as you process datasets, search for or filter documents in Analyze and within a workspace to include in a workbook or place on hold in Manage. Grammars are comprised of grammar sets, grammar classes, grammar types, grammar rules, and grammar values.
-
Grammar values are words or phrases identified in document content or metadata because they matched grammar rules.
-
Grammar rules (
) are the patterns used to identify specific information. For example, the built-in "Email addresses" grammar rule identifies information that conforms to the common pattern for email addresses.
-
Grammar types (
) are groupings of grammar rules. The built-in grammar rules are organized by grammar types that share a similar pattern. For example, the built-in "Addresses" grammar type contains multiple grammar rules that identify the mailing address patterns specific to various countries.
-
Grammar classes (
) are groupings of grammar types and grammar rules. The built-in grammar types are organized by grammar classes that identify common sensitive data types. For example, the built-in "Government ID" grammar class contains multiple grammar types that identify data such as driver's license numbers, passport numbers, or social security numbers.
-
Grammar sets (
) are groupings of grammar classes, grammar types, and grammar rules. The built-in grammar sets include common grammar rules that pertain to region-specific regulations. For example, the built-in "EU General Protection Regulation (UK)" grammar set contains grammar rules that identify data pertinent to GDPR and are specific to data patterns for the United Kingdom (addresses, phone numbers, bank details, and so on).