Grammars

In File Analysis Suite, a grammar is a group of entities (a word, phrase, or block of information) used to identify information. These entities are based on patterns that identify specific types of information, such as social security numbers, names, telephone numbers, addresses, and so on.

You use grammars and entities to search for or filter documents in Analyze and within a workspace to include in a workbook or place on hold.

File Analysis Suite includes several built-in grammars (grammar icon) that are comprised of specific built-in entities (entity icon). You can also create custom grammars (custom grammar icon) and entities (custom entity icon) to suite your specific needs, including adding a custom entity to a built-in grammar.

When managing grammars, keep the following in mind.

  • You cannot edit the name or description of built-in grammars.

  • You can add a custom entity to a built-in grammar.

  • To view additional information about a grammar, click in the row for the grammar and then click the open detail panel icon (open detail panel icon).

Entities

An entity represents a word, phrase, or block of information to be identified and are based on a regular expression pattern. In addition to the built-in entities that are included in the built-in grammars, you can create custom entities to identify information that might be specific to your environment. Entities can be defined by a specific pattern or by terms.

As an example using a pattern, you identify your customers with a customer ID that follows a specific syntax, or pattern. Over time, your customer ID pattern has changed in length to accommodate for growth. You create a custom grammar, "Customer IDs", to identify managed items that reference the customer IDs. You then create one or more custom entities within the custom grammar that define these patterns. You can create multiple custom entities, one for each customer ID pattern you want to identify, or create a single custom entity that includes multiple patterns. Consider the following:

  • If you need to discriminate between the different customer ID patterns, create multiple custom entities, one for each of your customer ID patterns. The entities display separately in the dashboards, and you will need to include all entities and "OR" them in searches to get results for both patterns.

  • If you do not need to discriminate between the different customer ID patterns (you just care about identifying customer IDs, no matter the pattern), create a single custom entity that includes multiple patterns, one for each of your customer ID patterns. All ID patterns display together as a single entity in the dashboards. In searches, you will need to include the single entity.

TIP: Entity patterns are defined using standard regular expressions. Grammars and entities in File Analysis Suite are derived from Micro Focus IDOL. For more information about the regular expression syntax for entities, see Entity syntax.

If you define the entity using terms, you can manually enter the desired terms or load the terms from an existing term list.

When managing entities, keep the following in mind.

  • You cannot edit the name or details of built-in entities.

  • You can add a custom entity to a built-in grammar.

  • To view additional information about a entity, click in the row for the entity and then click the open detail panel icon (open detail panel icon). Details about the entity display on the GENERAL tab.

  • To configure data masking for entities, see Data masking