Grammars

In File Analysis Suite, a grammar is a group of entities (a word, phrase, or block of information) used to identify information. These entities are based on patterns that identify specific types of information, such as social security numbers, names, telephone numbers, addresses, and so on.

You use grammars to search for or filter documents within a workspace to include in a workbook or place on hold.

File Analysis Suite includes several built-in grammars () that are comprised of specific built-in entities (). You can also create custom grammars () and entities () to suite your specific needs, including adding a custom entity to a built-in grammar.

When managing grammars and entities, keep the following in mind:

  • You cannot edit the name or description of built-in grammars.

  • You cannot edit the details of built-in entities.

  • You can add a custom entity to a built-in grammar.

Entities

An entity represents a word, phrase, or block of information based on a regular expression pattern. In addition to the built-in entities that are included in the built-in grammars, you can create custom entities to identify information that might be specific to your environment. Entities can be defined by a specific pattern or by terms.

As an example using a pattern, you identify your customers with a customer ID that follows a specific syntax, or pattern. Over time, your customer ID pattern has changed in length to accommodate for growth. You create a custom grammar, "Customer IDs", to identify managed items that reference the customer IDs. You then create one or more custom entities within the custom grammar that define these patterns. You can create multiple custom entities, one for each customer ID pattern you want to identify, or create a single custom entity that includes multiple patterns. Consider the following:

  • If you need to discriminate between the different customer ID patterns, create multiple custom entities, one for each of your customer ID patterns. The entities display separately in the dashboards, and you will need to include all entities and "OR" them in searches to get results for both patterns.

  • If you do not need to discriminate between the different customer ID patterns (you just care about identifying customer IDs, no matter the pattern), create a single custom entity that includes multiple patterns, one for each of your customer ID patterns. All ID patterns display together as a single entity in the dashboards. In searches, you will need to include the single entity.

TIP: Entity patterns are defined using standard regular expressions. Grammars and entities in File Analysis Suite are derived from Micro Focus IDOL. For more information about the regular expression syntax for entities, see Entity syntax.

If you define the entity using terms, you can manually entered the desired terms or load the terms from an existing term list.