Searching Tools

<< Click to Display Table of Contents >>

Navigation:  Using CloudNine LAW > Searching > Full-Text Searching Tools >

Searching Tools

FulltextDictionaryWindowIcon Fulltext Dictionary

This tool helps you quickly scan the full-text index from the active Case Directory for specific keywords. Typing directly into the Lookup field will immediately display results in the list below. These results are shown by Word, Doc Count (number of documents containing the word), and Hit Count (frequency that word is found within all documents) as they're found. You can also use this tool to add words from the list as search terms to the Full-Text Search pane of the Database Query Builder.

Options... - Opens the Full Text Search Options window (shown below).

Export... - Opens a File Explorer, allowing you to save a list of every word found in the active case's full-text index (including Doc Count and Hit Count frequencies) as a CSV file.

 

InfoIcon Using the Fulltext Dictionary

1.Start typing directly into the Lookup field to begin listing words from the active case's full-text index.

2.Scroll through the list to find appropriate search terms for the current query. Select words from the list by left-clicking on them.

3.With the desired word selected, pick a Join connector from the drop-down menu at the bottom-left, and then click on the Add (F3) button to populate the Full-Text Search pane with that search term.

i.You can also press F3 on your keyboard instead of using the Add (F3) button.

4.Once you've added all desired search terms, click Done at the bottom-right to close the Fulltext Dictionary.

FullTextSearchOptionsWindowIcon Full Text Search Options

This window provides settings that determine how LAW searches through the full-text index of the active Case Directory when performing text-based queries.

Stemming - All variants of each listed keyword are returned in the search results. For example, the search term "index" will also return "indexing", "indexed", "indexes", etc.

Phonic - Keywords sounding similar to those listed are also returned in the search results. For example, the search term "pear" will also return "pair", "pare", etc.

Fuzzy - Keywords with similar spelling to those listed are also returned in the search results. You can determine the margin of similarity with the slider, on a 1-10 scale.

Retrieval Thresholds - These options limit how long the search is performed for:

oMax Files to Retrieve - Matching records are returned until this value is reached. Leave at 0 for no limit.

oAbort Search After - Records are searched for matches until this value is met. Leave at 0 for no limit.

Search Syntax - The following options (select one) determine how search term text is interpreted:

oBoolean - Treat any Join connectors (AND/OR) exactly as they appear. All other text is treated as exact keywords or strings that are separated by these connectors.

oAll Words - Same as Boolean, but uses AND in place of all Join connectors.

oAny Words - Same as Boolean, but uses OR in place of all Join connectors.

oRegular Expression - Turns on regular expression (regex) mode, treating all text as search strings that target algorithmic patterns rather than exact keywords.

Results Options

oInclude hits by word in search results - A temporary "ft_HitsByWord" field will be included for each record returned by the query. This field displays which search terms were located, along with their frequency within.

Index Properties - Opens the Full Text Index Properties window.

InfoIcon Working With Noise Words

This topic section describes noise words, their impact on search, and how to edit the list of noise words.

What are noise words?

Noise words are words that are ignored by the indexer because they occur so commonly that they are of little use in searching content.

How noise words impact search

The effect of noise words in a search query typically manifests itself when searching for terms or word sequences. The search engine effectively treats a noise word in a search query as a word-level wildcard, matching any word that occurs at that position.

Effectively, a noise word is processed as an "any word" wildcard (*), so in the above example the search query is processed as: Harold * Cat.

A single word search query for a noise word will return no records. For example, a search query for the term "because" would yield no results.

Examples

The following table describes examples of how the noise word "The" effects search.

Request description

Matching text

Does not match

The Car

(If using Any Words/All Words syntax then use: "The Car")

"Red Car","Blue Car","Fast Car"

 

Harold The Cat (If using Any Words/All Words syntax then use: "Harold The Cat")

"Harold my cat"

"Harold My Friend" (Term search, only matches one of the required words in the sequence).

To edit the noise word list

LAW manages noise words at the application level in a file named noise.dat. This file is located in the DTConfig folder which can be found in the installation path of CloudNine™ LAW.

1.Use Notepad or some other text editor to open the noise.dat file.

2.Add or remove noise words from the list.

3.Save the file.

4.On the Tools menu click Full Text Index and then click Re-Index All Documents.

Note

Removal of noise words from noise.dat can result in reduced performance of indexing and search operations. This  is because any word removed from the noise list is indexed along with other keywords.