Managing data files

<< Click to Display Table of Contents >>

Navigation:  Administration > Database Administration > About Databases >

Managing data files

Files can be received from a client, data processing vendor, or third party. When receiving data, you should always review all files on the disk, prior to loading the data, to ensure they have the proper formats for Concordance Desktop.

Load files or delimited text files are the files used to construct your Concordance Desktop database. These files typically have extensions ending in .dat, .csv, or .txt. Each file contains record metadata, but some may also include body text. We recommend having your OCR separated into individual text files so they can be imported separately.

As an administrator, you should always make a practice of opening and reviewing your delimited text files when you receive them, as the files are not always prepared perfectly and may need to be modified.

When reviewing your data load files, always check for the following:

Field names – each line of metadata is one record, check each header column to verify data

Delimiters – unique characters that appear in the delimited text file and do not exist in your actual data

Date format – date fields are an 8-character maximum with slashes. If dates include slashes, you can import any format. If slashes are not used, then you must use the universal date format of YYYYMMDD or the mm-dd-yyyy date format with dashes.

Carriage return – a final carriage return ensures that the last record will load into the database.

Concordance Desktop Delimiters

Comma

Field break indicator, default is □ (ASCII 20), customizable, avoid characters in data

Quote

Keeps text together, default is þ (ACSII 254) and is only required around fields that have text and spaces, customizable, avoid characters in data

New Line

Manual line break and text wraps within a field, default is ® (ASCII 174), customizable, avoid characters in data

New Record

Starts a new record, final carriage return loads the last record, cannot be changed, industry standard

Delimiters are customizable for an organization's internal database design, but many organizations ask vendors to use Concordance Desktop default delimiters. If your case records contain the registered trademark symbol, you may want to consider changing the ® to another symbol in the load file.