Skip to content
English
  • There are no suggestions because the search field is empty.

System Onboarding: Data Preparation & Pre-upload Checklist

A checklist for customers to complete prior to uploading a system extract for use in a Gathid system loader.

Gathid supports .csv file extracts in automating its identity modeling. Certain constraints must be followed to ensure it can interpret the underlying data from your organization.

Please adhere to these checks to ensure effortless continuity from Gathid:

Prepare your system extract data 

☑️ Please submit one file that contains all data for a particular system, rather than in multiple files.
☑️ Review the data. Remove leading spaces in data at the source to ensure they are not added to extracts.
☑️ Remove Personally Identifiable Information (PII) that is not absolutely required.
Such as date of birth, phone number, address specifics, emergency contacts etc. Retain the common unique identifiers across systems such as email address and/or organizational data.
☑️ Remove other sensitive data such as salary information
☑️ Special characters are not present

Format

☑️ File format is .csv
☑️ Row 1 contains column headers only, no data, no blank cells
☑️ Column headers within the file are unique, have no leading spaces and no special characters present
☑️ Column heading name is unique in the file
☑️ Column header names reflect the data stored within the column
☑️ Column header names remain consistent when the extract is replaced by new data the next day
Gathid is configured based on the column headers specified in row 1 of the data file. If the column header names change the associated change will need to be manually updated in the Gathid loader via the Gathid UI. Best practice: Keep column names in files the same over time. If the column name should be different this can be changed in the Gathid metrics and reports by updating the column label field in the respective Loader. 

File Name Requirements

☑️ File name includes system name and reference to content

Naming convention: [System name] [Content]” i.e. “Okta Users”

☑️ Do not include a date and/or time in the filename and do not add one as you drop new files

This will ensure Gathid can find the data source that it has been configured to look for as it will not recognize files for this system if a different file name is subsequently used. The idea is that the file will be overridden within the upload portal each time a new set is uploaded daily and data history is already maintained within Gathid. 

Upload File

☑️ Upload the file to the root level of your upload portal (do not save the file in a folder)

Note: Gathid will look for the latest file in the upload portal that has the name that was configured in your Administration > Loaders > Data Sources tab for the system.