Preparing the Data File
You can input data to DataLoader either as a tab-delimited text file or in KBART format.
Tab-Delimited Text File
You can input data to DataLoader in a tab-delimited text file, containing the information that needs to be loaded into the SFX database. The file must be created in Microsoft Excel, and then saved as a tab-delimited text file. The first column should contain an identifier of the object. This identifier is called the primary key and can be the ISSN, ISBN, LCCN, Object ID, or OCLC number of the object. Only one primary key type can be used per input file.
The other columns should contain the information you want to load into SFX for the object, such as the object’s activation status (ACTIVE or INACTIVE), a threshold for the object, or a general note concerning the object. You configure in DataLoader what kind of information is contained in each column of the file.
When loading object portfolio information, prepare a separate data file for each target or target service. For example, create a data file that contains all the Wiley full-text journals to which your institution subscribes. Create another data file for Springer full-text journals.
The ACTIVATION_STATUS attribute accepts the values ACTIVE or INACTIVE (case-sensitive) in the corresponding field or column in the load file. Other attributes accept the values that are valid for Edit Portfolio form fields. For more information, see Editing Object Portfolios.
The following is an example of a data file containing object portfolio information:
|
0000-0001<tab>$obj->parsedDate(“>=”,1997,12,1)<tab>ACTIVE 0000-0002<tab>$obj->parsedDate(“>=”,1999,undef,2)<tab>ACTIVE 0000-0003<tab>$obj->parsedDate(“>=”,1998,79,1)<tab>ACTIVE |
The first column contains the object’s ISSN, the second column contains a threshold to be loaded to the object, and the third column contains the activation status to be set in the object.
KBART
You can input data to DataLoader in KBART format. This allows you to load files received from vendors without additional manipulation prior to loading. It also provide you with a way to load date coverage information without constructing Perl statement thresholds.
The following conditions apply to the KBART format:
- The file must be in tab-delimited TXT format.
- All metadata is provided as plain text
- Text should be encoded as UTF-8.
- The first row is a header row with a field label for each column that indicates the content of the field.
- There should not be a blank row between the column header and the first row of content
- The order of the columns in the KBART format does not matter as long as the column header is correct for each column. Additional columns are ignored.
- Each publication is given a separate line of the file, with a column for each field.
- A publication is listed twice if there is a coverage gap of greater than or equal to 12 months, with only the coverage field changing. SFX dataloader merges the different coverage ranges into one threshold statement.
- All rows should be in the same format.
Dataloader uses the following columns in the KBART file to build the SFX Perl threshold statements:
- num_first_vol_online
- num_first_issue_online
- date_last_issue_online
- num_last_vol_online
- num_last_issue_online
- embargo_info
The following table describes the columns of the KBART format:
| Column | Description |
|---|---|
| publication_title | Publication title |
| print_identifier | Print-format identifier (for example, ISSN, ISBN, etc.) |
| online_identifier | Online-format identifier (for example, eISSN, eISBN, etc.) |
| date_first_issue_online | Date of first issue available online |
| num_first_vol_online | Number of first volume available online |
| num_first_issue_online | Number of first issue available online |
| date_last_issue_online | Date of last issue available online (or blank, if coverage is to present) |
| num_last_vol_online | Number of last volume available online (or blank, if coverage is to present) |
| num_last_issue_online | Number of last issue available online (or blank, if coverage is to present) |
| title_url | Title-level URL |
| first_author | First author (for monographs) |
| title_id | Title level JKEY |
| embargo_info | Embargo information. Consists of 3 parts:
The units field also indicates the granularity of the embargo, that is, how frequently the moving wall moves. |
| coverage_depth | Coverage depth, can be:
|
| coverage_notes | Coverage notes |
| publisher_name | Publisher name (if not given in the file's title) |