Generic UNIMARC

This information is not applicable to Primo VE environments. For more details on Primo VE mapping, see Mapping to the Display, Facets, and Search Sections in the Primo VE Record.

Return to menu

Control Section

Control Section
Normalized Record Field	Source/Content	Note
Source ID	From data source definitions
Original Source ID	From data source definitions
Source Record-ID	From header of source file
Record ID	Source ID + Source Record-ID
Additional Record-ID
Source Type		Not in use.
Source Format	From data source definitions
SourceSystem

Display Section

General Notes

String multiple occurrences with a semicolon unless indicated otherwise. If the source data has a period at the end and it is not the final occurrence, remove the

Notes Regarding Subfields And Indicators

If no subfields are listed explicitly, data from all non-numeric subfields will be displayed.
If a field or a subfield is repeated, all instances should be displayed.
Subfields are listed in alphabetical order for the sake of clarity, but should be displayed in the order they are recorded in the source record.
If all the subfields or specified non-numeric subfields are taken, numeric subfields are not considered.
If a numeric subfield is specifically included, no other numeric subfield will be included.
If a numeric subfield is excluded, the mapping will take other numeric subfields.
If no indicators are defined, all indicators will be taken.

Display Section
Display Element	Source	Note
Source	Source from the data source definition
Resource Type	See mapping below
Title	200 acdehi	Punctuation : If several a, the second and following are preceded by ,^ chi preceded by .^ e preceded by ^:^ d preceded by ^=^
Uniform title	500 abhiklmqrsu OR 501 abejkmrsuwxyz OR 503 abdefijkmn	500 b preceded by ^[ followed by ]^ hiklqrsu preceded by .^ m preceded by ^( followed by )^ 501 bejkmrsuw preceded by .^ xyz preceded by ^:^ 503 bdeijkmn preceded by .^ f preceded by ,^
Vernacular title
Creator	700 abcdfg 7100 abcghp 720 af 730 a 7101 a	700 b preceded by ,^ f preceded by ^( followed by )^ other subfields preceded by .^ 7100 All subfields preceded by .^ except a 720 f preceded by ^( followed by )^
Contributor	702 abcdfg 7120 abcghp 722 af 716 acf 701 abcdfg 722 af 7111 a 7110 a 7121 a	702 b preceded by ,^ f preceded by ^( followed by )^ other subfields preceded by .^ 7120 All subfields preceded by .^ except a 722 f preceded by ^( followed by )^ 716 f preceded by ^( followed by )^ c preceded by .^
Description	328 b 330 a 327 abcdefghi	Every field is a separate occurrence. 327 every subfield is a title
Edition	205 a
Publisher	210 ac	c preceded by ^:^ if several a, followed a are preceded by ^;^
Subject	Strip all numeric subfields. 600 abcdfjtxyz 601 abcdefghjtxyz 602 afjtxyz 604 atjxyz 605 ahijkmnxyz 606 ajxyz 607 ajxyz 608 ajxyz 610 a 615 axyz 616 acfjx	Strip all numeric subfields. 600 b preceded by ,^ f preceded by ( and followed by ) xyz preceded by ^:^ other subfields preceded by .^ 601 xyz preceded by ^:^ other subfields preceded by .^ 602 f preceded by ( and followed by ) xyz preceded by ^:^ other subfields preceded by .^ 604 xyz preceded by ^:^ other subfields preceded by .^ 605 nm preceded by ( and followed by ) xyz preceded by ^:^ other subfields preceded by .^ 606 - 607 - 608 - 615 jxyz preceded by ^:^ 610 If several a, second and following preceded by ^:^ 616 f preceded by ( and followed by ) jxy preceded by ^:^ c preceded by .^
Language	101 a	Validate code against list of ISO 639-2 codes, section B. If the code cannot be translated, leave it as is.
Physical Format	337 a 215 acde	Punctuation = a : c ; d + e.
Identifier	010 $$a – prefix the value with ISBN 011 $$a – prefix the value with ISSN 013 $$a – prefix the value with ISMN	This mapping is disabled in the out-of-the-box template since the identifiers by default do not display in the Front End.
Relation	410, 411: Prefix the value with Series 430, 431, 432, 433, 434, 435, 436, 437: Prefix the value with Earlier Title 440, 441, 442, 443, 444, 445, 446, 447, 448: Prefix the value with Later Title	Every field should be a separate occurrence. The prefix should be added to $$C and the value to $$V. Display constant codes are used: series earlier_title later_title
Is Part Of	422 tfghi	f preceded by ^/^ g preceded by ^;^ hi preceded by .^
Creation Date	210 $d OR 100 $a/09-12	For a date created from 008, create a date only if it starts with a digit that is not zero and replace missing digits with a question mark. For example: 19-- > 19?? 19uu > 19??

Mapping to Resource Type

The mapping is based on the format type derived either from LDR positions 6 and 7 or tag and position. Use the following tables to determine the mapping.

LDR Positions
Leader pos. 6/7	Record type	Format
a Language material + pos.7= m	Books	BK
a Language material + pos.7= s	Continuing Resources	SE
c Notated music	Music	MU
d Manuscript notated music	Music	MU
e Cartographic material	Maps	MP
f Manuscript cartographic material	Maps	MP
g Projected medium	Visual materials	VM
i Nonmusical sound recording	Audio materials	AM
j Musical sound recording	Audio materials	AM
k Two-dimensional non-projectable graphic	Visual materials	VM
l Electronic Resource	Electronic resource	CF
m Mixed material	Mixed materials	MX
r Three-dimensional artifact or naturally occurring object	Visual materials	VM
b Manuscript language material	Books	BK
Default		BK

Record Type Derived from Tag and Position
Format	Based on (tag and position)	TYPE	Note
Any	328 a,b,c,d,e,t,z – is present or 105 a pos.4, 5, 6,7 = m or v	dissertation
BK	LDR pos.7 = a	article
BK		book	The catch-all for BK if no further information is available is Book
CF	110 $a pos.0 = f	database
CF	LDR pos.7 = s	electronic journal
CF		other
MP		Map
AM		audio
MU	LDR 06 = c or d	Score
SE	110 $a pos.0 = b	serial
SE	110 $a pos.0 = a or c or e or z	journal
VM	LDR pos.6 = k	image
VM	LDR pos.6 = g	video
VM		other
MX		other

Links

Links Section
Type of Link	Source	Note
OpenURL	Based on resource type from display: If type = article, then: $$Topenurl_article Else: $$Topenurl_journal	SFX has two sources for Primo: one for articles in which case the data is used and one for journals in which case the date is ignored. There is a different template per source.
OpenURL_fulltext	Based on resource type from display: If type = article, then: $$Topenurlfull_article Else: $$Topenurlfull_journal	SFX has two sources for Primo: one for articles in which case the data is used and one for journals in which case the date is ignored. There is a different template per source.
OpenURL_servicetext
Backlink
LinktoHoldings
Linkto Holdings_available
Linkto Holdings_unavailable
Linkto Holdings_doesnotexist
LinktoRequest
LinktoResource	8564 $u Add display text ($$D) from $2 + $z. If not available then use code: "Online version" 856 1# Add display text ($$D) from $2 + $z. If not available then use code: "Online versionValidate that the link is to the resource by checking the content of subfield 3.
Additional links	316 u $$Dlink to extract 317 u $$Dlink to extract 320 u $$D Link to bibliographical or index information 321 u $$D Link to extract 334 u $$D Link to awards information 337 u $$D Link to system details 345 u $$D Link to acquisition information
Thumbnail	010 a: $$Tamazon_thumb $$Tsyndetics_thumb $$Tgoogle_thumb
LinktoTOC	327 u $$U $$DTable of Contents 010 a $$Tsyndetics_toc$$DTable of Contents	Create Amazon and Syndetic links only if there is an ISBN.
LinktoAbstract	010 a $$Tsyndetics_abstract$$D Abstract
LinktoExcerpt	010 a $$Tsyndetics_excerpt$$D Excerpt from item
LinktoReview
LinktoFindingAid
LinktoPrice
Linktouc	$$Tamazon_uc – add if there is ISBN $$Tworldcat_isbn – add if there is ISBN ELSE add $$Tworldcat_oclc – if there is OCLC number

Search

Search Section
Index	Source tag	Notes
Creator/contributor	200 f 700 abcdfg 701 abcdfg 702 abcdfg 7100 abcghp 7120 abcghp 716 acf 700 1 or 2 ab 722 af 730 a 711 abcdef 712 abcdef
Title	200 acehi
Additional title	410 t, 411 t, 423 t, 225 aei, 410 aei, 421 ae, 422 ae, 423 ae, 430 ae, 432 ae, 433 ae, 434 ae, 435 ae, 436 ae, 437 ae, 440 ae, 441 ae, 442 ae, 443 ae, 444 ae, 445 ae, 446 ae, 447 ae, 448 ae, 451 ae, 452 ae, 453 ae, 454 ae, 455 ae, 456 ae, 470 ae, 481 ae, 482 ae, 488 ae, 501 a, 510 ae, 512 ae, 517 ae, 518 ae, 531 ae, 532 ae, 545 ae,
Alternative Title	200 d 454 thio 503 abdefijkmn 500 abhikl 510 aehi 512 aehi 513 aehij 514 aehij 515 aehij 516 aehij 517 aehij 530 a 531 a 532 a 540 aehi 541 aehi 545 a 7101 abcdefghp
Description	330 a 327 abcdefghi
Subject	Strip all numeric subfields. 600 abcdfjtxyz 601 abcdefghjtxyz 602 afjtxyz 604 atjxyz 605 ahijkmnxyz 606 ajxyz 607 ajxyz 608 ajxyz 610 a 615 axyz 616 acfjx 675 a 676 a
ISBN	010 az
ISSN	011 ayz
Resource type	Resource type from display
Creation date	100 $a/09-12 and 100 $a/13-16 are digits and not 9999 210 $d
Full Text
TOC	327 abcdefghi
RecordSource	Source ID from the control section	Note: Required to filter out certain sources.
RecordID	Record ID from the control section	Note: Required to retrieve record based on system number.
General	101 a 210 c 322 a 323 a 326 a 328 abcdetz 333 a 334 abcd
Search scope	From PNX: delivery/institution control/sourceid (for example the data source is added as a scope)
Restricted search scope
Scope	Copies from the Search scope and Restricted search scope from the sections above

Sort

Sort Section
Sort type	DC field
Creation Date	100 $a/09-12 OR 210 $d
author	Single sort key created from (subfields same as display/author): 700 710 720 730
title	Single sort key created from (subfields same as display/title): 200
Popularity

Facets

Facet Section
Facet	Source	Note
Resource type	Create this based on the Resource type field from display section as follows: Book -> books Journal -> journals Article -> articles Text Resource -> books Image -> images Audio -> media Video -> media Score -> Scores Map -> Maps Other -> other	In some cases, two values should be created, each as a separate field.
Language	101 subfields a, d, h.	If the language is not a valid ISO 639 code it should not be created.
Creator/Contributor_	700 abcdf 701 abcdf 702 abcdf 710 abcd 711 abcd 712 abcd 720 af 721 af 722 af 730 a 710 1# a 711 1# a 712 1# a	Punctuation 700, 701 and 702 b preceded by ,^ f preceded by ^( followed by )^ other subfields preceded by .^ 7100, 7110 and 7120 All subfields preceded by .^ except a 720, 721 and 722 f preceded by ^( followed by )^
Topic	600 abcdfjtxyz 601 abcdefghjtxyz 602 afjtxyz 604 atjxyz 605 ahijkmnxyz 606 ajxyz 607 ajxyz 610 a 616 acfjxy First facet level is all data up to the first occurrence of subfield $$j, x, y or z. Each subfield division (j, x, y or z) constitutes the next level. Note: The first facet level might have multiple occurrences in one record. These multiple occurrences should be de-duplicated.	Strip all numeric subfields. 600 b preceded by ,^ f preceded by ( and followed by ) other subfields preceded by .^ 601 other subfields preceded by .^ 602 f preceded by ( and followed by ) other subfields preceded by .^ 604 other subfields preceded by .^ 605 nm preceded by ( and followed by ) other subfields preceded by .^ 610 If several a, second and following preceded by ^:^ 616 f preceded by ( and followed by ) c preceded by .^
Genre	608 $a 600 $j 601 $j 602 $j 604 $j 605 $j 606 $j 607 $j
classification.lcc	Added by enrichment 680 $a
Creation Date	100 $a /09-12 OR 210 $d	Truncate 210 $d so that it has only 4 digits. If the date cannot be normalized to 4 digits, do not create the facet.
File size	Not in use
Collection
Physical format	Not in use	Not in use.
Top-level	online_resources -- assign if the delivery category is Online Resource, SFX Resource, or MetaLib Resource. new -- as tagged before load. Available in Library map is based on the availability information in the source record.
Pre-filter	Based on Resource Type from the display section: Book -> books Journal -> journals Article -> articles Text Resource -> books Image -> images Video -> audio_video Audio -> audio_video Maps -> maps Score -> scores
Related record

Duplicate Record Detection Vector

Currently two types of record matching vectors exist:

T1 – for non-serials
T2 – for serials

The mapping of record to T1 or T2 is based on the format type. The format type is based on the extraction procedure that creates the format (FMT) field from pos. 6 and 7 in the leader.

T1 – All formats except for SE
T2 – E

Vector for T1 - "non-serials"

Vector for T1
Field ID	Nature of field	Content of Field/Source Tag + Subfield	Note
T	Type	1	Created if the format is not SE.
The following fields are for the candidate selection:
C1	UnivID, UnivID_invalid	020 $b $z	Take prefix and number and remove any suffixes. Multiple occurrences are delimited by a semicolon.
C2	ISBN, Invalid_ISBN	010 $a $z	Use data until a blank character or the end of subfield. Multiple occurrences are delimited by a semicolon.
C3	Short title	200 $aehi Use normalization routine #1 Exact match on first 20 and last 10 characters.	The result is a single string of 30 characters.
C4	Year	100 $a 09-12
The following fields are for the matching program:
F1	UnivID	020 $a	Take prefix and numberand remove any suffixes
F2	UnivID_Invalid	020 $z	Take prefix and number and remove any suffixes Multiple occurrences are delimited by a semicolon.
F3	ISBN	010 $a	Use data until a blank character or the end of subfield. Multiple occurrences are delimited by a semicolon.
F4	ISBN_Invalid	010 $z	Use data until a blank character or the end of subfield. Multiple occurrences are delimited by a semicolon.
F5	Short title	200 $aehi	Same as C3.
F6	Year	100 $a 09-12
F7	Full title	200 $acdehi Use routine #2 from
F8	Country of publication	102 a
F9	Pagination	215 $$a
F10	Publisher	210 $$c Use filing routine #3 to normalize	Take only first occurrence of 260 tag and first occurrence of subfield c.
F11	Main entry (author, corporate body, meeting)	700 $abf OR 710 0# $abcghp OR 710 1# $abcdefghp Use normalization routine #3 to normalize

Vector for T2 - "serials"

Vector for T2
Field ID	Nature of field	Content of Field/Source Tag + Subfield	Note
T	Type	2	Created if the format is SE.
The following fields are for the candidate selection:
C1	UnivID, UnivID_invalid	020 $a $z	Use data until a blank character or the end of subfield. Multiple occurrences are delimited by a semicolon.
C2	ISSN, Invalid_ISSN, cancelled_ISSN	011 $a $y $z	Use data until a blank character or the end of subfield. Multiple occurrences are delimited by a semicolon.
C3	Short title	200 $aehi Use filing procedure #1 Exact match on first 25 char.	The result is a single string of 25 characters.
C4	Place of publication	210 $$a normalized using routine 75 After applying routine #3 then take only the first string (up to first blank).	Take only first occurrence of 210 and first occurrence of subfield a.
The following fields are for the matching program:
F1	UnivID	020 $a	Use data until a blank character or the end of subfield.
F2	UnivID_Invalid	020 $z	Use data until a blank character or the end of subfield. Multiple occurrences are delimited by a semicolon.
F3	ISSN	011 $a	Use data until a blank character or the end of subfield. Multiple occurrences are delimited by a semicolon.
F4	ISSN_Invalid	011 $y	Use data until a blank character or the end of subfield. Multiple occurrences are delimited by a semicolon.
F5	ISSN_Cancelled	011 $z	Use data until a blank character or the end of subfield. Multiple occurrences are delimited by a semicolon.
F6	Year	100 $a 09-12
F7	Full title	200 $aehi Use filing routine #2
F8	Truncated title	200 $a Use normalization routine #2
F9	Country of publication	102 a
F10	Place of publication	210 $$a normalized using routine #3 After applying routine, take only the first string (up to first blank).	Take only first occurrence of 210 and first occurrence of subfield a.
F11	Main entry (author, corporate body, meeting)	7100 $abcghp OR 7101 $abcdefghp Use filing routine #3

FRBRization

Refer to Normalization Routines for Duplicate Record Detection for the normalization routines for the author and title parts.

The key field has two subfields:

$$K key part
$$A key part type that determines the algorithm

FRBRization
Field ID	Source (value of $$K for K fields)	Key part type (value of $$A for K fields)	Note
T	Always 1		MARC 21 algorithm
K1	700, or 710, or 720, or 701, and 711, and 721	A	Single occurrence of 700, 710, 720 ; Multiple occurrences of 701, 711, 721, 730. 70X Take subfields a, b, f 71X Take subfields abc 72X Take subfields af 730 take subfield a
K2			Not defined. Unimarc does not have equivalent for title only key.
Kn	If format is not SE: 500, and 200, or 540, or 515, or 518, or 545, or 503 If format is SE: 200, and 520, or 530, or 531, or 532, or 545	T	200 – Subfields aedhi 500 – ahik 540 – aehi 515 – a 518 – a 545 – a 503 – abefijm 520 – aehi 530 – a 531 – a 532 – a

Delivery and Scoping

Delivery and Scoping Section
Delivery Field	Source	Additional Normalization Notes
Institution
Delivery category		Based on algorithm in Defining the Delivery Category Algorithm.
Restricted delivery scope

Ranking

Local mapping required as relevant.

Ranking Section
Booster Field	Source	Additional Normalization Notes
booster1	1 or as added by enrichment program
booster2		Not in use.

Enrichment

Local mapping required as relevant.

Enrichment Section
Enrichment Field	Source	Additional Normalization Notes
classification.lcc	680 a	All occurrences added to separate fields.
fulltext
TOC
Abstract
Review
Rank-parent-child
Rank-Number of copies
Rank-Date first copy
Rank-Number of loans

Additional Data

This includes multiple occurrences in separate fields.

Additional Data
Additional Data Field	Source	Additional Normalization Notes
Author Last	70X#1 a
Author First	70X#1 b
Author initials
Author first initial
Author middle initial
Author suffix
Author	abcdf abcdf
Corporate Author	7100 abcgh 7110 abcgh
Additional author	702 abcdf 712 abcgh 730 a 7121 adef
Series author	225 f
Book Title	If resource type is not an article or a journal: 200 aehi
Article title	200 aehi
Journal title	If resource type is Journal: 200 aehi
Short title	231 a
Additional title	5XX	Strip numeric subfields
Series title	225 410 411	Strip subfield x and numeric subfileds
Date	100 $a 09-12 or 210 $d	Normalize to 4 characters.
RISDate	210 $d or 100 $a/09-12
Additional Date
Volume
Issue
Part
Season
Quarter
Start page
End page
Pages
Article number
ISSN	011 a	Use data up to a blank character or end of subfield.
eISSN	452 x	Use data up to a blank character or end of subfield.
ISBN	010 a	Use data up to a blank character or end of subfield.
CODEN	040 a	Use data up to a blank character or end of subfield.
SICI
Metadata Format	If there is a 328 ->dissertation Else based on Resource type from display: book ->book journal ->journal article ->journal conference_proceeding & record has ISSN ->journal Else ->book
Genre	The Genre mapping table maps the resource type from the display section of the PNX to the genre that is required by the OpenURL.	Use Genre mapping table.
RISType	Based on Resource type from display: If there is a 328 then -> THES book -> BOOK journal -> JOUR map -> MAP video -> VIDEO audio -> SOUND music -> MUSIC article -> JOUR Else -> GEN
City of Publication	210 a
Publisher	210 c
Abstract	330 a
Miscellaneous1
Miscellaneous2
Miscellaneous3
OCLC ID	035 $$a - if text (OCoLC) is present in 035.	Take all digits following the text OCLC and until space. Example: 035 $$a(OC0LC)814782
LCCN	020 $$a	Take prefix and number.
DOI
URL
Local fields 1-25

Normalization Routines for Duplicate Record Detection

Certain characters are translated in XML:

Normalization Routines for Dedup Record Detection
Special Character	Special Meaning	Entity Encoding
>	Begins a tag.	>
<	Ends a tag.	<
Quotation mark.	"
	'Apostrophe.	'
&	Ampersand.	&

The publishing platform removes all leading and trailing spaces and packs double spaces.

Normalization Routine #1

Remove non-filing characters

Remove all text that appears within <<>> or within the Unicode characters 0088 and 0089.
Delete the following characters: '
Change the following characters to blank: !@#$%^&*()_+-={}}[]:";<>?,./~`
Convert characters using the FILING-KEY-01 character conversion table.
Change characters to lower case.
Remove all spaces.
Take first 10 and last 10 characters.

Normalization Routine #2

No non-filing characters in Unimarc.
Remove all text that appears within <<>> or within the Unicode characters 0088 and 0089. For example:

<datafield ind1="1" ind2="0" tag="200"> <subfield code="a"/></datafield>

<datafield tag="200" ind1="1" ind2="0"> <subfield code="a"> <<The>> book : its history in England in the middle ages! </subfield> </datafield>

Should become:

book: its history in England in the middle ages
Delete the following characters: '
Change the following characters to blank: !@#$%^&*()_+-={}}[]:";<>?,./~`
Convert characters using the FILING-KEY-01 character conversion table.
Change characters to lower case.

Normalization Routine #3

Delete the following characters: '
Change the following characters to blank: !@#$%^&*()_+-={}}[]:";<>?,./~`
Convert characters using the FILING-KEY-01 character conversion table.
Change characters to lower case.

Normalization Routines for FRBR

The publishing platform will delete leading and trailing blanks and remove double spaces.

Author Part Normalization

Delete characters: | [ ] '
Change characters to space: $~'^%*/\?@.:;<>{}}-()"!¿¡,
Convert characters using the NACO_diacritics character conversion table.
Change characters to lower case.

Title Part Normalization

Remove non-filing characters.
Remove all text that appears within <<>> or within the Unicode characters 0088 and 0089.
Delete characters: | [ ] '
Change characters to space: $~'^%*/\?@.:;<>{}}-()"!¿¡
Convert characters using the NACO_diacritics character conversion table.
Change characters to lower case.

Defining the Delivery Category Algorithm

The following "out of the box" algorithm will be used for UNIMARC. It should be possible to distinguish between the following resource types:

Physical items (except for microfolm)
Microform
SFX resources
Online resources

The algorithm is read from top to bottom. Once a record is assigned a category the algorithm stops.

When there are several definitions for the same category the priority is given to the "safest" option.

In the algorithm priority has been given to online resources based on the assumption that users most often prefer this option. Primo will include a display of the location and availability status of physical items.

The format is based on the definitions used for each resource type. For more information on these definitions, see LDR Positions.

Defining the Delivery Category Algorithm
Condition	Delivery Category	Note
If 035=SFX	SFX Resources
LDR pos.6 = l
or 106 $a pos.0 = s	Online Resource
If there is a 8564-	Online Resource
If 106 $a pos.0 = t	Microform
If 200 $$b includes the string micro	Microform
If not any of the above	Physical Item