Generic UNIMARC

This information is not applicable to Primo VE environments. For more details on Primo VE mapping, see Mapping to the Display, Facets, and Search Sections in the Primo VE Record.

Return to menu

Control Section

Control Section
Normalized Record Field Source/Content Note

Source ID

From data source definitions

 

Original Source ID

From data source definitions

 

Source Record-ID

From header of source file

 

Record ID

Source ID + Source Record-ID

 

Additional Record-ID

   

Source Type

 

Not in use.

Source Format

From data source definitions

 

SourceSystem

   

Display Section

General Notes

  • String multiple occurrences with a semicolon unless indicated otherwise. If the source data has a period at the end and it is not the final occurrence, remove the

Notes Regarding Subfields And Indicators

  • If no subfields are listed explicitly, data from all non-numeric subfields will be displayed.

  • If a field or a subfield is repeated, all instances should be displayed.

  • Subfields are listed in alphabetical order for the sake of clarity, but should be displayed in the order they are recorded in the source record.

  • If all the subfields or specified non-numeric subfields are taken, numeric subfields are not considered.

  • If a numeric subfield is specifically included, no other numeric subfield will be included.

  • If a numeric subfield is excluded, the mapping will take other numeric subfields.

  • If no indicators are defined, all indicators will be taken.

Display Section
Display Element Source Note

Source

Source from the data source definition

 

Resource Type

See mapping below

 

Title

200 acdehi

Punctuation :

If several a, the second and following are preceded by ,^

chi preceded by .^

e preceded by ^:^

d preceded by ^=^

Uniform title

500 abhiklmqrsu

OR

501 abejkmrsuwxyz

OR

503 abdefijkmn

500

b preceded by ^[ followed by ]^

hiklqrsu preceded by .^

m preceded by ^( followed by )^

501

bejkmrsuw preceded by .^

xyz preceded by ^:^

503

bdeijkmn preceded by .^

f preceded by ,^

Vernacular title

   

Creator

700 abcdfg

7100 abcghp

720 af

730 a

7101 a

700

b preceded by ,^

f preceded by ^( followed by )^

other subfields preceded by .^

7100

All subfields preceded by .^

except a

720

f preceded by ^( followed by )^

Contributor

702 abcdfg

7120 abcghp

722 af

716 acf

701 abcdfg

722 af

7111 a

7110 a

7121 a

702

b preceded by ,^

f preceded by ^( followed by )^

other subfields preceded by .^

7120

All subfields preceded by .^

except a

722

f preceded by ^( followed by )^

716

f preceded by ^( followed by )^

c preceded by .^

Description

328 b

330 a

327 abcdefghi

Every field is a separate occurrence.

327 every subfield is a title

Edition

205 a

 

Publisher

210 ac

c preceded by ^:^

if several a, followed a are preceded by ^;^

Subject

Strip all numeric subfields.

600 abcdfjtxyz

601 abcdefghjtxyz

602 afjtxyz

604 atjxyz

605 ahijkmnxyz

606 ajxyz

607 ajxyz

608 ajxyz

610 a

615 axyz

616 acfjx

Strip all numeric subfields.

600

b preceded by ,^

f preceded by ( and followed by )

xyz preceded by ^:^

other subfields preceded by .^

601

xyz preceded by ^:^

other subfields preceded by .^

602

f preceded by ( and followed by )

xyz preceded by ^:^

other subfields preceded by .^

604

xyz preceded by ^:^

other subfields preceded by .^

605

nm preceded by ( and followed by )

xyz preceded by ^:^

other subfields preceded by .^

606 - 607 - 608 - 615

jxyz preceded by ^:^

610

If several a, second and following preceded by ^:^

616

f preceded by ( and followed by )

jxy preceded by ^:^

c preceded by .^

Language

101 a

Validate code against list of ISO 639-2 codes, section B. If the code cannot be translated, leave it as is.

Physical Format

337 a

215 acde

Punctuation = a : c ; d + e.

Identifier

010 $$a – prefix the value with ISBN

011 $$a – prefix the value with ISSN

013 $$a – prefix the value with ISMN

This mapping is disabled in the out-of-the-box template since the identifiers by default do not display in the Front End.

Relation

410, 411: Prefix the value with Series

430, 431, 432, 433, 434, 435, 436, 437: Prefix the value with Earlier Title

440, 441, 442, 443, 444, 445, 446, 447, 448: Prefix the value with Later Title

Every field should be a separate occurrence.

The prefix should be added to $$C and the value to $$V.

Display constant codes are used:

series

earlier_title

later_title

Is Part Of

422 tfghi

f preceded by ^/^

g preceded by ^;^

hi preceded by .^

Creation Date

210 $d

OR

100 $a/09-12

For a date created from 008, create a date only if it starts with a digit that is not zero and replace missing digits with a question mark. For example:

19-- > 19??

19uu > 19??

Mapping to Resource Type

The mapping is based on the format type derived either from LDR positions 6 and 7 or tag and position. Use the following tables to determine the mapping.

LDR Positions
Leader pos. 6/7 Record type Format

a Language material + pos.7= m

Books

BK

a Language material + pos.7= s

Continuing Resources

SE

c Notated music

Music

MU

d Manuscript notated music

Music

MU

e Cartographic material

Maps

MP

f Manuscript cartographic material

Maps

MP

g Projected medium

Visual materials

VM

i Nonmusical sound recording

Audio materials

AM

j Musical sound recording

Audio materials

AM

k Two-dimensional non-projectable graphic

Visual materials

VM

l Electronic Resource

Electronic resource

CF

m Mixed material

Mixed materials

MX

r Three-dimensional artifact or naturally occurring object

Visual materials

VM

b Manuscript language material

Books

BK

Default

 

BK

Record Type Derived from Tag and Position
Format Based on (tag and position) TYPE Note

Any

328 a,b,c,d,e,t,z – is present

or

105 a pos.4, 5, 6,7 = m or v

dissertation

 

BK

LDR pos.7 = a

article

 

BK

 

book

The catch-all for BK if no further information is available is Book

CF

110 $a pos.0 = f

database

 

CF

LDR pos.7 = s

electronic journal

 

CF

 

other

 

MP

 

Map

 

AM

 

audio

 

MU

LDR 06 = c or d

Score

 

SE

110 $a pos.0 = b

serial

 

SE

110 $a pos.0 = a or c or e or z

journal

 

VM

LDR pos.6 = k

image

 

VM

LDR pos.6 = g

video

 

VM

 

other

 

MX

 

other

 

Links

Links Section
Type of Link Source Note

OpenURL

Based on resource type from display:

If type = article, then:

$$Topenurl_article

Else:

$$Topenurl_journal

SFX has two sources for Primo: one for articles in which case the data is used and one for journals in which case the date is ignored. There is a different template per source.

OpenURL_fulltext

Based on resource type from display:

If type = article, then:

$$Topenurlfull_article

Else:

$$Topenurlfull_journal

SFX has two sources for Primo: one for articles in which case the data is used and one for journals in which case the date is ignored. There is a different template per source.

OpenURL_servicetext

   

Backlink

   

LinktoHoldings

   

Linkto Holdings_available

   

Linkto Holdings_unavailable

   

Linkto Holdings_doesnotexist

   

LinktoRequest

   

LinktoResource

8564 $u

Add display text ($$D) from $2 + $z. If not available then use code: "Online version"

856 1#

Add display text ($$D) from $2 + $z. If not available then use code: "Online versionValidate that the link is to the resource by checking the content of subfield 3.

 

Additional links

316 u $$Dlink to extract

317 u $$Dlink to extract

320 u $$D Link to bibliographical or index information

321 u $$D Link to extract

334 u $$D Link to awards information

337 u $$D Link to system details

345 u $$D Link to acquisition information

 

Thumbnail

010 a:

$$Tamazon_thumb

$$Tsyndetics_thumb

$$Tgoogle_thumb

 

LinktoTOC

327 u $$U $$DTable of Contents

010 a $$Tsyndetics_toc$$DTable of Contents

Create Amazon and Syndetic links only if there is an ISBN.

LinktoAbstract

010 a $$Tsyndetics_abstract$$D Abstract

 

LinktoExcerpt

010 a $$Tsyndetics_excerpt$$D Excerpt from item

 

LinktoReview

   

LinktoFindingAid

   

LinktoPrice

   

Linktouc

$$Tamazon_uc – add if there is ISBN

$$Tworldcat_isbn – add if there is ISBN ELSE add

$$Tworldcat_oclc – if there is OCLC number

 

Search

Search Section
Index Source tag Notes

Creator/contributor

200 f

700 abcdfg

701 abcdfg

702 abcdfg

7100 abcghp

7120 abcghp

716 acf

700 1 or 2 ab

722 af

730 a

711 abcdef

712 abcdef

 

Title

200 acehi

 

Additional title

410 t, 411 t, 423 t, 225 aei, 410 aei, 421 ae, 422 ae, 423 ae, 430 ae, 432 ae, 433 ae, 434 ae, 435 ae, 436 ae, 437 ae, 440 ae, 441 ae, 442 ae, 443 ae, 444 ae, 445 ae, 446 ae, 447 ae, 448 ae, 451 ae, 452 ae, 453 ae, 454 ae, 455 ae, 456 ae, 470 ae, 481 ae, 482 ae, 488 ae, 501 a, 510 ae, 512 ae, 517 ae, 518 ae, 531 ae, 532 ae, 545 ae,

 

Alternative Title

200 d

454 thio

503 abdefijkmn

500 abhikl

510 aehi

512 aehi

513 aehij

514 aehij

515 aehij

516 aehij

517 aehij

530 a

531 a

532 a

540 aehi

541 aehi

545 a

7101 abcdefghp

 

Description

330 a

327 abcdefghi

 

Subject

Strip all numeric subfields.

600 abcdfjtxyz

601 abcdefghjtxyz

602 afjtxyz

604 atjxyz

605 ahijkmnxyz

606 ajxyz

607 ajxyz

608 ajxyz

610 a

615 axyz

616 acfjx

675 a

676 a

 

ISBN

010 az

 

ISSN

011 ayz

 

Resource type

Resource type from display

 

Creation date

100 $a/09-12 and 100 $a/13-16 are digits and not 9999

210 $d

 

Full Text

   

TOC

327 abcdefghi

 

RecordSource

Source ID from the control section

Note: Required to filter out certain sources.

RecordID

Record ID from the control section

Note: Required to retrieve record based on system number.

General

101 a

210 c

322 a

323 a

326 a

328 abcdetz

333 a

334 abcd

 

Search scope

From PNX:

delivery/institution

control/sourceid (for example the data source is added as a scope)

 

Restricted search scope

   

Scope

Copies from the Search scope and Restricted search scope from the sections above

 

Sort

Sort Section
Sort type DC field

Creation Date

100 $a/09-12 OR 210 $d

author

Single sort key created from (subfields same as display/author):

700

710

720

730

title

Single sort key created from (subfields same as display/title):

200

Popularity

 

Facets

Facet Section
Facet Source Note

Resource type

Create this based on the Resource type field from display section as follows:

Book -> books

Journal -> journals

Article -> articles

Text Resource -> books

Image -> images

Audio -> media

Video -> media

Score -> Scores

Map -> Maps

Other -> other

In some cases, two values should be created, each as a separate field.

Language

101 subfields a, d, h.

If the language is not a valid ISO 639 code it should not be created.

Creator/Contributor_

700 abcdf

701 abcdf

702 abcdf

710 abcd

711 abcd

712 abcd

720 af

721 af

722 af

730 a

710 1# a

711 1# a

712 1# a

Punctuation

700, 701 and 702

b preceded by ,^

f preceded by ^( followed by )^

other subfields preceded by .^

7100, 7110 and 7120

All subfields preceded by .^

except a

720, 721 and 722

f preceded by ^( followed by )^

Topic

600 abcdfjtxyz

601 abcdefghjtxyz

602 afjtxyz

604 atjxyz

605 ahijkmnxyz

606 ajxyz

607 ajxyz

610 a

616 acfjxy

First facet level is all data up to the first occurrence of subfield $$j, x, y or z. Each subfield division (j, x, y or z) constitutes the next level.

Note: The first facet level might have multiple occurrences in one record. These multiple occurrences should be de-duplicated.

Strip all numeric subfields.

600

b preceded by ,^

f preceded by ( and followed by )

other subfields preceded by .^

601

other subfields preceded by .^

602

f preceded by ( and followed by )

other subfields preceded by .^

604

other subfields preceded by .^

605

nm preceded by ( and followed by )

other subfields preceded by .^

610

If several a, second and following preceded by ^:^

616

f preceded by ( and followed by )

c preceded by .^

Genre

608 $a

600 $j

601 $j

602 $j

604 $j

605 $j

606 $j

607 $j

 

classification.lcc

Added by enrichment

680 $a

 

Creation Date

100 $a /09-12 OR 210 $d

Truncate 210 $d so that it has only 4 digits. If the date cannot be normalized to 4 digits, do not create the facet.

File size

Not in use

 

Collection

   

Physical format

Not in use

Not in use.

Top-level

online_resources -- assign if the delivery category is Online Resource, SFX Resource, or MetaLib Resource.

new -- as tagged before load.

Available in Library map is based on the availability information in the source record.

 

Pre-filter

Based on Resource Type from the display section:

Book -> books

Journal -> journals

Article -> articles

Text Resource -> books

Image -> images

Video -> audio_video

Audio -> audio_video

Maps -> maps

Score -> scores

 

Related record

   

Duplicate Record Detection Vector

Currently two types of record matching vectors exist:

  • T1 – for non-serials

  • T2 – for serials

The mapping of record to T1 or T2 is based on the format type. The format type is based on the extraction procedure that creates the format (FMT) field from pos. 6 and 7 in the leader.

  • T1 – All formats except for SE

  • T2 – E

Vector for T1 - "non-serials"

Vector for T1
Field ID Nature of field Content of Field/Source Tag + Subfield Note

T

Type

1

Created if the format is not SE.

The following fields are for the candidate selection:

C1

UnivID, UnivID_invalid

020 $b $z

Take prefix and number and remove any suffixes.

Multiple occurrences are delimited by a semicolon.

C2

ISBN, Invalid_ISBN

010 $a $z

Use data until a blank character or the end of subfield.

Multiple occurrences are delimited by a semicolon.

C3

Short title

200 $aehi

Use normalization routine #1

Exact match on first 20 and last 10 characters.

The result is a single string of 30 characters.

C4

Year

100 $a 09-12

 

The following fields are for the matching program:

F1

UnivID

020 $a

Take prefix and numberand remove any suffixes

F2

UnivID_Invalid

020 $z

Take prefix and number and remove any suffixes

Multiple occurrences are delimited by a semicolon.

F3

ISBN

010 $a

Use data until a blank character or the end of subfield.

Multiple occurrences are delimited by a semicolon.

F4

ISBN_Invalid

010 $z

Use data until a blank character or the end of subfield.

Multiple occurrences are delimited by a semicolon.

F5

Short title

200 $aehi

Same as C3.

F6

Year

100 $a 09-12

 

F7

Full title

200 $acdehi

Use routine #2 from

 

F8

Country of publication

102 a

 

F9

Pagination

215 $$a

 

F10

Publisher

210 $$c

Use filing routine #3 to normalize

Take only first occurrence of 260 tag and first occurrence of subfield c.

F11

Main entry (author, corporate body, meeting)

700 $abf

OR

710 0# $abcghp

OR

710 1# $abcdefghp

Use normalization routine #3 to normalize

 

Vector for T2 - "serials"

Vector for T2
Field ID Nature of field Content of Field/Source Tag + Subfield Note

T

Type

2

Created if the format is SE.

The following fields are for the candidate selection:

C1

UnivID, UnivID_invalid

020 $a $z

Use data until a blank character or the end of subfield.

Multiple occurrences are delimited by a semicolon.

C2

ISSN, Invalid_ISSN, cancelled_ISSN

011 $a $y $z

Use data until a blank character or the end of subfield.

Multiple occurrences are delimited by a semicolon.

C3

Short title

200 $aehi

Use filing procedure #1

Exact match on first 25 char.

The result is a single string of 25 characters.

C4

Place of publication

210 $$a normalized using routine 75

After applying routine #3 then take only the first string (up to first blank).

Take only first occurrence of 210 and first occurrence of subfield a.

The following fields are for the matching program:

F1

UnivID

020 $a

Use data until a blank character or the end of subfield.

F2

UnivID_Invalid

020 $z

Use data until a blank character or the end of subfield.

Multiple occurrences are delimited by a semicolon.

F3

ISSN

011 $a

Use data until a blank character or the end of subfield.

Multiple occurrences are delimited by a semicolon.

F4

ISSN_Invalid

011 $y

Use data until a blank character or the end of subfield.

Multiple occurrences are delimited by a semicolon.

F5

ISSN_Cancelled

011 $z

Use data until a blank character or the end of subfield.

Multiple occurrences are delimited by a semicolon.

F6

Year

100 $a 09-12

 

F7

Full title

200 $aehi

Use filing routine #2

 

F8

Truncated title

200 $a

Use normalization routine #2

 

F9

Country of publication

102 a

 

F10

Place of publication

210 $$a normalized using routine #3

After applying routine, take only the first string (up to first blank).

Take only first occurrence of 210 and first occurrence of subfield a.

F11

Main entry (author, corporate body, meeting)

7100 $abcghp

OR

7101 $abcdefghp

Use filing routine #3

 

FRBRization

Refer to Normalization Routines for Duplicate Record Detection for the normalization routines for the author and title parts.

The key field has two subfields:

  • $$K key part

  • $$A key part type that determines the algorithm

FRBRization
Field ID Source (value of $$K for K fields) Key part type (value of $$A for K fields) Note

T

Always 1

 

MARC 21 algorithm

K1

700, or 710, or 720, or 701, and 711, and 721

A

Single occurrence of 700, 710, 720 ;

Multiple occurrences of 701, 711, 721, 730.

70X Take subfields a, b, f

71X Take subfields abc

72X Take subfields af

730 take subfield a

K2

   

Not defined. Unimarc does not have equivalent for title only key.

Kn

If format is not SE:

500, and

200, or

540, or

515, or

518, or

545, or

503

If format is SE:

200, and

520, or

530, or

531, or

532, or

545

T

200 – Subfields aedhi

500 – ahik

540 – aehi

515 – a

518 – a

545 – a

503 – abefijm

520 – aehi

530 – a

531 – a

532 – a

Delivery and Scoping

Delivery and Scoping Section
Delivery Field Source Additional Normalization Notes

Institution

   

Delivery category

 

Based on algorithm in Defining the Delivery Category Algorithm.

Restricted delivery scope

   

Ranking

Local mapping required as relevant.

Ranking Section
Booster Field Source Additional Normalization Notes

booster1

1 or as added by enrichment program

 

booster2

 

Not in use.

Enrichment

Local mapping required as relevant.

Enrichment Section
Enrichment Field Source Additional Normalization Notes

classification.lcc

680 a

All occurrences added to separate fields.

fulltext

   

TOC

   

Abstract

   

Review

   

Rank-parent-child

   

Rank-Number of copies

   

Rank-Date first copy

   

Rank-Number of loans

   

Additional Data

This includes multiple occurrences in separate fields.

Additional Data
Additional Data Field Source Additional Normalization Notes

Author Last

70X#1 a

 

Author First

70X#1 b

 

Author initials

   

Author first initial

   

Author middle initial

   

Author suffix

   

Author

abcdf

abcdf

 

Corporate Author

7100 abcgh

7110 abcgh

 

Additional author

702 abcdf

712 abcgh

730 a

7121 adef

 

Series author

225 f

 

Book Title

If resource type is not an article or a journal:

200 aehi

 

Article title

200 aehi

 

Journal title

If resource type is Journal:

200 aehi

 

Short title

231 a

 

Additional title

5XX

Strip numeric subfields

Series title

225

410

411

Strip subfield x and numeric subfileds

Date

100 $a 09-12 or 210 $d

Normalize to 4 characters.

RISDate

210 $d or 100 $a/09-12

 

Additional Date

   

Volume

   

Issue

   

Part

   

Season

   

Quarter

   

Start page

   

End page

   

Pages

   

Article number

   

ISSN

011 a

Use data up to a blank character or end of subfield.

eISSN

452 x

Use data up to a blank character or end of subfield.

ISBN

010 a

Use data up to a blank character or end of subfield.

CODEN

040 a

Use data up to a blank character or end of subfield.

SICI

   

Metadata Format

If there is a 328 ->dissertation

Else based on Resource type from display:

  • book ->book

  • journal ->journal

  • article ->journal

  • conference_proceeding & record has ISSN ->journal

Else ->book

 

Genre

The Genre mapping table maps the resource type from the display section of the PNX to the genre that is required by the OpenURL.

Use Genre mapping table.

RISType

Based on Resource type from display:

If there is a 328 then -> THES

book -> BOOK

journal -> JOUR

map -> MAP

video -> VIDEO

audio -> SOUND

music -> MUSIC

article -> JOUR

Else -> GEN

 

City of Publication

210 a

 

Publisher

210 c

 

Abstract

330 a

 

Miscellaneous1

   

Miscellaneous2

   

Miscellaneous3

   

OCLC ID

035 $$a - if text (OCoLC) is present in 035.

Take all digits following the text OCLC and until space.

Example:

035 $$a(OC0LC)814782

LCCN

020 $$a

Take prefix and number.

DOI

   

URL

   

Local fields 1-25

   

Normalization Routines for Duplicate Record Detection

Certain characters are translated in XML:
Normalization Routines for Dedup Record Detection
Special Character Special Meaning Entity Encoding

>

Begins a tag.

>

<

Ends a tag.

<

Quotation mark.

"

 
 

'Apostrophe.

'

&

Ampersand.

&

The publishing platform removes all leading and trailing spaces and packs double spaces.

Normalization Routine #1

  1. Remove non-filing characters

    Remove all text that appears within <<>> or within the Unicode characters 0088 and 0089.
  2. Delete the following characters: '

  3. Change the following characters to blank: !@#$%^&*()_+-={}}[]:";<>?,./~`

  4. Convert characters using the FILING-KEY-01 character conversion table.

  5. Change characters to lower case.

  6. Remove all spaces.

  7. Take first 10 and last 10 characters.

Normalization Routine #2

  1. No non-filing characters in Unimarc.

  2. Remove all text that appears within <<>> or within the Unicode characters 0088 and 0089. For example:

    <datafield ind1="1" ind2="0" tag="200"> <subfield code="a"/></datafield>

    <datafield tag="200" ind1="1" ind2="0"> <subfield code="a"> <<The>> book : its history in England in the middle ages! </subfield> </datafield>

    Should become:

    book: its history in England in the middle ages

  3. Delete the following characters: '

  4. Change the following characters to blank: !@#$%^&*()_+-={}}[]:";<>?,./~`

  5. Convert characters using the FILING-KEY-01 character conversion table.

  6. Change characters to lower case.

Normalization Routine #3

  1. Delete the following characters: '

  2. Change the following characters to blank: !@#$%^&*()_+-={}}[]:";<>?,./~`

  3. Convert characters using the FILING-KEY-01 character conversion table.

  4. Change characters to lower case.

Normalization Routines for FRBR

The publishing platform will delete leading and trailing blanks and remove double spaces.

Author Part Normalization

  1. Delete characters: | [ ] '

  2. Change characters to space: $~'^%*/\?@.:;<>{}}-()"!¿¡,

  3. Convert characters using the NACO_diacritics character conversion table.

  4. Change characters to lower case.

Title Part Normalization

  1. Remove non-filing characters.

  2. Remove all text that appears within <<>> or within the Unicode characters 0088 and 0089.

  3. Delete characters: | [ ] '

  4. Change characters to space: $~'^%*/\?@.:;<>{}}-()"!¿¡

  5. Convert characters using the NACO_diacritics character conversion table.

  6. Change characters to lower case.

Defining the Delivery Category Algorithm

The following "out of the box" algorithm will be used for UNIMARC. It should be possible to distinguish between the following resource types:

  • Physical items (except for microfolm)

  • Microform

  • SFX resources

  • Online resources

The algorithm is read from top to bottom. Once a record is assigned a category the algorithm stops.

When there are several definitions for the same category the priority is given to the "safest" option.

In the algorithm priority has been given to online resources based on the assumption that users most often prefer this option. Primo will include a display of the location and availability status of physical items.

The format is based on the definitions used for each resource type. For more information on these definitions, see LDR Positions.

Defining the Delivery Category Algorithm
Condition Delivery Category Note

If 035=SFX

SFX Resources

 

LDR pos.6 = l

   

or 106 $a pos.0 = s

Online Resource

 

If there is a 8564-

Online Resource

 

If 106 $a pos.0 = t

Microform

 

If 200 $$b includes the string micro

Microform

 

If not any of the above

Physical Item