Skip to content
This repository has been archived by the owner on Dec 19, 2018. It is now read-only.

culturegraph/culturegraph-record-aggregator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 
 
 

Repository files navigation

CultureGraph Record Aggregator

culturegraph record aggregator

A library that uses marc4j to aggregate MARC21 records into a single record.

Introduction

The Culturegraph project produces groups of records that could be identified by comparing them using a rule based matching procedure.

The library targets the export of those identified records into a single record.

Procedure

Assuming a collection of records as input:

  • For each record r

    1. Add the data field 035 (contains a field link fl)

    2. Mark each data field (exclude field 035 and 883) in r with the field link fl

  • Collect all data fields from all records

    1. Sort all data fields

    2. Re-Enumerate all field links

    3. Merge duplicate fields

    4. Create a blank record b

      • Use build number as control number (control field 001)

      • Use your ISIL as cataloging agency (control field 003)

    5. Add all data fields to b

    6. Return b

Appendix

Example

The folder example contains three marcxml records and their aggregate.

Packages

No packages published

Languages