This document is a major revision of File Procedure 1 dated 1993-03-10, and is offered as a guide for CERL members and others submitting files for inclusion in the Heritage of the Printed Book Database. The most important changes relate to (a) the cut-off date for files containing records for items published after 1830, (b) the inclusion of materials printed by lithography, engraving, etc., © the optional inclusion of non-textual materials (cartographic materials, music and graphics) and (d) the conditional inclusion of records for microforms and other surrogates. Approved by the Executive Committee, June 2005.
The Database consists primarily of records of materials printed in Europe in the period c. 1450 to c. 1830.
The choice of 1830 as the cut-off point for the Database is necessarily arbitrary. It was chosen as being an approximate date when machine printing began to supersede hand-printing. Many hand-printed items were published after 1830, and many files containing records of old printed materials continue beyond that date.
The core of the Database therefore remains first and foremost the record of the European printed heritage of the hand-press period, but it is not rigidly defined by date.
The following notes offer guidelines for CERL members and others submitting files. In all cases of doubt, please consult the Secretariat who will be glad to give advice and assistance.
The Heritage of the Printed Book Database shall consist primarily of records of books; that is, monographic and serial publications printed by hand. (This definition includes broadsheets). Usually these are solely or mainly textual materials. In order to record the full range of European printing of the hand press period, the database should also include books and other publications printed from lithography and engraving.
1)
Non-textual materials (cartographic materials, printed music and graphics) shall also be admissible – their inclusion may be encouraged on the grounds that they form part of the printed heritage – but file suppliers must decide initially whether or not to submit records for them. It may be considered that they are still better served by separate, specialist databases.
If the file contains a substantial number of records for other materials:
In all cases of doubt, please consult the Secretariat.
Summary of revisions: This document is new. There was no corresponding document in the former UNAD series.
Pica+ is the HPB's internal format at VZG.
Conversion tables for MARC21 and UNIMARC are available.
CERL will accept data in MARC/UTF-8. The Data Conversion Group (DCG) in Göttingen processes the data for CERL, and they have created UNIMARC and MARC21 conversions which aim to retain as much of the data as possible - in particular elements concerning antiquarian materials.
The level of cataloguing can vary.
We prefer it if the file is delivered to CERL by FTP. Full details will be provided by the Executive Manager upon request (contact the CERL Secretariat).
CERL sends all file providers a Letter of Agreement to be signed and returned. The text of the Letter of Agreement may be found here.
The file provider is also required to write a short description of the file it contributed to the HPB. For examples of these file descriptions, see http://www.cerl.org/web/en/resources/hpb/content. Clicking on any of the libraries listed will take you to their file description. Images are welcome!
The description of the file is included on the CERL-website to assist users of the HPB database to determine the contents of the database.
The process described here is to ensure that the data submitted for inclusion in the HPB is in the best possible condition. This will prevent the actual file loading process from being a long drawn out and cumbersome procedure.
Once the file has been submitted it is passed on to the Data Conversion Group in Göttingen where the file will be analysed. Contents as well as format issues will be validated. The findings will be compiled in an Analysis Report and passed on to the file provider.
The Analysis Report can include recommendations for changes to be carried out by the file provider in time for the next HPB update. It also documents changes to the file that will be applied by the Data Conversion Group in order to optimise the records for the HPB environment and increase the data quality as much as possible (pre-processing). This can also include corrections which can be algorithmically applied to absorb cataloguing deficiencies. This does of course not prevent the file provider from performing such corrections in the local system as well.
The file provider will be asked to agree to the suggested pre-processing and once the file provider, the Data Conversion Group and CERL are agreed that no matters are outstanding with regard to the file vettings, the complete file will be processed by VZG.
If a retroconverted source file does contain excessively long fields, the file supplier must seek the advice of the CERL Secretariat before submitting the file for evaluation.
CERL is happy to received updates - i.e. new additions or amended records. It is important that your Record IDs are the same from one file version to the next.
If you file format (e.g. use of field and subfields) has not changed and your Records IDs have remained stable, the updates will be processed automatically. In that case, you may determine the frequency of delivering updates to CERL.
[ Under construction ]
The Manual (section 4.6 and Appendix J) authorises the use of two codes to delimit strings of one or more non-sorting characters. These codes form part of ISO 6630 : Bibliographic Control Set (UNIMARC C1 set) :
8/8 Non-sorting character(s), Beginning, represented here as *NSB*
8/9 Non-sorting character(s), End, represented here as *NSE*
In the Manual and other UNIMARC documentation these codes are conventionally represented by NSB and NSE, both placed between an equal sign that has been crossed through.
The codes are most commonly used to mark non-sorting articles at the beginning of title fields, but can also be used to enclose much longer strings and even strings occurring in the middle or at the end of subfields. The Manual places no restrictions on their use (they can be used in any non-coded data field) and offers no guidance : usage is governed by the cataloguing rules employed by the various bibliographic agencies.
MARC21 uses indicator values to denote the number of non-filing characters at the beginning of title fields (up to 9 characters, inclusive of spaces), but has no other provisions for longer strings or non-filing characters in other positions.
Other formats (for example INTERMARC, MAB) do not use indicators for this purpose and indicate non-filing characters by other means.
'n | De | Eener | Eit | I | O | Une |
's | Dei | Eens | El | Il | Os | Uno |
't | Dem | Egy | En | L' | The | Unos |
A | Den | Ei | Ene | La | Um | Y |
Ain | Der | Ein | Et | Las | Uma | Yr |
An | Des | Eine | Ett | Le | Umas | |
As | Det | Einem | Eyn | Les | Un | |
Az | Die | Einen | Gl' | Li | Un' | |
D' | Een | Einer | Gli | Lo | Una | |
Das | Eene | Eines | Het | Los | Unas |
under construction