Popis: |
Scientific literature and its corresponding bibliographic metadata information is typically available through online digital repositories: • INSPIRE, the High Energy Physics (HEP) information system is the source of information about the whole HEP literature. • TheCERNDocumentServer(CDS) is the CERN Institutional Library containing all documents produced at CERN; • arXiv is a pre-print server hosting pre-print versions of several scientific fields. • SCOAP3 is an initiative to convert key journals in the HEP field to open access and comes with its own digital repository. Across these 4 entities, there is a big overlap in terms of content, and maintaining consistency between the corresponding bibliographic metadata is an open challenge. The proposed thesis tries to model and implement a possible solution to automate the propagation of updates in order to reduce the necessary manual data manipulation to a minimum. |