A record will experience many major changes over a lifetime of decades (say, 50 years or more). The record (or at least the information conveyed by the record) must be expected to live longer (meaning also being useful) than any other component in the system; be that technology, actors, or organizations, and must be able to survive transitions related to any of these other components. This imposes the following challenges:
|
|
Limited lifetime of technology:
|
||||||||
|
|
Limited lifetime of organizations and organizational units. This includes sell-outs and mergers and even termination, as well as changes in ownership and authorisations. |
||||||||
|
|
Limited lifetime of ownership. When an object (say, a ship) is sold, the accompanying documentation must follow the object, change ownership, and possibly be migrated to new systems and new technology. |
||||||||
|
|
Limited lifetime of actors, roles and processes. Work procedures, responsibilities and persons change over time. This has particular relevance to security (ownership, authorisations and more), but also to semantic interpretation of documents. Outsourcing of (parts of) processes and operations may pose particular challenges. |
||||||||
|
|
Limited lifetime of external reference data and services from external actors. If a piece of (contextual) information is maintained by an external actor, continuation of this actor’s support may be needed. Moreover, not only individual records need to survive transitions, but also contextual information, history information, and relationships between records. |
The transition survival challenge can only be solved by proper maintenance procedures for records and their storage. Since any technology (and organizations, processes etc.) in use at the time of record creation will be obsolete long before the expiry of the lifetime of the record, and one cannot predict the nature of the technology (organizations, processes etc.) that will constitute the replacement (say, 15 years from now), specifications must be sufficiently independent of technology. At the same time, specifications must be supported by existing technology. Note that in particular conversion and versioning of context information (metadata etc.) is a fairly new research area where the partners of LongRec are in a good position to contribute. LongRec will address all aspects listed above, with particular focus on formats, organizational issues, and processes. Research in records transition survival will partly be carried out by one Ph.D. student localised at NTNU.
LongRec focuses on information that is accessed, used and changed during its entire lifetime. Thus, availability and accessibility is very important. All aspects, such as search, retrieval, presentation, verification (see research challenge 4), and change, need to be addressed. An important aspect is the ability to find all relevant information. Partly, this can be achieved by structuring (and maintaining over time) relationships between documents and their context. However, a general trend today is that search functionality becomes increasingly important. Thus, structuring the documents and their context information in a way that facilitates reliable searching will become an important measure. Searches in old documents brings on challenges related to terms and relevance; which terms are used in a 30 years old document, and which of today's terms cannot be used or have to be translated into their “old” more or less equivalent terms? LongRec will address these topics with particular focus on search. Research in long-term usage will partly be carried out by one Ph.D. student localised at NTNU.
When committing to digital collaboration it is a prerequisite that semantics of documents used as input and output to business processes are agreed. This particularly applies for processes where documents are exchanged between systems. However, semantics are important also for processes where documents are exchanged digitally between human users. Semantic information is partly stored in the records themselves and partly as separate objects in an archive (typical for information common to many records). Additionally, referral may be made to information maintained by external parties. Examples of distributed information sources in Norway are
www.lovdata.no (laws and regulations), Enhetsregisteret (public company registry), and sources of relevant metadata (e.g. Oppgaveregisteret for reporting to the public sector). Thus, one may need to LongRec rely on the continued support and trustworthiness of external parties. Not all contexts are storable (e.g. a person’s explanation of why the answer to a particular question is as indicated), and a document and its context may over time make less sense due to changes in terminology and advances in knowledge in the field of the document. The inevitable reduction in semantic value will imply decreased trustworthiness and usefulness of the document. Updates to the document may mitigate this. Methodologies for knowledge engineering have been around for some years (UML was defined in the mid nineties; OWL is a few years old). In a long term view, the methodologies used for describing semantics are immature and the best of breed are rapidly changing. On this background the following research challenges are important for the semantic aspects of LongRec:|
|
Transition of ontologies between generations of ontology methodologies, |
|
|
Maintaining distributed ontologies, |
|
|
Requirements to the initial ontology used, |
|
|
Identify areas of ontology engineering that possibly will be exposed to significant changes. The research area must be regarded as relatively immature, but the LongRec consortium is in a good position to significantly advance state of the art in the field. Research in preservation of semantic value will partly be carried out by one Ph.D. student inaugurated at NTNU but hosted by DNV IQM. |
In the context of LongRec, digital original documents must be trusted over time. Additional properties with respect to the items in research challenges 1-3 are confidence in preservation of: Availability (to authorised actors), integrity (correctness), confidentiality (to unauthorised actors), IPR protection (of ownership), and accountability (traceability of actions and events related to the document). These properties are threatened by errors, mistakes and failures, but also by security events (i.e. intentional attacks). LongRec proposes to use the evidential value of a record as an index for the degree of trust, and to derive guidelines for preservation of evidential value over time. With respect to a record, the following questions must be answered:
|
|
What are the requirements for evidential value of the record? E.g. which compliance requirements (legal, privacy, etc.) must be addressed? |
|
|
How are these requirements met at the time of creation, and how do they change over time? E.g.requirements for confidentiality and availability can typically be relaxed over time. |
|
|
How can evidential value best be preserved over time according to the (partly changing) requirements of the record? |
|
|
How can evidential value be verified when required? E.g. requirements for trust in the verificationprocess and related software, and requirements for use of independent actors. |
As a rule, it is inevitable that the evidential value decreases over time. Any operation on a document, such as format conversion, implies a risk and involves actors and components that must be trusted. LongRec will develop theory and methods to as far as possible preserve evidential value over time, and to express, measure, and verify evidential value. As all security related work, analysis of threats, risks and vulnerabilities must be applied. A general risk analysis may be augmented by specific analysis related to case studies. Two particular topics that will be addressed are signatures and IPR (Intellectual Property Rights). Digital signature as a mechanism for integrity and accountability receives particular attention due to legal and security requirements. As a digital signature binds also to the formats used, a digital signature by necessity cannot survive format conversion. Long-term aspects of digital signatures will be studiedby LongRec, including methods to cope with removal of signatures without too much loss of evidential value (use of neutral actors and their services may be necessary). IPR issues, in a broad sense, can if not properly addressed impede long-term storage and access activities. Ownership and IPR necessarily change over time, must comply with relevant laws and regulations, and dictate management of authorizations given to other actors. IPR can be considered as parts of the context information of a record. Both customary access management solutions, and a DRM (Digital Rights Management) approach will be studied by LongRec.
Research in preservation of evidential value will partly be carried out by one Ph.D. student inaugurated at NTNU but hosted by NR.
LongRec will primarily address issues raised by a changing and evolving environment (with respect to the actors) during the exploratory phase of the project. Deep research in this area will require other competence and possibly other partners, such as experts in law, economics, society, culture, and politics. However, existing issues in areas such as compliance requirements will be explored by LongRec. Since legal, social and cultural framework must be viewed in an international perspective, the partners of LongRec will consider an application for a project under the 7
th Research Framework Programme of the EU particularly addressing research challenge 5.
Contact: |
|||
|
Inger Mette
Gustavsen, DNV
Research & Innovation Inger.mette.gustavsen@dnv.com +47 6757 7049 / +47 917 08 230 |
Heidi Brovold,
DNV Research &
Innovation heidi.brovold@dnv.com +47 6757 9240 / +47 970 68 323 |
Jon Ølnes,
DNV Research &
Innovation jon.olnes@dnv.com +47 478 46 094 |
|