• If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.

  • You already know Dokkio is an AI-powered assistant to organize & manage your digital files & messages. Very soon, Dokkio will support Outlook as well as One Drive. Check it out today!


Ingest - selected issues

Page history last edited by n.jacobs@... 12 years, 11 months ago

)Map version


Initiatives to link articles and datasets


Project StORe (Source To Output Repositories)

JISC-funded, multi-partner project investigating the ability to move seamlessly from source (data) repositories to output (publications) repositories. The project completed mid-2008.


Repository for the Laboratory (R4L) Project


JISC-funded project applying repository technology to experimental data capture, analysis and reporting processes in chemistry to enable linking between datasets and articles and between related datasets. Project partners: University of Southampton and ALPSP (Association of Learned and Professional Society Publishers).


ESCAPE ( Enhanced Scientific Communication by Aggregated Publications Environment)


SURF-funded project allowing repositories to connect groups of related deposits through OAI-ORE resource maps.


Descriptive bibliographic metadata schema


Dublin Core


Managed by the Dublin Core Metadata Initiative. The Dublin Core metadata set consists of 15 standard elements that are repeatable and freely orderable. Qualified Dublin Core includes additional levels of detail to the basic set.


MARC (MAchine Readable Cataloguing)


Library of Congress-maintained standard for bibliographic metadata. Some regional variants exist, though these are tending to become standardised onto USMARC, the US version.


METS (Metadata Encoding and Transmission Standard)


The METS schema is a standard for encoding descriptive, administrative, and structural metadata regarding objects within a digital library, expressed using the XML schema language of the World Wide Web Consortium. The standard is maintained in the Network Development and MARC Standards Office of the Library of Congress, and is being developed as an initiative of the Digital Library Federation.


MODS (Metadata Object Description Schema)


Metadata Object Description Schema (MODS) is a schema for a bibliographic element set that may be used for a variety of purposes, and particularly for library applications. The standard is maintained by the Library of Congress.


SWAP (Scholarly Works Application Profile)


JISC-funded project to develop an application profile for Dublin Core. Article on SWAP: http://www.ariadne.ac.uk/issue50/allinson-et-al/



Complex object metadata schema


DIDL (Digital Item Declaration Language)

Developed by the Moving Picture Experts Group to accurately describe multimedia objects. ISO Standard. Used by LANL


IMS Global Learning Consortium Content Packaging XML Binding



SCORM (Shareable Content Object Reference Model)



CCSDS (Consultative Committee for Space Data Systems) XML Formatted Data Unit (XFDU) Structure and Construction Rules


Published September 2008



Deposit interface




Enhancement of the SWORD app, developed by Stuart Lewis.


SWORD (Simple Web Service Offering Repository Deposit)


Lightweight protocol for depositing content from one location to another. A profile of the Atom Publishing Protocol (ATOMPUB). Enables batch deposit. JISC-funded project managed by UKOLN.



A new JISC-funded project running from July 2010 to June 2011. Partners are the University of Southampton and the University of Edinburgh, in close association with Microsoft. The overall aim of the project is to develop better ways of depositing content into repositories, and one particular objective is to create ways of depositing directly from Office documents.




a collaboration between Mendeley, Symplectic and CARET and the Library at the University of Cambridge.  "DURA essentially results in two possible ways of depositing content into a university’s institutional repository: (2) Universities can either directly link their repository to the Mendeley API which should be developed during this project, or (2) Where available, the integration can take place via Symplectic’s software, meaning that the data is already clean and disambiguated and full copyright data sourced from RoMEO is available to the repository and the subtle nature of “gearing” between the dynamic data in the researcher’s publication list and the more static repository are automatically handled by an existing technological solution."





Repository-to-repository exchange (work to date is mostly, but not exclusively, for preservation purposes)


AIHT (Archive Ingest and Handling test)

A project of the NDIIPP. An experiment to provide better understanding of digital archiving experiences at institutional level. Drew many institution-pertinent conclusions and two general ones: preservation is an outcome and a variety of measures to preserve are better than one (ii) data-centric approach is preferable to tool-centric or process-centric. D-Lib article on the project: http://www.dlib.org/dlib/december05/shirky/12shirky.html


MathArc: Ensuring Access to Mathematics Over Time


Projects partners Cornell University and Goettingen University. Created a system for preserving and disseminating mathematics and statistics literature by interchanging information packages between geographically and administratively separated databases using OAI-PMH.




Tool developed under DFG funding to enable metadata to to be streamed from institutional repositories to disciplinary repositories and vice versa.


PRESTA (PREMIS Requirements Statement)


Australian project undertaken by APSR. Studied how PREMIS could be used to transfer content from one repository to another.


ECHO DEPository


NDIIPP project at the University of Illinois Urbana-Champaign, 2004-9. Implemented instances of four different repository software applications and designed an interoperability architecture. The architecture consists of a METS interoperability profile and a 'hub' service that translates to and fro from the profile.


'Mining for ORE': Repository Developers' Challenge at Open Repositories 08 (supported by the JISC CRIG [Common Repository Interface group])

The developer team from the PRESERV2 Project won this competition with a deposition of an entire Fedora repository (metadata plus content) into EPrints and vice versa, implementing the OAI-ORE standard. Beta release: http://www.openarchives.org/ore/0.9/primer.html    ; Video at http://blip.tv/file/866653




JISC-funded project, extended to a second phase, that investigated the provision of preservations services available for institutional repositories. The Arts & Humanities Data Service (AHDS) developed a high-level model and shared preservation environment for use by these institutions. The second phase extended the study to cover repositories that archive additional, wide-ranging content types and implement different organisational models. Project partners SHERPA and the AHDS.


TIPR (Towards Interoperable Preservation Repositories)



Two year (beginning October 2008) project funded by the Institute of Museum and Library Services, carried out at Florida Center for Library Automation at Cornell University and New York University Libraries. The project will test the exchange of information between three repositories (based on DAITSS, aDORe and DSpace). It will use the ECHO DEP profile and build upon MathArc work.


CRIS-repository interoperability

CRIS are (generally relational) databases recording the context of research within an organisation. Recently, CRIS have moved towards web-based system, based on defined data models. The main standard is CERIF (Current European Research Information Format), developed by EuroCRIS (www.eurocris.org) and recommended by the EU. There are some exploratory initiatives currently exploring the issues involved in creating interoperability between CRIS and Open Access (literature) repositories and an increasing dialogue between the CRIS and OA communities on this. One is the Knowledge Exchange CRIS/OAR Interoperability Project, http://www.knowledge-exchange.info/Default.aspx?ID=340, focused on "defining a CERIF-based metadata exchange format for publication information with an associated common vocabulary".


Deposit Plait project


JISC-funded project that sought to investigate:

  • Metadata requirements of repositories and service providers of deposited items
  • What metadata can be easily extracted from the deposited document, and if useful, whether a web service can be built to provide such metadata from a file
  • What metadata can be obtained from online metadata sources and personal bibliographic management software
  • Possibility of building a ‘Meta-API’ using the open document format web service and bibliographic data sources
  • What metadata needs to be verified or provided by the depositor

The final report has been published (May 2009) and is available from the project website at the URL above.

Comments (2)

pcastro@bib.csic.es said

at 4:39 am on Mar 25, 2010

Hi Alma,

CRIS-OAR interoperability being the one 'Deposit interface' item lacking a reference URL, I'd suggest including a link to the Knowledge Exchange CRIS/OAR Interoperability Project, http://www.knowledge-exchange.info/Default.aspx?ID=340, focused on "defining a CERIF-based metadata exchange format for publication information with an associated common vocabulary". Sonex workgroup is also scoping the subject and we may shortly supply some extra URL related to institutional interoperability initiatives.


pcastro@bib.csic.es said

at 4:25 pm on Apr 19, 2010

Hi Alma,

This is to report recent release of the SONEX blog, http://sonexworkgroup.blogspot.com/, SONEX (Scholarly Output Notification and Exchange) being the new name for the strand formerly known as Repository Handshake. Besides initial analysis of deposit opportunities, the SONEX workgroup is presently scoping ongoing projects dealing with selected usecases and promoting synergies among them. Might be worth including a reference to this blog at the Repository-to-repository exchange section. Thanks.


You don't have permission to comment on this page.