Dataset

A simple and intuitive way to organize and describe your neuroimaging and behavioral data.

The CESSDA Metadata Model contains metadata elements, their definitions and information on other requirements, such as repeatability. The Model is built from the viewpoint of quantitative (social science) data and is based on DDI Lifecycle 3.2 metadata standard.

The profiles specify the metadata requirements of the CESSDA Data Catalogue, based on the CESSDA Metadata Model and the DDI specifications.
CIDOC Conceptual Reference Model (CRM) is a conceptual model used in cultural heritage to enable information integration. It provides definitions and a structured framework to describe concepts, relationships, and data used in cultural heritage documentation. The CIDOC CRM, developed and maintained over a period of more than 20 years, was initially proposed by the CIDOC Documentation Standards Working Group and is currently managed by the CIDOC CRM SIG, both of which are working groups of the CIDOC Council. In December 2006, it received official recognition as an ISO standard, and this status was reaffirmed in 2014 under the ISO 21127:2014 designation.
The Component Metadata Infrastructure provides a framework to create and use self-defined metadata formats. It relies on a modular model of so-called metadata components, which can be assembled together, to improve reuse, interoperability and cooperation among metadata modelers. The model is standardised in ISO 24622-1 and ISO 24622-2. The serialization is typically in XML. Metadata in this format are often distributed via OAI-PMH. The definition of data categories is provided externally, for example by linking to schema.org or the Clarin Concept Registry.
CRMarchaeo is an extension designed to address the documentation and management of archaeological data and excavation processes. CRMarchaeo expands upon the CIDOC CRM base ontology by introducing additional classes, properties, and relationships that enable the representation of archaeological entities and their contextual information. It also provides a structured framework for capturing and describing archaeological sites, artefacts, stratigraphy, excavation processes, and related archaeological concepts. It supports the documentation of archaeological contexts, including the spatial and temporal dimensions of excavations, stratigraphic relationships, and the association of artefacts with specific layers or features.
CRMdig, for digital preservation, is an extension tailored for the domain of digital preservation. It provides a conceptual framework for documenting and managing the lifecycle of digital objects, ensuring their long-term preservation and accessibility. CRMdig incorporates additional classes and relationships to capture the unique characteristics and requirements of digital preservation, including concepts such as file formats, metadata, migration, authenticity, and provenance. It can be used to model and manage digital collections, establish preservation policies, and implement strategies for the sustainable preservation of digital materials.
CRMsci is the extension for scientific observation and is specifically designed to support the documentation and integration of scientific data. It provides a structured framework for capturing and describing scientific observations, measurements, and experiments in various domains such as environmental studies, natural sciences, and biodiversity research.
CRMtex is the extension defined to address the documentation and representation of text-related cultural heritage objects. It provides a structured framework for describing various types of textual objects, including inscriptions, papyri, manuscripts, and other ancient texts. It aims to facilitate the organization and retrieval of text-related cultural heritage objects by providing a standardised approach to their documentation and description. CRMtex allows cultural heritage institutions, libraries, and research projects to manage and share textual resources effectively. It also enhances the interoperability of text-related data, enabling the integration of textual information from diverse sources, to facilitate scholarly research, textual analysis, and the preservation of textual cultural entities for future generations.
CSV on the Web is a W3C Recommendation for describing CSV files on the Web, ensuring their proper description, including mapping to JSON and RDF formats.

By using DCAT to describe datasets in data catalogs, publishers are using a standard model and vocabulary that facilitates the consumption and aggregation of metadata from multiple catalogs, and in doing so can increase the discoverability of datasets. It also makes it possible to have a decentralized approach to publishing data catalogs and makes federated search for datasets across catalogs in multiple sites possible using the same query mechanism and structure. Aggregated DCAT metadata can serve as a manifest file as part of the digital preservation process.

DCAT-AP is an application profile of DCAT (Data Catalog Vocabulary W3C Recommendation) to be used in European data portals. It is a universal metadata scheme based on RDF, ready to be further profiled for specific domain needs.
LIDO is an XML schema intended for delivering metadata, for use in a variety of online services, from an organization’s online collections database to portals of aggregated resources, as well as exposing, sharing, and connecting data on the web. Its strength lies in its ability to support the typical range of descriptive information about objects of material culture. It can be used for all kinds of objects, e.g., art, architecture, cultural history, history of technology, and natural history. LIDO supports multilingual application environments. Being an application of the CIDOC Conceptual Reference Model (CRM), LIDO is the result of a collaborative effort of international stakeholders in the museum sector, starting in 2008, to create a common solution for contributing cultural heritage content to portals and other repositories of aggregated resources. LIDO is maintained under the patronage of CIDOC - ICOM International Committee for Documentation.
LRMOO, formerly known as FRBoo, is an extension, developed in collaboration with IFLA, for bibliographic information and library cataloguing. It provides a conceptual framework for describing bibliographic entities and their relationships in a structured manner by introducing classes, properties, and relationships that allow for the representation of bibliographic resources, such as books, articles, and other library materials, along with their authors, editions, and related works. LRMoo facilitates the organization and retrieval of bibliographic information by providing a consistent and standardized approach to cataloguing. It enables the modelling of complex relationships between different versions, translations, and editions of a work, as well as the association of works with their creators and subjects.
The MEDIN Discovery Metadata Standard is a marine profile of the UK government Standard GEMINI2 and also complies with other international conventions such as INSPIRE and ISO19115.
MIxS currently consists of three separate checklists; MIGS for genomes, MIMS for metagenomes, and MIMARKS for marker genes. To create a single entry point to all minimum information checklists from the GSC and to the environmental packages, we created an overarching framework, the MIxS standard (publication in Nature Biotechnology). MIxS includes the technology-specific checklists from the previous MIGS and MIMS standards, provides a way of introducing additional checklists such as MIMARKS, and also allows annotation of sample data using environmental packages.
Open Data for Access and Mining (ODAM) Structural Metadata is a format describing how the metadata should be formatted and what should be included to ensure ODAM compliance for a data set. To comply with this format, two metadata files in TSV format are required in addition to the data file(s). These two files describe the metadata of the dataset, which includes descriptions of measures and structural metadata like references between tables. The metadata lets non-expert users explore and visualize your data. By making data interoperable and reusable by both humans and machines, it also encourages data dissemination according to FAIR principles. The structural metadata is specified in section 'Data collection and preparation' on the website.
ODM-XML is a data exchange standard, vendor-neutral, platform-independent suited for exchanging and archiving clinical and translational research data, along with their associated metadata, administrative data, reference data, and audit information. ODM-XML facilitates the regulatory-compliant acquisition, archival and exchange of metadata and data.
Plasma-MDS is used to provide structured disciplinary metadata to data sets in the field of plasma science and technology. Its main aim is to facilitate the discovery and exchange of research data in this field.
RDA: Resource Description and Access is a package of data elements, guidelines, and instructions for creating library and cultural heritage resource metadata that are well-formed according to international models for user-focused linked data applications.
Recommended Metadata for Biological Images (REMBI) provides guidelines for metadata for biological images to enable the FAIR sharing of scientific data. REMBI is the result of the bioimaging community coming together to develop metadata standards that describe the imaging data itself, together with supporting metadata such as those describing the biological study and sample.

The Registry Interchange Format – Collections and Services (RIF-CS) schema was developed as a data interchange format for supporting the electronic exchange of collection and service descriptions. It is a profile of ISO 2146, an information model for registry services for libraries and related organisations, adapting it for use in the research data context.

It was originally developed by the Australian National Data Service for use with Research Data Australia and the Research Data Australia Registry.

RO-Crate is a community effort to establish a lightweight approach to packaging research data with their metadata. It is based on schema.org annotations in JSON-LD, and aims to make best-practice in formal metadata description accessible and practical for use in a wider variety of situations, from an individual researcher working with a folder of data, to large data-intensive computational research environments.
TEI/EPIDOC is a collaborative effort that combines the expertise of EpiDoc and the Text Encoding Initiative (TEI). It establishes standardised guidelines and tools for encoding scholarly and educational editions of ancient documents, embracing inscriptions, papyri, manuscripts, and other text-bearing objects. By leveraging a subset of TEI's standard, TEI/EPIDOC enables the representation of texts in a digital form while also addressing the historical context and materiality of the objects. This comprehensive approach allows scholars to publish digital editions that not only encompass the transcription and editorial treatment of the texts but also provide insights into the objects themselves. As a result, TEI/EPIDOC enriches our understanding of ancient civilizations and facilitates the dissemination of knowledge about their tangible heritage. TEI/EPIDOC is currently employed by the EAGLE Project and Epigraphy.info.