Ethical Open Science for Past Global Change Data

Outreach & Education

Training for the next generation of Quaternary informaticians

org-chart

Overview

Our goal is to help scholars across career levels navigate an increasingly complex data ecosystem and to practice ethical science in their work.

Activities



Quaternary science often yields heterogeneous datasets. These complex records can include physical specimens (e.g., faunal remains, sediment cores, human artifacts), assemblage-level occurrence data, trait measurements (e.g., stable isotopic ratios, osteometrics), and chronologies. A number of repositories that specialize in the curation of each of these distinct data elements has arisen, ranging from individual researcher databases to museums to online data aggregators. While many of these resources enable high-quality Quaternary data preservation, they also present disciplinary practitioners with the daunting task of navigating an increasingly opaque data ecosystem.

To help researchers navigate this complex data ecosystem, we are developing database recommender tools. These tools will allow researchers to input some basic datasets parameters (e.g., time and region, kind of specimen, kind of measurement) and output data curation recommendations.

Our aim is to introduce future scientists to principles of ethical open data and Indigenous data sovereignty. We are therefore developing modular curricula, which can be inserted into introductory undergraduate courses in data science or ecology, to familiarize students with the FAIR principles and CARE principles. Students will explore case studies focused on Quaternary data that cause them to consider the oftentimes invisible infrastructures and practices that enable FAIR and CARE-aligned data curation, as well as explore what gaps there are in Quaternary data with regard to FAIR and CARE.

Disparate data types are already being combined by data practitioners via linked open data (LOD). These linkages rely on knowing the spatial region and temporal span of an observation or data record. These workflows adhere to the FAIR data principles.

Similar to FAIR principles, CARE principles have become widely adopted - but seemingly only in the communities most obviously affected in the Natural Sciences: Indigenous, Archaeological, Anthropological, Zooarchaeological, and Ethnography disciplines. Much of the adherence to CARE has been through new, ground-up development of data resources. The incorporation of CARE into biological and paleontological data use is lacking, partly due to historic reasons and partly due to lack of awareness by communities.

Our goal is to incorporate CARE data principles and practices into already established data resources that adhere to FAIR and Open Science principles. While some of this work requires infrastructure building, much of this work can be done by influencing the practices of data users to effect change in the ethos of the next generation of data practitioners.

This project will have three outcomes: 1) the development of a working methodology for incorporating CARE practices into research that spans time and disciplines; 2) the use of this methodology for a scientific research endeavor; and 3) the teaching of this methodology to early career researchers, in order to effect change in our communities.