Tying It All Together

Learning Objectives

After completing this module, you will be able to:

Identify the three primary “products” that come out of synthesis groups.
Understand the metadata and other features that make published datasets useful.
Evaluate the reach and reproducibility of an ecological synthesis project’s outputs.
Create a plan for your synthesis team’s research products that applies contribution, publishing, and citation practices that will benefit the team.

Introduction

So far in the course, we’ve made the point that ecological synthesis research benefits from an inclusive, team-based approach to science, and that teams should use effective, collaborative methods to integrate and analyze their data. Synthesis research is also intended to be influential and useful. There are many definitions of “influential and useful” to consider here, but successful ecological synthesis teams typically aim to expand our understanding of ecological systems, and to improve human lives and the environment. Three circles labeled 'data', 'results' and 'analytical workflow' with arrows connecting each pair pointing in both directions Their ability to accomplish this frequently depends on what research products are created, and how those are communicated and shared with the outside world.

“Research products” can be defined broadly, but there are three interconnected, publishable products that are the most common outputs from a synthesis project (or any research project, really): the data, analytical workflows (code for data cleaning or statistics, for example) and research results. Each of these is a valuable product of synthesis science, and each one should reference the others. In this module we’ll discuss the mechanics of publishing each one, and how they can be made accessible and useful for the long-term.

Publishing a synthesis dataset

In Module 2 we discussed some considerations for creating and formatting harmonized data files in synthesis research. We also introduced the importance of metadata for describing data and making it more usable. Publishing harmonized data files and descriptive metadata together as a dataset helps ensure that the data products produced by a synthesis team are findable, accessible, interoperable, and reusable (FAIR). FAIR data are an important outcome for most ecological synthesis projects.

More about Findable, Accessible, Interoperable, Reusable (FAIR) data

The FAIR principles, standing for Findability, Accessibility, Interoperability, and Reusability, are a community-standard set of guidelines for evaluating the quality and utility of published research data. Making an effort to meet the FAIR criteria promotes both human and machine usability of data, and is a worthy objective when preparing to publish data from a synthesis research project.

The FAIR principles were first defined in the paper by Wilkinson et al.(2016). Since this time, many resources have arisen to guide the implementation of the FAIR principles¹ and to quantify FAIR data successes and failures in the research and publishing communities (Bahim et al. 2020; Gries et al. 2023).

Activity 1: Evaluate published datasets

Lets start our journey to publishing datasets by looking at some that are already published. Form breakout groups and course instructors will assign each group a dataset for evaluation. With your group, answer these questions about the dataset:

Where were the data collected?
What variables were measured and in what units?
What is the origin of the data and how have they been altered since collection?
Were the first three questions easy to answer? Why or why not?

Example dataset: Jarzyna, M.A., K.E. Norman, J.M. LaMontagne, M.R. Helmus, D. Li, S.M. Parker, M. Perez Rocha, S. Record, E.R. Sokol, P. Zarnetske, and T.D. Surasinghe. 2021. temporalNEON: Repository containing raw and cleaned-up organismal data from the National Ecological Observatory Network (NEON) useful for evaluating the links between change in biodiversity and ecosystem stability ver 1. Environmental Data Initiative. https://doi.org/10.6073/pasta/7f0e0598132e3fea1bfd36a4257af643.