flowchart LR A[Nuclei detection] --> B[Quantification] B --> C[Spatial analysis]
A primer for researchers
Agenda
Understanding BioImaging
Planning and adquiring images
Gathering image metadata
Organizing and sharing image data
Sharing reproducible workflows
Bioimaging has entered the realm of big data comprising increasingly complex datasets. We face numerous challenges, specifically, proper data handling and management and the creation and sharing of reproducible image analysis workflows.
BioImages have the potential for scientific discovery beyond its original acquisition purpose when handled according to FAIR principles (Schmidt, et al. 2024; see Wilkinson, et al. 2016).
Agenda
Understanding BioImaging
Planning and adquiring images
Gathering image metadata
Organizing and sharing image data
Sharing reproducible workflows
Conventional file-system-based storage is quickly reaching its limits. Before the data are generated, researchers must consider how it will be stored, moved, documented and analyzed during (and after) the project’s lifetime.
Current directions:
How will you handle your images?
While panning an imaging experiment, consider the most suitable approach given your capabilities for data handling and analysis.
Adquisition parameters are key
During imaging it is necessary to consider a compromise between the parameters needed to answer the research question (magnification, size, bit depth) and the available processing power (storage, computing power).
After acquiring original images in proprietary formats (i.e .CZI or .LIF), researchers can use different tools to open and transform the images to open formats (.TIFF).
Is a format developed by the Open Microscopy Environment (OME) based on TIFF specification. OME-TIFF incorporates:
Strategic planing and data management, cloud-ready platforms and formats, and institutional support ensure scalable, accessible, and reusable imaging data.
Agenda
Understanding BioImaging
Planning and adquiring images
Gathering image metadata
Organizing and sharing image data
Sharing reproducible workflows
High-quality metadata is crucial for making imaging data FAIR.
Current challenges:
REMBI advice metadata guidelines to address the needs of diverse biomedical imaging communities.
A summary template is available here.
Visit the MicroMeta App and the associated research article
The MethodsJ2 Fiji plugIn generates text for microscopy materials and methods by extracting information from metadata (MicroMeta App file). Visit here the GitHub repository or the associated research article.
OMERO incorporates MDEmic (MetaData Editor for microscopy) an tool that provides an easy way to explore and edit metadata from images.
Agenda
Understanding BioImaging
Planning and adquiring images
Gathering image metadata
Organizing and sharing image data
Sharing reproducible workflows
Eventually, biomedical images (big data) can reach terabytes or petabytes in size, exceeding most standard file-sharing solutions.
Tip
Effective image storage requires infrastructure, optimization of processing workflows, and standardized sharing protocols.
For selecting the storage modality, think that, as researchers, we do not want simply to store the dataset somewhere, we want to make it accessible and usable.
We want our images to be
These instances are installed in dedicated network (core facility) space for long-term storage and sharing.
Tip
“We strongly discourage author statements that images ‘are available upon request’, as this has been shown to be inefficient” (Schmied et al. 2023)
Agenda
Understanding BioImaging
Planning and adquiring images
Gathering image metadata
Organizing and sharing image data
Sharing reproducible workflows
In any research workflow, the analysis of images must be:
From an RDM perspective, analysis of biomedical images ideally entails:
Tip
Accurate, descriptive naming conventions and README files with metadata, or codebooks are vital to assure the integrity of analysis pipelines.
A modular pipeline establishes the main image analysis tasks into independent sub tasks.
flowchart LR A[Nuclei detection] --> B[Quantification] B --> C[Spatial analysis]
Tip
Modularity allows to construct complex analysis pipelines from independent components that can function together. This promotes the reuse of independent modules.
Low-level (technical knowledge)
Transform images in other images or data:
High-level (disciplinary knowledge)
Transform outputs from low-level tasks into information with biological meaning:
There are dozens of open/free options to analyze research images and share reproducible workflows:
BioImage.IO is a community-driven AI model repository that provides access to pretrained AI models with a plethora of open/free software partners.
“The mouse is antisocial. The GUI is antisocial. So, what’s that mean? You have a problem to solve, and you solve it with a GUI. What do you have? A problem solved. But when you solve it with a command line interface in a scripting environment, you have an artifact. And all of the sudden, that artifact can be shared with someone” Jeffrey Snover
Use CODE not the mouse!
Tip
Sharing of research objects in public, active research management repositories, like the Open Science Framework (OSF) is an excellent strategy to promote open, reproducible research. Please consider sharing illustrative images, figures and tables used in publication.
Tip
In general, these are lower resolution images/figures (.png) not used for analysis but for illustration in research reports (thesis, articles).
‘Data is available on request’ statements in publications are found to be often unreliable in practice (Schmidt et al., 2024)
Open Science Principle
Share the data as openly as possible and keep it only as closed as necessary.
Depositing a dataset in a repository is NOT ONLY an exercise in meeting the requirements of funding agencies and journals. It is an ethical and professional responsibility of researchers to ensure reproducible research, and the access and reuse of research data.
Contact us to ensure that your data is well prepared and can be effectively shared with the research community.
Handling biomedical images and sharing reproducible workflows - FRDR curation team