Introduction
METASPACE is the cloud computing engine for annotating metabolites, lipids, and glycans in imaging mass spectrometry data, and hosts a knowledgebase of over 13.000 public annotated datasets from over 200 labs.
Most METASPACE users access the platform through the web app, which offers core functionalities for uploading, processing, and visualizing results.
However, bioinformaticians looking to integrate METASPACE into their workflows must develop custom scripts to interact with the METASPACE Python client and submit datasets.
Developing nf-core modules will streamline this process, enabling scalable and reproducible integration of METASPACE into existing bioinformatics pipelines.
This will also support METASPACE in-the-loop applications, particularly in spatial multi-omics integration workflows.
Goal
-
Create the following nf-core modules using Cardinal:
- Data import (creating MSImagingExperiment object).
- Preprocessing (Primarily peak picking) to generate a centroided dataset -> input for METASPACE.
-
Create an API module to submit datasets to METASPACE using METASPACE python client
-
Create a module to convert METASPACE datasets to AnnData and SpatialData objects using METASPACE Converter
-
Create a module to perform enrichment analysis using S2IsoMEr from AnnData object. (I’ll personally work on this module)
-
If time permits, combine the above modules in one pipeline
We welcome contributors of all experience levels. All of the above goals are suitable for beginners.