The PRIDE database at 20 years: 2025 update 论文

2024Nucleic Acids Research引用 1512顶会

Research Data Management PracticesScientific Computing and Data ManagementGenetics, Bioinformatics, and Biomedical Research

Scientific Computing and Data Management Genetics, Bioinformatics, and Biomedical Research Research Data Management Practices

关系图谱

作者

摘要

The PRoteomics IDEntifications (PRIDE) database (https://www.ebi.ac.uk/pride/) is the world's leading mass spectrometry (MS)-based proteomics data repository and one of the founding members of the ProteomeXchange consortium. This manuscript summarizes the developments in PRIDE resources and related tools for the last three years. The number of submitted datasets to PRIDE Archive (the archival component of PRIDE) has reached on average around 534 datasets per month. This has been possible thanks to continuous improvements in infrastructure such as a new file transfer protocol for very large datasets (Globus), a new data resubmission pipeline and an automatic dataset validation process. Additionally, we will highlight novel activities such as the availability of the PRIDE chatbot (based on the use of open-source Large Language Models), and our work to improve support for MS crosslinking datasets. Furthermore, we will describe how we have increased our efforts to reuse, reanalyze and disseminate high-quality proteomics data into added-value resources such as UniProt, Ensembl and Expression Atlas.

作者查看全部 (11)

Juan Antonio Vizcaíno

Shengbo Wang

Mathias Walzer

Ananth Prakash

The PRIDE database at 20 years: 2025 update 论文

摘要

作者查看全部 (11)

相关技术

相关事件

相关文章