AI Medical Compendium Journal:
Scientific data

Showing 41 to 50 of 148 articles

MIMIC-BP: A curated dataset for blood pressure estimation.

Scientific data
Blood pressure (BP) is one of the most prominent indicators of potential cardiovascular disorders. Traditionally, BP measurement relies on inflatable cuffs, which is inconvenient and limit the acquisition of such important health-related information ...

Speech production under stress for machine learning: multimodal dataset of 79 cases and 8 signals.

Scientific data
Early identification of cognitive or physical overload is critical in fields where human decision making matters when preventing threats to safety and property. Pilots, drivers, surgeons, and operators of nuclear plants are among those affected by th...

A multi-species benchmark for training and validating mass spectrometry proteomics machine learning models.

Scientific data
Training machine learning models for tasks such as de novo sequencing or spectral clustering requires large collections of confidently identified spectra. Here we describe a dataset of 2.8 million high-confidence peptide-spectrum matches derived from...

An Experimental and Clinical Physiological Signal Dataset for Automated Pain Recognition.

Scientific data
Access to large amounts of data is essential for successful machine learning research. However, there is insufficient data for many applications, as data collection is often challenging and time-consuming. The same applies to automated pain recogniti...

High-resolution AI image dataset for diagnosing oral submucous fibrosis and squamous cell carcinoma.

Scientific data
Oral cancer is a global health challenge with a difficult histopathological diagnosis. The accurate histopathological interpretation of oral cancer tissue samples remains difficult. However, early diagnosis is very challenging due to a lack of experi...

Dataset from a human-in-the-loop approach to identify functionally important protein residues from literature.

Scientific data
We present a novel system that leverages curators in the loop to develop a dataset and model for detecting structure features and functional annotations at residue-level from standard publication text. Our approach involves the integration of data fr...

EnzChemRED, a rich enzyme chemistry relation extraction dataset.

Scientific data
Expert curation is essential to capture knowledge of enzyme functions from the scientific literature in FAIR open knowledgebases but cannot keep pace with the rate of new discoveries and new publications. In this work we present EnzChemRED, for Enzym...

GooseDetect: A Fully Annotated Dataset for Lion-head Goose Detection in Smart Farms.

Scientific data
Large datasets are required to develop Artificial Intelligence (AI) models in AI powered smart farming for reducing farmers' routine workload, this paper contributes the first large lion-head goose dataset GooseDetect, which consists of 2,660 images ...

An open dataset for oracle bone character recognition and decipherment.

Scientific data
Oracle bone script, one of the earliest known forms of ancient Chinese writing, presents invaluable research materials for scholars studying the humanities and geography of the Shang Dynasty, dating back 3,000 years. The immense historical and cultur...

Deepdive: Leveraging Pre-trained Deep Learning for Deep-Sea ROV Biota Identification in the Great Barrier Reef.

Scientific data
Understanding and preserving the deep sea ecosystems is paramount for marine conservation efforts. Automated object (deep-sea biota) classification can enable the creation of detailed habitat maps that not only aid in biodiversity assessments but als...