Skip to content

Conversation

@camila-camacho-phd
Copy link

@camila-camacho-phd camila-camacho-phd commented Sep 15, 2025

Pull Request Template

Summary

Provide a short summary of the changes in this PR.


Data Modalities Covered

List the modalities included in this PR (tick all that apply):

  • Image
  • Text
  • Signals
  • Tabular
  • Other

Dataset & DataLoader Design

Keep the recommended structure, number 1: use a BaseDataLoader that loads each modality, and a multimodal data loader that inherits from the BaseDataLoader. Then we incorporate an ID_mapper, which looks at patient IDs to build a dataframe (each column is a modality, each row a patient, the value is a tuple containing, first, a binary variable pointing out whether the subject has information in the specific modality and, for instance, the IDs for multiple cases of images taken). The dataframe can be expanded for more modalities, and the user can filter it regarding the modalities that are required. Finally, there is a validation function inside the DataLoader that makes sure that the patients have all the necessary information (for instance, determined in the metadata).


codescene-delta-analysis[bot]

This comment was marked as outdated.

codescene-delta-analysis[bot]

This comment was marked as outdated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant