Skip to content

Test Multimodal Configurations #22

Description

@andrewscouten

This is for when someone (including me) can pick back up work! I am currently testing the multimodal configuration:

docker compose --profile prod-rocm-wsl run --rm prod-rocm-wsl python -m oncolearn.trainer --config data/configs/modeling/multimodal/tcga_brca_cbioportal_pam5

With cross validation using 5 folds each with a train / test set. I have ran a hyper parameter search using optuna:

  • For each trial, training for 10 epochs on the train split, then testing on the test split
  • Averaging F1 scores of the test splits for each of the 5 trials to get our maximization metric
  • Tuning to maximize this F1 score
Image

I am not sure if this is the best approach, but the goal would then be to have a proper train / val / test split and use the maximized hyper parameters for the final implementation of the model. We can then do the same thing for the cancer stage subtypes.

I have included my current trials, which can be viewed with the VSCode extension "Optuna Dashboard". The code expects the unzipped .db file to be under "outputs" in the base directory:

brca_cbioportal_pam50.zip

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    Fields

    No fields configured for Task.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions