Validity and Stability Pipeline Development FS Benchmark by schaeferbasti · Pull Request #288 · autogluon/tabarena

schaeferbasti · 2026-04-10T08:15:04Z

Issue #, if available:

Description of changes:

Edit feature_selection_benchmark_runner.py for validity and stability (fix bugs, make usable for cli, save results in csv files)
Soon: Add batch script for executing the runner with all datasets, methods and repeats

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…sted)

LennartPurucker · 2026-04-12T21:13:44Z

experimental/feature_selection_benchmark/extra_benchmark/feature_selection_benchmark_runner.py

+
+    print(result)
+    result = pd.DataFrame([result.__dict__])
+    path = f"results/{args.mode}_{args.method_name}_{args.data_foundry_task_id.split('|')[3].split('/')[0]}_{args.repeat}.csv"


Consider adding code that checks if the cache exists and then skips unless we pass --ignore-cache.

Also maybe make this logic a function so that we can easily go from argumetns to cache path

LennartPurucker · 2026-04-12T21:16:51Z

tabflow_slurm/setup_slurm_base_v2.py


+
+@dataclass
+class ExtraBenchmarkSetup2026:


Let us not have this in the TabArena TabFlow code, but in the experimental part. Also, I think this kind of setup won't work the same way, might be a good starting point, but likely you can make it much simpler and hardcode a lot of choices. In theory, you just need to generate a loop + array job with the values of this loop. For example, check out some suggestions from Claude Code or ChatGPT for how to do this

SLURM does not need all this setup loigc but most importantly the batch file. There is also submitit as a python package as an alternative

schaeferbasti added 7 commits April 10, 2026 10:12

fix: minor changes and corrections for the fs_benchmark_runner

1617453

WIP fix: make data_foundry usable for now

9cd8bae

WIP fix: save FeatureSelectionResult to csv

9712ff2

fix: use usual constants for data_foundry_cache

013c06f

maint: save results in results folder

ccf751a

maint: use args instead of download_data_foundry_dataset

bf6eb04

maint: use args instead of download_data_foundry_dataset

9f22c32

schaeferbasti marked this pull request as ready for review April 10, 2026 11:18

schaeferbasti added 2 commits April 10, 2026 13:39

maint: change path for cluster (need to check)

dc5ad57

add: script for generating script for extra benchmark (needs to be te…

5ef21ed

…sted)

LennartPurucker requested changes Apr 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validity and Stability Pipeline Development FS Benchmark#288

Validity and Stability Pipeline Development FS Benchmark#288
schaeferbasti wants to merge 9 commits intoautogluon:fe_benchmark_mainfrom
schaeferbasti:fe_benchmark_main_val_pipeline

schaeferbasti commented Apr 10, 2026 •

edited

Loading

Uh oh!

LennartPurucker Apr 12, 2026

Uh oh!

LennartPurucker Apr 12, 2026

Uh oh!

LennartPurucker Apr 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants



		@dataclass
		class ExtraBenchmarkSetup2026:

Conversation

schaeferbasti commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LennartPurucker Apr 12, 2026

Choose a reason for hiding this comment

Uh oh!

LennartPurucker Apr 12, 2026

Choose a reason for hiding this comment

Uh oh!

LennartPurucker Apr 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

schaeferbasti commented Apr 10, 2026 •

edited

Loading