Skip to content

Experimental Tag for benchmarks#171

Open
nv-rborkar wants to merge 5 commits into
mlcommons:masterfrom
nv-rborkar:nv-rborkar-experimental-tag
Open

Experimental Tag for benchmarks#171
nv-rborkar wants to merge 5 commits into
mlcommons:masterfrom
nv-rborkar:nv-rborkar-experimental-tag

Conversation

@nv-rborkar

@nv-rborkar nv-rborkar commented Apr 2, 2024

Copy link
Copy Markdown
Contributor

Agile tag is a way for MLPerf to stay agile & make early bets on upcoming hot benchmarks.
It allows adopting viral benchmarks while also providing a way to update, refresh or tweak them as the ML landscape changes quickly instead of getting locked-in to a 2 year cadence.

Ideally all benchmarks should have the agility to refresh if landscape warrants faster change e.g. update the sequence length of existing LLM models or tweak outdated architectures but we should also balance the churn to reduce submitter burden and prolong investment !/$.

@nv-rborkar nv-rborkar requested a review from a team as a code owner April 2, 2024 21:02
@github-actions

github-actions Bot commented Apr 2, 2024

Copy link
Copy Markdown

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

Also added minimum lifetime of agile to be 1 year based on discussion in WG.
@nv-rborkar

Copy link
Copy Markdown
Contributor Author

4/25: Training WG agrees with this proposal

@itayhubara

itayhubara commented May 9, 2024

Copy link
Copy Markdown

Two comments:

  1. I cannot find the "expected lifetime of 4 rounds" rule anywhere - @nv-rborkar can you please point me to that line
  2. I believe that changing the rule to allow one agile (instead of two) per round should be enough (we don't usually have two "real agile" models that we wish to add per year)

@hiwotadese

hiwotadese commented Jun 20, 2024

Copy link
Copy Markdown
Contributor

WG agreed on 06/20 to add "Benchmarks live for a minimum of 2 years. Early retirement requires formal WG vote" to the rules to handle early retirement.

ShriyaRishab added a commit to ShriyaRishab/policies that referenced this pull request Jul 12, 2024
ShriyaRishab added a commit to ShriyaRishab/policies that referenced this pull request Jul 12, 2024
@ShriyaRishab

Copy link
Copy Markdown
Contributor

#181 - PR to address benchmark lifecycle and early retirement process.

ShriyaRishab added a commit to ShriyaRishab/policies that referenced this pull request Jul 18, 2024
ShriyaRishab added a commit to ShriyaRishab/policies that referenced this pull request Oct 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants