docs(flaky-tests): document AI Investigation tab by samgutentag · Pull Request #539 · trunk-io/docs

samgutentag · 2026-04-11T11:00:25Z

Summary

Adds a new page flaky-tests/ai-investigation.md documenting the Analysis tab on the Flaky Tests test detail page (trunk-io/trunk2#3504)
Covers viewing investigation results, triggering new investigations, applying fix recommendations, and viewing investigation history
Documents all seven fact types surfaced in Key Findings
Adds the page to summary.md

Source

trunk2 PR: https://github.com/trunk-io/trunk2/pull/3504

Test plan

Preview in GitBook

Adds a new page covering the Analysis tab introduced in trunk-io/trunk2#3504. Documents how to view AI investigation results, trigger new investigations, apply fix recommendations, and browse investigation history. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

trunk-staging-io · 2026-04-11T11:00:29Z

Merging to main in this repository is managed by Trunk.

To merge this pull request, check the box to the left or comment /trunk merge below.

After your PR is submitted to the merge queue, this comment will be automatically updated with its status. If the PR fails, failure details will also be posted here

claude · 2026-04-11T11:00:53Z

Claude finished @samgutentag's task in 1m 39s —— View job

PR Review

Gather context and read files
Review flaky-tests/ai-investigation.md
Review summary.md changes
Post inline review comments

Overall the page is well-structured, follows the repo's documentation conventions, and all relative links resolve to existing files. Two issues found:

Blocking

Line 31 — Confidence score ranges overlap: "yellow for 50%+" includes the 80%+ range already labeled green. The three bands need to be mutually exclusive (e.g., green 80%+, yellow 50–79%, orange below 50%). Inline suggestion posted.

Non-blocking

Line 55 — The "CI Logs" fact type description reads as if Trunk is augmenting test failure outputs rather than analyzing CI logs. Inline suggestion posted for a clearer phrasing, but not blocking.

claude · 2026-04-11T11:02:28Z

+
+At the top, you will see:
+
+- The **overall confidence score** (color-coded green for 80%+, yellow for 50%+, orange below 50%)


The confidence score ranges overlap: "yellow for 50%+" technically includes the 80%+ range already described as green. The three bands should be mutually exclusive.

Suggested change

- The **overall confidence score** (color-coded green for 80%+, yellow for 50%+, orange below 50%)

- The **overall confidence score** (color-coded green for 80%+, yellow for 50–79%, orange below 50%)

claude · 2026-04-11T11:02:29Z

+
+| Fact type | What it analyzes |
+|---|---|
+| **CI Logs** | Supplements test failure outputs with CI workflow logs |


The phrasing "Supplements test failure outputs with CI workflow logs" reads as though the fact type is augmenting the outputs rather than analyzing the logs themselves. Suggest clarifying the direction:

Suggested change

| **CI Logs** | Supplements test failure outputs with CI workflow logs |

| **CI Logs** | CI workflow logs that provide additional context beyond test failure output |

claude Bot reviewed Apr 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(flaky-tests): document AI Investigation tab#539

docs(flaky-tests): document AI Investigation tab#539
samgutentag wants to merge 1 commit intomainfrom
sam-gutentag/flaky-tests-ai-investigation-tab

samgutentag commented Apr 11, 2026

Uh oh!

trunk-staging-io Bot commented Apr 11, 2026

Uh oh!

claude Bot commented Apr 11, 2026 •

edited

Loading

Uh oh!

claude Bot Apr 11, 2026

Uh oh!

claude Bot Apr 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants


		At the top, you will see:

		- The overall confidence score (color-coded green for 80%+, yellow for 50%+, orange below 50%)

	\| CI Logs \| Supplements test failure outputs with CI workflow logs \|
	\| CI Logs \| CI workflow logs that provide additional context beyond test failure output \|

Conversation

samgutentag commented Apr 11, 2026

Summary

Source

Test plan

Uh oh!

trunk-staging-io Bot commented Apr 11, 2026

Uh oh!

claude Bot commented Apr 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review

Uh oh!

claude Bot Apr 11, 2026

Choose a reason for hiding this comment

Uh oh!

claude Bot Apr 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

claude Bot commented Apr 11, 2026 •

edited

Loading