[AI] AI object mask tool by andriiryzhkov · Pull Request #20378 · darktable-org/darktable

andriiryzhkov · 2026-02-21T22:01:33Z

Adds a new mask tool that lets users select objects in the image by clicking. Built on the AI subsystem from #20322.

How it works

AI object mask is an interactive single-object selection tool. The user activates the object mask tool, waits for the image to be encoded (background thread), then clicks to place foreground/background point prompts. The model segments the object in real time. Right-click finalizes the selection by vectorizing the raster mask into Bézier path forms that integrate with darktable's existing mask system.

Architecture

Segmentation engine (src/ai/segmentation.c): Implements the two-stage encoder/decoder pipeline. Supports both SAM2.1 (multi-mask + IoU selection + low-res refinement) and SegNext (single mask, full-res refinement). Encoder outputs are cached so multiple clicks don't re-encode.
Object mask tool (src/develop/masks/object.c): Runs image encoding in a background thread to keep the UI responsive. Displays a "working..." overlay during encoding. Supports foreground clicks (label 1), background clicks (label 0), and box prompts (SAM only).
Raster-to-vector (src/common/ras2vect.c): Extended with cleanup (turdsize), smoothing (alphamax), and boundary sign output for hole detection.

Models

The segmentation engine supports both SegNext and SAM model architectures. SegNext is the default — it produces good enough results and is compliant with the Open Source AI Definition. Models are downloaded on demand by the AI subsystem from the model repository: https://github.com/andriiryzhkov/darktable-ai

Depends on #20322
Fixes #12295

TurboGit · 2026-02-22T09:39:59Z

Thanks for this new implementation. I'll test soon.

TurboGit · 2026-02-22T13:07:26Z

@andriiryzhkov : I have created the darktable-org/darktable-ai repository. You should be able to clone this repository and create a PR to initialize it. If needed I can initialize it with the current content of your darktable-ai repo.

andriiryzhkov · 2026-02-22T18:52:51Z

@TurboGit I will create a PR with models that are ready, because I have some models just for next experiments. I will keep them separate.

TurboGit · 2026-02-22T18:54:04Z

Sounds good to me.

andriiryzhkov · 2026-02-22T19:45:18Z

@TurboGit can you initialize repository darktable-org/darktable-ai with some empty file, so I will be able to fork it?

andriiryzhkov · 2026-02-24T21:17:14Z

After testing, I can confirm SegNext runs a little bit slower than SAM2.1, though I was able to optimize model loading and inference to make it reasonable.

Mask quality is lower than SAM2.1. I had to tweak mask post-processing — primarily to cut off small fragments around the mask edges.

That said, SegNext works reasonably well as an interactive tool: providing both foreground (positive) and background (negative) points produces decent results.

One additional limitation: SegNext does not support box prompts, which is a minor disadvantage compared to SAM.

andriiryzhkov · 2026-02-25T18:50:25Z

Update

The AI object mask tool now uses a brush stroke for initial object selection instead of a single click. This was the best way to improve selection quality with SegNext model - a single click didn't always produce good results, and these model don't support box prompts. The brush stroke is resampled into evenly-spaced foreground points via arc-length parameterization, providing a much richer prompt for the segmentation decoder. From my experience, the results are significantly better.

How it works

The tool now has two stages:

Stage 1 - Brush selection:

Drag over the object to paint a selection stroke
A short click also counts as a completed stroke
Scroll to adjust brush size
Ctrl+scroll to adjust opacity

Stage 2 - Point refinement:

Click to add foreground points (+)
Shift+click to add background points (-)
Alt+click to clear and start over
Right-click to apply the final mask

All points (brush and refinement) are kept throughout the session and sent to the decoder together, so each refinement click builds on the full context.

Performance note

SegNext is a larger model than SAM, so on slower computers it may be less responsive. I had to turn off CoreML and DirectML acceleration because model load and conversion took longer than CPU inference for this model.

Some samples (all with SegNext default model)

Object selection is not always perfect and you don't always get a good result on the first try. But overall it already feels quite capable. And this is with a model that meets open-source AI criteria.

TurboGit · 2026-02-25T21:24:39Z

@andriiryzhkov : Where is the model to test? From dt it cannot be downloaded (not found) and I don't see it in the repo.

andriiryzhkov · 2026-02-25T22:15:06Z

It is still from my repository

plugins/ai/repository=andriiryzhkov/darktable-ai

To download models from https://github.com/darktable-org/darktable-ai we need to create release there first with version like 5.5.0.x. This will trigger GitHub Action which converts, packs and attaches model packages as assets to release files.

TurboGit · 2026-02-26T08:02:04Z

@andriiryzhkov : Maybe something messed up on my side then.

I do have mask-object-segnext-b2hq :

$ ls -d ~/.local/share/darktable/models/mask-ob*
/home/obry/.local/share/darktable/models/mask-object-segnext-b2hq

But on UI it is has the not downloaded status:

And if I try to download I get:

And if I try the object mask button:

So I'm stuck :) Any idea?

andriiryzhkov · 2026-02-26T09:14:53Z

@TurboGit: I should have highlighted this change better - in order to improve models discoverability I decided to limit the models location to the config folder ~/.config/darktable/models/ (instead of ~/.local/share/darktable/models/).

So you just need to move your models:

mv ~/.local/share/darktable/models/mask-object-* ~/.config/darktable/models/

For download option you need to make sure parameter in darktablerc looks like this:

plugins/ai/repository=andriiryzhkov/darktable-ai

That is current default value in PR.

TurboGit · 2026-02-26T09:50:42Z

~/.config/darktable/models/

Not a good change because the ~/.config/darktable is used also to store DB and people can have multiple database for different work (--configdir option) and we don't want people to have to download the AI models for each one.

The ~/.local/share/darktable/models/ (as the name implies) is a shared directory.

andriiryzhkov · 2026-02-26T10:51:11Z

Not a good change because the ~/.config/darktable is used also to store DB and people can have multiple database for different work (--configdir option) and we don't want people to have to download the AI models for each one.

I agree. I run into a problem on Windows with ~/.local/share/darktable/models/ equivalent, so made this quick fix. Anyway, I need to research how to improve this.

andriiryzhkov · 2026-02-26T12:27:38Z

@TurboGit: Fixed in #20322, merged here. Models now use g_get_user_data_dir() instead of config dir.

TurboGit · 2026-02-26T17:31:05Z

Just tested a bit, the model is quite slower indeed and I have some issues with the refinement.

Use the integration test mire1.cr2 brush the bottle on the right. It misses some part on the left of the bottle, click to add this area but in fact the whole area shrink. See the video:

Capture.video.du.2026-02-26.18-28-06.mp4

I have first selected the bottle, and then 2 times clicked on the missed area. I have been able to reproduce on 2 images tested out of 3.

BTW, when the masks are computed we need a busy cursor.

andriiryzhkov · 2026-02-26T20:32:27Z

SegNext and SAM 2.1 are different models with different design goals, so a direct quality comparison isn't entirely fair. In the context of AI object masking specifically, SegNext is a bit slower and less accurate than SAM 2.1 – though the difference is not huge.

In terms of model choice, we still have two options: proceed with SAM 2.1 (small), which is faster and produces better quality masks but carries Meta's data provenance concerns, or go with SegNext (base), which is cleaner in terms of training data transparency.

No model produces a perfect mask in every situation – and that's fine. After many years working with AI professionally, the question I always come back to is: does this save the user a significant amount of time to achieve the desired result? If yes, it's worth the trade-offs – even if the model isn't technically flawless. With AI object masking, the output is always a path mask, which means any inaccuracies can be refined manually. The user retains full control; the AI just removes the tedious baseline work.

Regarding the bottle selection in the demo – the missing part of the mask is near the edge where two dark bottles are close together. The model simply can't distinguish the boundary in that region. In such situations, no amount of refinement clicks will help: if the model can't perceive the edge, it won't mask it correctly. This is a known limitation of contrast-dependent segmentation, not a bug.

Personally, I would lean towards SAM 2.1 – but I don't want this choice to become a divisive topic for the community. I'm happy to go with whatever direction we reach consensus on.

BTW, when the masks are computed we need a busy cursor.

Noted.

TurboGit · 2026-02-26T21:26:31Z

SegNext and SAM 2.1 are different models with different design goals, so a direct quality comparison isn't entirely fair.

I agree with that, but here I'm not comparing but more trying to see what could be the best for Darktable. I'll continue testing. At the moment the situation of SegNext is not compelling.

the question I always come back to is: does this save the user a significant amount of time to achieve the desired result? If yes, it's worth the trade-offs – even if the model isn't technically flawless.

Exactly, and for me at the moment I'm not sure SegNext qualify.

In terms of model choice, we still have two options: proceed with SAM 2.1 (small), which is faster and produces better quality masks but carries Meta's data provenance concerns

Agreed too, another alternative is to provides both and let user do the choice.

Personally, I would lean towards SAM 2.1

My feeling at the moment too.

Let me do some more tests. Again thanks for the hard work put on this.

TurboGit · 2026-03-06T11:18:10Z

Tested again, and to me the SegNext is of poor quality. I would like to be able to propose also from the AI preferences the SAM 2.1 small (not the default) but at least this will give a high quality AI masking option.

Having only SegNext will make Darktable looks really bad when compared to the alternatives and at the end this option will certainly not adopted by most people. This is in the spirit of what I want for Darktable, the freedom to use what they found OK for their work depending on their sensitivity about AI and models.

andriiryzhkov · 2026-03-06T14:03:41Z

I will prepare PR to add SAM 2.1 models to https://github.com/darktable-org/darktable-ai repository

andriiryzhkov added 10 commits February 21, 2026 11:21

Add AI inference subsystem with ONNX Runtime backend

e102b4e

Add AI object mask tool with SAM and SegNext support

f36b4eb

Add AI backend unit test

d7922d0

Fix AI provider change requiring app restart

2498250

Merge branch 'split/ai-subsystem' into split/ai-object-mask

5d25e32

Remove consumer-specific settings from AI subsystem

3c95dcd

Update model settings

bea0e25

Add cleanup, smoothing and sign parameters to ras2vect

f2ad56f

Update rasterfile ras2forms call to match extended signature

c124ae2

Add segmentation module to darktable_ai library

6f90d86

andriiryzhkov mentioned this pull request Feb 21, 2026

[AI] AI inference subsystem with ONNX Runtime backend #20322

Open

andriiryzhkov added 6 commits February 22, 2026 09:24

Add BUILD_AI_DOWNLOAD flag and install model from file

885e745

Fix OpenVINO provider calling convention mismatch

5b2ae51

Require checksum verification for model downloads

8aa2d92

Update github_asset to dtmodel

e90a7c0

Assign g_ort and g_env inside g_once callbacks

b629fc6

Read config before acquiring env->lock in dt_ai_load_model_ext

d35c71b

andriiryzhkov added 2 commits February 22, 2026 13:43

Harden AI subsystem from code review findings

06b7ccd

Add error path and edge case tests for AI backend

46251fe

TurboGit added this to the 5.6 milestone Feb 22, 2026

TurboGit added priority: low core features work as expected, only secondary/optional features don't feature: new new features to add difficulty: hard big changes across different parts of the code base scope: image processing correcting pixels labels Feb 22, 2026

Add ARCHIVE_EXTRACT_SECURE_NOABSOLUTEPATHS to model extraction

752b9c3

andriiryzhkov added 5 commits February 24, 2026 21:03

Updated default mask post-processing parameters

add0b93

Clip mask overlay to viewport in darkroom expose

c99de87

Mark form names for translation

da1bc09

Revert "Add ARCHIVE_EXTRACT_SECURE_NOABSOLUTEPATHS to model extraction"

836edf6

Merge branch 'object_ai_mask' into split/ai-object-mask

97f4a93

andriiryzhkov added 2 commits February 25, 2026 15:37

Add brush tool to object mask for intuitive selection

5cf48c5

Update object mask icon to sparkle shape and match button order

e03815f

Add vectorization preview overlay to object mask

b4cfe0d

andriiryzhkov added 2 commits February 26, 2026 13:07

Store AI models in user data dir instead of config dir

94afc08

Merge branch 'object_ai_mask' into split/ai-object-mask

430c121

Fix models scan dir

6d7dd45

andriiryzhkov added 5 commits February 27, 2026 09:15

Show busy cursor during object mask encoding and decoding

afd0c09

Fix spurious re-encoding after object mask finalization

72eca4d

Fix object mask cursor position and hint timing

6ca82a2

Remove busy cursor from background image encoding

d2dfb0e

Fix comments style

85e6c24

Conversation

andriiryzhkov commented Feb 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

How it works

Architecture

Models

Uh oh!

TurboGit commented Feb 22, 2026

Uh oh!

TurboGit commented Feb 22, 2026

Uh oh!

andriiryzhkov commented Feb 22, 2026

Uh oh!

TurboGit commented Feb 22, 2026

Uh oh!

andriiryzhkov commented Feb 22, 2026

Uh oh!

andriiryzhkov commented Feb 24, 2026

Uh oh!

andriiryzhkov commented Feb 25, 2026

Update

How it works

Performance note

Some samples (all with SegNext default model)

Uh oh!

TurboGit commented Feb 25, 2026

Uh oh!

andriiryzhkov commented Feb 25, 2026

Uh oh!

TurboGit commented Feb 26, 2026

Uh oh!

andriiryzhkov commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TurboGit commented Feb 26, 2026

Uh oh!

andriiryzhkov commented Feb 26, 2026

Uh oh!

andriiryzhkov commented Feb 26, 2026

Uh oh!

TurboGit commented Feb 26, 2026

Uh oh!

andriiryzhkov commented Feb 26, 2026

Uh oh!

TurboGit commented Feb 26, 2026

Uh oh!

TurboGit commented Mar 6, 2026

Uh oh!

andriiryzhkov commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

andriiryzhkov commented Feb 21, 2026 •

edited

Loading

andriiryzhkov commented Feb 26, 2026 •

edited

Loading