Darktable AI Models

AI model conversion and packaging pipeline for darktable – an open-source photography workflow application and raw developer (GitHub).

Currently targets the ONNX backend. The pipeline is designed to support additional backends as darktable gains support for other AI runtimes.

Models

Model	Task	Description
`denoise-nafnet`	denoise	NAFNet denoiser trained on SIDD dataset
`denoise-nind`	denoise	UNet denoiser trained on NIND dataset
`mask-object-segnext-b2hq`	mask-object	SegNext ViT-B SAx2 HQ for masking
`upscale-bsrgan`	upscale	BSRGAN 2x and 4x blind super-resolution

Repository structure

pyproject.toml        Project configuration, dependency groups, CLI entry point
darktable_ai/         Python package (CLI + pipeline orchestration)
vendor/               Git submodules (NAFNet, nind-denoise, SegNext)
samples/<task>/        Sample images organized by task
output/               Build output: ONNX models + config.json (gitignored)
temp/                 Downloaded checkpoints (gitignored)
models/
  <model>/
    model.yaml        Model metadata, checkpoints, conversion steps
    convert.py        Model-specific conversion script
    demo.py           Demo inference script
    .skip             If present, skip this model in batch operations and CI

Requirements

Requires uv and Python 3.11–3.12.

Dependencies are managed through dependency groups in pyproject.toml. The base package only needs click and pyyaml. ML dependencies are split into groups — one per model plus a shared core group — so you only install what you need. Use uv sync --group <name> to install a specific group, or --group all-models for everything.

Setup

git clone --recurse-submodules https://github.com/<org>/darktable-ai.git
cd darktable-ai

# Install CLI + core ML dependencies
uv sync --group core

# Or install deps for a specific model
uv sync --group nind

# Or install everything
uv sync --group all-models

Usage

# List available models
uv run dtai list

# Run full pipeline for a single model
uv run dtai run denoise-nind

# Run full pipeline for all models
uv run dtai run

# Run individual steps
uv run dtai setup denoise-nind       # Download checkpoints
uv run dtai convert denoise-nind     # Convert to ONNX + generate config.json
uv run dtai validate denoise-nind    # Validate ONNX output
uv run dtai package denoise-nind     # Create .dtmodel archive
uv run dtai demo denoise-nind        # Run demo on sample images

# Evaluate a model
uv sync --group eval
uv run dtai eval mask mask-object-segnext-b2hq --limit 5

Demos

Each model includes a demo.py script that runs inference on sample images from samples/<task>/. Models that require per-image input (e.g. point prompts for object segmentation) define image_args in their model.yaml.

Output images are saved to models/<model>/output/.

Model selection criteria

Darktable is free software licensed under GPL-3.0. All AI models included in this repository are selected with the following principles in mind.

Open source compliance

Each model card documents the following and must meet the stated requirements:

GPL-3.0-compatible license. Model weights must be released under a license compatible with GPL-3.0 (e.g. Apache-2.0, MIT, BSD, GPL-3.0). Proprietary or non-commercial-only models are not accepted.
OSAID v1.0 classification. Open Source AI, Open Weights, or Open Model.
MOF classification. Class I (Open Science), Class II (Open Tooling), or Class III (Open Model).
Training data license. Specific license(s) for each training dataset.
Training data provenance. Where data came from and how it was collected. Models trained on undisclosed or scraped personal data without consent are not accepted.
Training code availability. Link to public training code under an open-source license.
Known limitations. What cannot be audited or verified (e.g. non-releasable pre-training data, non-OSI training data licenses).

Published research

Peer-reviewed or public report. Models should have an accompanying peer-reviewed paper or public technical report describing the architecture and training methodology.

Responsible use

Privacy by design. All inference runs locally on the user's machine. No data is sent to external services. No telemetry, no cloud dependencies.
Purpose-limited scope. Models are selected for photo editing tasks: denoising, masking, depth estimation, and object removal (inpainting), etc. We do not include models designed for generating, manipulating, or synthesizing human likenesses.
Reproducibility. Conversion scripts, model configurations, and source references are fully documented so that any user can verify and rebuild the ONNX models from the original checkpoints.

Adding a new model

Create models/<model>/model.yaml with model metadata, checkpoint URLs, and conversion steps
Create models/<model>/convert.py with model-specific conversion logic
Create models/<model>/demo.py with inference script
Create models/<model>/README.md with the model card (see below)
Add sample images to samples/<task>/
If the model depends on an external repo, add it as a git submodule under vendor/
Add a dependency group to pyproject.toml if the model needs extra packages
Run uv run dtai run <model> to build and test

convert.py

The conversion script must expose a convert() function that the pipeline calls directly. It receives keyword arguments matching the args dict in model.yaml (with template variables resolved). Keep main() with argparse for standalone use.

def convert(checkpoint, output, opset=17, fp16=False):
    """Entry point called by the pipeline."""
    model = load_model(checkpoint)
    export_to_onnx(model, output, opset_version=opset, fp16=fp16)

def main():
    parser = argparse.ArgumentParser()
    parser.add_argument("--checkpoint", required=True)
    parser.add_argument("--output", required=True)
    parser.add_argument("--opset", type=int, default=17)
    parser.add_argument("--fp16", action="store_true")
    args = parser.parse_args()
    convert(args.checkpoint, args.output, args.opset, args.fp16)

if __name__ == "__main__":
    main()

The corresponding model.yaml args use Python keyword names (not CLI flags):

convert:
  - script: convert.py
    args:
      checkpoint: "{temp}/model.pth"
      output: "{output}/model.onnx"
      opset: 17
      fp16: true

Available template variables: {root}, {model_dir}, {temp}, {output}, {repo}.

demo.py

The demo script must expose a demo() function. The first arguments depend on the model type:

single (type: single): demo(model, image, output, **kwargs)
split (type: split): demo(encoder, decoder, image, output, **kwargs)
multi (type: multi): demo(model_dir, image, output, **kwargs)

The pipeline passes image and output paths automatically. Any per-image arguments defined in model.yaml under demo.image_args are passed as extra **kwargs.

def demo(model, image, output, **kwargs):
    """Entry point called by the pipeline."""
    run_inference(model, image, output)

def main():
    parser = argparse.ArgumentParser()
    parser.add_argument("--model", required=True)
    parser.add_argument("--image", required=True)
    parser.add_argument("--output", required=True)
    args = parser.parse_args()
    demo(args.model, args.image, args.output)

if __name__ == "__main__":
    main()

Model card (README.md)

Each model directory must include a README.md documenting:

Source – repository URL, paper reference, license
Architecture – brief description of the model architecture
ONNX Models – input/output tensor names, shapes, data types, normalization, tiling support
Selection Criteria – a table covering all items from the model selection criteria:

Property	Value
Model license	(e.g. Apache-2.0)
OSAID v1.0	(e.g. Open Source AI)
MOF	(e.g. Class II)
Training data license	...
Training data provenance	...
Training code	(link)
Known limitations	...
Published research	(link to paper)
Inference	Local only, no cloud dependencies
Scope	(e.g. Image denoising)
Reproducibility	Full pipeline

See existing model READMEs for examples.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
darktable_ai		darktable_ai
models		models
samples		samples
vendor		vendor
.gitignore		.gitignore
.gitmodules		.gitmodules
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Darktable AI Models

Models

Repository structure

Requirements

Setup

Usage

Demos

Model selection criteria

Open source compliance

Published research

Responsible use

Adding a new model

convert.py

demo.py

Model card (README.md)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Darktable AI Models

Models

Repository structure

Requirements

Setup

Usage

Demos

Model selection criteria

Open source compliance

Published research

Responsible use

Adding a new model

convert.py

demo.py

Model card (README.md)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages