feat(TaskProcessing): Add OCR TaskType #56908

marcelklehr · 2025-12-08T11:40:29Z

Summary

Adds a task processing task type for doing OCR

TODO

Ideas for more inputs?

Checklist

Code is properly formatted
Sign-off message is added to all commits
Tests (unit, integration, api and/or acceptance) are included
Screenshots before/after for front-end changes
Documentation (manuals or wiki) will be updated once the PR is merged
Backports requested where applicable (ex: critical bugfixes)
Labels added where applicable (ex: bug/enhancement, 3. to review, feature component)
Milestone added for target branch/version (ex: 32.x for stable32)

janepie · 2025-12-08T12:47:47Z

Looks good! We could add an input for the language to be extracted and have it default to automatic detection, or add that as optional input only for providers that make use of it. Both fine for me, wdyt @julien-nc @kyteinsky ?

julien-nc · 2025-12-08T12:50:18Z

Not sure the OCR libraries take a "language" param to help them perform an optimal extraction. @marcelklehr Do they?
If so, I'm ok with adding an input field. It's also fine to let the providers add an optional one as not all the providers might support the param.

marcelklehr · 2025-12-08T12:55:44Z

Not sure the OCR libraries take a "language" param to help them perform an optimal extraction. @marcelklehr Do they?

The latest models don't require a language input, but older libraries like tesseract may require this. I think an optional input is fine.

Signed-off-by: Marcel Klehr <[email protected]>

kyteinsky · 2025-12-09T06:31:21Z

lib/public/TaskProcessing/TaskTypes/ImageToTextOpticalCharacterRecognition.php

+	public function getInputShape(): array {
+		return [
+			'input' => new ShapeDescriptor(
+				$this->l->t('Input Image'),
+				$this->l->t('The image to extract text from'),
+				EShapeType::Image
+			),
+		];
+	}


it would be nice if it were a ListOfFiles so it can accept images and pdfs both, and multiple of them instead of a single one for a single task, which also keeps the task list shorter in the DB.

marcelklehr added this to the Nextcloud 33 milestone Dec 8, 2025

marcelklehr requested a review from a team as a code owner December 8, 2025 11:40

marcelklehr added the 3. to review Waiting for reviews label Dec 8, 2025

marcelklehr requested review from ArtificialOwl, CarlSchwan, icewind1991, julien-nc, kyteinsky and leftybournes and removed request for a team December 8, 2025 11:40

julien-nc approved these changes Dec 8, 2025

View reviewed changes

marcelklehr force-pushed the feat/tasktype-ocr branch from bca2b42 to e339591 Compare December 8, 2025 11:48

marcelklehr added enhancement feature: TaskProcessing labels Dec 8, 2025

marcelklehr requested a review from janepie December 8, 2025 12:34

janepie approved these changes Dec 8, 2025

View reviewed changes

marcelklehr force-pushed the feat/tasktype-ocr branch from e339591 to 483a4b2 Compare December 8, 2025 13:49

marcelklehr enabled auto-merge December 8, 2025 13:49

marcelklehr force-pushed the feat/tasktype-ocr branch from 483a4b2 to 42bf379 Compare December 8, 2025 16:41

feat(TaskProcessing): Add OCR TaskType

3355e6a

Signed-off-by: Marcel Klehr <[email protected]>

marcelklehr force-pushed the feat/tasktype-ocr branch from 42bf379 to 3355e6a Compare December 8, 2025 16:44

kesselb disabled auto-merge December 8, 2025 16:45

kesselb merged commit b7b4a3a into master Dec 8, 2025
173 of 179 checks passed

kesselb deleted the feat/tasktype-ocr branch December 8, 2025 16:53

kyteinsky reviewed Dec 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat(TaskProcessing): Add OCR TaskType #56908

feat(TaskProcessing): Add OCR TaskType #56908

marcelklehr commented Dec 8, 2025

Uh oh!

janepie commented Dec 8, 2025

Uh oh!

julien-nc commented Dec 8, 2025

Uh oh!

marcelklehr commented Dec 8, 2025

Uh oh!

Uh oh!

kyteinsky Dec 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Uh oh!

feat(TaskProcessing): Add OCR TaskType #56908

feat(TaskProcessing): Add OCR TaskType #56908

Conversation

marcelklehr commented Dec 8, 2025

Summary

TODO

Checklist

Uh oh!

janepie commented Dec 8, 2025

Uh oh!

julien-nc commented Dec 8, 2025

Uh oh!

marcelklehr commented Dec 8, 2025

Uh oh!

Uh oh!

kyteinsky Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants