Add LLaMA-Factory support by XuRuihan · Pull Request #38 · inclusionAI/Ming

XuRuihan · 2025-07-18T14:37:13Z

No description provided.

…_MoE_lite_v4

Copilot

Pull Request Overview

This PR adds support for the Ming model in LLaMA-Factory, a popular fine-tuning framework. The integration enables users to fine-tune the Ming-Lite-Omni multimodal model using LLaMA-Factory's tools and configuration system.

Key changes:

Adds a comprehensive patch file that integrates Ming model support into LLaMA-Factory
Provides a YAML configuration file for LoRA-based supervised fine-tuning
Updates documentation with detailed setup and usage instructions

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 5 comments.

File	Description
ming.patch	Comprehensive patch file adding Ming model support to LLaMA-Factory including multimodal processing, template registration, and model configuration
ming_lora_sft.yaml	Configuration file for LoRA-based supervised fine-tuning of Ming model with training parameters and dataset settings
README.md	Documentation updates with LLaMA-Factory setup instructions and usage examples, plus a Docker command fix

Comments suppressed due to low confidence (1)

ming.patch:1

Large blocks of commented-out code should be removed rather than left in the codebase. If this audio processing functionality is planned for future implementation, consider adding a TODO comment with context instead.

diff --git a/src/llamafactory/chat/hf_engine.py b/src/llamafactory/chat/hf_engine.py

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-09-29T02:29:29Z

ming_lora_sft.yaml

+plot_loss: true
+overwrite_output_dir: true
+save_only_model: false
+report_to: none  # choices: [none, wandb, tensorboard, swanlab, mlflow]


The comment incorrectly lists 'swanlab' as a choice. The correct name is 'wandb' for Weights & Biases, but 'swanlab' appears to be a typo - it should likely be 'wandb' or removed if it's not a valid option.

Suggested change

report_to: none # choices: [none, wandb, tensorboard, swanlab, mlflow]

report_to: none # choices: [none, wandb, tensorboard, mlflow]

Copilot · 2025-09-29T02:29:30Z

ming.patch

+        # if len(audios) != 0:
+        #     sampling_rate = 16000
+        #     audios = self._regularize_audios(audios, sampling_rate=sampling_rate)["audios"]
+        #     audios = [(torch.tensor(audio), sampling_rate)for audio in audios]


Missing space before 'for' in the list comprehension. Should be [(torch.tensor(audio), sampling_rate) for audio in audios].

Copilot · 2025-09-29T02:29:30Z

ming.patch

+            image_grid_thw = image_inputs["image_grid_thw"]
+
+        if len(videos):
+            # assert len(videos) <= 1, "Video count must be at most 1!"


Commented-out assertion should either be removed or properly implemented if the constraint is still valid. If this limitation exists, it should be enforced through proper validation.

Suggested change

+ # assert len(videos) <= 1, "Video count must be at most 1!"

+ if len(videos) > 1:

+ raise ValueError("Video count must be at most 1!")

Copilot · 2025-09-29T02:29:30Z

ming.patch

+        # if len(audios):
+        #     audio_inputs = self._get_mm_inputs([], [], audios, processor)
+        #     audio_feats_lengths = audio_inputs["encoder_feats_lengths"]


Another block of commented-out code that should be removed or properly implemented. This creates inconsistency with the audio processing logic above.

Suggested change

+ # if len(audios):

+ # audio_inputs = self._get_mm_inputs([], [], audios, processor)

+ # audio_feats_lengths = audio_inputs["encoder_feats_lengths"]

+ if len(audios):

+ audio_inputs = self._get_mm_inputs([], [], audios, processor)

+ audio_feats_lengths = audio_inputs["encoder_feats_lengths"]

Copilot · 2025-09-29T02:29:31Z

README.md

+    ├── am.mvn
+    ├── audio_detokenizer
+    ├── ...
+    ├── inclutionAI


Typo in directory name - should be 'inclusionAI' to match the actual organization name referenced elsewhere in the documentation.

Suggested change

├── inclutionAI

├── inclusionAI

XuRuihan added 4 commits July 17, 2025 20:38

llama-factory usage

b6a91bf

Merge branch '0710_MoE_lite_v4' of github.com:XuRuihan/Ming into 0710…

92c714d

…_MoE_lite_v4

add example yaml

e406147

add README llama-factory yaml usage

a950cfa

mingcheng requested a review from Copilot September 29, 2025 02:28

Copilot AI reviewed Sep 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LLaMA-Factory support#38

Add LLaMA-Factory support#38
XuRuihan wants to merge 4 commits intoinclusionAI:mainfrom
XuRuihan:0710_MoE_lite_v4

XuRuihan commented Jul 18, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Sep 29, 2025

Uh oh!

Copilot AI Sep 29, 2025

Uh oh!

Copilot AI Sep 29, 2025

Uh oh!

Copilot AI Sep 29, 2025

Uh oh!

Copilot AI Sep 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	report_to: none # choices: [none, wandb, tensorboard, swanlab, mlflow]
	report_to: none # choices: [none, wandb, tensorboard, mlflow]

	+ # assert len(videos) <= 1, "Video count must be at most 1!"
	+ if len(videos) > 1:
	+ raise ValueError("Video count must be at most 1!")

Conversation

XuRuihan commented Jul 18, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants