[Feature Request] Support for MiniCPM-V #92

DarioPTWR · 2024-11-30T17:44:26Z

Required prerequisites

I have searched the Issue Tracker and Discussions that this hasn't already been reported. (+1 or comment there if it has.)
Consider asking first in a Discussion.

Motivation

Hi! Love the work done in this repo. I see that DPO currently supports various Text+Image -> Text models, and was wondering if you could extend this support to the MiniCPM-V MLLM as well? That would be greatly appreciated!

MiniCPM-V (i.e., OmniLMM-3B) is an efficient version with promising performance for deployment. The model is built based on SigLip-400M and MiniCPM-2.4B.

Thank you!

Solution

No response

Alternatives

No response

Additional context

No response

Gaiejj · 2024-12-01T08:32:29Z

Thank you very much for your support and recognition of our work! We are preparing to refactor the chat_template, which will facilitate the integration of new models. In this PR, we will also update the support for MiniCPM-V. Stay tuned!

DarioPTWR · 2024-12-01T10:12:35Z

Thank you! Can't wait to see the update soon for MiniCPM-V 1.0! In the meantime, I was just wondering if it would still be possible to apply DPO retraining to MiniCPM-V with the current state of the repo? Or would it be completely impossible? Thanks!

Gaiejj · 2024-12-02T14:40:42Z

We have implemented DPO fine-tuning for MiniCPM-V in #93 (SFT and RLHF are on the way). We cordially invite you to experience this change as a beta user.

You can clone this repository using the following command.

git clone https://github.com/Gaiejj/align-anything.git -b dev-refactor

Then, install the necessary dependencies using the command provided below, with conda installed with CUDA.

conda install nvidia/label/cuda-12.2.0::cuda
export CUDA_HOME=$CONDA_PREFIX

pip install -e .[train]
pip install -e .[minicpmv]

After that, you only need to run the following script under the scripts folder:

./minicpmv_dpo.sh

If you feel that there is any anomaly or are not satisfied with the training, we sincerely hope that you can provide us with feedback, as it is very important to us.

DarioPTWR · 2024-12-02T15:36:26Z

Thank you so much! Will test it out on my end, and comment in this PR again if there are any anomalies spotted.

DarioPTWR added the enhancement New feature or request label Nov 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Support for MiniCPM-V #92

[Feature Request] Support for MiniCPM-V #92

DarioPTWR commented Nov 30, 2024

Gaiejj commented Dec 1, 2024

DarioPTWR commented Dec 1, 2024

Gaiejj commented Dec 2, 2024 •

edited

Loading

DarioPTWR commented Dec 2, 2024

[Feature Request] Support for MiniCPM-V #92

[Feature Request] Support for MiniCPM-V #92

Comments

DarioPTWR commented Nov 30, 2024

Required prerequisites

Motivation

Solution

Alternatives

Additional context

Gaiejj commented Dec 1, 2024

DarioPTWR commented Dec 1, 2024

Gaiejj commented Dec 2, 2024 • edited Loading

DarioPTWR commented Dec 2, 2024

Gaiejj commented Dec 2, 2024 •

edited

Loading