SimCTG BART training #27

rahulseetharaman · 2023-05-06T16:58:11Z

Hi @yxuansu, thanks for the wonderful library

I am trying to use SimCTG framework to train a BART model for a question generation task. I am facing the following issue in trying to train a BART model with SimCTG loss.

  File "experiment-5/simctg_train.py", line 224, in <module>
    train()
  File "experiment-5/simctg_train.py", line 122, in train
    mle_loss, cl_loss = simctgloss(last_hidden_states=last_hidden_states, logits=logits,
  File "/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
    return forward_call(*args, **kwargs)
  File "simctg/lossfunction.py", line 93, in forward
    assert labels.size() == input_ids.size()
AssertionError

While looking at the loss function, I did realize why this happens. Is the loss function designed to support only decoder only models like GPT for example ? How to adapt it for BART and T5 ? For bart and t5 the assertion that input ids and labels dimensions are the same need not hold.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SimCTG BART training #27

SimCTG BART training #27

rahulseetharaman commented May 6, 2023 •

edited

Loading

SimCTG BART training #27

SimCTG BART training #27

Comments

rahulseetharaman commented May 6, 2023 • edited Loading

rahulseetharaman commented May 6, 2023 •

edited

Loading