Using GPT JT to train Open Assistant #3470

sr5434 · 2023-03-28T11:57:42Z

sr5434
Mar 28, 2023

According to Yannic Kilcher’s video about Open Assistant, the MVP is a model like InstructGPT. However, a model similar to InstructGPT already exists called GPT JT. Can’t we skip training an InstructGPT like model and use GPT JT?

andreaskoepf · 2023-03-28T12:40:39Z

andreaskoepf
Mar 28, 2023
Maintainer

Any model that is based on HF-Transformers CaualLM should be usable without problems. If you have compute you could try fine-tuning a GPT JT model on the OA dataset.

0 replies

sr5434 · 2023-03-28T15:25:58Z

sr5434
Mar 28, 2023
Author

Would GPT JT be considered as an option as a pretrained model to fine-tune into the final model?

0 replies

andreaskoepf · 2023-03-28T21:43:38Z

andreaskoepf
Mar 28, 2023
Maintainer

Would GPT JT be considered as an option as a pretrained model to fine-tune into the final model?

Yes, that's definitely something we should try. Also probably the larger togethercomputer/GPT-NeoXT-Chat-Base-20B.

0 replies

sr5434 · 2023-03-29T11:42:09Z

sr5434
Mar 29, 2023
Author

Yeah, that sounds good. Maybe an open assistant model trained from GPT JT can be Open Assistant Large and one trained from GPT NeoXT chat base can be Open Assistant Extra Large

0 replies

sr5434 · 2023-04-15T19:51:02Z

sr5434
Apr 15, 2023
Author

@sanagno what are your thoughts on this?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using GPT JT to train Open Assistant #3470

{{title}}

Replies: 5 comments

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Using GPT JT to train Open Assistant #3470

sr5434 Mar 28, 2023

Replies: 5 comments

andreaskoepf Mar 28, 2023 Maintainer

sr5434 Mar 28, 2023 Author

andreaskoepf Mar 28, 2023 Maintainer

sr5434 Mar 29, 2023 Author

sr5434 Apr 15, 2023 Author

sr5434
Mar 28, 2023

andreaskoepf
Mar 28, 2023
Maintainer

sr5434
Mar 28, 2023
Author

andreaskoepf
Mar 28, 2023
Maintainer

sr5434
Mar 29, 2023
Author

sr5434
Apr 15, 2023
Author