The best method for comparing two GPT-2 models

Jack_im report abuse

I have two fine-tuned GPT-2 345M models with different learning rate. Is there any method to compare them and find out which one is better?

Answers

Mike1191 report abuse

It’s hard to answer as it depends on the task. Can you provide more details?

Jack_im report abuse

The models have the learning rate 1e-4 and 1e-6 in accordance. Can it be helpful?

Mike1191 report abuse

You can use metrics for measuring the quality of the sample (BLEU metrics). For sample diversity. It’s needed to measure the predicted probability distribution against the true distribution.

Add Answer

Need support?

Just drop us an email to ... Show more