Tracy Teal
10/09/2024, 4:42 PMDoes it slow down inference if someone uses fine-tuning? If so, how much slower? What's best practice if they need fast inference for multiple calls in bursts?
Marco
10/09/2024, 4:57 PMfinetune_steps
. The more steps we finetune, the longer it takes, but it can also lead to overfitting, so we don't want to finetune too much either.
My advice would be to test different finetuning steps, see how it impacts speed and performance.
Not sure how fast they want inference to be, but even with finetuning, it will be relatively faster than training a model from scratch.Tracy Teal
10/09/2024, 6:45 PM