Models

Models are fine-tuned LLMs trained on one or more versions of datasets.

How long do models take to train?

Models typically take 20-30 minutes to train but this all depends on how busy our GPUs are and it can take much longer.

How large are the model weights?

With Mistral 7B the weights (in a .zip file) are usually around 15GB. This varies between base models.

Last updated