tutorial

Tutorials • Fine-tuning Tutorials

Mistral Fine-tuning

This tutorial guides you on fine-tuning the open-source Mistral 7B model on the MoAI Platform.

Tutorials • Fine-tuning Tutorials • Mistral Fine-tuning

1. Preparing for Fine-tuning

Preparing the PyTorch script execution environment on the MoAI Platform is similar to doing so on a typical GPU server.

Tutorials • Fine-tuning Tutorials • Mistral Fine-tuning

2. Understanding Training Code

Once you have prepared all the training data, let's delve into the contents of the train_mistral.py script to execute the actual fine-tuning process.

Tutorials • Fine-tuning Tutorials • Mistral Fine-tuning

3. Model Fine-tuning

Now, we will train the model through the following process.

Tutorials • Fine-tuning Tutorials • Mistral Fine-tuning

4. Checking Training Results

Running the train_mistral.py script, as in the previous section, will save the resulting model in the

Tutorials • Fine-tuning Tutorials • Mistral Fine-tuning

5. Changing the Number of GPUs

Let's rerun the fine-tuning task with a different number of GPUs.

Tutorials • Fine-tuning Tutorials • Mistral Fine-tuning

6. Conclusion

From this tutorial, we have seen how to fine-tune the Mistral 7B model on the MoAI Platform.

Tutorials • Fine-tuning Tutorials

Llama3 70B Fine-tuning

This tutorial introduces how to fine-tune the open-source LLama3-70B model using the MoAI Platform.

Tutorials • Fine-tuning Tutorials • Llama3 70B Fine-tuning

1. MoAI Platform’s Parallelization - Llama3 70B

To fine-tune the Llama3 70B model, you must use multiple GPUs and implement parallelization techniques such as Tensor Parallelism, Pipeline...

Tutorials • Fine-tuning Tutorials • Llama3 70B Fine-tuning

2. Preparing for Fine-tuning

For a smooth tutorial experience, the following specifications are recommended:

Tutorials • Fine-tuning Tutorials • Llama3 70B Fine-tuning

3. Model Fine-tuning

Now, we will actually execute the fine-tuning process.

Tutorials • Fine-tuning Tutorials • Llama3 70B Fine-tuning

4. Conclusion

We have now explored the process of fine-tuning the Llama3-70b model on the MoAI Platform.

Tutorials • Fine-tuning Tutorials

Baichuan2 Fine-tuning

This tutorial introduces an example of fine-tuning the open-source Baichuan2-13B model on the MoAI Platform.

Tutorials • Fine-tuning Tutorials • Baichuan2 Fine-tuning

1. Preparing for Fine-tuning

Preparing the PyTorch script execution environment on the MoAI Platform is similar to doing so on a typical GPU server.

Tutorials • Fine-tuning Tutorials • Baichuan2 Fine-tuning

2. Understanding Training Code

If you've prepared all the training data, let's now take a look at the contents of train_baichuan2.py

Tutorials • Fine-tuning Tutorials • Baichuan2 Fine-tuning

3. Model Fine-tuning

Now, we will train the model through the following process.

Tutorials • Fine-tuning Tutorials • Baichuan2 Fine-tuning

4. Checking Training Results

Similar to the previous chapter, when you execute the train_baichuan2_13b.py script, the resulting model will be saved in the

Tutorials • Fine-tuning Tutorials • Baichuan2 Fine-tuning

5. Changing the Number of GPUs

Let's rerun the fine-tuning task with a different number of GPUs.

Tutorials • Fine-tuning Tutorials • Baichuan2 Fine-tuning

6. Conclusion

So far, we've explored the process of fine-tuning the Baichuan2 13B model, which is publicly available on Hugging Face, using the MoAI Platform.

Tutorials • Fine-tuning Tutorials

Qwen Fine-tuning

This tutorial introduces an example of fine-tuning the open-source Qwen2.5 7B model on the MoAI Platform.

Tutorials • Fine-tuning Tutorials • Qwen Fine-tuning

1. Preparing for Fine-tuning

Preparing the PyTorch script execution environment on the MoAI Platform is similar to doing so on a typical GPU server.

Tutorials • Fine-tuning Tutorials • Qwen Fine-tuning

2. Understanding Training Code

If you've prepared all the training data, let's now take a look at the train_qwen.py script to actually run the fine-tuning process.

Tutorials • Fine-tuning Tutorials • Qwen Fine-tuning

3. Model Fine-tuning

Now, we will train the model through the following process.

Tutorials • Fine-tuning Tutorials • Qwen Fine-tuning

4. Checking Training Results

Running the train_qwen.py script, as in the previous chapter, will save the resulting model in the qwen_code_generation

Tutorials • Fine-tuning Tutorials • Qwen Fine-tuning

5. Changing the Number of GPUs

Let's rerun the fine-tuning task with a different number of GPUs.

Tutorials • Fine-tuning Tutorials • Qwen Fine-tuning

6. Conclusion

So far, we've explored the process of fine-tuning the Qwen2.5 7B model on the MoAI Platform.

Quickstart

Please obtain a container or virtual machine on the MoAI Platform from your infrastructure provider and follow the instructions to connect via SSH.

Tutorials

MNIST on MoAI Platform

Tutorials

Fine-tuning on MoAI Platform

This tutorial introduces an example of fine-tuning the open-source Llama3-8B model on the MoAI Platform.

Tutorials • Fine-tuning on MoAI Platform

1. Preparing for Fine-tuning

Setting up the PyTorch execution environment on the MoAI Platform is similar to setting it up on a typical GPU server.

Tutorials • Fine-tuning on MoAI Platform

2. Understanding Training Code

Once you have prepared all the training data, let's take a look at the contents of the train_llama3.py

Tutorials • Fine-tuning on MoAI Platform

3. Model Fine-tuning

Now, we will actually execute the fine-tuning process.

Tutorials • Fine-tuning on MoAI Platform

4. Checking Training Results

When you execute the train_llama3.py script as in the previous section, the resulting model will be saved in the

Tutorials • Fine-tuning on MoAI Platform

5. Changing the Number of GPUs

Let's rerun the fine-tuning task with a different number of GPUs.

Tutorials • Fine-tuning on MoAI Platform

6. Conclusion

From this tutorial, we have seen how to fine-tune Llama3 8B on the MoAI Platform.

# Tag: tutorial