Fine-tuning GPT to learn a new coding language

mcl · May 17, 2023, 9:50am

Hi GPT community,

I am a developer from Internet Computer Protocol blockchain and I wish to fine-tune GPT to learn our native language Motoko. We have exponentially growth for new developers, No. 1 GitHub commit as Layer 1 blockchain in last month, so I am very confident our current example code is enough for fine-tune.

My initial approach is to fetch GitHub repo by tag of Motoko language and then use self-instruct technique (like Alpaca from LLaMA) to generate prompt given code. I wish to get some critics before I dive in. How do u think of this approach for dataset generation?

Additionally, I am open for suggestions to create a plug-in rather than fine-tune. If plug-in is more legit, what message my backend sever shall response to users’ inquiries through GPT?

Thank u for suggestions.

patrick.g.olsen · May 18, 2023, 12:21pm

First off, what model do you want to use?

This is from the OpenAI documentation:

What models can be fine-tuned? (https://platform.openai.com/docs/guides/fine-tuning/what-models-can-be-fine-tuned)

Fine-tuning is currently only available for the following base models: davinci, curie, babbage, and ada. These are the original models that do not have any instruction following training (like text-davinci-003 does for example).

I would try a few different approaches.

Upload documentation to a vector database and create a langchain tool for it to query.
Give it the documentation straight in it’s system message. Unsure how many tokens that takes up.

Hope that helps

ahsaniftikhar025 · August 27, 2023, 8:27pm

Hello,
I am trying to perform a similar task. Have you found any good methods to do this?

Topic		Replies	Views
Teaching GPT a new/niche programming language API	1	1714	June 2, 2023
Fine-Tune Davinci to write programming language API codex	2	1100	May 3, 2023
Training GPT to learn new scripting language API	1	1334	December 15, 2023
Fine Tuning ChatGPT with large text from Books Prompting	18	11213	March 26, 2024
What is the best way to teach a GPT model a new scripting language? Community gpt-4 , fine-tuning , chatgpt-plugin , functions	5	2947	December 24, 2023

Fine-tuning GPT to learn a new coding language

Related topics