admin管理员组

文章数量:1399944

My goal is to create a chat bot specialized in answering questions related to diabetes.

I am new to fine tuning and have a couple questions before I begin. My question is about the dataset format and the underlying model I should use.

I want to fine tune the LLM on the following dataset - I am thinking of using the Alpaca format - make a prompt with ##[Instruction] ##[Input] ##[Output] - (). However, I want to use the rationale and explanation column from the database. How should I incorporate it? If not rationale, I want to incorporate explanation column while fine tuning.

I plan to use the base model rather than the instruct model. Will that be the right choice?

Seeking guidance, Thanks!

本文标签: large language modelHow to incorporate additional data in fine tuning LLMStack Overflow