How to choose best LLMs for merging?
EXPLORING MODEL KINSHIP FOR MERGING LARGE LANGUAGE MODELS
October 17, 2024
https://arxiv.org/pdf/2410.12613This paper introduces "model kinship", a metric to measure the similarity between Large Language Models (LLMs) during the process of merging them. Researchers found that repeatedly merging high-performing LLMs leads to performance stagnation because the models become too similar. They propose a new merging strategy that leverages "model kinship" to identify and merge diverse LLMs, resulting in better performance and faster convergence.