How to align LLMs with rules without human annotations?
Iterative Graph Alignment
August 30, 2024
https://arxiv.org/pdf/2408.16667This research paper introduces Iterative Graph Alignment (IGA), a new method for aligning LLMs with rules without human intervention.
IGA uses a "teacher" VLM to create logical graphs representing the reasoning behind correct answers. A "student" LLM learns from these graphs and compares its own responses to identify and address gaps in its understanding. This multi-agent approach enables the LLM to self-improve and become more adept at following specific rules in open-ended conversations.