Training an AI Assistant for STEM Education
Together with a team of colleagues, developed an AI assistant for STEM students. This included data gathering and preparation, training of a reward model (Deberta), fine-tuning of a large language model (Distilled GPT-2). Used the Antrophic-style Constitutional AI approach in order to bias the model towards clarity, correctness, completeness, and rigour answers.