Project AIxandria
28 Feb 2025
I recently went to a physics teachers’ conference and found out that Colin Hopkins, a retired but still prominent and very active member of the community, has been working on something that I very much care about and want to see succeed. It is not only a fabulous idea that should have been done a long time ago, but also something that I would like to see implemented in every other subject.
For some background, Colin has been collecting past exam papers for longer than the department has access to, and he has been sorting all of the questions from each of those papers out based on concepts/skills, for example Newton’s Laws (specifically F=ma with constant acceleration), and coding each question for the question type (whether it is multiple choice, a calculation or an explanation question).
Colin is now at the stage where all the questions and answers are ready, and the next step he is working on is trying to find an easy way to let teachers convert their class results into a Guttman chart, analyse it for areas of weakness, and then link students to the relevant similar questions in his database for additional practice materials. I want to help him find an AI-assisted solution that can potentially automate part of this process.
I started by simply following his original idea, which was to give all of the data and instructions to ChatGPT to see what happens. Despite how many ideas I thought I had about what could work better, of course it was not so easy. Sometimes questions were missing and some students were not marked, or questions were marked but done incorrectly. By the end it even marked and provided results for students that didn't exist at all. Rubbish in, rubbish out.
The next step was to explore a few more advanced options, such as creating my own 'GPT' or using the API 'Assistants' in the hope of being able to create consistently useful outputs. I watched this video to get myself familiar with the general idea of what to do:
The idea was simple enough, and I had everything I needed to get started. I'll spare you the details, but in the end my long winding adventure into API keys, system instructions and rendering markdown formatting did not result in a useful solution. However, I did discover through the process that LLMs in general were still not reliable enough to handle large amounts of data accurately... and that it is very cheap to use if you know what to ask for. With the right setup, the current (2025) pricing of GPT-4o mini output is USD$0.60, measured by the millions of tokens, which will last for quite some time for most purposes. For comparison, the latest GPT-4.5 costs USD$150 per 1 million output tokens (250 times the price).
Eventually I decided to go back to basics and pursue the path of good old excel. With the help of AI again, I discovered that there was a way to excel formula everything I needed into the sheet, such that you only need to provide student responses and answers and it would take care of the rest. Neat!
Interestingly, the hurdle that took me the longest time to get over was the fact that Google Sheets does not allow for horizontal sorting with a formula. The Microsoft version of Excel in Office is slightly different and it has that function, so everything worked well after the migration.