LLMs, reliability & the scientific process
Are we becoming too lenient?
Are we becoming too lenient?
Passing long lists of texts to categorise
Looking at the differences between predicted and effective realisation
Some quick notes - itβs a tool, not a oracle
Looking at the differences between predicted and effective realisation