AI summary: A recent paper suggesting that GPT-4’s performance has degraded over time is being challenged. The paper’s methodology and evaluation criteria are under scrutiny, with critics arguing that the perceived degradation is more about changes in the AI’s behavior, not its capabilities. The debate highlights the complexity of evaluating language models and the potential for fine-tuning to unintentionally alter AI behavior.
Read more…