Recent large language models (LLMs) have shown impressive performance across a diverse array of tasks. However, their use in high-stakes or computationally constrained environments has highlighted the ...