谷歌浏览器插件
订阅小程序
在清言上使用

Comment on “comparing the Performance of College Chemistry Students with ChatGPT for Calculations Involving Acids and Bases”

JOURNAL OF CHEMICAL EDUCATION(2024)

引用 0|浏览2
暂无评分
摘要
In a recent paper in this Journal (), Clark et al. evaluated the performance of the GPT-3.5 large language model (LLM) on ten undergraduate pH calculation problems. They reported that GPT-3.5 gave especially poor results for salt and titration problems, returning the correct results only 10% and 0% of the time, respectively, and that, despite a correct application of heuristics, the LLM made mathematical errors and used flawed strategies. However, these problems are partially mitigated using the more advanced GPT-4 model and entirely corrected using simple prompting and calculator tool use patterns demonstrated herein.
更多
查看译文
关键词
General Public,High-School/Introductory Chemistry,First-Year Undergraduate/General,Second-Year Undergraduate,Internet/Web-Based Learning,Misconceptions,Problem-Solving/Decision Making,Testing/Assessment,Acids/Bases,pH,Generative AI,LargeLanguage Models,ChatGPT,GPT-4
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要