Exclusive Content:

Haiper steps out of stealth mode, secures $13.8 million seed funding for video-generative AI

Haiper Emerges from Stealth Mode with $13.8 Million Seed...

Running Your ML Notebook on Databricks: A Step-by-Step Guide

A Step-by-Step Guide to Hosting Machine Learning Notebooks in...

“Revealing Weak Infosec Practices that Open the Door for Cyber Criminals in Your Organization” • The Register

Warning: Stolen ChatGPT Credentials a Hot Commodity on the...

Evaluating ChatGPT’s Ability to Answer Questions in Natural Science and Engineering through Empirical Testing

Evaluation of ChatGPT’s Answering Capabilities in Natural Science and Engineering Domains: A Study at Delft University of Technology

In our recent study, we delved into the capabilities of ChatGPT within the natural science and engineering domains. The study involved a diverse group of participants from different faculties at Delft University of Technology, including assistant professors, associate professors, full professors, lecturers, Ph.D. students, postdoctoral researchers, and others.

Our evaluation focused on assessing ChatGPT’s answering capabilities across various skill categories and educational levels. The results, as depicted in Figure 1, highlighted several key findings. Firstly, ChatGPT received higher scores for basic and scientific skills compared to skills beyond scientific knowledge. Participants rated the question relatedness of the answers and the level of English highly. However, the model’s critical attitude scored lowest among the assessment criteria, suggesting the need for further verification of results.

Moreover, the assessment of scientific correctness revealed that ChatGPT can provide mostly correct answers for Bachelor level questions and partly correct answers for Master and Ph.D. level questions. It was interesting to note the impact of the answers generated by ChatGPT, with participants mentioning various potential impacts ranging from environmental to safety concerns.

Further analysis of the study variables, including skill categories and educational levels, showed significant influences on the assessment scores. Scientific skills were rated higher than skills beyond scientific knowledge, and answers for lower educational levels received better ratings. Faculty, however, did not show a significant influence on the assessment rating.

The study also included free text comments from participants, providing additional insights into the perceived quality of ChatGPT’s answers. Comments ranged from critiques about lack of detail to comparisons with student answers. Some participants raised concerns about the sources of training data used by ChatGPT and its implications on the generated answers. Emotional reactions were also observed, with a mix of neutral, positive, and negative sentiments expressed in the comments.

Overall, our study sheds light on the strengths and weaknesses of ChatGPT in answering questions related to natural science and engineering. While the model demonstrates competence in certain areas, further improvements are needed, especially in critical thinking and ensuring scientific correctness. As AI continues to shape the future of education and research, studies like ours provide valuable insights for enhancing the capabilities of AI-powered tools in academic settings.

Latest

Creating Age-Responsive, Context-Aware AI Using Amazon Bedrock Guardrails

Ensuring Safe and Reliable AI Responses: A Guardrail-First Approach...

Sephora Introduces ChatGPT App to Enhance AI-Driven Beauty Shopping体验

Sephora Launches AI-Powered App in ChatGPT for Personalized Beauty...

European Robotics Forum Set for Birmingham in 2027

European Robotics Forum 2027: Shaping the Future of Robotics...

Google Launches Global AI Camera Assistant for Live Search

Google Unveils Search Live: Revolutionizing Visual AI and Conversational...

Don't miss

Haiper steps out of stealth mode, secures $13.8 million seed funding for video-generative AI

Haiper Emerges from Stealth Mode with $13.8 Million Seed...

Running Your ML Notebook on Databricks: A Step-by-Step Guide

A Step-by-Step Guide to Hosting Machine Learning Notebooks in...

VOXI UK Launches First AI Chatbot to Support Customers

VOXI Launches AI Chatbot to Revolutionize Customer Services in...

Investing in digital infrastructure key to realizing generative AI’s potential for driving economic growth | articles

Challenges Hindering the Widescale Deployment of Generative AI: Legal,...

Sephora Introduces ChatGPT App to Enhance AI-Driven Beauty Shopping体验

Sephora Launches AI-Powered App in ChatGPT for Personalized Beauty Shopping Experience Sephora Launches ChatGPT App: Revolutionizing Beauty Retail with AI In a groundbreaking move, Sephora has...

OpenAI Introduces ChatGPT Library for Long-Term User Storage Solutions

OpenAI Introduces File Storage and Retrieval Feature for ChatGPT Users ChatGPT users can now store, browse, and retrieve files, enhancing their experience with the AI...

OpenAI Shifts ChatGPT Shopping Strategy Following Disappointing Instant Checkout Performance

OpenAI Repositions ChatGPT as a Product Discovery Tool Amid Checkout Challenges OpenAI Shifts Focus: ChatGPT as a Product Discovery Tool In a notable pivot, OpenAI has...