Exclusive Content:

Haiper steps out of stealth mode, secures $13.8 million seed funding for video-generative AI

Haiper Emerges from Stealth Mode with $13.8 Million Seed...

“Revealing Weak Infosec Practices that Open the Door for Cyber Criminals in Your Organization” • The Register

Warning: Stolen ChatGPT Credentials a Hot Commodity on the...

VOXI UK Launches First AI Chatbot to Support Customers

VOXI Launches AI Chatbot to Revolutionize Customer Services in...

Evaluating ChatGPT’s Ability to Answer Questions in Natural Science and Engineering through Empirical Testing

Evaluation of ChatGPT’s Answering Capabilities in Natural Science and Engineering Domains: A Study at Delft University of Technology

In our recent study, we delved into the capabilities of ChatGPT within the natural science and engineering domains. The study involved a diverse group of participants from different faculties at Delft University of Technology, including assistant professors, associate professors, full professors, lecturers, Ph.D. students, postdoctoral researchers, and others.

Our evaluation focused on assessing ChatGPT’s answering capabilities across various skill categories and educational levels. The results, as depicted in Figure 1, highlighted several key findings. Firstly, ChatGPT received higher scores for basic and scientific skills compared to skills beyond scientific knowledge. Participants rated the question relatedness of the answers and the level of English highly. However, the model’s critical attitude scored lowest among the assessment criteria, suggesting the need for further verification of results.

Moreover, the assessment of scientific correctness revealed that ChatGPT can provide mostly correct answers for Bachelor level questions and partly correct answers for Master and Ph.D. level questions. It was interesting to note the impact of the answers generated by ChatGPT, with participants mentioning various potential impacts ranging from environmental to safety concerns.

Further analysis of the study variables, including skill categories and educational levels, showed significant influences on the assessment scores. Scientific skills were rated higher than skills beyond scientific knowledge, and answers for lower educational levels received better ratings. Faculty, however, did not show a significant influence on the assessment rating.

The study also included free text comments from participants, providing additional insights into the perceived quality of ChatGPT’s answers. Comments ranged from critiques about lack of detail to comparisons with student answers. Some participants raised concerns about the sources of training data used by ChatGPT and its implications on the generated answers. Emotional reactions were also observed, with a mix of neutral, positive, and negative sentiments expressed in the comments.

Overall, our study sheds light on the strengths and weaknesses of ChatGPT in answering questions related to natural science and engineering. While the model demonstrates competence in certain areas, further improvements are needed, especially in critical thinking and ensuring scientific correctness. As AI continues to shape the future of education and research, studies like ours provide valuable insights for enhancing the capabilities of AI-powered tools in academic settings.

Latest

Dashboard for Analyzing Medical Reports with Amazon Bedrock, LangChain, and Streamlit

Enhanced Medical Reports Analysis Dashboard: Leveraging AI for Streamlined...

Broadcom and OpenAI Collaborating on a Custom Chip for ChatGPT

Powering the Future: OpenAI's Custom Chip Collaboration with Broadcom Revolutionizing...

Xborg Robotics Introduces Advanced Whole-Body Collaborative Industrial Solutions at the Hong Kong Electronics Fair (Autumn Edition)

Xborg Robotics Unveils Revolutionary Humanoid Solutions for High-Risk Industrial...

How AI is Revolutionizing Data, Decision-Making, and Risk Management

Transforming Finance: The Impact of AI and Machine Learning...

Don't miss

Haiper steps out of stealth mode, secures $13.8 million seed funding for video-generative AI

Haiper Emerges from Stealth Mode with $13.8 Million Seed...

VOXI UK Launches First AI Chatbot to Support Customers

VOXI Launches AI Chatbot to Revolutionize Customer Services in...

Investing in digital infrastructure key to realizing generative AI’s potential for driving economic growth | articles

Challenges Hindering the Widescale Deployment of Generative AI: Legal,...

Microsoft launches new AI tool to assist finance teams with generative tasks

Microsoft Launches AI Copilot for Finance Teams in Microsoft...

Broadcom and OpenAI Collaborating on a Custom Chip for ChatGPT

Powering the Future: OpenAI's Custom Chip Collaboration with Broadcom Revolutionizing AI Inferencing and Efficiency Breaking Ground in AI: OpenAI's Custom Chip Collaboration with Broadcom The world of...

‘I Realized I’d Been ChatGPT-ed into Bed’: The Bizarre Effects of...

The Rise of AI in Modern Dating: Navigating the Love Landscape in a Digital Age The AI Dilemma in Dating: Are We Chatfishing Ourselves? As the...

I Asked ChatGPT About the Worst Money Mistakes You Can Make...

Insights from ChatGPT: The Worst Financial Mistakes You Can Make The Worst Financial Mistakes You Can Make: Insights from ChatGPT In today’s fast-paced financial landscape, it’s...