Open-Ended Testing Stays Relevant for Critical
Knowledge Assessment

Noémi Fuller; Mónika Ferenczy; Szilvia Szunomár; Orsolya Máté; Zsuzsanna Germán; Annamária Pakai; Sziládiné Katalin Dr Fusz; Miklós Zrínyi; András Oláh

+1 (929) 600-8049

- Feedback
- Signup
- Submit Manuscript

e-Pub

Full Text

COJ Nursing & Healthcare

Open-Ended Testing Stays Relevant for Critical Knowledge Assessment

Noémi Fuller¹, Mónika Ferenczy², Szilvia Szunomár³, Orsolya Máté⁴, Zsuzsanna Germán⁵, Annamária Pakai⁶, Sziládiné Katalin Dr Fusz⁷, Miklós Zrínyi⁸*, András Oláh⁹

¹Director, Faculty of Health, University of Pécs, Hungary

^2,3,4,5,6,7 Assistant Professor, Faculty of Health, University of Pécs, Hungary

⁹ Dean, Faculty of Health, University of Pécs, Hungary

*Corresponding author: Miklós Zrínyi, Assistant Professor, Faculty of Health, University of Pécs, Hungary

Submission: October 18, 2019;Published: November 07, 2019

DOI: 10.31031/COJNH.2019.05.000619

ISSN: 2577-2007
Volume5 Issue4

Abstract

Aim: To evaluate the difference/bias rate between open-ended (OE) vs multiple-choice questions (MCQs) for a critical nursing skill.

Method: Three hundred and seventy-six nurses responded from 3 nursing schools to a 20-item MCQs and OE instrument. Questions concerned nursing knowledge of maintaining clear airways and interventions related to trachea suction. Subjects first responded to the OE instrument followed by MCQs. Both tests were paper based. Statistical analyses included paired t-tests and one-way ANOVA.

Result: Outcomes showed significant differences between OE and MCQs, MCQs were scored higher. On MCQs, Licensed Vocational Nurse (LVN) and Associate Degree in Nursing (ADN) group scores did not differ. As for OE, both ADNs and BSNs did significantly better than LVNs. BSNs increased their lead markedly over both groups in OE.

Conclusion: MCQs overestimated knowledge levels when respondents’ knowledge base was less confident or professional qualification was lower. When knowledge base was solid, the difference between OE and MCQs disappeared.

Highlights

Multiple-choice test scores can be biased upwards compared to open-ended tests
BSN nurses outperformed LVN and ADN nurses on open-ended tests
Nurses with more/longer experience score equal on both test types
Open-ended tests are still advantageous to assess critical skills

Keywords: Open-ended; Multiple-choice; Test; Test bias; Critical skill

Introduction

Valid, reliable and standardized knowledge assessment has always been a challenge for educators and continue to pose new dilemmas. By the adoption of computerized systems and technological innovations, knowledge assessment has gradually shifted towards electronic exams and choice-based response alternatives, in line with expectations of millennial medical students whose preference is for speed and efficacy [1]. Reid & Colleagues [2] also documented that nurses in general thought that computer-based testing was more user friendly and would rather take a multiple-choice question exam. As argued by Morrision & Walsh [3], measuring nursing students’ capacity for critical thinking remains a constant challenge in which carefully crafted multiple-choice questions (MCQs) may provide a reliable platform.

The debate, however, whether to use MCQs or open-ended (OE) techniques is ongoing. Friewald et al. [4] pointed out that easy administration, scoring and evaluation make MCQs a tempting option and are widely utilized by faculty. Research, however, confirmed that OE and MCQs measure different characteristics of the thought process [5]. So called ‘higher order’ MCQs have been questioned whether or not they are able to test clinical reasoning skills of medical students [4]. MCQs were also criticized to expose students to false statements which later they recall as being true [6].

Tweed & Colleagues [7] raised concerns that many students performing well on MCQs have incorrect responses to items that should be considered hazardous and that collecting a sufficient number of correct answers should not offset incorrect responses. Electronic exams, in general, also tend to be biased upwards in terms of scoring compared to paper based examinations [8]. OE methods, on the contrary, require students of higher level of thinking and, instead of simple knowledge recall, knowledge construction. Why OE is less popular has do with faculty time needed to find and score the correct text [9].

When OE was applied to mathematical testing, which requires higher order of thinking, and was compared against MCQ on number of errors and misconceptions, OE was found to produce more favorable results [10]. A similar test was repeated years later to find no significant difference between OE and MCQs [11]. When OE was applied to test whether nurses were able to calculate correct drug dosage (a critical skill that may lead to fatal outcome if performed incorrect), authors found that dosage dilution was answered correctly by 87% of respondents [12]. They also reported a success rate of 69.6% for finding the safe dose as well as 79% for identifying the right ratio/proportion. Had this been performed as an MCQ test, the proportion of correct responses could have been higher. Recall the argument by Tweed et al. [7] who argued that correct answers should not offset incorrect responses for critical skills assessment. Therefore, the aim of our research was to assess a critical care nursing skill (respiratory pathway management) on a large nursing cohort to evaluate the difference between OE vs MCQs.

Hypotheses

A. Subjects will score greater on average on the MCQ instrument. B. Nurses with BSN degree will score higher than other qualifications on both instruments.

Methods

This investigation utilized a cross-sectional, non-experimental, survey-based research design, respondents being their own control in the study (pairwise comparison). Subjects were recruited from three different locations, the main campus of the University in X [placeholder for institution], and from its two satellite schools X [placeholder for institution]. Participants were all nurses attending continuing education credit classes. They had been approached on the day of attending the class and were asked to participate in a research project that involved knowledge testing. A total of 500 nurses had been contacted. Sampling was convenient in that whoever on the day of testing volunteered to sit for the test had been included. There were no specific exclusion criteria decided for this research. Data was collected over a period of 6 months in Spring and Summer classes of 2017. Participants were asked to sit for two sets of tests after finishing their continuing education classes. The first test administered was an open-ended (OE) instrument followed by a second test taken with a multiple-choice instrument containing identical questions to the OE assessment tool. Each test lasted for 45 minutes.

Participants were seated in a large auditorium leaving enough space between to avoid copying from each other. There was only a small break allowed between two tests, most test takers were asked to stay in place in order to minimize discussion of results. To reduce stress induced testing bias, the principal investigator clarified participants before taking the test that they were part of a research and outcomes of their answers did not influence their continuing education credits by any means. The research was submitted for local IRB approval. Participation was voluntary and anonymized. There was no specific external funding received to support study implementation.

Instruments

The actual research instrument was a paper-based, 20-item, multiple-choice assessment tool developed by a panel of six advanced practice nurses on the topic of securing open airways and nurse management of the trachea. All six nurses worked in critical care units and were considered expert nurses on the topic. As the purpose of the research was not instrument development but testing, we focused on establishing content validity of the instrument by asking another expert panel of eight nurses, coming both from academia and practice, to agree on the set of questions. For each item wording an agreement rate of 90% was accepted to make sure items were clear and represented the topic tested. A final number of 20 items were included in order to allow test takers a sufficient amount of time to complete the task. Sample items on this instrument included “What catheter lubrification do you use before introducing it into the airways to apply trachea suction?

Tap water
Distilled water
Sterile sodium chloride solution
the catheter must not be lubricated” or “How do you choose the size of the catheter for the patient before trachea suction?
1. The diameter of the catheter should not be more than 50% of the inner diameter of the artificial airways
2. The diameter of the catheter should be more than 50% of the inner diameter of the artificial airways
3. Catheter should be selected by the weight of the patient
4. Catheter should be selected by its color
5. The diameter of the catheter should not be more than one-third of the inner diameter of the artificial airways

The diameter of the catheter should not be more than 20% of the inner diameter of the artificial airways”. Open-ended items included the same questions asking the respondents to describe the answer by their own words.

Instruments

Tap water
Distilled water
Sterile sodium chloride solution
the catheter must not be lubricated” or “How do you choose the size of the catheter for the patient before trachea suction?
1. The diameter of the catheter should not be more than 50% of the inner diameter of the artificial airways
2. The diameter of the catheter should be more than 50% of the inner diameter of the artificial airways
3. Catheter should be selected by the weight of the patient
4. Catheter should be selected by its color
5. The diameter of the catheter should not be more than one-third of the inner diameter of the artificial airways
6. The diameter of the catheter should not be more than 20% of the inner diameter of the artificial airways”. Open-ended items included the same questions asking the respondents to describe the answer by their own words.

Each correct answer on the multiple-choice instrument was assigned one (1) point, incorrect answers zero (0) points. Similar coding was used for the OE instrument. When the answer included the correct wording/description, one point was assigned, if the wording was incorrect zero point was recorded. Tests were checked by the same panel of six advanced practice nurses who developed the instrument. For the OE instrument, in case of ambiguity, two panel members were asked to agree on the final score of the unclear item. Final score for both instruments were calculated by adding all items with a score of 1. The possible range of scores was between 0 and 40. Besides these instruments, a demographic survey was also distributed to record age, gender and highest qualification of nurses along with a few other indicators. The final, English version of the instrument is available from the authors.

Statistical analyses included descriptive statistics of sample characteristics and scores achieved on main tests. One sample Kolmogorov-Smirnov test was used to check the normality assumption of our data. Paired sample t-test as well as Wilcoxon test were used to establish the difference between OE and multiple-choice test results. One-way ANOVA with Bonferroni post-hoc test was used to check whether outcomes of both OE and multiple-choice tests were different across various nurse qualifications. A priori sample size calculations by (level of significance set at 5%, statistical power at 0.85 [15%], and medium effect size [0.25]) showed that a total of 180 subjects had to be recruited to ensure adequate statistical power [13]. All analyses were done by SPSS Windows version 23.0. There was no policy developed for the replacement of missing data. Answers and spaces left blank were considered lack of knowledge in this research.

Result

Our final sample included 376 nurses who decided to take both tests. Average age of the sample was 41.5 (SD 9.85) years, nurses had been working in healthcare for an average of 12.1 (SD 10.19) years. Of the total sample, 22.6% represented RNs (BSN), 43% Associate Degree Nurses (ADN), and the rest of our sample included nurses with a Licensed Vocational Nurse (LVN) degree. As for when nurses had been awarded their degrees, 87% of our sample graduated before 2010. This sample was comprised of 4% male and 96% female nurses.

Table 1: Descriptive Statistics, Test Scores.

Table 1 shows descriptive data for both instruments for the full sample and a subsample of nurses who identified themselves as being more or less frequently involved in the clinical management of respiratory pathways. One sample Kolmogorov-Smirnov tests for OE and multiple-choice scales showed significance levels (p) being<0.001, that is, scores were not normally distributed.

Results indicate that the full sample as well as subsamples were skewed to the left, that is, average scores reported on both instruments were below the scale midpoint (below 20 points). This means that nurses on average displayed inferior knowledge of the topic tested. Table 1 however also shows that those with more frequent involvement in respiratory management scored much closer on both instruments than those with less frequent practice.

To test whether total scores achieved on both instruments differed, we employed paired sample t-tests as well as Wilcoxon tests (due to non-normal distribution). Both tests yielded identical differences between the two instruments, therefore we report here results of paired t-tests (Table 2).

Table 2: Paired t-tests.

Outcomes of the analyses showed significant differences between OE and multiple-choice total scores, favoring multiplechoice, for the full sample as well as for the less frequently involved subsample. However, there was no statistical difference between OE and multiple-choice test scores for nurses who reported more frequent involvement in respiratory management.

Finally, we looked at whether there were underlying differences across nurse qualifications in terms of both OE and multiple-choice total scores. Due to non-normal data distribution, we ran both one-way ANOVA and Kruskal-Wallis tests. Since both tests yielded significant results, we report here the ANOVA outcome. Differences for both models were significant (F_MCQs = 10.04; p < 0.001 and F_OE = 50.49; p < 0.001). Evidently, for the multiple-choice instrument, LVN and ADN groups did not differ on total test scores whereas BSN nurses achieved significantly better scores than both groups (Table 3). As for the OE instrument, however, both ADNs and BSNs did significantly better compared to LVNs, but BSN nurses increased their lead markedly over both groups when compared to multiplechoice testing.

Table 3: ANOVA Post hoc Multiple comparisons.

*Mean difference is significant at the 0.05 level.

Discussion

This research aimed to investigate whether there were any differences in knowledge assessment of nurses when open-ended vs multiple-choice instruments had been used. In general, the sample achieved an average total score on both scales lower (13.38 multiple choice and 6.48 open-ended) than was expected. This may have been due to a few reasons. First, test takers were probably less motivated in their test performance (since they knew they had been part of an experiment) to complete as many correct answers as possible as opposed to real life testing.

However, missing data (that would have lowered average scores) on all instruments was very low, suggesting that nurses completed the task as if taking a real-life test. Another potential explanation may be that the majority of respondents (81%) earned their nursing degree before 2010 (Hence relevant knowledge had not been not refreshed) and did not carry out everyday nursing tasks involving respiratory management skills. Had this been a reallife pass/fail test, 23.3% of our sample would have passed on the OE, 45.2% on the MCQ test. Our success rate was much lower than that of Ozyazicioglu et al. [12].

While our test was also generic, however, was more complex than reported by Ozyazicioglu et al. [12], Hence less easy to answer to. Note that when examining maximum scores achieved (Table 1), OE responses outperformed MCQ on all accounts.

As for the difference between OE and multiple-choice test scores, our results conflicted with Stepankova & Emanovsky [11] who reported no differences between the two test modalities in their study, and with Birenbaum & Tatsuoka [10], who found OE more favorable. Our results showed MCQs return greater average scores compared to OE, however, OE maximum scores were higher than MCQs. Importantly enough, when respondents’ knowledge base was strongly rooted, we saw no significant difference between the two test modalities. We also confirm an upward scoring bias for MCQs as argued by Washburn et al. [8] despite that both tests were administered paper-based.

We know that more advanced nurse qualifications are related to higher order of knowledge which was clearly supported by the ANOVA analysis. BSN nurses successively outperformed ADNs and LVN nurses both in OE and MCQs. However, the distance between BSNs and the other two cadres increased markedly when OE scores had been juxtaposed. We also demonstrated the differentiating power of OE vs MCQs for ADNs and LVNs as well. While these two groups achieved statistically identical results for MCQs, ADNs emerged ahead of LVNs in the OE testing phase. Therefore, both null hypotheses were rejected in this research.

Authors of this paper, therefore, conclude that MCQs, in general, have a tendency to overrate knowledge levels when respondents’ knowledge base is less confident or professional qualification levels are closer. When knowledge base was strong, the difference between OE and MCQs disappeared. However, we likewise observed that OE responses outperformed MCQs when maximum achievable scores were considered. Why that happened is up to speculation at this point. We recommend future research to investigate the cause of such difference. Authors, therefore, argue that both assessment methods seem valid to distinguish various knowledge and professional levels, and should be used perhaps interchangeably, or best in combination within the same test, when assessing critical knowledge and skills.

Limitation

Authors acknowledge that knowledge tested in this paper was part of standard nursing education, however, subjects did not frequently refresh and utilize this particular set of knowledge in practice, possibly skewing results. Authors are also aware that exposing subjects to the information of being investigated may have changed their behaviors to testing. However, this was a condition to observe in the ethical approval.

References

© 2019 Miklós Zrínyi. This is an open access article distributed under the terms of the Creative Commons Attribution License , which permits unrestricted use, distribution, and build upon your work non-commercially.

Submit Query

PubMed Indexed Articles

Track Your Article

Editor In Chief

Hirotada TSUJII

Ph.D in Agriculture from Faculty of Agriculture, Tohoku University

Approaches in Poultry, Dairy & Veterinary Sciences

Maria Kuman

Research Professor, PhD, Holistic Research Institute

Advances in Complementary & Alternative Medicine

Tomasz Karski

MD PhD, Professor, Vincent Pol University

Orthopedic Research Online Journal

Jiexiong Feng

Professor, Chief Doctor, Director of Department of Pediatric Surgery, Associate Director of Department of Surgery, Doctoral Supervisor Tongji hospital, Tongji medical college, Huazhong University of Science and Technology

Research in Pediatrics & Neonatology

Muhammad Atiqullah

Senior Research Engineer and Professor, Center for Refining and Petrochemicals, Research Institute, King Fahd University of Petroleum and Minerals (KFUPM), Dhahran, Saudi Arabia

Research & Development in Material Science

Ian James Martins

Fellow of International Agency for Standards and Ratings (IASR), Edith Cowan University, Sarich Neuroscience Research Institute

Advancements in Case Studies

Thomas F George

Chancellor Emeritus / Professor Emeritus of Chemistry and Physics, University of Missouri–St. Louis

Annals of Chemical Science Research

Jose Crisologo de Sales Silva

Ph.D in Science from the Federal University of Alagoas, UFAL, Brazil

Novel Research in Sciences

Naglaa Sami Adbel Aziz Mahmoud

Assistant Professor in College of Architecture, Art and Design

Academic Journal of Engineering Studies

Tong-Ching Tom Wu

Interim Dean, College of Education and Health Sciences, Director of Biomechanics Laboratory, Sport Science Innovation Program, Bridgewater State University

Research & Investigations in Sports Medicine

Dr. Jose Luis Turabian

Professor of numerous training courses in Family Medicine

Associative Journal of Health Sciences

Dariusz Jacek Jakóbczak

Assistant Professor, Department of Electronics and Computer Science

COJ Electronics & Communications

Önder Pekcan

Emeritus Professor of Physics, Kadir Has University, Turkey

Polymer Science: Peer Review Journal

Member In

View All...

Quick Links

Editorial Board Registrations

×

Join as Editor

Join as Associate Editor
Submit your Article
Best Paper of the Volume
Reprints
Refer a Friend

×

Refer a Friend

Suggested By

Referrer Details
Advertise With Us

×

Advertise With Us

Our Recent Edition

Top Editors

Zhengcai Lou

Wenzhou Medical University, China
Ya Lie Ku

Fooyin University, Taiwan
Volkan Sarper Erikci

Saglik Bilimleri University, Turkey
Tomasz Karski

Vincent Pol University, Poland
Thamil Selvam

National Defence University of Malaysia, Malaysia
Tarik Baykara

Dogus University, Turkey
Steven Smith

Hope College, USA
Stanislav Grigoriev

Russian Academy of Sciences, Russia
Shi Zhou

Southern Cross University, Australia
Shewikar Farrag

Umm Al-Qura University, Saudi Arabia
Ray Marks

City University of New York, USA
Praveen K Maghelal

Khalifa University of Science & Technology, United Arab Emirates
Peng Yu

Hebei Normal University, China
Nawal Mohamed Khalafallah

Alexandria University, Egypt
N K Kishore

Indian Institute of Technology Kharagpur, India
Muzzalupo Innocenzo

Council for Agriculture Research and Analysis of Agri Economy (CREA), Italy
Muhammad Atiqullah

King Fahd University of Petroleum and Minerals, Saudi Arabia
Mohamed A Rashed

King Abdulaziz University, Saudi Arabia
Maurice E Morgenstein

University of Oregon, USA
Martin Sweatman

University of Edinburgh, Scotland
Maria Kuman

University of Tennessee, USA
Manuel Velasco

Central University of Venezuela, Venezuela
Majid Monajjemi

Islamic Azad University Central Tehran Branch, Iran
Luisetto Mauro

Tourin University, Italy
Lloyd Arthur Jenkins

Teaching & Public Speaking, Spain
Leonardo Milella

Paeditric Hospital "Giovanni XXIII", Italy
Kanakis Dimitrios

University of Nicosia, Cyprus
Jose Luis Clua Espuny

Universidad Miguel Hernández de Elche, Spain
John Korstad

Oral Roberts University, USA
Jinliang Zhang

Beijing Normal University, China
Irina Koretsky

Howard University, USA
Ian James Martins

Edith Cowan University, Australia
Hamid Yahiya Hussain

Dubai Health Authority, UAE
Gundu HR Rao

University of Minnesota, USA
GP Karmakar

Indian Institute of Technology Kharagpur, India
Ghassan George Haddad

Serhal Hospital, Lebanon
George Gregory Buttigieg

University of Malta, Malta
Fumihiko Hinoshita

National Center for Global Health and Medicine, Japan
Freida Pemberton

Molloy College, USA
Francisco Welington de Sousa Lima

Federal University of Piauí, Brazil
Florian Bert

Krankenhaus Nordwest Hospital, Germany
Fathi Habashi

Laval University, Canada
Dora Alicia Cortes Hernandez

Cinvestav-Unidad Saltillo, Mexico
Daniel Kinem

UPMC Hamot Neuroscience Institute, USA
Conxita Mestres Miralles

Ramon Llull University, Spain
Barry Kraynack

White Bear Associates, LLC, USA
Arkady S Voloshin

Lehigh University, USA
Alireza Heidari

California Southern University, USA
Alex Guskov

Institute of Solid State Physics of RAS, Russia
Alan Diego Briem Stamm

University of Buenos Aires, Argentina
Ahmed Nasr Ghanem

Mansoura University, Egypt
Afaf K El Ansary

King Saud University, Saudi Arabia
A Bernardes

University of Coimbra, Portugal

Financial Support

Latest e-Books

Latest Video

© 2017 Crimson Publishers, All rights reserved. No part of this content may be reproduced or transmitted in any form or by any means as per the standard guidelines of fair use. Creative Commons License Open Access by Crimson Publishers is licensed under

a Creative Commons Attribution 4.0 International License. Based on a work at www.crimsonpublishers.com. Best viewed in

| Above IE 9.0 version

Scroll

Full Text

COJ Nursing & Healthcare

Open-Ended Testing Stays Relevant for Critical Knowledge Assessment

Abstract

Introduction

Hypotheses

Methods

Instruments

Instruments

Result

Discussion

Limitation

References

PubMed Indexed Articles

Track Your Article

Editor In Chief

Member In

Signup for Newsletter

Quick Links

Our Recent Edition

Top Editors

Financial Support

Sponsors

Latest e-Books

Latest Video

Reprints