HYPOTHESIS PAPER
DO CHATBOTS PROVIDE RELIABLE INFORMATION ABOUT MOBILE APPS IN AUDIOLOGY?
 
 
More details
Hide details
1
Department of Experimental Audiology, World Hearing Center, Institute of Physiology and Pathology of Hearing, Warsaw/Kajetany, Poland
 
 
A - Research concept and design; B - Collection and/or assembly of data; C - Data analysis and interpretation; D - Writing the article; E - Critical revision of the article; F - Final approval of article;
 
 
Submission date: 2024-07-24
 
 
Final revision date: 2024-08-23
 
 
Acceptance date: 2024-08-23
 
 
Publication date: 2024-09-05
 
 
Corresponding author
Małgorzata Pastucha   

Department of Experimental Audiology, Institute of Physiology and Pathology of Hearing, Mochnackiego 10, 02-042, Warsaw, Poland
 
 
J Hear Sci 2024;14(3):9-15
 
KEYWORDS
TOPICS
ABSTRACT
Introduction:
In light of the growing interest in utilizing AI for information retrieval, assessing the accuracy and reliability of tools such as chatbots is essential. This study aims to evaluate the efficacy of chatbots in providing accurate information about mobile applications (apps) in the field of audiology.

Material and methods:
The responses of the Gemini and ChatGPT chatbots to eight open-ended questions posed in Polish and English were compared. Each answer was assessed for correctness.

Results:
Gemini_ENG achieved the highest correctness with a score of 5 points (62.5%), while ChatGPT_PL scored 2 points (25%), and both Gemini_PL and ChatGPT_ENG scored 1 point (12.5%). Chatbots were most effective in recommending apps for older adults, with three of the four chatbots providing accurate recommendations. However, they struggled when asked to recommend apps for non-English speakers, to describe apps, or to provide direct links, with none of them scoring points in these areas.

Conclusions:
Chatbots are currently unreliable sources of information about audiology apps. Depending on the language, there is significant variability in response accuracy. A good example is that Gemini_ENG performed far better than Gemini_PL. A major issue for all of them was the frequent fabrication of data, including the creation of nonexistent app names and incorrect links.

 
REFERENCES (20)
1.
Yang HS, Wang F, Greenblatt MB, Huang SX, Zhang Y. AI chatbots in clinical laboratory medicine: foundations and trends. Clin Chem, 2023; 69(11): 1238–46. https://doi.org/10.1093/clinch....
 
2.
Thirunavukarasu AJ, Ting DSJ, Elangovan K, Gutierrez L, Tan TF, Ting DSW. Large language models in medicine. Nat Med, 2023; 29(8): 1930–40. https://doi.org/10.1038/s41591....
 
3.
Biswas S, Logan NS, Davies LN, Sheppard AL, Wolffsohn JS. Assessing the utility of ChatGPT as an artificial intelligence-based large language model for information to answer questions on myopia. Ophthalmic Physiol Opt, 2023; 43(6): 1562–70. https://doi.org/10.1111/opo.13....
 
4.
Eggmann F, Weiger R, Zitzmann NU, Blatz MB. Implications of large language models such as ChatGPT for dental medicine. J Esthet Restor Dent, 2023; 35(7): 1098–102. https://doi.org/10.1111/jerd.1....
 
5.
Valentini M, Szkandera J, Smolle MA, Scheipl S, Leithner A, Andreou D. Artificial intelligence large language model ChatGPT: is it a trustworthy and reliable source of information for sarcoma patients? Front Public Health, 2024; 12: 1303319. https://doi.org/10.3389/fpubh.....
 
6.
Okolo CA, Babawarun O, Arowoogun JO, Adeniyi AO, Chidi R. The role of mobile health applications in improving patient engagement and health outcomes: a critical review. Int J Sci Res Arch, 2024; 11(1): 2566–74. https://doi.org/10.30574/ijsra....
 
7.
Galetsi P, Katsaliaki K, Kumar S. Exploring benefits and ethical challenges in the rise of mHealth (mobile healthcare) technology for the common good: an analysis of mobile applications for health specialists. Technovation, 2023; 121: 102598. https://doi.org/10.1016/j.tech....
 
8.
Almufarrij I, Dillon H, Dawes P, Moore DR, Yeung W, Charalambous A-P, et al. Web- and app-based tools for remote hearing assessment: a scoping review. Int J Audiol, 2023; 62(8): 699–712. https://doi.org/10.1080/149920....
 
9.
Masalski M, Grysiński T, Kręcicki T. Hearing tests based on biologically calibrated mobile devices: comparison with pure-tone audiometry. JMIR Mhealth Uhealth, 2018; 6(1): e10. https://doi.org/10.2196/mhealt....
 
10.
Potgieter J-M, Swanepoel DW, Smits C. Evaluating a smartphone digits-in-noise test as part of the audiometric test battery. S Afr J Commun Disord, 2018; 65(1): 574. https://doi.org/10.4102/sajcd.....
 
11.
Dillon H, Mee C, Moreno JC, Seymour J. Hearing tests are just child’s play: the sound scouts game for children entering school. Int J Audiol, 2018; 57(7): 529–37. https://doi.org/10.1080/149920....
 
12.
Zuhair V, Babar A, Ali R, et al. Exploring the Impact of artificial intelligence on global health and enhancing healthcare in developing nations. J Prim Care Community Health, 2024; 15: 21501319241245847. https://doi.org/10.1177/215013....
 
13.
Jedrzejczak WW, Skarzynski PH, Raj-Koziak D, Sanfins MD, Hatzopoulos S, Kochanek K. ChatGPT for tinnitus information and support: response accuracy and retest after three and six months. Brain Sci, 2024; 14(5): 465. https://doi.org/10.3390/brains....
 
14.
Jedrzejczak WW, Kochanek K. Comparison of the audiological knowledge of three chatbots: ChatGPT, Bing Chat, and Bard. Audiol Neurootol, 2024; 1–7. https://doi.org/10.1159/000538....
 
15.
Kochanek K, Skarzynski H, Jedrzejczak WW. Accuracy and repeatability of ChatGPT based on a set of multiple-choice questions on objective tests of hearing. Cureus, 2024; 16(5): e59857. https://doi.org/10.7759/cureus....
 
16.
Wang S, Mo C, Chen Y, Dai X, Wang H, Shen X. Exploring the performance of ChatGPT-4 in the Taiwan Audiologist Qualification Examination: preliminary observational study highlighting the potential of AI chatbots in hearing care. JMIR Med Educ, 2024; 10: e55595. https://doi.org/10.2196/55595.
 
17.
Chayka A, Sukhov A. Comparing chatbots trained in different languages. Commun ACM, 2023; 66(12): 6–7. https://doi.org/10.1145/362683....
 
18.
Pastucha M, Kochanek K, Piłka E, Jedrzejczak WW. [Review of mobile apps for self-testing of hearing in Polish language]. Now Audiofonol, 2024; 13(1): 21–31 [in Polish]. https://doi.org/10.17431/na/17....
 
19.
McGowan A, Gui Y, Dobbs M, Shuster S, Cotter M, Selloni A, et al. ChatGPT and Bard exhibit spontaneous citation fabrication during psychiatry literature search. Psychiatry Res, 2023; 326: 115334. https://doi.org/10.1016/j.psyc....
 
20.
Lee TJ, Campbell DJ, Patel S, Hossain A, Radfar N, Siddiqui E, et al. Unlocking health literacy: the ultimate guide to hypertension education from ChatGPT versus Google Gemini. Cureus, 2024; 16(5): e59898. https://doi.org/10.7759/cureus....
 
Journals System - logo
Scroll to top