Date of Award
Fall 2023
Thesis Type
Open Access
Degree Name
Honors Bachelor of Arts
Department
Computer Science
Sponsor
Daniel Myers
Committee Member
Rochelle Elva
Abstract
This paper presents a comparative analysis of human and AI performance on a sentiment analysis task involving the coding of qualitative data from community program transcripts. The results demonstrate promising but imperfect agreement between two AI models, Claude and Bing, versus three human annotators and one expert annotator using the Community Capitals framework categories. While both models achieved fair alignment with human judgment, confusion patterns emerged involving metaphorical language and text overlapping multiple categories. The findings provide a case study for benchmarking conversational AI systems against human baselines to reveal limitations and target improvements. Key gaps center around distinguishing between social and human elements and handling cultural references. Expanded testing on more diverse datasets could further quantify differences in classification capabilities. Overall, the analysis exposes definable areas where machines still struggle compared to humans, highlighting productive research directions to eventually achieve a similar threshold to humans across diverse language inputs. As AI systems enter real-world applications, human-AI comparative studies can help define boundaries between robust statistics-based learning and adaptive human cognition.
Recommended Citation
Shah, Aakriti, "Evaluating AI Sentiment Analysis" (2023). Honors Program Theses. 218.
https://scholarship.rollins.edu/honors/218
Rights Holder
Aakriti Shah
Included in
Computer Engineering Commons, Computer Sciences Commons, Statistics and Probability Commons