TY - JOUR
T1 - Predicting Credibility of Online Reviews
T2 - An Integrated Approach
AU - Abedin, Ehsan
AU - Mendoza, Antonette
AU - Akbarighatar, Pouria
AU - Karunasekera, Shanika
PY - 2024
Y1 - 2024
N2 - In the digital age, online reviews significantly influence consumer purchasing decisions. However, there is a growing concern about fake reviews undermining their credibility. While existing research underscores the significance of online reviews for consumers, there is a gap in the literature on developing a predictive model for assessing the credibility of online reviews that takes into account the consumer perspective as the end user of such information. To address this gap, this study combines human-driven and data-driven approaches by using two datasets with different labeling methods to develop a predictive model. The first dataset is labeled using the Yelp filtering algorithm, while the second dataset is labeled based on the perception of the crowd toward credibility. The results show that the predictive model can differentiate fake and credible reviews with an 82% AUC (Area Under the Curve) performance, using review attributes such as length, subjectivity, readability, extremity, external and internal consistency. Additionally, SHAP (SHapley Additive exPlanations) has been conducted to further explain important characteristics that differentiate fake and credible reviews.
AB - In the digital age, online reviews significantly influence consumer purchasing decisions. However, there is a growing concern about fake reviews undermining their credibility. While existing research underscores the significance of online reviews for consumers, there is a gap in the literature on developing a predictive model for assessing the credibility of online reviews that takes into account the consumer perspective as the end user of such information. To address this gap, this study combines human-driven and data-driven approaches by using two datasets with different labeling methods to develop a predictive model. The first dataset is labeled using the Yelp filtering algorithm, while the second dataset is labeled based on the perception of the crowd toward credibility. The results show that the predictive model can differentiate fake and credible reviews with an 82% AUC (Area Under the Curve) performance, using review attributes such as length, subjectivity, readability, extremity, external and internal consistency. Additionally, SHAP (SHapley Additive exPlanations) has been conducted to further explain important characteristics that differentiate fake and credible reviews.
KW - deception detection
KW - fake reviews
KW - Online consumer reviews
KW - review credibility
UR - http://www.scopus.com/inward/record.url?scp=85189647562&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2024.3383846
DO - 10.1109/ACCESS.2024.3383846
M3 - Article
AN - SCOPUS:85189647562
SN - 2169-3536
VL - 12
SP - 49050
EP - 49061
JO - IEEE Access
JF - IEEE Access
ER -