Spotting Fake Reviews using Positive-Unlabeled Learning

Li, Huayi; Liu, Bing; Mukherjee, Arjun; Shao, Jidong


Document title:	Spotting Fake Reviews using Positive-Unlabeled Learning
Journal:	Computación y sistemas
Database:	PERIÓDICA
System number:	000379428
ISSN:	1405-5546
Authors:	Li, Huayi¹ Liu, Bing¹ Mukherjee, Arjun² Shao, Jidong³
Institutions:	¹University of Illinois, Department of Computer Science, Chicago, Illinois. Estados Unidos de América ²University of Houston, Department of Computer Science, Houston, Texas. Estados Unidos de América ³Dianping Inc., Shanghai. China
Year:	2014
Season:	Jul-Sep
Volumen:	18
Number:	3
Pages:	467-475
Country:	México
Language:	Inglés
Document type:	Artículo
Approach:	Analítico, descriptivo
English abstract	Fake review detection has been studied by researchers for several years. However, so far all reported studies are based on English reviews. This paper reports a study of detecting fake reviews in Chinese. Our review dataset is from the Chinese review hosting site Dianping, which has built a fake review detection system. They are confident that their algorithm has a very high precision, but they don't know the recall. This means that all fake reviews detected by the system are almost certainly fake but the remaining reviews may not be all genuine. This paper first reports a supervised learning study of two classes, fake and unknown. However, since the unknown set may contain many fake reviews, it is more appropriate to treat it as an unlabeled set. This calls for the model of learning from positive and unlabeled examples (or PU-learning). Experimental results show that PU learning not only outperforms supervised learning significantly, but also detects a large number of potentially fake reviews hidden in the unlabeled set that Dianping fails to detect
Disciplines:	Ciencias de la computación, Literatura y lingüística
Keyword:	Inteligencia artificial, Aprendizaje de máquinas, Revisiones falsas
Keyword:	Computer science, Literature and linguistics, Artificial intelligence, Machine learning, Fake reviews
Full text:	Texto completo (Ver HTML)

Spotting Fake Reviews using Positive-Unlabeled Learning

Wait a moment...