Hate Speech Detection in Social Media Surveillance: A Review of Related Literature

Usman Ahmed; Rahmet Bibi; Obaid Ullah; Rahat Bano

doi:10.32350/icr.0101.01

Usman Ahmed Foundation University Islmabad https://orcid.org/0000-0002-9319-7345
Rahmet Bibi Superior University Lahore
Obaid Ullah Superior University Lahore
Rahat Bano Superior University Lahore

DOI: https://doi.org/10.32350/icr.0101.01

Keywords: data mining,, hate speech, NLP, semantics, sementic analysis, social media, surveillance, text mining

Abstract

Abstract Views: 264

Social media surveillance is a requirement for governments and intelligence agencies around the world to detect and prevent hate crimes. The dynamic and unstructured nature of the textual content available on social media platforms makes it very complex to extract hate related speech patterns from this content. It also creates ambiguities in the data and therefore, data mining techniques become difficult to apply in this scenario. Several alternative techniques were adopted by different researchers in the past to cope with this problem and to capture and analyze such unstructured text for the purpose of hate speech detection. In this paper, we reviewed, categorized and presented a state-of-the-art of these techniques which were divided in to three categories namely text mining, sentiment analysis and semantics. The challenges in the application of the existing techniques were also discussed and these can be taken up as future directions

Downloads

Download data is not yet available.

References

D. Roark,''The end of privacy for the populace, the personof interest and the persecuted,''HealthandTechnology, vol. 7,no. 4,pp.501–517, 2017.

R. Irfan, et al.,''A survey ontext mining in social networks,''The Knowledge Engineering Review, vol.30,no, 2, pp.157–170, 2015. https://doi.org/10.1017/S0269888914000277

R. Feldman,and J. Sanger,The text mining handbook: advanced approaches in analyzing unstructured data. Cambridge university press, 2007.

N. Guarino,Formal ontology in information systems. IOS Press, 1998.

A. Hotho,R. Jäschke,K. Lerman,''Mining social semantics on the social web,''Semantic Web, vol. 8,no. 5, pp.623–624, 2017.http://dx.doi.org/10.3233/SW-170272

L. Sorensen,''User managed trust in social networking -ComparingFacebook, MySpace and Linkedin,''In: 2009 1st International Conference on Wireless Communication, Vehicular Technology, Information Theory and Aerospace Electronic Systems Technology, 2009, pp.427–431.

F. Del Vigna, A. Cimino, F. Dell’Orletta,M. Petrocchi, M. Tesconi,''Hateme, hate me not: Hate speech detection on Facebook,'' InProceedings of the First Italian Conference on Cybersecurity (ITASEC17),2017, pp.86-95.

W. Warner,andJ. Hirschberg,''Detecting Hate Speech on the World Wide Web,''In: Proceedings of the second workshop on language in social media,Association for Computational Linguistics, Stroudsburg, PA, USA, 2012, pp. 19–26.

B. Parekh,''Hate Speech,''Public Policy Research, vol.12,no. 4, pp.213–223, 2006.https://doi.org/10.1111/j.1070-535.2005.00405.x

R. M. Simpson,''Dignity, harm, and hate speech,''Law and Philosophy, vol.32,pp.701–728, 2013. https://doi.org/10.1007/s10982-012-9164-z

X. Hu,andH. Liu,''Text analytics in social media,''In: Mining text data, C. C. Aggarwal, and C. Zhai, eds.Springer US, Boston, MA, 2012, pp. 385–414.https://link.springer.com/chapter/10.1007/978-1-4614-3223-4_12

M. M. Rahman,''Mining social data to extract intellectual knowledge,''International Journal of Intelligent Systems and Applications, vol.4,no. 10, 2012. https://doi.org/10.5815/ijisa.2012.10.02

A. Sarker,et al.,''Social media mining for toxicovigilance: automatic monitoring of prescription medication abuse fromTwitter,''Drug Safety, vol.39,no. 3, pp.231–240, 2016. https://doi.org/10.1007/s40264-015-0379-4

P. Mika,''Ontologies are us: A unified model of social networks and semantics,''Web Semantics: Science, Services and Agents on the World Wide Web, vol.5,no. 1, pp.5–1, 2007. https://doi.org/10.1016/j.websem.2006.11.002

D. Maynard, I. Roberts,M. A. Greenwood,D. Rout,K. Bontcheva,''A framework for real-timesemantic social media analysis,''Web Semantics: Science, Services and Agents on the World Wide Web, vol. 44,pp.75–88, 2017. https://doi.org/10.1016/j.wasman.2017.08.009

N. Djuric,J. Zhou,R. Morris,M. Grbovic,V. Radosavljevic,N. Bhamidipati, ''Hate speech detection with comment embeddings,''In: Proceedings of the 24th International Conference on World Wide Web, ACM, New York, NY, USA,2015, pp. 29–30. https://doi.org/10.1145/2740908.2742760

G. Paltoglou, andM. Thelwall,''Twitter, MySpace, and Digg: unsupervised sentiment analysis in social media,''ACM Transactions on Intelligent Systems and Technology, vol.3,no. 4, pp. 1–66, 2012. https://doi.org/10.1145/2337542.2337551

Y. Senarath, andH. Purohit,''Evaluating Semantic feature representations to efficiently detect hate intent on social media,''In: 2020 IEEE 14th International Conference on Semantic Computing (ICSC), Irvine, CA, USA,2020,pp. 199–202. https://doi.org/10.1109/ICSC.2020.00041

F. Sahito,A. Latif,W. Slany,''Weaving Twitter stream into Linked Data a proof of concept framework,''In: 2011 7th International Conference on Emerging Technologies, Islamabad, Pakistan,2011,pp. 1–6. https://doi.org/10.1109/ICET.2011.6048497