Publications

Samba: Identifying inappropriate videos for young children on YouTube

Abstract

YouTube videos are one of the most effective platforms for disseminating creative material and ideas, and they appeal to a diverse audience. Along with adults and older children, young children are avid consumers of YouTube materials. Children often lack means to evaluate if a given content is appropriate for their age, and parents have very limited options to enforce content restrictions on YouTube. Young children can thus become exposed to inappropriate content, such as violent, scary or disturbing videos on YouTube. Previous studies demonstrated that YouTube videos can be classified into appropriate or inappropriate for young viewers using video metadata, such as video thumbnails, title, comments, etc. Metadata-based approaches achieve high accuracy, but still have significant misclassifications, due to the reliability of input features. In this paper, we propose a fusion model, called Samba, which uses both metadata and video subtitles for content classification. Using subtitles in the model helps better infer the true nature of a video improving classification accuracy. On a large-scale, comprehensive dataset of 70K videos, we show that Samba achieves 95% accuracy, outperforming other state-of-the-art classifiers by at least 7%. We also publicly release our dataset.

Metadata

publication
Proceedings of the ACM International Conference on Information and Knowledge …, 2022
year
2022
publication date
2022
authors
Binh M Le, Rajat Tandon, Chingis Oinar, Jeffrey Liu, Uma Durairaj, Jiani Guo, Spencer Zahabizadeh, Sanjana Ilango, Jeremy Tang, Fred Morstatter, Simon S Woo, Jelena Mirkovic
link
https://www.researchgate.net/profile/Rajat-Tandon-2/publication/363133906_Samba_Identifying_Inappropriate_Videos_for_Young_Children_on_YouTube/links/6336d7ae9cb4fe44f3ed3acc/Samba-Identifying-Inappropriate-Videos-for-Young-Children-on-YouTube.pdf
resource_link
https://www.researchgate.net/profile/Rajat-Tandon-2/publication/363133906_Samba_Identifying_Inappropriate_Videos_for_Young_Children_on_YouTube/links/6336d7ae9cb4fe44f3ed3acc/Samba-Identifying-Inappropriate-Videos-for-Young-Children-on-YouTube.pdf
journal
Proceedings of the ACM International Conference on Information and Knowledge Management