论文标题

超越简单的有毒:检测俄罗斯语言的易燃主题的不适当陈述

Beyond Plain Toxic: Detection of Inappropriate Statements on Flammable Topics for the Russian Language

论文作者

Babakov, Nikolay, Logacheva, Varvara, Panchenko, Alexander

论文摘要

互联网上的毒性,例如仇恨言论,对特定用户或人群的犯罪或使用淫秽词的罪行是一个公认的问题。但是,还有其他类型的不适当信息通常不被视为有毒的信息,例如因为它们不包含明确的罪行。此类信息可以包含涵盖的毒性或概括,促进有害行为(犯罪,自杀,吸毒),引起“加热”的讨论。这些信息通常与特定的敏感主题有关,例如关于政治,性少数群体,社会不公,比其他话题更频繁,例如汽车或计算,产生有毒的情绪反应。同时,显然并非所有可易燃主题中的消息都是不合适的。 为此,在这项工作中,我们介绍了两个文本集,该文本收集是根据二进制的二进制概念和敏感主题的多项式概念标记的。假设不符合性的概念在同一文化的人们中很普遍,我们将我们的方法基于人类对不可接受和有害的事物的直觉理解。为了使不合适的概念概念,我们以数据驱动的方式来定义它。也就是说,我们进行大规模注释研究,询问工人是否会损害公司的声誉。可以接受的高通道间协议的高价值表明,存在不当的概念,并且可以被不同的人统一理解。为了以客观的方式定义敏感主题的概念,我们根据法律和公关部门的专家通常建议的大型上市公司专家建议的指南可能有害。

Toxicity on the Internet, such as hate speech, offenses towards particular users or groups of people, or the use of obscene words, is an acknowledged problem. However, there also exist other types of inappropriate messages which are usually not viewed as toxic, e.g. as they do not contain explicit offences. Such messages can contain covered toxicity or generalizations, incite harmful actions (crime, suicide, drug use), provoke "heated" discussions. Such messages are often related to particular sensitive topics, e.g. on politics, sexual minorities, social injustice which more often than other topics, e.g. cars or computing, yield toxic emotional reactions. At the same time, clearly not all messages within such flammable topics are inappropriate. Towards this end, in this work, we present two text collections labelled according to binary notion of inapropriateness and a multinomial notion of sensitive topic. Assuming that the notion of inappropriateness is common among people of the same culture, we base our approach on human intuitive understanding of what is not acceptable and harmful. To objectivise the notion of inappropriateness, we define it in a data-driven way though crowdsourcing. Namely we run a large-scale annotation study asking workers if a given chatbot textual statement could harm reputation of a company created it. Acceptably high values of inter-annotator agreement suggest that the notion of inappropriateness exists and can be uniformly understood by different people. To define the notion of sensitive topics in an objective way we use on guidelines suggested commonly by specialists of legal and PR department of a large public company as potentially harmful.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源