论文标题
YZR-NET:自我监督的隐藏表示形式不变亵渎检测的转变
YZR-net : Self-supervised Hidden representations Invariant to Transformations for profanity detection
论文作者
论文摘要
在当前{\ it e-}学习平台上,实时课程是一个重要的工具,可为学生提供在学习新概念时获得更多参与的机会。在这样的课程中,与教师和同伴的互动元素有助于消除学习筒仓,并使每个学生有机会在这个虚拟班级时代体验与离线学习相关的某些方面。班上的一种常见互动方式是通过聊天 /消息框架,老师可以在其中广播消息并获得直播学生的即时反馈。这种互动自由是任何学生学习成长的关键方面,但滥用它可能会产生严重的影响。一些不法行为使用此框架发送亵渎消息,这可能会对其他学生以及班级的老师产生负面影响。这些罕见但高影响力的情况消除了需要自动检测机制,以阻止在任何平台上发布此类聊天。在这项工作中,我们开发了YZR-NET,这是一个自我监督的框架,即使学生试图添加巧妙的修改以欺骗系统,也能够稳健地检测聊天中使用的亵渎单词。令牌 /单词级别上的匹配机制使我们能够保持紧凑和动态亵渎词汇,而无需重新底层模型就可以更新。我们的亵渎检测框架是独立语言的,并且可以在英语中处理滥用及其音译对应物hinglish(用英语写的印地语语言单词)。
On current {\it e-}learning platforms, live classes are an important tool that provides students with an opportunity to get more involved while learning new concepts. In such classes, the element of interaction with teachers and fellow peers helps in removing learning silos and gives each student a chance to experience some aspects relevant to offline learning in this era of virtual classes. One common way of interaction in a class is through the chats / messaging framework, where the teacher can broadcast messages as well as get instant feedback from the students in the live class. This freedom of interaction is a crucial aspect for any student's learning growth but misuse of it can have serious repercussions. Some miscreants use this framework to send profane messages which can have a negative impact on other students as well as the teacher of the class. These rare but high impact situations obviate the need for automatic detection mechanisms that prevent the posting of such chats on any platform. In this work we develop YZR-Net which is a self-supervised framework that is able to robustly detect profane words used in a chat even if the student tries to add clever modifications to fool the system. The matching mechanism on token / word level allows us to maintain a compact as well as dynamic profane vocabulary which can be updated without retraining the underlying model. Our profanity detection framework is language independent and can handle abuses in both English as well as its transliterated counterpart Hinglish (Hindi language words written in English).