论文标题
使用听不清的音频和语音助手通过电话传输敏感数据
Using Inaudible Audio and Voice Assistants to Transmit Sensitive Data over Telephony
论文作者
论文摘要
由于语音助理(VA)在家庭和企业网络中的部署日益普及,因此出现了新的安全性和隐私问题。过去的许多研究结果表明,即使一个人可能在附近,恶意演员如何使用隐藏的命令来使VAS进行某些操作。但是,此类工作尚未探讨与VAS接近的计算机如何利用电话通道在VAS的帮助下渗透数据。在通过指挥VA来拨打电话的通信频道表征了通信通道后,我们演示了恶意软件如何将数据编码到音频中并通过电话频道发送。这样的攻击可以按大规模和低成本远程制作,可用于绕过可能因敏感数据泄漏而部署的网络防御。我们使用双色调多频调将任意二进制数据编码到音频中,这些数据可以通过计算机扬声器播放,并通过VA介导的电话通道发送到远程系统。我们表明,持续几分钟的短电话可以以高精度传输适度的数据。这可以在使大多数人使用频率接近人类听力范围的频率的携带者调制载体中,从而使音频几乎听不清。几个因素影响数据传输速率,包括计算机和VA之间的距离,可能存在的环境噪声以及调节载体的频率。借助我们建造的原型,我们通过实验评估这些因素对数据传输速率和传输准确性的影响。我们的结果表明,计算机附近的语音助手可以对存储在此类计算机上的数据构成新的威胁。这些威胁并未由传统的主机和网络防御解决。我们简要讨论可能的缓解方法。
New security and privacy concerns arise due to the growing popularity of voice assistant (VA) deployments in home and enterprise networks. A number of past research results have demonstrated how malicious actors can use hidden commands to get VAs to perform certain operations even when a person may be in their vicinity. However, such work has not explored how compromised computers that are close to VAs can leverage the phone channel to exfiltrate data with the help of VAs. After characterizing the communication channel that is set up by commanding a VA to make a call to a phone number, we demonstrate how malware can encode data into audio and send it via the phone channel. Such an attack, which can be crafted remotely, at scale and at low cost, can be used to bypass network defenses that may be deployed against leakage of sensitive data. We use Dual-Tone Multi-Frequency tones to encode arbitrary binary data into audio that can be played over computer speakers and sent through a VA mediated phone channel to a remote system. We show that modest amounts of data can be transmitted with high accuracy with a short phone call lasting a few minutes. This can be done while making the audio nearly inaudible for most people by modulating it with a carrier with frequencies that are near the higher end of the human hearing range. Several factors influence the data transfer rate, including the distance between the computer and the VA, the ambient noise that may be present and the frequency of modulating carrier. With the help of a prototype built by us, we experimentally assess the impact of these factors on data transfer rates and transmission accuracy. Our results show that voice assistants in the vicinity of computers can pose new threats to data stored on such computers. These threats are not addressed by traditional host and network defenses. We briefly discuss possible mitigation ways.