Traditional VAD usually plays a fundamental role in speech processing systems because of its ability to delimit speech segments. Network-level VAD, on the other hand, can be quite helpful in network management, which is the motivation for our study. We propose the first real-time network-level VAD algorithm that can extract voice activity from encrypted and non-silence-suppressed Skype traffic. We evaluate the speech detection accuracy of the proposed algorithm with extensive reallife traces. The results show that our scheme achieve reasonably good performance even high degree of randomness has been injected into the network traffic.
Related paper:
- Quantifying Skype User Satisfaction
- Detecting VoIP Traffic Based on Human Conversation Patterns
- Peer-to-Peer Application Recognition Based on Signaling Activity
AUTHOR = {Yu-Chun Chang and Kuan-Ta Chen and Chen-Chi Wu and Chin-Laung Lei},
TITLE = {Inferring Speech Activities from Encrypted Skype Traffic},
BOOKTITLE = {Proceedings of IEEE Globecom 2008},
YEAR = {2008}
}
