Open Journal Systems

An Improved NID Steganography Method for Low Bitrate Speech on VoIP

Liu Jin(College of Computer Science and Technology, National Huaqiao University)
Zhang Yiwen(College of Computer Science and Technology, National Huaqiao University)

Abstract

In the procedure of encoding process on low bitrate speech, fixed codebook division is an efficient and promising embedding method for steganography. An improved neighbor index division (NID) steganography method based on the high bitrate frame of G.723.1 codec (6.3kbit/s) is proposed, which employs the parity and low distortion of neighbor indices for G.723.1 fixed codebooks. Differing from previously NID method which performs quantized index modulation (QIM) beforehand, the proposed method divides codeword indices into separate sub-codebooks according to the secret message bits dynamically in the original G.723.1 codec quantization period. Compared with existing NID method, our proposed method doesn’t need to divide the codebook before the encoding starts. The embedding and codebook dividing happen simultaneously, which utilizes the characteristics of specific secret message bits. The experiment results show that the proposed method has a much lower quality degradation for the decoding speech and still fulfills the low latency requirement for communication.

Keywords

Low Bitrate Speech; Steganography; Voice Over IP; QIM

Full Text:

PDF

References

Peng J, Tang S. Covert Communication over VoIP Streaming Media with Dynamic Key Distribution and Authentication[J]. IEEE Transactions on Industrial Electronics, 2020.

Yang H, Yang Z, Huang Y. Steganalysis of voip streams with cnn-lstm network[C]//Proceedings of the ACM Workshop on Information Hiding and Multimedia Security. 2019: 204-209.

Azadmanesh M, Mahdavi M, Ghahfarokhi B S. A reliable and efficient micro-protocol for data transmission over an RTP-based covert channel[J]. Multimedia Systems, 2019: 1-18.

Zhang X, Tan Y A, Liang C, et al. A covert channel over volte via adjusting silence periods[J]. IEEE Access, 2018, 6: 9292-9302.

Kabal P. ITU-T G. 723.1 speech coder: A matlab implementation[J]. McGill Univ, 2004.

Yang W, Tang S, Li M, et al. Markov bidirectional transfer matrix for detecting LSB speech steganography with low embedding rates[J]. Multimedia Tools and Applications, 2018, 77(14): 17937-17952.

Liu J, Tian H, Zhou K. Frame-bitrate-change based steganography for voice-over-IP[J]. Journal of Central South University, 2014, 21(12): 4544-4552.

Liu J, Tian H, Lu J, et al. Neighbor-index-division steganography based on QIM method for G. 723.1 speech streams[J]. Journal of Ambient Intelligence and Humanized Computing, 2016, 7(1): 139-147.

ITU-T Rec. G.723.1. Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s[S], 2006.

Tian H, Liu J, Li S. Improving security of quantization-index-modulation steganography in low bit-rate speech streams[J]. Multimedia systems, 2014, 20(2): 143-154.

ITU-T Rec. P.862. Perceptual evaluation of speech quality (PESQ): an objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs[S]. 2001.



DOI: http://dx.doi.org/10.26549/met.v5i1.6167

Refbacks

  • There are currently no refbacks.
Copyright © 2021 Jin Liu, Yiwen Zhang Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
  • :+65-62233778 QQ:2249355960 :contact@s-p.sg