DATA HIDING ALGORITHM IN DNA SEQUENCE BASED ON SUBSTITUTION METHOD
Abstract
In recent years, data hiding, including watermarking, is an interested field of researcher for concealing data into a host, such as database, video, image, audio, QR code, DNA sequence… With the strong development of Bioinformatics, the DNA sequece is also considered for data hiding problems. Many studies proposed methods to conceal data in DNA and RNA sequence, but they could not increase the amount of hidden data. In this study, we propose a data hiding algoritm for improving the embedded capacity in the DNA sequence. The algorithm uses a Complementary table with four values corresponding with data bits ‘00’, ‘01’, ‘10’, and ‘11’, so that, we can embed two bits for each nucleotide. Moreover, with four levels of Complementary values, the algorithm shows that the proposed method also improve the security for hidden data.
References
T. S. Nguyen, C. C. Chang, M. C. Lin, “Adaptive lossless data-hiding and compression scheme for SMVQ indices using SOC,” Smart Comput. Review, 2014, vol. 4, no. 3, pp. 230-245.
J. Mielikainen, “LSB matching revisited,” IEEE Signal Process. Letts., 2006, vol. 13, pp. 285–287.
C. C. Chang, T. S. Nguyen, “A reversible data hiding scheme for SMVQ indices,” Informatica, 2014, vol. 25, no. 4, pp. 523-540.
C. V. Nguyen, D. Tay, and G. Deng, “A fast watermarking system for H.264/AVC video,” in Proc. IEEE APCCAS, Dec. 2006, pp. 81–84.
M. Fallahpour, M. David, “Reversible data hiding based on H. 264/AVC intra prediction.” Digital Watermarking. Springer Berlin Heidelberg, 2008, pp. 52-60.
Chien. N. D, Son N. T, & Hsu F. R, “An algorithm for DNA sequence hiding in H. 264/AVC video.” In Proceedings of the Seventh Symposium on Information and Communication Technology ACM, December 2016, pp. 229-234.
J.D. Watson, F.H.C. Crick, “Molecular structure of Nucleic acids: A structure for deoxyribose nucleic acid,” Nature 171, 1953, pp. 737, 738.
Church, G. M., Gao, Y., & Kosuri, S., “Next-generation digital information storage in DNA,” Science, 2012, 337(6102), pp. 1628-1628.
National Center for Biotechnology Information, https://www.ncbi.nlm.nih.gov/
Ensembl , http://www.ensembl.org/downloads.html
Shimanovsky, B., Feng, J., & Potkonjak, M., “Hiding data in DNA” International Workshop on Information Hiding, 2002, Springer Berlin Heidelberg, pp. 373-386.
Shiu, H. J., Ng, K. L., Fang, J. F., Lee, R. C., & Huang, C. H., “Data hiding methods based upon DNA sequences,” Information Sciences, 2010, 180(11), pp. 2196-2208.
Haughton, D., & Balado, F., “BioCode: Two biologically compatible Algorithms for embedding data in non-coding and coding regions of DNA,” BMC bioinformatics, 2013, 14(1), 121. (DH_DNA_7)
Wang, Z., Zhao, X., Wang, H., & Cui, G., “Information hiding based on DNA steganography,” , 2013 4th IEEE International Conference on Software Engineering and Service Science (ICSESS), 2013, IEEE, pp. 946-949. (DH_DNA_8).
Najaftorkaman, M., & Kazazi, N. S., “A method to encrypt information with DNA-based cryptography,” International Journal of Cyber-Security and Digital Forensics, 2015, 4(3), pp. 417-427. (DH_DNA_2).
UbaidurRahman, N. H., Balamurugan, C., & Mariappan, R., “A novel string matrix data structure for DNA encoding algorithm,” Procedia Computer Science, 2015, 46, pp. 820-832. (DH_DNA_4)
Huang, Y. H., Chang, C. C., & Wu, C. Y., “A DNA-based data hiding technique with low modification rates,” Multimedia tools and applications, 2014, 70(3), pp. 1439-1451. (CCC_1)
Liu, H., Lin, D., & Kadir, A., “A novel data hiding method based on deoxyribonucleic acid coding,” Computers & Electrical Engineering, 2013, 39(4), pp. 1164-1173.
Al-Harbi, O. A., Alahmadi, W. E., & Aljahdali, A. O. “Security analysis of DNA based steganography techniques.” SN Applied Sciences, 2020, 2(2), pp. 1-10.
S. Singh and Y. Sharma, “A Review on DNA based Cryptography for Data hiding,” 2019 International Conference on Intelligent Sustainable Systems (ICISS), 2019, Palladam, Tamilnadu, India, pp. 282-285.