Applying data mining steps to explore different small RNAs from buffalo milk transcriptome



RNAseq, Data mining, RNA, Next Generation Sequencing, RNA databases


This study is a first attempt to find different types of RNA in lactating buffalo's milk somatic cells. The molecular factors that regulate lactation need to be identified and understood in order to help milk production. By using data mining techniques, patterns and information hidden within a dataset can be identified. In order to detect the RNA, data of 12 samples of buffalo milk somatic cells were analyzed. For extraction of diverse RNAs COMPSRA (COMprehensive Platform for Small RNA Analysis) pipeline was used. We were able to identify several miRNAs, piRNAs, snRNAs, snoRNAs, circRNAs and tRNAs in buffalo milk somatic cells. circRNAs ranked highest among all the samples in our dataset, followed by piRNAs and then miRNAs. Understanding the RNA regulators of lactation will improve and facilitate management of buffalo milk production. Furthermore, our study contributes towards a complete annotation of the buffalo genome.


Download data is not yet available.


Bartel, D. P. (2004) . MicroRNAs: Genomics, biogenesis,mechanism, and function. Cell. 116(2): 281-297.

Siomi, M. C. et al. (2011). PIWI-interacting small RNAs:The vanguard of genome defence. Nature Reviews Molecular Cell Biology. 12(4) : 246-258.

O'Donoghue, P., Ling, J., & Soll, D. (2018). Transfer RNA function and evolution. RNA biology. 15(4-5): 423–426.

Adachi, H. & Yu, Y. (2014). Insight into the mechanisms and functions of spliceosomal nRNA pseudouridylation. World J. Biol. Chem.5: 398–408.

Maxwell, E. S. & Fournier, M. J. (1995). The small nucleolar RNAs. Annu Rev Biochem. 64: 897–934.

Lu, M. (2020). Circular RNA: functions, applications and prospects. ExRNA 2:1.

Li, J. et al. (2020). COMPSRA: a COMprehensive Platform for Small RNA-Seq data Analysis. Sci Rep. 10: 4552.

Andrews, S. (2010). FastQC: A Quality Control Tool for High Throughput Sequence Data [Online].web resource for piRNA producing loci”. Nucleic acids research. 44(D1):223–230.

Martin, M. (2011). Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal. 17(1): 10-12.

Dobin, A. & Gingeras, T.R. (2015). Mapping RNA-seq Reads with STAR. Curr Protoc Bioinformatics. 51(1): 11-14.

Kozomara, A., Birgaoanu, M. & Griffiths-Jones, S. (2019). miRBase: from microRNA sequences to function. Nucleic Acids Research.47:155-162.

Frankish, A. et al. (2019). GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Research.47: 766–773.

Chan, P.P & Lowe, T.M. (2009). GtRNAdb: a database of transfer RNA genes detected in genomic sequence. Nucleic acids research. 37: 93–97.

Rosenkranz, D. (2016). piRNA cluster database: a web resource for piRNA producing loci. Nucleic acids research. 44: 223–230.

Lakshmi, S.S. & Agrawal, S. (2018). piRNABank: a web resource on classified and clustered Piwi-interacting RNAs. Nucleic Acids Res. . 36:173-177.

Wang, J. et al. (2019) . piRBase: a comprehensive database of piRNA sequences. Nucleic Acids Research. 47:175-180.

Glazar, P., Papavasileiou, P. & Rajewsky, N. (2014). circBase: a database for circular RNAs. RNA. 20(11):1666–1670.

Rubio, M. et al. (2018). Circulating miRNAs, isomiRs and small RNA clusters in human plasma and breast milk. PloS ONE. 13(3).

Li, R. et al. (2016),.Comparative Analysis of the miRNome of Bovine Milk Fat,Whey and Cells. PLoS ONE. 11(4).

Lai, Y. C. et al. (2020). Bovine milk transcriptome analysis reveals microRNAs and RNU2 involved in mastitis. FEBS J. 287(9): 1899-1918.



How to Cite

Chhabra, P., & Brij Mohan Goel. (2022). Applying data mining steps to explore different small RNAs from buffalo milk transcriptome . Journal of Information Technology and Computing, 3(1), 32–36.