<i>De novo</i> transcriptome analysis of <i>Spodoptera exigua</i> multiple nucleopolyhedrovirus (SeMNPV) genes in latently infected Se301 cells

Zheng Fang; Jingxu Shao; Qingbei Weng

doi:10.1007/s12250-016-3791-8

October 2016

Zheng Fang, Jingxu Shao and Qingbei Weng. De novo transcriptome analysis of Spodoptera exigua multiple nucleopolyhedrovirus (SeMNPV) genes in latently infected Se301 cells[J]. Virologica Sinica, 2016, 31(5): 425-436. doi: 10.1007/s12250-016-3791-8

Citation: Zheng Fang, Jingxu Shao, Qingbei Weng. De novo transcriptome analysis of Spodoptera exigua multiple nucleopolyhedrovirus (SeMNPV) genes in latently infected Se301 cells .VIROLOGICA SINICA, 2016, 31(5) : 425-436. http://dx.doi.org/10.1007/s12250-016-3791-8

潜伏感染Se301细胞中甜菜夜蛾核多角体病毒（SeMNPV）基因的转录组分析

贵州师范大学生命科学学院，贵阳，550001，中国

通讯作者： 翁庆北, wengqb@126.com, ORCID: 0000-0003-4244-573X
收稿日期： 2016-04-22
录用日期： 2016-09-22
出版日期： 2016-10-18

摘要

P8-Se301-C1细胞是一株含有部分甜菜夜蛾核多角体病毒（Spodoptera exigua multiple nucleopolyhedrovirus，SeMNPV）基因组的甜菜夜蛾Se301细胞克隆株。该细胞不产生子代病毒粒子，对同源病毒SeMNPV的感染具抑制作用，是SeMNPV潜伏样（latent-like）感染的Se301细胞。为明确P8-Se301-C1细胞中所含的SeMNPV基因，本研究对P8-Se301-C1细胞和Se301细胞的转录组进行了重头组装并比较分析。从P8-Se301-C1细胞中共获得54,569,296条读长（read），并最终得到112,565条平均长度为1,093 nt的独立基因（unigene）。从Se301细胞中共获得56,865,504条读长，并最终获得102,996条平均长度为1,082 nt的独立基因。通过RNA-Seq分析以及RT-PCR进一步验证表明，P8-Se301-C1细胞中含有se5, se7, se8, se12, se43, se45, se89, se90, se124和se126等10个SeMNPV病毒基因转录本，且这些转录本在Se301细胞中未检测到。3'/5' RACE分析表明，这些病毒基因转录本的3'-或5'-末端序列与宿主细胞基因序列一致，暗示在P8-Se301-C1细胞中，SeMNPV基因可能与宿主基因整合并转录。另外，通过RACE分析在P8-Se301-C1细胞中还检测到了se11, se42, se44, se88, se91和se127等6个病毒基因转录本，它们与其他SeMNPV病毒基因嵌合在一起形成融合基因转录本。总体地，在P8-Se301-C1细胞中共检测到了16个SeMNPV病毒基因转录本。研究结果为进一步深入研究杆状病毒潜伏感染和超感染排斥的机理提供信息。
- RNA-Seq
- , SeMNPV
- , 杆状病毒
- , 潜伏感染
- , 甜菜夜蛾

De novo transcriptome analysis of Spodoptera exigua multiple nucleopolyhedrovirus (SeMNPV) genes in latently infected Se301 cells

School of Life Sciences, Guizhou Normal University, Guiyang, 550001, China

Corresponding author: Qingbei Weng, wengqb@126.com
ORCID: 0000-0003-4244-573X
Received Date: 22 April 2016
Accepted Date: 22 September 2016
Published Date: 18 October 2016

Abstract

Cells of the P8-Se301-C1 strain are Spodoptera exigua cell clones that each harbor a partial version of the S. exigua multiple nucleopolyhedrovirus (SeMNPV) genome and which are resistant to homologous SeMNPV infections. The cells produce no viral progeny, suggesting that the infection is a latent-like viral infection. To investigate the SeMNPV genes harbored in the P8-Se301-C1 cells, the de novo transcriptomes of P8-Se301-C1 cells and S. exigua Se301 cells were analyzed and compared. A total of 54,569,296 reads were obtained from the P8-Se301-C1 cells that yielded 112,565 final unigenes with a mean length of 1,093 nt. A total of 56,865,504 reads were obtained from the Se301 cells that yielded 102,996 final unigenes with a mean length of 1,082 nt. Ten SeMNPV gene transcripts (se5, se7, se8, se12, se43, se45, se89, se90, se124, and se126) were detected in the P8-Se301-C1 cells by RNA-Seq but not in the Se301 cells, which was verified by RTPCR. 5'/3' RACE analyses showed that the 3'- or 5'-end sequences of the viral transcripts are aligned to the host gene sequences in P8-Se301-C1 cells, suggesting that the SeMNPV genes may integrate into and be transcribed with the host genes in the P8-Se301-C1 cells. Furthermore, six additional viral gene transcripts, se11, se42, se44, se88, se91, and se127 (incorporated into chimeric fusion transcripts in the P8-Se301-C1 cells), were detected in the RACE analyses. Taken together, sixteen SeMNPV transcripts were identified in the P8-Se301-C1 cell strain. This study provides information to develop the understanding of baculovirus latent infections and superinfection exclusion.
- RNA-Seq
- , SeMNPV
- , baculovirus
- , latent infection
- , Spodoptera exigua

References
1. Bonilla GR, Roberts LR. 2005. The role of hepatitis B virus integrations in the pathogenesis of human hepatocellular carcinoma. J Hepatol, 42: 760-777.
  doi: 10.1016/j.jhep.2005.02.005
2. Cai Y, Long Z, Qiu J, Yuan M, Li G, Yang K. 2012. An ac34 deletion mutant of Autographa californica nucleopolyhedrovirus exhibits delayed late gene expression and a lack of virulence in vivo. J Virol, 86: 10432-10443.
  doi: 10.1128/JVI.00779-12
3. Chao YC, Wood HA, Chang CY, Lee HJ, Shen WC, Lee HT. 1992. Differential expression of Hz-1 baculovirus genes during productive and persistent viral infections. J Virol, 66: 1442-1448.
4. Chen B, Zhang YJ, He Z, Li W, Si F, Tang Y, He Q, Qiao L, Yan Z, Fu W, Che Y. 2014. De novo transcriptome sequencing and sequence analysis of the malaria vector Anopheles sinensis (Diptera: Culicidae). Parasit Vectors, 7: 314.
  doi: 10.1186/1756-3305-7-314
5. Choi JY, Roh JY, Wang Y, Zhen Z, Tao XY, Lee JH, Liu Q, Kim JS, Shin SW, Je YH. 2012. Analysis of genes expression of Spodoptera exigua larvae upon AcMNPV infection. PLoS One, 7: e42462.
  doi: 10.1371/journal.pone.0042462
6. de Jong J, Arif BM, Theilmann DA, Krell PJ. 2009. Autographa californica multiple nucleopolyhedrovirus me53 (ac140) is a nonessential gene required for efficient budded-virus production. J Virol, 83: 7440-7448.
  doi: 10.1128/JVI.02390-08
7. He W, You M, Vasseur L, Yang G, Xie M, Cui K, Bai J, Liu C, Li X, Xu X, Huang S. 2012. Developmental and insecticide-resistant insights from the de novo assembled transcriptome of the diamondback moth, Plutella xylostella. Genomics, 99: 169-177.
  doi: 10.1016/j.ygeno.2011.12.009
8. IJkel W, van Strien EA, Heldens JG, Broer R, Zuidema D, Goldbach RW, Vlak JM. 1999. Sequence and organization of the Spodoptera exigua multicapsid nucleopolyhedrovirus genome. J Gen Virol, 80: 3289-3304.
  doi: 10.1099/0022-1317-80-12-3289
9. Lambirth KC, Whaley AM, Blakley IC, Schlueter JA, Bost KL, Loraine AE, Piller KJ. 2015. A Comparison of transgenic and wild type soybean seeds: analysis of transcriptome profiles using RNA-Seq. BMC Biotechnol, 15: 89.
  doi: 10.1186/s12896-015-0207-z
10. Lee SS, Lehman IR. 1999. The interaction of herpes simplex type 1 virus origin-binding protein (UL9 protein) with Box I, the high affinity element of the viral origin of DNA replication. J Biol Chem, 274: 18613-18617.
  doi: 10.1074/jbc.274.26.18613
11. Li L, Harwood SH, Rohrmann GF. 1999. Identification of additional genes that influence baculovirus late gene expression. Virology, 255: 9-19.
  doi: 10.1006/viro.1998.9546
12. Li H, Jiang W, Zhang Z, Xing Y, Li F. 2013. Transcriptome Analysis and Screening for Potential Target Genes for RNAi-Mediated Pest Control of the Beet Armyworm, Spodoptera exigua. PLoS One, 8: e65931.
  doi: 10.1371/journal.pone.0065931
13. Lin G, Blissard GW. 2002. Analysis of an Autographa californica multicapsid nucleopolyhedrovirus lef-6-null virus: LEF-6 is not essential for viral replication but appears to accelerate late gene transcription. J Virol, 76: 5503-5514.
  doi: 10.1128/JVI.76.11.5503-5514.2002
14. Liu H, Wu W, Hou K, Chen J, Zhao Z. 2016. Deep sequencing reveals transcriptome re-programming of Polygonum multiflorum thunb. roots to the elicitation with methyl jasmonate. Mol Genet Genomics, 291: 337-348.
15. Mikhailov VS, Mikhailova AL, Iwanaga M, Gomi S, Maeda S. 1998. Bombyx mori nucleopolyhedrovirus encodes a DNA-binding protein capable of destabilizing duplex DNA. J Virol, 72: 3107-3116.
16. Minami M, Daimon Y, Mori K, Takashima H, Nakajima T, Itoh Y, Okanoue T. 2005. Hepatitis B virus-related insertional mutagenesis in chronic hepatitis B patients as an early drastic genetic change leading to hepatocarcinogenesis. Oncogene, 24: 4340-4348.
  doi: 10.1038/sj.onc.1208628
17. Murillo R, Hussey MS, Possee RD. 2011. Evidence for covert baculovirus infections in a Spodoptera exigua laboratory culture. J Gen Virol, 92: 1061-1070.
  doi: 10.1099/vir.0.028027-0
18. Perng GC, Chokephaibulkit K, Thompson RL, Sawtell NM, Slanina SM, Ghiasi H, Nesburn AB, Wechsler SL. 1996. The region of the herpes simplex virus type 1 LAT gene that is colinear with the ICP34.5 gene is not involved in spontaneous reactivation. J Virol, 70: 282-291.
19. Qiu L, Hou L, Zhang B, Liu L, Li B, Deng P, Ma W, Wang X, Fabrick JA, Chen L, Lei C. 2015. Cadherin is involved in the action of Bacillus thuringiensis toxins Cry1Ac and Cry2Aa in the beet armyworm, Spodoptera exigua. J Invertebr Pathol, 127: 47-53.
  doi: 10.1016/j.jip.2015.02.009
20. Saffert RT, Kalejta RF. 2007. Human cytomegalovirus gene expression is silenced by Daxx-mediated intrinsic immune defense in model latent infections established in vitro. J Virol, 81: 9109-9120.
  doi: 10.1128/JVI.00827-07
21. Slavokhotova AA, Shelenkov AA, Odintsova TI. 2015. Prediction of Leymus arenarius (L.) antimicrobial peptides based on de novo transcriptome assembly. Plant Mol Biol, 89: 203-214.
22. Sriram S, Gopinathan KP. 1998. The potential role of a late gene expression factor, lef2, from Bombyx mori nuclear polyhedrosis virus in very late gene transcription and DNA replication. Virology, 251: 108-122.
  doi: 10.1006/viro.1998.9404
23. Sun L, Qiu G, Cui L, Ma C, Yuan H. 2015. Molecular characterization of a ryanodine receptor gene from Spodoptera exigua and its upregulation by chlorantraniliprole. Pestic Biochem Physiol, 123: 56-63.
  doi: 10.1016/j.pestbp.2015.03.002
24. Tamori A, Yamanishi Y, Kawashima S, Kanehisa M, Enomoto M, Tanaka H, Kubo S, Shiomi S, Nishiguchi S. 2005. Alteration of gene expression in human hepatocellular carcinoma with integrated hepatitis B virus DNA. Clin Cancer Res, 11: 5821-5826.
  doi: 10.1158/1078-0432.CCR-04-2055
25. Tao X, Gu YH, Wang HY, Zheng W, Li X, Zhao CW, Zhang YZ. 2012. Digital gene expression analysis based on integrated de novo transcriptome assembly of sweet potato [Ipomoea batatas (L.) Lam]. PLoS One, 7: e36234.
  doi: 10.1371/journal.pone.0036234
26. The UniProt Consortiums. 2011. Ongoing and future developments at the Universal Protein Resource. Nucleic Acids Res, 39: D214-219.
  doi: 10.1093/nar/gkq1020
27. Thompson RL, Sawtell NM. 1997. The herpes simplex virus type 1 latency-associated transcript gene regulates the establishment of latency. J Virol, 71: 5432-5440.
28. Virto C, Navarro D, Tellez MM, Herrero S, Williams T, Murillo R, Caballero P. 2014. Natural populations of Spodoptera exigua are infected by multiple viruses that are transmitted to their offspring. J Invertebr Pathol, 122: 22-27.
  doi: 10.1016/j.jip.2014.07.007
29. Vogel H, Badapanda C, Knorr E, Vilcinskas A. 2014. RNA-sequencing analysis reveals abundant developmental stage-specific and immunity-related genes in the pollen beetle Meligethes aeneus. Insect Mol Biol, 23: 98-112.
  doi: 10.1111/imb.2014.23.issue-1
30. Wang Y, Wu W, Li Z, Yuan M, Feng G, Yu Q, Yang K, Pang Y. 2007. ac18 is not essential for the propagation of Autographa californica multiple nucleopolyhedrovirus. Virology, 367: 71-81.
  doi: 10.1016/j.virol.2007.05.017
31. Weng Q, Yang K, Xiao W, Yuan M, Zhang W, Pang Y. 2009. Establishment of an insect cell clone that harbours a partial baculoviral genome and is resistant to homologous virus infection. J Gen Virol, 90: 2871-2876.
  doi: 10.1099/vir.0.013334-0
32. Westenberg M, Uijtdewilligen P, Vlak JM. 2007. Baculovirus envelope fusion proteins F and GP64 exploit distinct receptors to gain entry into cultured insect cells. J Gen Virol, 88: 3302-3306.
  doi: 10.1099/vir.0.83240-0
33. Xiao CL, Mai ZB, Lian XL, Zhong J, Jin JJ, He QY, G. Z. 2014. FANSe2: a robust and cost-efficient alignment tool for quantitative next-generation sequencing applications, p. e94250, PLoS One, vol. 9.
34. Xu K, Sun F, Chai G, Wang Y, Shi L, Liu S, Xi Y. 2015. De novo assembly and transcriptome analysis of two contrary tillering mutants to learn the mechanisms of tillers outgrowth in switchgrass (Panicum virgatum L.). Front Plant Sci, 6: 749.
35. Xu HJ, Yang ZN, Zhao JF, Tian CH, Ge JQ, Tang XD, Bao YY, Zhang CX. 2008. Bombyx mori nucleopolyhedrovirus ORF56 encodes an occlusion-derived virus protein and is not essential for budded virus production. J Gen Virol, 89: 1212-1219.
  doi: 10.1099/vir.0.83633-0
36. Yang Y, Smith. SA. 2013. Optimizing de novo assembly of short-read RNA-seq data for phylogenomics. BioMed Central, 14: 328.
37. Yu M, Carstens EB. 2012. Choristoneura fumiferana multiple nucleopolyhedrovirus LEF-3-P143 complex can complement DNA replication and budded virus in an AcMNPV LEF-3-P143 double knockout bacmid. J Gen Virol, 93: 383-388.
  doi: 10.1099/vir.0.036699-0
38. Zhang G, Fedyunin I, Kirchner S, Xiao C, Valleriani A, Ignatova Z. 2012. FANSe: an accurate algorithm for quantitative mapping of large scale sequencing reads. Nucleic Acids Res, 40: e83.
  doi: 10.1093/nar/gks196
39. Zhao YJ, Zeng Y, Chen L, Dong Y, Wang W. 2014. Analysis of transcriptomes of three orb-web spider species reveals gene profiles involved in silk and toxin. Insect Sci, 21: 687-698.
  doi: 10.1111/ins.2014.21.issue-6
40. Zheng XL, Wang P, Cheng WJ, Wang XP, Lei CL. 2012. Projecting overwintering regions of the beet armyworm, Spodoptera exigua in China using the CLIMEX model. J Insect Sci, 12: 13.
Proportional views
10.1007s12250-016-3791-8.pdf

Figures(8) / Tables(5)

PDF

Article Metrics

Article views(7613) PDF downloads(23) Cited by(0)

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

HTML

INTRODUCTION

The beet armyworm—Spodoptera exigua—is a major migratory insect pest that damages numerous vegetables and is causing increasing economic losses in the agriculture sector, including the industries related to cotton, food crops, and timber (Zheng et al., 2012; Virto et al., 2014; Qiu et al., 2015; Sun et al., 2015). The baculovirus S. exigua multiple nucleopolyhedrovirus (SeMNPV) is a very specific pathogen of S. exigua, hence, it has been developed for use as a bioinsecticide (Virto et al., 2014). Several bioinsecticides, including SPEXIT^®(Andermatt Biocontrol, Switzerland), VIR-EX^®(Biocolor, Spain), and SPOD-X^®(Certis, US) contain SeMNPV.

Viral infections can be divided into acute, persistent, and latent infections. The vast majority viral genes are expressed during the acute infection and, because of the production of progeny virions, the infection spreads within a host and to new hosts (Saffert and Kalejta, 2007). During persistent infections, some viral genes are downregulated by viral or cellular regulatory gene products (Mayer and Ebbesen, 1994). A latent infection is defined as a reversible nonproductive infection of a cell in which the viral genome is present but infectious viruses are not produced except during intermittent episodes of reactivation (Stevens, 1989). During latent infections, viral genomes are maintained in host cells for a long periods with very little or no gene expression, which allows the virus to evade detection by the host immune system (Murillo et al., 2011).

P8-Se301 cells are S. exigua Se301 that are infected with an attenuated version of SeMNPV and the P8-Se301-C1 cell strain is cloned from these persistently infected P8-Se301 cells. P8-Se301-C1 cells harbor a partial SeMNPV genome and they are morphologically similar to Se301 cells but they do not produce viral progeny. The cells are resistant to SeMNPV infection but not to infection by the heterologous Autographa californica multiple nucleopolyhedrovirus (AcMNPV), which is also a baculovirus. It has been suggested that SeMNPV resides in P8-Se301-C1 cells as a latent-like infection, which means that these cells provide a promising experimental system to investigate the mechanisms of baculovirus persistence in insects (Weng et al., 2009).

Herpes simplex virus 1 (HSV-1) can establish lifelong latency in the trigeminal sensory neurons of humans, and the expression of the viral RNA known as latency-associated transcript (LAT) in the absence of the production of viral proteins is believed to play a role in establishing latency (Perng et al., 1996; Thompson and Sawtell, 1997). Therefore, exploring the transcription of SeMNPV genes in P8-Se301-C1 cells may aid the understanding of the molecular mechanism of latent infections.

Recently, Illumina strand-specific RNA sequencing (RNA-Seq)—a newly developed, large-scale, and genome-wide process—has been used for transcriptome analysis and gene discovery. This highlights the potential to use RNA-Seq to cost-effectively obtain large amounts of transcriptome data and then compare the evolution of the genomes of non-model species (Zhao et al., 2014; Slavokhotova et al., 2015; Xu et al., 2015). RNA-Seq has several obvious advantages, such as its cost-effectiveness, its high resolution, and the fact that it has a wide dynamic range of expression levels over which transcripts can be detected (Vogel et al., 2014; Lambirth et al., 2015). In this study, Illumina paired-end sequencing was used to analyze the de novo transcriptomes of S. exigua cells. We compared the transcriptome sequences of P8-Se301-C1 cells and Se301 cells using RNA-Seq and identified ten SeMNPV gene transcripts in the P8-Se301-C1 cells. Moreover, six additional SeMNPV gene transcripts were detected in P8-Se301-C1 cells using 5'/3' rapid amplification of cDNA ends (RACE) analyses. These novel findings provide useful information on the mechanisms of latent infections and superinfection exclusion.

MATERIALS AND METHODS

Cell culture

Se301 and P8-Se301-C1 cells were cultured at 27 ℃ in Grace's Insect Medium (Invitrogen, Carlsbad, US) supplemented with 10% fetal bovine serum (FBS), penicillin, (100 μg/mL) and streptomycin (30 μg/mL).

RNA extraction, cDNA synthesis, and Illumina sequencing

The total RNA was extracted from the two cell lines using a TaKaRa MiniBEST Universal RNA Extraction Kit (TaKaRa, Dalian, China) according to manufacturer's protocol. The RNA integrity number (RIN) of the total RNA was verified using an Agilent Bioanalyzer 2100 (Agilent Technologies, Santa Clara, CA, US), and the quantity was measured using a NanoDrop 2000 Spectrophotometer.

The total RNA was purified further using an RNeasy Micro Kit (QIAGEN GmBH, Germany) and an RNase-Free DNase Set (QIAGEN). After removing the ribosomal RNA, the remaining RNA was split into short fragments using an RNA fragmentation buffer. The RNA fragments were used as templates to amplify first-strand cDNA using random hexamerprimers, and then the second cDNA strand was synthesized. cDNA libraries of both the Se301 cells and the P8-Se301-C1 cells were created using the double-stranded cDNA. Paired-end sequencing was carried out using the PE125 strategy of the Illumina HiSeq 2500 Sequencing System (Illumina, San Diego, CA, US) at Shanghai Biotechnology Corporation.

Sequence statistics, de novo assembly, and mapping

Before assembly and mapping, the raw RNA-Seq reads obtained from the Se301 and P8-Se301-C1 cDNA libraries were processed using the ShortRead package to filter out low-quality nucleotide sequences, adapters, and PCR primer sequences. Reads with a length shorter than 35 nt or with at least 2 ambiguous nucleotides (i.e., those which could be any type of nucleotide) were removed. The resulting cleaned reads were mapped to the SeMNPV genome to screen out the SeMNPV gene transcripts. The cleaned reads were assembled as primary unigenes using the Trinity package with an optimized k-mer length of 25 (Tao et al., 2012; Chen et al., 2014). The primary unigenes were cleaned by removing redundant genes and they were then assembled into a final set of unigenes using CD-HIT software (Yang and Smith, 2013). The cleaned reads were aligned to the final unigenes using the mapping algorithm, FANSe2, and allowing up to 7 mismatched nucleotides (Zhang et al., 2012; Xiao et al., 2014). The final set of unigenes (with at least 10 mapped reads) were considered to be reliably assembled unigenes.

Annotation

Universal Protein (UniProt) is the most comprehensive catalog of protein sequences and functional annotations (Uniprot Consortiums, 2011). In addition, the Gene Ontology (GO) database, the Clusters of Orthologous Groups of proteins (COG) database, and the Kyoto Encyclopedia of Genes and Genomes (KEGG) database are major databases of putative gene functions (He et al., 2012). BLASTX was used to search the UniProt, GO, COG, and KEGG databases for the final unigenes, with an e-value cut-off of 1e-5.

Reverse transcription PCR (RT-PCR)

RT-PCR was used to verify the SeMNPV gene transcripts from the P8-Se301-C1 cells. A cDNA First Strand Synthesis Kit (SiDanSai, Shanghai, China) was used to reverse transcribe 1 μg of total RNA extracted from the P8-Se301-C1 cells and from the Se301 cells into cDNA. The transcripts of the SeMNPV genes that were detected in the P8-Se301-C1 cells by RNA-Seq were amplified using RT-PCR and Se301 cells were used as a negative control. The specific primers are shown in Supplementary Table 1. Agarose gel electrophoresis was carried out to separate the resulting DNA products, which were then sequenced by Beijing Ruibo (Guangzhou, China).

5'/3' RACE analysis

To sequence the full-length of the SeMNPV gene transcripts, the total RNA was extracted from the P8-Se301-C1 cells and RACE analyses were carried out with a SMARTer RACE 5'/3' Kit (Clontech, US). A set of gene-specific primers were designed (Supplementary Table 1) and labelled either GSP (gene-specific primer) or NGSP (nested gene-specific primer) according to the sequence of the RT-PCR products. PCR amplification was carried using the following protocol. Briefly, reaction volumes of 20 μL were prepared containing 5'/3'-RACE-Ready cDNA as the template, primers (0.5 μL of the 10 μM GSP and 2.0 μL of the universal primer mix provided in the kit) and 2×PrimerSTAR Max DNA Polymerase (TaKaRa). The conditions were as follows: 2 min at 94 ℃; 25 cycles of 30 s at 94 ℃, 30 s at 68 ℃, 3 min at 72 ℃; and a final extension at 72 ℃ for 5 min. The products were diluted with water (using a ratio of 1:10) and used as template cDNA for the nested PCR. The nested PCR conditions were as follows: 2 min at 94 ℃; 30 cycles of 30 s at 94 ℃, 30 s at 55 C-60 ℃, 3 min at 72 ℃; and a final extension at 72 ℃ for 5 min. The PCR products were visualized by electrophoresis on a 1% agarose gel and sequenced. Subsequently, the 5' and 3' fragment sequences were assembled to obtain the full-length cDNA sequences of each of the SeMNPV genes.

DISCUSSION

Although bioinformatics tools for sequence assembly and data analysis have been developed, the de novo assembly of short reads continues to be challenging in cases where there is no reference genome (Chen et al., 2014). In this study, we obtained 56, 865, 504 raw reads from Se301 cells and 54, 569, 296 raw reads from P8-Se301-C1 cells using RNA-Seq. Previous studies have obtained more than 34, 809, 334 raw reads associated with the S. exigua transcriptome and representing all the stages of the lifecycle of S. exigua comprising the egg, larval, pupal, and adult stages (Li et al., 2013) and 74, 928 raw reads associated with the gene expression of S. exigua larvae infected with AcMNPV (Choi et al., 2012). After assembly of the de novo transcripts from the P8-Se301-C1 and Se301 samples using Trinity software, a total of 112, 565 and 102, 996 final unigenes were generated, respectively. The set of final unigenes from the P8-Se301-C1 cells contained 26, 553 (23.6%) unigenes that were annotated and the final set of unigenes from the Se301 cells contained 24, 906 (24.2%) unigenes that were annotated. This additional data provides more genetic information for further studies of S. exigua.

In the P8-Se301-C1 cells, ten SeMNPV gene transcripts were detected by RNA-Seq. RACE analyses were carried out to analyze the full length of each of the ten gene transcripts, and six additional SeMNPV genes were detected during the RACE analyses. Thus, taken together, sixteen SeMNPV gene transcripts were identified in P8-Se301-C1 cells, comprising the full-length sequences of nine genes (se5, se7, se8, se12, se43, se45, se89, se124 and se126), and the partial sequences of seven genes (se11, se42, se44, se88, se90, se91 and se127).

se5 is a unique SeMNPV gene with unknown function. A bioinformatics analysis showed that it has five protein kinase C phosphorylation sites and nine casein kinase Ⅱ phosphorylation sites (Sun et al., 2015). Compared to the wild-type control, the deletion of se5 resulted in a decrease in the pathogenicity of occlusion bodies (OBs) and a significantly extended the viral life cycle. se7 is an early gene that is unnecessary for viral replication but required for efficient production of baculoviruses(de Jong et al., 2009). se8, which encodes F protein, has been found in group Ⅱ nucleopolyhedroviruses (NPVs). It is a functional homolog of the GP64 protein of group Ⅰ NPVs of alphabaculoviruses but has a higher rate of activitythan GP64 (Westenberg et al., 2007).

se11 is a closely related homolog of ld137a (orf4 PE) from Lymantria dispar multiple nucleopolyhedrovirus (LdMNPV), which encodes a protein that appears to be a major component of the polyhedron envelope (Gombart et al., 1989a; Russell & Rohrmann, 1990). se12 is a late gene expression factor that plays a direct role in very late gene transcription (Sriram and Gopinathan, 1998). se42 is the closely related homolog of ac19 from AcMNPV (IJkel et al., 1999). se44 is a unique SeMNPV gene with unknown function (IJkel et al., 1999). The homolog of se43 in AcMNPV is ac18, which is a highly conserved gene in lepidopteran nucleopolyhedroviruses. se43 may play a role in the efficiency of S. exigua infections (Wang et al., 2007).

In terms of protein identity and genomic location, the homolog of se45 in LdMNPV is ld120 (rr2b) (IJkel et al., 1999). se88 is a closely related homolog of ac71 (iap-2) from AcMNPV, which functions as an apoptosis inhibitor and is a potential host range factor (Li et al., 1999). The homolog of se89 in AcMNPV, ac69, has a stimulatory effect on late gene expression and has been implicated in cell division (Li et al., 1999). se90 is a baculovirus core gene that has been shown to be highly conserved in all baculoviruses that have been fully sequenced (Nie et al., 2012). It is homologous to ORF 56 (bm56) of B. mori nuclear polyhedrosis virus (BmNPV), which facilitate efficient virus production in vivo (Xu et al., 2008). se91 is a closely related homolog of ac67 (lef-3) from AcMNPV, which is essential for AcMNPV DNA replication in vivo (Yu and Carstens, 2012).

The se124 homologs are highly conserved in all sequenced alphabaculoviruses and its homolog in AcMNPV, Ac34, is an activator protein that promotes late gene expression and is essential for the pathogenicity of AcMNPV (Cai et al., 2012). se126 is a homolog of ORF16 (bm16) of BmNPV, which functions as a single-stranded DNA binding protein that plays a role in virus replication (Mikhailov et al., 1998). se127 is a closely related homolog of ac28 (lef-6) of AcMNPV, which is not essential for viral replication but the infection cycle is substantially delayed in its absence (Lin and Blissard, 2002).

Previously, a study demonstrated the presence of a SeMNPV polyhedrin transcript in P8-Se301-C1 cells (Weng et al., 2009). In this study, before RNA-Seq was carried out, the presence of the polyhedrin transcript was confirmed by nested RT-PCR using the total RNA extracted from the P8-Se301-C1 cells. However, the transcript was not identified from the RNA-Seq results. Six more transcripts (se11, se42, se44, se88, se91 and se127) were detected using RACE analyses, but not by RNA-Seq, which implies that more SeMNPV gene transcripts may be present in P8-Se301-C1 cells but were not identified due to their low abundance.

The RACE analyses of the full-length transcripts containing SeMNPV genes showed that the 3′ and 5′ end sequences of the transcripts map to B. mori genes or to unknown genes that cannot be aligned to sequences in the NCBI's Nucleotide Database. We also attempted to align the 3′ and 5′ end sequences to the previous sequence data on S. exigua (NCBI accession numbers: SRX110132, SRX110248, GAFU00000000 and SRA056289) (Choi et al., 2012; Li et al., 2013), but there were no matches. As the entire SeMNPV genome has been sequenced (IJkel et al., 1999), the 3′ and 5′ end must be from the host genome and not from the SeMNPV genome, suggesting that the SeMNPV genes integrate into host genome and are transcribed with the host genome in P8-Se301-C1 cells.

Latent infections of insect cells with nudivirus 1 (HzNV-1) is characterized by the expression of only one viral transcript (persistence associated transcript 1, PAT1) (Chao et al., 1992), and the viral DNA exists as either a circular or an inserted form (Lee and Lehman, 1999). In chronic hepatitis B virus (HBV) infections of humans, subgenomic HBV DNA fragments were found to integrate into random sites of the host genome (Bonilla and Roberts, 2005; Minami et al., 2005). Due to the integration, the host genome was altered (a "cis" effect) and the HBV genome was altered (a "trans" effect) (Bonilla and Roberts, 2005; Minami et al., 2005), which may led to the modulation of the expression of the human genes near the integration sites, followed by integration site-specific expression of the genes during hepatocarcinogenesis (Tamori et al., 2005). In our study, we found that SeMNPV genes were integrated into the host genome in the P8-Se301-C1 cells, which led to the production of novel fusion transcripts. The results support our previous findings (Weng et al., 2009) and further demonstrate that SeMNPV resides in P8-Se301-C1 cells as a latent infection.

In summary, we obtained fully assembled transcripts from S. exigua cells and identified sixteen transcripts of SeMNPV genes in P8-Se301-C1 cells. Among the SeMNPV gene transcripts, ten transcripts were detected using RNA-Seq and six additional transcripts were detected using RACE analyses. The findings suggest that, in Se301 cells that are latently infected with SeMNPV, some baculoviral genomic DNA fragments fuse together and integrate into the host genome, which leads to the production of novel chimeric fusion transcripts. The role of these novel fusion transcripts in latency in P8-Se301-C1 cells remains unclear. Future DNA sequencing studies would be useful to further explain the form of the viral DNA in the latently infected insect cells and the locations of the viral gene integration sites in the host genome. The findings provide important insights into the molecular mechanisms of baculovirus latent infections and superinfection exclusion.

ACKNOWLEDGMENTS

The authors appreciate valuable thoughts and suggestions from Prof. Kai Yang and Dr. Wenbi Wu at State Key Laboratory of Biocontrol of Sun Yet-sen University. We also thank Hanquan Liang at Sun Yet-sen University for revisions. This research was funded by the National Natural Science Foundation of China (No. 31160378), the Science and Technology Foundation of Guizhou Province (No. LKS[2010]21), and the Startup Foundation for Doctors of Guizhou Normal University.

COMPLIANCE WITH ETHICS GUIDELINES

The authors declared that they have no conflict of interest. This article does not contain any studies with human or animal subjects performed by any of the authors.

AUTHOR CONTRIBUTIONS

QBW conceived and supervised the study, and finalized the manuscript. ZF performed most experiments and wrote the paper. JXS helped with the RACE experiments and manuscript drafting. All authors read and approved the final manuscript.

Supplementary figures/tables are available on the websites of Virologica Sinica: www.virosin.org; link.springer.com/journal/12250.

Figure (8) Table (5) Reference (40) Relative (20)

Transcript size (bp)	Harbored SeMNPV genes		5' end sequence		3' end sequence
Transcript size (bp)	Position in the transcript	SeMNPV gene (position in the SeMNPV genome)	Position in the transcript	Aligned host gene mRNA (position in the host gene)	Position in the transcript	Aligned host gene mRNA (position in the gene)
1847	nt 106-1634	se5(nt 6190-7713)	nt 3-48	B. mori akh2(nt 26-71)	nt 1707-1847	No hit^a
1530	nt 138-1310	se7(nt 9261-10433)	nt 1-130	No hit	nt 1373-1448	B. mori dh40(nt 3116-3192)
2136	nt 104-2103	se8(nt 12498-14495)	nt 12-60	B. mori dh40(nt 1035-998)	—^b	—
978	nt 300-933 nt 55-91 nt 162-229	se12(nt 16064-16693) se11(nt 15817-15852) se11(nt 15923-16101)	nt 5-48	B. mori akh2(nt 27-71)	nt 946-978	No hit
1691	nt 348-1511 nt 46-261	se43(nt 42696-43856) se42(nt 42392-42606)	nt 1-46	No hit	nt 1520-1691	No hit
1118	nt 161-1107 nt 62-70	se45(nt 44408-45394) se44(nt 44305-44312)	nt 1-61	No hit	—	—
1370	nt 488-671 nt 35-489	se89(nt 85499-86398) se88(nt 85088-85799)	nt 1-35	No hit	—	—
1035	nt 407-991 nt 57-323 nt 322-396	se124(nt 118809-119391) se90(nt 86521-86789) se91(nt 86788-86860)	nt 3-50	B. mori adh40(nt 25-71)	nt 1007-1035	—
1421	nt 102-1122 nt 1164-1354	se126(nt 120802-121788) se127(nt 121816-122307)	nt 1-62	No hit	nt 1360-1413	B. mori sifa(nt 462-564)
Note: a: The end sequence cannot be aligned to any known sequence in the NCBI's Nucleotide Database. b: The end sequence is matched to SeMNPV gene.

Primer	Sequence (5'-3')	Genome site (nt)	Product size (bp)
RT-PCR
se5-F	GCCTCTGCTATCGTTGCT	6832-6849	858
se5-R	CTGATCGGTGGTTTCTCC	7689-7672	858
se7-F	GAGGAGATACGAGGTGATG	9674-9692	753
se7-R	TTTCCAAACTTTAGTGCC	10426-10409	753
se8-F	CGCCAAAGACATAGTCCA	12557-12574	1736
se8-R	GCGTCAACATTGCCATTA	14292-14275	1736
se12-F	TATAGCGTTCTGTTTAGCG	16124-16142	557
se12-R	ATTGGATTGGTGCCTTTG	16680-16663	557
se43-F	TCAGCGTCAATAGACTCAT	43066-43084	651
se43-R	CGAAGCGATTCATAAAGTA	43716-43698	651
se45-F	ACGACGACTTTACCCAGAA	44523-44541	656
se45-R	ATCGGCGACAAACTCAAT	45178-45161	656
se89-F	ACCAACGCCGATTGTCTG	86155-86138	566
se89-R	GTGCGGTGGGCATCTTCA	85590-85607	566
se90-F	AGGGACCGTGTCGAAGTA	86765-86748	301
se90-R	CTGCCACCGTCAATAGGA	86465-86482	301
se124-F	GGTTGGGTGACGTGATAC	118937-118954	413
se124-R	GTCGCTACATTCGTAGTTGT	119349-119330	413
se126-F	TGGGATACTCAAGCCTAAA	121203-121221	533
se126-R	TCTCGCTCACCTTCTTATT	121737-121756	533
RACE-PCR		-	-
GSP-se5-F	CAAGAGGAGCCCTGGAAC	-	-
GSP-se5-R	GTGGAGGTAGAATACGGC	-	-
GSP-se7-F	GAGGAGATACGAGGTGATG	-	-
GSP-se7-R	CCGTGATTTCAAACCTTT	-	-
GSP-se8-F	CGCCAAAGACATAGTCCA	-	-
GSP-se8-R	ATTACCGTTACAACTGCG	-	-
GSP-se12-F	AGCGACATACCGTTGCAAGT	-	-
GSP-se12-R	GGCGATGTACGCGTTGAAAA	-	-
GSP-se43-F	CACCTTCGCCCTCAACAGAT	-	-
GSP-se43-R	AGCGCGTACAGAATGCTCTT	-	-
GSP-se45-F	TGTCTGCCGCACGAAAAGTA	-	-
GSP-se45-R	CTAAACGTCAGTCCGGGCAT	-	-
GSP-se89-F	CCGGCCAGTTTGCCAAATAC	-	-
GSP-se89-R	TCGTTTCCGCTAACGTCGAA	-	-
GSP-se124-F	CCAAAATCCCGACGACAACG	-	-
GSP-se124-R	GGTCGCGCATCATCATCAAC	-	-
GSP-se126-F	CGTTTCTGCGCGAATCTCTG	-	-
GSP-se126-R	TTGATCGCGAGCGAATACGA	-	-
NGSP-se5-F	TGATTTCGATGGCCTACCCG	-	-
NGSP-se5-R	TTTCCGGTCTGTCATCAGGC	-	-
NGSP-se7-F	AGCGCAGAACCGTATGTCAA	-	-
NGSP-se7-R	TCTCGTCTCGGTGACCGTAT	-	-
NGSP-se8-F	GTGCCGCATGAGCGATAAAG	-	-
NGSP-se8-R	TCGCTTTCGAACCTACCCAC	-	-
NGSP-se12-F	GAAGCGTTGACGCCGAAAAA	-	-
NGSP-se12-R	GCGCGGACGAACTTGAAAAT	-	-
NGSP-se43-F	GATTGCAGCCGTTCAAGAGC	-	-
NGSP-se43-R	CCCAAGGTGTACGTGTCGAT	-	-
NGSP-se45-F	CAGGCGTTGGATTGCATGTG	-	-
NGSP-se45-R	TTGACTATACTGTCGGCGGC	-	-
NGSP-se89-F	GGCGTCACCTTACGAGACAA	-	-
NGSP-se89-R	TGTCGAAGCAGCCGTACATT	-	-
NGSP-se124-F	CCGCCGTTAAACAACCATCG	-	-
NGSP-se124-R	TCTCCAAGACGACACTCCCA	-	-
NGSP-se126-F	GAGCATTCGTTGGTCGAAGC	-	-
NGSP-se126-R	AGAACTTGCGCACAAACGTC	-	-

Cell line	Raw reads	Quality trimmed	Adaptor trimmed	rRNA trimmed	Clean ratio
P8-Se301-C1	54, 569, 296	54, 292, 372	53, 666, 292	52, 784, 058	96.7%
Se301	56, 865, 504	56, 639, 360	55, 941, 000	55, 033, 958	96.8%

Cell line	Counts	Total length (nt)	N50 (nt)	Mean length	N%	GC%
P8-Se301-C1	116, 048	136, 121, 302	2, 137	1, 173	0.0	37.5
Se301	104, 600	122, 303, 958	2, 145	1, 169	0.0	37.6

Cell line	Counts	Total length (nt)	N50 (nt)	Mean length	N%	GC%
P8-Se301-C1	112, 565	121, 964, 401	1, 824	1, 093	0.0	37.3
Se301	102, 996	109, 124, 144	1, 803	1, 082	0.0	37.4

潜伏感染Se301细胞中甜菜夜蛾核多角体病毒（SeMNPV）基因的转录组分析

摘要

De novo transcriptome analysis of Spodoptera exigua multiple nucleopolyhedrovirus (SeMNPV) genes in latently infected Se301 cells

Abstract

References

Proportional views

Article Metrics

Related

Proportional views

通讯作者: 陈斌, bchen63@163.com

De novo transcriptome analysis of Spodoptera exigua multiple nucleopolyhedrovirus (SeMNPV) genes in latently infected Se301 cells

Corresponding author: Qingbei Weng, wengqb@126.com

HTML

Cell culture

RNA extraction, cDNA synthesis, and Illumina sequencing

Sequence statistics, de novo assembly, and mapping

Annotation

Reverse transcription PCR (RT-PCR)

5'/3' RACE analysis

Sequence trimming

De novo assembly

Functional annotation

Identification of SeMNPV gene transcripts in P8-Se301-C1 cells by RNA-Seq

3′/5′ RACE analyses of the SeMNPV gene transcripts

目录