Get Permission Procopio: In-silico design of epitopes against SARS-CoV-2 based on membrane protein


Introduction

Advances in bioinformatics have facilitated the selection of possible epitopes without the need for cultivation and the risks inherent in the manipulation of pathogens. This type of method is a great advantage over conventional vaccinology techniques, including faster results and lower costs (Soria-Guerra et al., 2015).1 SARS-CoV-2 is a single-stranded positive sense RNA virus (+ ssRNA) belonging to the genus Betacoronavirus of the Coronaviridae family, with a genome of approximately 30 kilobases and that codes for several structural and non-structural proteins. The four main structural proteins of CoVs are the spike glycoprotein (S), small envelope protein (E), membrane protein (M) and nucleocapsid protein (N) (Ujike and Taguchi, 2015, Schoeman and Fielding 2019, Rahman et al., 2020), 2, 3, 4 all of which are encoded at the end of the genome. The focus of this work was thus the in-silico design and construction of epitopes for generation of antibodies against SARS-CoV-2, based on membrane protein. This viral proteins interact with receptors on the host cell membrane and are required for penetration of the viral genome into cells by fusion of the viral envelope with the plasma membrane or endosomal membranes (Holmes and Williams 1990). 5

The SARS-CoV-2 sequence used for analysis was NC_045512.2, deposited with the NCBI (www.ncbi.nlm.nih.gov). The protein chosen was membrane protein ORF5, located in region 26,523 - 27,191 with 669 nt forming 222 aa (Figure 1). The similarity analysis of the membrane glycoprotein gene with other sequences deposited in the NCBI was 100% for MT585079, MT479226, LC553259, MT598178, MT577621, MT601295, MT577636 and MT577643, making it a relatively conserved gene. The analyses were performed in the Topology Data Bank of Transmembrane Proteins (TOPDB), Constrained Consensus TOPology Prediction Server (CCTOP) and Predict Secondary Structure (PSIPRED 4.0). The analysis of the interaction of the protein with the membrane showed that most of the protein is inside the viral envelope, as can be seen in Figure 1. The protein showed three regions inserted in the envelope membrane (underlined) and two regions outside the envelope (blue). In all, three analyses found the same topology, with approximately 120 amino acids inside the envelope (red) and 25 amino acids outside, as shown in blue in Figure 1.

Figure 1

Amino acid sequence of the SARS-CoV-2 membrane protein: in the blue region on the surface of the membrane that forms the envelope, in the red region on the inside of the envelope and in the underlined region, indicating interaction with the membrane. The blue highlighted region was used for epitope synthesis.

https://typeset-prod-media-server.s3.amazonaws.com/article_uploads/741d18b4-5e12-4751-9a9b-18ad8c3ccb82/image/26264fbf-2edd-48f6-9ef6-38fe8324a7cd-uimage.png

The first step is to discriminate the epitopes that are potentially immunoprotective from epitopes that are not. The methods used to predict continuous antibody epitopes from the protein sequence were: Sequential B-Cell Epitope Predictor (BepiPred-2.0), and Antibody Epitope Prediction (IEDB), both of which showed that the start and end regions can be used to generate epitopes, as shown in Figure 2Figure 2 .

Figure 2

Predicted B-cell epitopes using BepiPred 2.0 epitope predictor. Yellow areas above the thres hold (red line) are proposed as part of B cell epitopes in membrane proteins.

https://s3-us-west-2.amazonaws.com/typeset-prod-media-server/0d8abc35-6a05-4964-9e9f-766f6e4fda64image1.png

BepiPred-2.0 showed that the first 20 and the last 20 amino acids are the best regions to generate epitopes. The first 20 amino acids were chosen because they are on the surface of the viral envelope. The evaluation in CTLPred is a direct method for predicting CTL epitopes of T cells, where epitopes with 9 amino acids starting at amino acid 5 (score 0.99), 7 (score 0.79) and 8 (score 0.80) are excellent for generating epitopes. The analysis in AlgPred, based on amino acid composition, showed that the amino acid sequence has allergenic potential (score 0.5).

The DNA sequence used for synthesis was 60 bp (ATG GCA GAT TCC AAC GGT ACT ATT ACC GTT GAA GAG CTT AAA AAG CTC CTT GAA CAA TGG), generating a sequence of 20 amino acids (Met Ala Asp Ser Asn Gly Thr Ile Thr Val Glu Glu Leu Lys Lys Leu Leu Glu Gln Trp), with the first 20 amino acids in Table 1. For production of the heterologous protein in Escherichia coli, two expression systems were used in pET-28a (Novagen), with His tag for purification and pFLAG-ATS (Sigma) for purification and detection of the FLAG fusion protein. The synthesis was performed by Twist Bioscience. One of the first steps in epitope validation is to track immunoreactivity through serological analysis. This requires expression of the recombinant protein in a suitable host. Escherichia coli expression systems are the preferred choice for the production of recombinant proteins. The potential advantages of epitope-based vaccines also include increased safety, the opportunity to rationally design epitopes to increase potency and amplitude, and the ability to focus immune responses on conserved epitopes. Therefore, in vivo it is expected that this protein will interact with antigen-presenting cells, eliciting a potentially protective immune response (Shey et al., 2019)6, in which isolated epitopes, capable of stimulating a specific immune response, can be identified and used to obtain advanced vaccine formulations, replacing those consisting of complete formulations of pathogens. The next step currently planned is to express this peptide in a bacterial system and carry out the various immunological tests necessary to validate the results obtained here through immuno-computer analysis.

Source of Funding

None.

Conflict of interest

None.

References

1 

R E Soria-Guerra R Nieto-Gomez D O Govea-Alonso S Rosales-Mendoza An overview of bioinformatics tools for epitope prediction: Implications on vaccine developmentJ Biomed Inf20155340514

2 

M Ujike F Taguchi Incorporation of Spike and Membrane Glycoproteins into Coronavirus VirionsViruses201574170025

3 

D Schoeman B C Fielding Coronavirus envelope protein: current knowledgeVirol J201916169

4 

M S Rahman M N Hoque M R Islam S Akter Asm Rubayet-Ul-Alam M A Siddique Epitope-based chimeric peptide vaccine design against S, M and E proteins of SARS-CoV-2 etiologic agent of global pandemic COVID-19: an in silico approachbioRxiv2020

5 

K V Holmes R K Williams Background Paper Functions of Coronavirus GlycoproteinsAdv Exp Med Biol1990276Springer

6 

R A Shey S M Ghogomu K K Esoh N D Nebangwa C M Shintouo N F Nongley In-silico design of a multi-epitope vaccine candidate against onchocerciasis and related filarial diseasesScientific Rep2019914409



jats-html.xsl


This is an Open Access (OA) journal, and articles are distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as appropriate credit is given and the new creations are licensed under the identical terms.

  • Article highlights
  • Article tables
  • Article images

View Article

PDF File   Full Text Article


Copyright permission

Get article permission for commercial use

Downlaod

PDF File   XML File   ePub File


Digital Object Identifier (DOI)

Article DOI

https://doi.org/10.18231/j.ijmmtd.2020.043


Article Metrics






Article Access statistics

Viewed: 1381

PDF Downloaded: 577



Medical Abbreviation List