We got the idea of ORF7a deletion (accession no. OP711842) from our
earlier work on ORF8 inactivation with creation of TAA termination mutation
(Figure 2). BlastN search with deletion boundary gave ten sequences with 100%
homology and 100% cover (Figure 3). The oligonucleotides of few sequences at
the deletion boundaries were given in Table 1. The sequences with accession
nos. MW309830, MW309829 and MW309831 were obtained from earlier studies in 2020
whereas we found here only one sequences belonged to omicron variant (Table 1).
Table 1 showed that only ORF7a deletion mutants with accession numbers
OK249834, OL447329, MT628273, ON926042 and MW309829 were spread into omicron
variants. The figure 4 showed the Blast-2 search between oligonucleotides with
mutant genomes to confirm the authenticity of oligonucleotides (Figure 4). Few
ORF7a deletion abolished ATG codon of ORF7b gene (accession numbers OL522556,
OK249824, OK192887, MZ727798, OP334687, OW352845) while in few ORF7a mutants
ATG codon of ORF7b was retained (accession numbers OP711837, OL447329,
OU556481, OP398207, OK439931, ON926042) (Figure 5). In (Figure 6), we
demonstrated that few ORF7a deletion mutants had consecutive three deletions
(accession nos. OP174404 and OP490708; Omicron BA.4 variant). The first
deletion was 85nt at 27471 (5’-aca aca gta ctt tta aaa gaa cct tgc tct tct gga
aca tac gag ggc aat tca cca ttt cat cct cta gct gat aac aaa ttt g-3’), followed
by second 45nt deletion at 27608 (5’-aca cgt cta tca gtt acg tgc cag atc agt
ttc acc taa act gtt-3’) and a third deletion of 63nt at 27687 (5’-cca att ttt
ctt att gtt gcg gca ata gtg ttt ata aca ctt tgc ttc aca ctc aaa aga aag-3’) but
no change in ORF7b and ORF8 genes. Such truncation also created TGA and TAA two
termination codons but nullifying its own TGA termination codon facilitating no
ORF7a protein synthesis (Figure 6). We demonstrated the BlastX data of
ORF7a/b-ORF8 locus of two mutants. In mutant -1 (accession no. ON940286) ORF7a
gene fusion protein may be obtained but ORF7b and ORF8 protein could made
normal (Table 2). Whereas in mutant-2 (accession no. OK249834), no ORF7a and
also ORF7b due to deletion of ATG codon as well as ORF8 due to creation of TAA
termination codon. Thus, we clearly demonstrated the hotspot deletion site of
the ORF7a/b-ORF8 locus. In figure-7, we presented few ORF7a 96nt deletion
mutants belong to omicron BA.2 and BA.4 corona viruses as well as B.1.1.7
lineage (Table 3) (Figure 7). We made primers to amplify such deletion
sequences and made restriction analysis to characterize the mutant sequences
(Figure 9). We showed the multi-alignment of spike protein of ORF7a deletion
mutants to demonstrate their variant status. The OM966459 sequence was omicron
BA.1 as it had 143VYY deletion and 215EPE insertion in the spike. The MT628273,
ON577132, ON577132 sequences were early lineages as no ~30 mutations in the RBD
domain of spike.
The
ON940286, ON608606, and ON926042 sequences were omicron BA.2 with spike 24LPP
deletion but no 69HV deletion. The OP174404 and OP490708 sequence were omicron
BA.4 variant having 141KSF ORF1ab protein deletion (data not shown). The
OP791425, OP721043, ON852734, OP753941, OP543868 and OP405203 sequences were
likely omicron BA.5 variants. These findings clearly demonstrated the spread of
ORF7a deletion mutants into omicron corona virus. Note that all omicron
variants had 31ERS N-protein deletion (Figure 10). The accession numbers
beginning with OX, OV, OU and OD (UK origin) had no protein expression data.
Thus, our protein multi-alignment data was limited. We showed ORF1ab
polyprotein deletions that frequently found in different corona virus isolates.
The GHVMV, 141KSG and 3675SGF were important deletions and we found ORF7a
deletion mutants also had such genetic imprint. Never-the-less, we have no idea
regarding the functional consequence in nsp1 transactivator protein for GHVMV
and KSF deletion and in nsp6 accessory protein for SGF deletion. Finally, we
BlastN searched the full length ORF7a sequence which had three deletions to
trap more diverse ORF7a deleted sequences (Figure 11). The accession number
OP174404 had first 85nt deletion at 27468, followed by second 45nt deletion at
27608 and a third deletion of 63nt at 27713 position as described before. The
ORF7a mutant (accession no. ON482431) had 7nt first deletion at 27468 followed
by second 83nt deletion at 27509. Whereas, the ORF7a mutant (accession no.
ON611111) had 85nt one deletion at 27468 position and in another mutant
(accession no. OP077998) had also one 100nt deletion at the same position.
Thus, we showed the dynamics of ORF7a gene deletions with time since 2020 to
2022. It should be noted that few sequences (OV426369, OM356217, and ON482431)
were Delta variant with 157FR deletion in the spike whereas one sequence was
B.1.1.7 origin with 69HV and 145Y amino acids deletions in the spike. On the
other hand, 24LPP deletion only suggested that ON611111 and OX031117 sequences
were omicron BA.2 variant which had also 3675SGF ORF1ab deletion. Whereas with
141KSF deletion in nsp1 domain of ORF1ab polyprotein and 24LPP plus 69HV spike
deletions suggested BA.4 omicron corona virus (see, accession numbers OP077998
and OP174404). The accession number OK235146 was classified as omicron BA.5
variant with no 141KSF mutation but spike 24LPP plus 69HV spike deletions. But
the expression of ORF8 gene was hampered in this virus due to creation of
CAA=TAA termination codon and only 26AA length ORF8 might be produced. Anyway,
a R52I mutation in the ORF8 gene was also identified in this virus as we had
reported earlier (in press). The ORF7a
deletion mutants belong to accession numbers OV426369, OM356217, ON482431 and
OL835581 were Delta variants with 157FR deletion in the spike but we also found
119DF deletion in the ORF8 gene (data not shown here). Thus, we showed the
transmission and propagation of ORF7a deletion mutants across early B.0 and B.1
lineages to B.1.1.7 and B.1.617.2 most deadly variants and then also to mild
pathogenic but highly transmissible omicron corona virus variants. We
classified omicron variant into sub-variants BA.1, BA.2, BA.4 and BA.5 and
found ORF7a deletions in all omicron sub-variants (Figure 12). We showed the
hairpin structure of ORF7a gene locus which was changed in three deletion
mutants (hairpin loop 6 was abolished) but not in one deletion mutant. We
postulated that stiff hairpin loop-2 may have role in driving deletion in the
ORF7a gene. We performed a protein multi-alignment of 25 sequences from
different corona virus variants and only one I88S point mutation detected.
Overall, ORF7a gene deletions found consistently in B.1.1.7 and B.1.617.2 and
then B.1.1.529 variants. But point mutation in the ORF7a protein was very
limited (Figure 13).