-
Tracing influenza viruses' evolution has been the subject of much research relevant to influenza viruses and frequent reassor-tment events for both avian and human influenza viruses have been detected using phylogenetic methods (6, 13). For instance, several genotypes of H5N1 avian influenza viruses have been detected in the past five years, and these have been designated A, B, C, D, E, G, V, W, X, Y, Z, Z+ and so on (4, 5, 11, 13). Viruses of genotype A, B, C, D, E and F were also identified for the H9N2 subtype (1). There is no denying that these studies revealed differences among viruses. However, all of the differences were principally at the DNA level and genetic information at the protein level was not fully utilized. Therefore, phylogenetic trees sometimes could not provide subtle differences among viral genes.
To better make use of the information at the protein level, molecular characterization analyses and reverse genetics techniques have been performed to help find the key sites relevent to pathogenicity, virulence and even host selection of influenza viruses (15, 16) etc. Up to date, some positions playing important roles in viral genomes have been found, such as the con-necting peptide sites in HA, Lys-627 in the PB2 fragment (7, 10) and so on. Thus, molecular characteri-zation analysis does have advantages in seeking single amino acid and short peptide mutations. However, it is difficult for it to integrate all these genetic information as a whole to find genes that co-reassort or proteins displaying compensatory mutations. To this end, Obenauer et al. introduced a proteotyping method to visualize unique amino acid signatures (proteotypes) (17). This method was able to identify co-reassorting genes, 50+ protein-protein pairs, virus "families" that share specific combination of genes and proteins exhibiting compensatory mutations (8).
Neuraminidase (NA) is a surface protein that cleaves sialic acid from virus and host cell glycocon-jugates at the end of the virus life cycle to allow mature virions to be released (25). Phylogenetic studies have revealed that the H5N1 avian influenza viruses of China were divided into three lineages according to the NA gene tree, with one lineage (Ⅰ) possessing a 19-aa deletion in the stalk of NA, one lineage (Ⅱ) without deletion, and one lineage (Ⅲ) with a 20-aa deletion (9, 24). Viruses of genotypes A, G, X, Y, Z and ShanTou3-like (ST3-like) belonged to group Ⅲ, while B, C, D, E, W, Z+, ST1-like and ST2-like isolates belonged to group Ⅱ and HK/156/97 was placed into group Ⅰ.
In this paper, we took NA gene fragments of some H5N1 influenza viruses isolated from mainland China, Hong Kong Special Administration Region (SAR) and Southern Asia as an example to illustrate how the proteotyping method worked.
HTML
-
The NA gene tree was mostly divided into two major lineages with a small branch out of them (Fig. 1). One major lineage involved viruses of genotype A, G, X, Y, Z, while the other involved genotype B, C, D, E, W, Z+.
Figure 1. Proteotypes for NA genes/proteins of some H5N1 avian and human influenza viruses. Phylogenetic analysis was based on nucleotides 20-1426 ( 1,407 bp) of the NA gene and the tree was rooted to K02252 (A/Parrot/Ulstcer/73. H7N1). Following the GenBank accession numbers there was the corresponding genotype or host information of these viruses. Scale bar. 0.02 nucleotide change per site. The left column was the GenBank accession numbers of the representative H5N1 avian and human influenza viruses. The protein alignment was adjusted according to the sequence orders of the viruses in the NA gene tree. The right column was the serial numbers designated to the NA proteotypes respectively.
-
Proteotypes of NA proteins supported the phy-logeny revealed by the NA gene tree (Fig. 1). However, there were some differences between the results of the phylogenetic and proteotyping analyses. First of all, the proteotyping analysis displayed protein differences within the lineage and even within the genotype. For example, the differences of NA proteins among the viruses of the X genotype were observed. Likewise, our results indicated that the Z genotype viruses might be further divided into more proteotypes (Fig. 1). Secondly, some proteotypes might involve more than one genotype. For instance, some viruses of genotype X, A, Y and Z were defined as the same proteotype -p1.2 (Fig. 1, Table 1). Finally, some co-variable amino acids that might be potentially important to maintain the advanced structures and functions of the proteins were found. Particularly, it was possible that Thr17, Lys64, Asn75, His233 and Ser320 co-evolved in the NA proteins of some Southeast Asia isolates of Genotype Z (Fig.1). It also suggested some sites of viruses of genotype W might evolve with each other (Fig. 1).
Table 1. Some representative H5N1 avian and human influenza viruses and their corresponding NA proteotypes