![]() |
Swine flu PB2 gene phylogenetic tree |
This post will be updated through the day.
Update 1 (near the bottom): I don’t think the Ohio/07 virus is particularly closely related to the current swine flu
Update 2 (at the bottom): A helpful comment from The Virology Blog about the ancestry of the virus
As I noted yesterday, the CDC has released a number of the Swine Flu genome sequences; they’re available here. (There are more sequences released yesterday.) In my quick and primitive glance at them yesterday, I didn’t see anything very remarkable about the sequences — they looked like more or less straightforward swine flu things.
Sandra, at Discovering Biology in a Digital World, has run some more sophisticated analyses on the sequences and suggests that these viruses may be “the same strain that caused an outbreak in 2007 at an Ohio country fair.” She is refering to the virus A/SW/OH/511445/2007, which is described here:
Vincent, A., Swenson, S., Lager, K., Gauger, P., Loiacono, C., & Zhang, Y. (2009). Characterization of an influenza A virus isolated from pigs during an outbreak of respiratory disease in swine and people during a county fair in the United States Veterinary Microbiology DOI: 10.1016/j.vetmic.2009.01.003
In August 2007, pigs and people became clinically affected by an influenza-like illness during attendance at an Ohio county fair. … Approximately 26 people in close association with the fair pigs were affected by an influenza-like illness. Viruses from at least two individuals were isolated, sequenced and analyzed at the Centers for Disease Control and determined to be nearly identical to the swine virus studied here (A. Klimov, personal communication).
The accession numbers for the Ohio virus are EU604689-EU604696.
Please see Sandra’s page for her explanation of what she did. However, I am not yet convinced. I ran my own analysis, selecting sequences in what I think is a diffferent, and what I think is a more appropriate, way, and I did not see the present swine flu clustering with the Ohio strain. (See the snippet of a figure at the top left here; click for a larger version.) I am far from an expert in phylogenetic analysis or on influenza virus, so I’m not saying I’m right by any means — but I would like to see an explanation of why my approach is wrong.
Basically — as far as I can tell; Sandra hasn’t posted her complete methods as I write this, though that will come later today — the difference between our conclusions may be the way we collected sequences for comparison. She “used H1N1 (and a couple of H1N2) protein sequences from Swine and Humans between Jan 1 2006 and today.” I didn’t try to restrict the sequences in any way — rather, I ran a BLAST search on the non-redundant nucleotide collection in GenBank and selected the top 100 (or 1000) most similar sequences for comparison. To me, it makes sense that we should not artificially restrict our sample, but rather should look at all the closest matches we can find.
In my PB2 tree, the nearest neighbours are another California isolate of the new strain, then the closest cluster includes A/duck/NC/91347/01(H1N2), A/mallard duck/South Dakota/Sg-00125/2007(H3N2), A/pintail duck/South Dakota/Sg-00126/2007(H3N2), and A/swine/Korea/CAS05/2004(H3N2). (The avian PB2 is expected, because current swine flu strains have acquired an avian PB2.)
One way to look more closely at this might be to look at some of the gene sequences more specifically. Vincent et al. point to some unusual features of the Ohio virus’s HA gene. Though it’s not likely to be unique it may help sort out relationships here:
Unique changes at antigenic determinant sites were identified in the OH07 HA at positions 71 and 162 and may play a role in the loss of cross-reactivity. The NA gene was shown to be related to the swine N1 phylogenetic cluster (Fig. 1B). The internal genes (Fig. 2A–F) were shown to be of the triple reassortant SIV lineage and group with those of the cluster IV H3N2 viruses reported by Olsen, et al. (Olsen et al., 2006). The PB2 gene was determined to contain the conserved avian amino acid residue glutamic acid at position 627, reported to be important in avian and human host specifcity (Subbarao et al., 1993).
Right now I need to get my kids ready for school, but I’ll look at those genes later today.
Update 1. OK, I’ve looked at the sites in the Ohio virus that Vincent et al flagged as being unsual. The current swine flu viruses do not match all those unusual sites. They do match some but so do many (dozens of) other swine flus. I am not really seeing evidence that the A/SW/OH/511445/2007 swine flu isolate is “the same strain that caused an outbreak in 2007 at an Ohio country fair.”
Vincent et al (their Table 3) flagged the following sites as unusual; here are comparisons. I ran several from the current outbreak — also a couple of HAs from swine flus that a simple BLAST search identified as highly related to the current outbreak (much more similar than is the A/SW/OH/511445/2007). The figure at bottom shows it graphically (current outbreak in blue — A/SW/OH/511445/2007 in green — unusual residues boxed in red) and the table breaks it down.
71 | 73 | 74 | 142 | 156 | 162 | |
A/California/04/2009_H1N1 | S | A | S | K | N | S |
A/California/07/2009(H1N1) | S | A | S | K | N | S |
A/Texas/05/2009(H1N1) | S | A | S | K | N | S |
A/Swine/Ohio/891/01(H1N2) | F | A | S | K | N | S |
A/SW/OH/511445/2007 | S | A | S | N | N | N |
A/swine/Guangxi/17/2005(H1N2) | F | A | S | K | N | S |
Update 2. Vincent at The Virology Blog says:
According to ProMED-mail, the NA and MP genes are related to those of influenza viruses from Asian-European swine, and the other genes appear to originate from swine flu viruses from pigs in North America. The data are in accord with the original assertion of the CDC that all genes of the new isolate were derived from swine viruses.
[…] More Swine Flu genome […]
I’ve compared the California and New York HA sequences, and they differ by only a few amino acids. If we assume that the NY strain came from Mexico, this would suggest that the Mexican strains are going to be similar. The increased severity (if that is really true) in Mexico is likely to be due to host factors. I notice a German sequence just came in – from a tourist who had traveled to Mexico. I’m going to do a quick alignment of that now. I wish I had more time to spend on this!
I still think the most likely reason for the apparent difference in Mexico and the US is the missing denominator, rather than any host or environmental factor. That is, I suspect that the virus is much more widespread in Mexico City than the authorities know. Because they almost entirely tested severely ill hospitalized patients, it’s not surprising that they found high mortality rates; if they were also testing mild cases (as is happening in the US and elsewhere) I think they would likely find a very widespread infection with a low mortality rate.
But who knows.
Hi,
I’ve tried a similar analysis to Sandra’s, except using BLAST and maximum likelihood:
http://koppology.blogspot.com/2009/04/swine-flu-phylogeny-part-ii.html
I agree that there is not a particularly strong argument for the Ohio strain being particularly close. However, I have made other observations that I’ve detailed in the post — I think there is some tentative evidence for antigenic shift.
This is very much a first attempt, feedback would be appreciated.
See also
http://www.wired.com/wiredscience/2009/04/swinefluupdate/
David
[…] However, it should take a long time for three viruses to combine into one coat. Long enough for a precursor to have spread widely, causing animals to get sick so that they get tested, perhaps to the point that the new genome is detected in a laboratory, before a third virus is incorporated into the new strain. It should also be possible to find a virus having a combination of two of the three RNA strains somewhere else in the wild. This may have occurred already in pigs in Ohio (2007). There are doubts, however, that the Ohio outbreak is not a red herring, as is covered well here. […]
what puzzle me is position 185, 331 & 391
does Mexico only deposit her Sequences at GISAID ? Is it ethical to withhold them ?
Do they claim for some sort of “copyright” on
“their” viruses like Indonesia does for H5N1 ?
So we will always know who was the originator
of this mess when we get swine flu ….
A/Mexico/4482/2009 (H1N1)
A/Mexico/4486/2009 (H1N1)
A/Mexico/4108/2009 (H1N1)
A/Mexico/4115/2009 (H1N1)
A/Mexico/4603/2009 (H1N1)
A/Mexico/4604/2009 (H1N1)
everything seems to go back to one ancestor
virus from this year. Presumably the one where
the reassortment happened
Nice Work, But there a fundamental question that recombination will occur in gene level and not the protein (aminoacid) therefore It make more sense to use phylogenic tree or blast based on the genome sequence since the results of comparison by these two method are not completely identical, here are the closest match for canadian SW H1N1 that I found by simple blast. I look forward for your comments.
1 PB2 AF251434.1 (A/Swine/Minnesota/593/99 (H3N2))
AF251426.1 (A/Swine/Iowa/569/99 (H3N2))
AF251418.1 (A/Swine/Iowa/533/99 (H3N2))
AF251410.1 (A/Swine/Nebraska/209/98 (H3N2))
AY233387.1 (A/duck/NC/91347/01(H1N2))
2 PB1 AF342823.1 (A/Wisconsin/10/98 (H1N1))
AF250130.1 (A/Swine/Indiana/9K035/99 (H1N2))
AY233388.1 (A/duck/NC/91347/01(H1N2))
AF251429.1 (A/Swine/Minnesota/593/99 (H3N2))
AF251421.1 (A/Swine/Iowa/569/99 (H3N2))
3 PA CY033792.1 (A/pintail duck/South Dakota/Sg-00126/2007(H3N2))
AY233389.1 (A/duck/NC/91347/01(H1N2))
EF551052.1 (A/swine/North Carolina/2003(H3N2))
AF455722.1 (A/Swine/Illinois/100084/01 (H1N2))
AF251433.1 (A/Swine/Minnesota/593/99 (H3N2))
4 HA AF250124.1 (A/Swine/Indiana/9K035/99 (H1N2))
EF556201.1 (A/swine/Guangxi/17/2005(H1N2))
AY060047.1 (A/SW/MN/23124-T/01(H1N2))
AF455680.1 (A/Swine/Indiana/P12439/00(H1N2))
AY060050.1 (A/SW/MN/16419/01(H1N2))
5 NP AF251415.2 (A/Swine/Iowa/533/99 (H3N2))
AF251423.2 (A/Swine/Iowa/569/99 (H3N2))
EU798854.1 (A/swine/Korea/CY05/2007(H3N2))
EU798853.1 (A/swine/Korea/CY04/2007(H3N2))
AF251431.2 (A/Swine/Minnesota/593/99 (H3N2))
6 NA AF250366.2 (A/Swine/England/195852/92 (H1N1))
CY038009.1 (A/swine/England/WVL7/1992(H1N1))
CY038001.1 (A/swine/Spain/WVL6/1991(H1N1))
CY037908.1 (A/swine/England/WVL10/1993(H1N1))
AM920729.1 (A/swine/Germany/Vi5698/95(H1N1))
7 M AJ293925.1 (A/Hong Kong/1774/99(H3N2))
AY363575.1 (A/swine/Hong Kong/5212/99(H3N2))
AY363574.1 (A/swine/Hong Kong/5200/99(H3N2))
AY363573.1 (A/swine/Hong Kong/5190/99(H3N2))
DQ186981.1 (A/swine/Leipzig/145/92(H3N2))
8 NS EU735822.1 (A/turkey/OH/313053/2004(H3N2))
DQ150437.1 (A/swine/IN/PU542/04 (H3N1))
AF342817.1 (A/Wisconsin/10/98 (H1N1))
AF153262.1 (A/Swine/Minnesota/9088-2/98 (H3N2))
AF153261.1 (A/Swine/Texas/4199-2/98 (H3N2))