Here are the results of a quick BLAST lineup of some of the segments from the new H1N1 influenza viruses.
gsgs said in comments that “we had the reassortment event from 1998 (or some years earlier ?) when this type of virus emerged. The ancestor of modern American Swine-flu.
Several other progenies from USA of this reassortment are at genbank.
But the new virus (in 123458) is more similar to the 1998 virus than to any of these, suggesting that it evolved separately (in Mexico ?) since ~1998. 511445 is more similar to several other viruses than to 1998.”
But as far as I can see that’s simply not true. Homology lineups — even just looking at H1N1 strains — pull out recent viruses long before the 1998. The closest H1N1 I find matching the HA is A/swine/OH/511445/2007(H1N1); for polymerase (which is probably better for lineups) it’s A/Iowa/CEID23/2005, just three years before the new H1N1. Only a NP search finds a 98 H1N1 as a best match, and that’s only a fraction of a percent better match than a 2005 Korean strain — ten more residues out of 1500.
What’s more, it’s wrong-headed. The differences in homology between different viruses here represent only a tiny handful of nucleic acid changes. And that kind of difference represents a week or two in the replication of a single virus. Look at the HA from isolates from the current outbreak! There are half a dozen changes between two different California isolates! That’s just about the difference that makes gsgs say the parent must be a 98 virus and not a 2005 virus, and yet that’s the same virus. There are a bunch of H3N2 viruses that are closer than any H1N1– are these therefore the parental viruses?
I’m sorry, it’s craziness to look at these kinds of changes and make the claim that it’s somehow informative of parental lines. The virus changes too fast to put any parent/progeny relationship on a handful of residues, because that takes like a week in the virus cycle.
| Hemagglutinin | ||
|---|---|---|
| Strain | Identities/total length | Percent identity |
| A/California/04/2009(H1N1) | 1701/1701 | 100.00 |
| A/California/07/2009(H1N1) | 1700/1701 | 99.94 |
| A/California/08/2009(H1N1) | 1700/1701 | 99.94 |
| A/California/07/2009(H1N1) | 1700/1701 | 99.94 |
| A/California/07/2009(H1N1) | 1698/1701 | 99.82 |
| A/California/10/2009(H1N1) | 1696/1701 | 99.71 |
| A/California/05/2009(H1N1) | 1696/1701 | 99.71 |
| A/Regensburg/Germany/01/2009(H1N1) | 1441/1446 | 99.65 |
| A/Texas/04/2009(H1N1) | 1695/1701 | 99.65 |
| A/Texas/04/2009(H1N1) | 1695/1701 | 99.65 |
| A/New York/19/2009(H1N1) | 1695/1701 | 99.65 |
| A/Texas/04/2009(H1N1) | 1695/1701 | 99.65 |
| A/California/06/2009(H1N1) | 1695/1701 | 99.65 |
| A/Texas/05/2009(H1N1) | 1695/1701 | 99.65 |
| A/Auckland/1/2009(H1N1) | 1571/1576 | 99.68 |
| A/Swine/Indiana/P12439/00 (H1N2) | 1624/1704 | 95.31 |
| A/Swine/Indiana/9K035/99 (H1N2) | 1624/1706 | 95.19 |
| A/Turkey/MO/24093/99(H1N2) | 1619/1704 | 95.01 |
| A/swine/Minnesota/1192/2001(H1N2) | 1618/1704 | 94.95 |
| A/Swine/Ohio/891/01(H1N2) | 1617/1704 | 94.89 |
| A/swine/Guangxi/17/2005(H1N2) | 1617/1705 | 94.84 |
| A/SW/MN/23124-T/01(H1N2) | 1617/1705 | 94.84 |
| A/SW/MN/16419/01(H1N2) | 1615/1705 | 94.72 |
| A/SW/MN/23124-S/01(H1N2) | 1615/1705 | 94.72 |
| A/Swine/Illinois/100085A/01 (H1N2) | 1614/1705 | 94.66 |
| A/Swine/Illinois/100084/01 (H1N2) | 1612/1706 | 94.49 |
| A/swine/Guangxi/13/2006(H1N2) | 1608/1704 | 94.37 |
| A/swine/Minnesota/00194/2003(H1N2) | 1609/1706 | 94.31 |
| A/swine/Kansas/00246/2004(H1N2) | 1603/1706 | 93.96 |
| A/swine/OH/511445/2007(H1N1) | 1598/1704 | 93.78 |
| A/Swine/North Carolina/93523/01 (H1N2) | 1597/1706 | 93.61 |
| Polymerase | ||
| Strain | Identities/total length | Percent identity |
| A/California/07/2009(H1N1) | 2151/2151 | 100.00 |
| A/California/04/2009(H1N1) | 2151/2151 | 100.00 |
| A/California/07/2009(H1N1) | 2151/2151 | 100.00 |
| A/California/04/2009(H1N1) | 2151/2151 | 100.00 |
| A/California/07/2009(H1N1) | 2150/2151 | 99.95 |
| A/Texas/04/2009(H1N1) | 2148/2151 | 99.86 |
| A/Texas/05/2009(H1N1) | 2148/2151 | 99.86 |
| A/Texas/04/2009(H1N1) | 2148/2151 | 99.86 |
| A/Texas/05/2009(H1N1) | 2148/2151 | 99.86 |
| A/California/06/2009(H1N1) | 2147/2151 | 99.81 |
| A/California/05/2009(H1N1) | 2147/2151 | 99.81 |
| A/Swine/Illinois/100084/01 (H1N2) | 2070/2154 | 96.10 |
| A/Swine/Minnesota/593/99 (H3N2) | 2069/2153 | 96.10 |
| A/Swine/Iowa/569/99 (H3N2) | 2069/2153 | 96.10 |
| A/Swine/Iowa/533/99 (H3N2) | 2069/2153 | 96.10 |
| A/Swine/Nebraska/209/98 (H3N2) | 2069/2153 | 96.10 |
| A/pintail duck/South Dakota/Sg-00126/2007(H3N2) | 2052/2129 | 96.38 |
| A/Swine/Minnesota/55551/00 (H1N2) | 2067/2153 | 96.01 |
| A/Swine/North Carolina/93523/01 (H1N2) | 2064/2153 | 95.87 |
| A/mallard duck/South Dakota/Sg-00125/2007(H3N2) | 2038/2115 | 96.36 |
| A/Swine/Korea/CY02/02(H1N2) | 2063/2153 | 95.82 |
| A/Swine/Indiana/P12439/00 (H1N2) | 2063/2153 | 95.82 |
| A/duck/NC/91347/01(H1N2) | 2062/2153 | 95.77 |
| A/Swine/North Carolina/98225/01(H1N2) | 2061/2153 | 95.73 |
| A/swine/Korea/CY05/2007(H3N2) | 2061/2154 | 95.68 |
| A/swine/Korea/CY04/2007(H3N2) | 2061/2154 | 95.68 |
| A/Swine/Illinois/100085A/01 (H1N2) | 2060/2154 | 95.64 |
| A/swine/Korea/CY09/2007(H3N2) | 2059/2154 | 95.59 |
| A/swine/North Carolina/2003(H3N2) | 2059/2154 | 95.59 |
| A/swine/Korea/CAS05/2004(H3N2) | 2056/2153 | 95.49 |
| A/swine/Korea/Hongsong2/2004(H1N2) | 2056/2153 | 95.49 |
| A/swine/Korea/JNS06/2004(H3N2) | 2055/2153 | 95.45 |
| A/swine/Korea/CAS09/2006(H3N2) | 2055/2153 | 95.45 |
| A/swine/Korea/PZ14/2006(H1N2) | 2055/2153 | 95.45 |
| A/swine/Korea/PZ7/2006(H1N2) | 2055/2153 | 95.45 |
| A/swine/Korea/PZ4/2006(H1N2) | 2055/2153 | 95.45 |
| A/swine/Korea/JL02/2005(H1N2) | 2055/2153 | 95.45 |
| A/swine/Korea/CAS07/2005(H3N2) | 2053/2153 | 95.36 |
| A/swine/Korea/CAN04/2005(H3N2) | 2053/2153 | 95.36 |
| A/Iowa/CEID23/2005(H1N1) | 2054/2154 | 95.36 |
| Nucleoprotein | ||
| Strain | Identities/total length | Percent identity |
| A/Texas/04/2009(H1N1) | 1497/1497 | 100.00 |
| A/Texas/05/2009(H1N1) | 1497/1497 | 100.00 |
| A/California/04/2009(H1N1) | 1497/1497 | 100.00 |
| A/Texas/04/2009(H1N1) | 1497/1497 | 100.00 |
| A/Texas/05/2009(H1N1) | 1497/1497 | 100.00 |
| A/California/04/2009(H1N1) | 1497/1497 | 100.00 |
| A/California/06/2009(H1N1) | 1496/1497 | 99.93 |
| A/California/07/2009(H1N1) | 1495/1497 | 99.87 |
| A/California/05/2009(H1N1) | 1494/1497 | 99.80 |
| A/Swine/Iowa/533/99 (H3N2) | 1450/1498 | 96.80 |
| A/swine/Korea/CY05/2007(H3N2) | 1449/1498 | 96.73 |
| A/swine/Korea/CY04/2007(H3N2) | 1449/1498 | 96.73 |
| A/swine/Korea/CY09/2007(H3N2) | 1448/1498 | 96.66 |
| A/Swine/Indiana/P12439/00 (H1N2) | 1447/1497 | 96.66 |
| A/Swine/Iowa/569/99 (H3N2) | 1447/1498 | 96.60 |
| A/swine/Korea/JNS06/2004(H3N2) | 1445/1497 | 96.53 |
| A/Swine/Ohio/891/01(H1N2) | 1446/1498 | 96.53 |
| A/Swine/Minnesota/593/99 (H3N2) | 1445/1498 | 96.46 |
| A/swine/Korea/CAS09/2006(H3N2) | 1443/1497 | 96.39 |
| A/swine/Korea/CAN04/2005(H3N2) | 1443/1497 | 96.39 |
| A/Swine/Indiana/9K035/99 (H1N2) | 1445/1499 | 96.40 |
| A/Swine/Minnesota/55551/00 (H1N2) | 1442/1497 | 96.33 |
| A/Wisconsin/10/98 (H1N1) | 1445/1500 | 96.33 |
| A/swine/Korea/CAS07/2005(H3N2) | 1441/1497 | 96.26 |
| A/Swine/Illinois/100084/01 (H1N2) | 1442/1498 | 96.26 |
| A/Swine/Illinois/100085A/01 (H1N2) | 1442/1498 | 96.26 |
| A/swine/Korea/CAS05/2004(H3N2) | 1440/1497 | 96.19 |
| A/swine/Shanghai/1/2007(H1N2) | 1440/1498 | 96.13 |
| A/Swine/North Carolina/93523/01 (H1N2) | 1437/1497 | 95.99 |
| A/Swine/North Carolina/98225/01(H1N2) | 1432/1495 | 95.79 |
| A/swine/Guangxi/17/2005(H1N2) | 1436/1498 | 95.86 |
| A/swine/Korea/CAS08/2005(H1N1) | 1435/1498 | 95.79 |
| A/turkey/IA/21089-3/1992(H1N1) | 1434/1497 | 95.79 |
| A/swine/Guangxi/13/2006(H1N2) | 1435/1498 | 95.79 |
| A/duck/NC/91347/01(H1N2) | 1433/1496 | 95.79 |
| A/swine/Korea/CAN01/2004(H1N1) | 1434/1498 | 95.73 |
| A/turkey/Ohio/313053/04(H3N2) | 1434/1498 | 95.73 |
| A/turkey/OH/313053/2004(H3N2) | 1433/1498 | 95.66 |
| A/swine/California/T9001707/1991(H1N1) | 1432/1497 | 95.66 |
| A/swine/North Carolina/2003(H3N2) | 1433/1498 | 95.66 |
| A/swine/MI/PU243/04 (H3N1) | 1433/1498 | 95.66 |
Again, many thanx for this work. Perhaps at some point in the near future you could educate us uninformed folks about nomenclature. Assuming there is some underlying logic. What makes any given H1N1 genome a ‘swine’-designated sequence instead of a place-designated sequence? ie A/Brisbane/yada/yada(H1N1) versus A/swine/OH/yada/yada(H1N1). Is it an antigenic determination or a sequence signature or simply proximity to an actual porcine mammal(see I have avoided the ‘s’ word LOL)
jay:
I quote from a post over on Effect Measure:
‘But each isolate of this virus also has an “official” name or designation. One isolated from a Texas case has this name: A/Texas/05/2009(H1N1). The naming system looks complicated but it’s really quite simple. The A-part means that this is a flu virus of type A (rather than flu B or flu C). Texas is a location marker, which could be broad, like Texas, or more narrow, like the name of a city (e.g., Hong Kong). The 05 is the specimen identifier in the lab where it was isolated and 2009 is the year. If the virus had been isolated from a non-human animal, that would be included, e.g., A/Chicken/Shantou/4231/2003. And the H1N1 part is the subtype.’
The post is at this link: http://scienceblogs.com/effectmeasure/2009/04/swine_flu_a_virus_by_any_other.php
seems that you are ignoring partial sequences.
Please include them.
The best matches are all from 1998/99 and not
from 2004-2007, where also some sequences are available. (segments 123458)
Except these South-Dakota ducks.
Moreover the expected number of mutations per year matches quite well with A/Wisconsin/10/1998,
so the new virus could be almost a direct descendent. (close relative)
Raul Rabadan tells me that the NA of the H1N1 is similar to Eurasian swine NAs that last circulated 10 years ago. And he also says that the swine flu database is sufficiently sparse for many countries that such comparisons aren’t valid. Plus, the flu database is full of errors (see his paper in J. Virol, 2008). So I suggest caution when using these sequences to make conclusions about the origin of influenza viruses.
Good point, especially about the flu database. That’s a fascinating article by Rabadan for several reasons.
Again, many thanx for this work. Perhaps at some point in the near future you could educate us uninformed folks about nomenclature. Assuming there is some underlying logic. What makes any given H1N1 genome a ‘swine’-designated sequence instead of a place-designated sequence? ie A/Brisbane/yada/yada(H1N1) versus A/swine/OH/yada/yada(H1N1). Is it an antigenic determination or a sequence signature or simply proximity to an actual porcine mammal(see I have avoided the ‘s’ word LOL)