forked from TomokazuKonishi/direct-PCA-for-sequences
-
Notifications
You must be signed in to change notification settings - Fork 0
/
abcd.txt
18 lines (18 loc) · 14.8 KB
/
abcd.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
>KT592526.1_Influenza_D_virus_(D/bovine/Italy/46484/2015)_segment_4_hemagglutinin-esterase_(HE)_gene_complete_cds --MFLLLATITAITACQ--------AERELI--CIVQRVNESFSLHSGFGGNVYSMKTEPMTGFTNVTKGASVINQKDWIGFGDSRTDLNNDQFPASSDVPLAVAKKFRSLSGASLMLSAFGPPGKVDYLYQGCGKEKVFYEGVNW-------------SPEAGIDCFGSNWTQTKKDFYSKIYEAARSSTCMTLVNSLDT------KISSTTATAGTASSCS-----SSW-MKSPLWY-----AESSVNPG-AKPQVCGTEQSATFTLPTSF------GIYKCNKHVVQLCYFVYENKTA-FNTFGCGDYYQNYYDGNGNLIGGMDNRVAAYRGIAN-----VGVKIECPSKILNPGTYSIRSTPRFLLVPKRSYCFD-TDGGYP-IQVVQSEWSASRRS-DNA-TEEACLQTEGCIFIKKTTPYVGEADDNHGDIEMRQLLSGLGNNDTVCVSQS-GYTKGETPFVRDYLSPPKYGRCQLKTDSGRIPTLPSGLIIPQAGTDSLMRTLTPATRIFGIDDLIFGLLFVGFVAGGVAG------GYFWGRSNGGGGGASVSSTQAGFDKIGKDIQQLRNDTNAAIEGFNGRIAHDEQA--IKNLAKEIEDARAEALVGELGIIRSLIVANLSMNLKES---LYELANQITKRG-GGIAQEAG-PGCW----YVDSENCDASCKEYI----FNFNGSATVPTLRPVDTKVVITSD-----PYYLGSTIALCLLGLVAIVASVGVIW---ICCKK*NLRKN-LLA-
>KM015494.1_Influenza_D_virus_(D/bovine/Shandong/Y125/2014)_segment_4_hemagglutinin-esterase_(HE)_gene_complete_cds --MFLLLATITAITACQ--------AERELI--CIVQRVNESFSLHSGFGGNVYSMKTEPMTGFTNVTKGASVINQKDWIGFGDSRTDLTNDQFPASSDVPLAVAKKFRSLSGASLMLSAFGPPGKVDYLYQGCGKEKVFYEGVNW-------------SPEAGIDCFGSNWTQTKKDFYSMIYEAAIGSTCMTLVNSLDI------KISSTTATAGTASSCS-----SSW-MKSPLWY-----AESSVNPG-AKPQVCGTEQSATFTLPTSF------GIYKCNKHVVQLCYFVYENKTT-FNTFGCGDYYQDYYDGNGNLIGGMDNRVAAYRGIAN-----AGVKIECPSKILNPGTYSIRSTPRFLLVPKRSYCFD-TDGGYP-IQVVQSEWSASRRS-DNA-TEEACLQTEGCIFIKKTTPYVGEADDNHGDIEMRQLLSGLSNNDTVCVSQS-GYTKGETPFVKDYLSPPKYGRCQLKTDSRRIPTLPSGLIIPQAGTDSLMRTLTPATRIFGIDDLIFGLLFVGFVTGGVAG------GYFWGRSNGGGGGASVSSTQAGFDKIGKDIQQLRNDTNAAIEGFNGRIAHDEQA--IKNLAKEIEDARAEALVGELGIIRSLIVANISMNLKES---LYELANQITKRG-GGIAQEAG-PGCW----YVDSENCDASCKEYI----FNFNGSATVPTLRPVDTKVVITSD-----PYYLGSTIALCLLGLVAIAASVGVIW---ICCKK*NLRKN-LLA-
>CAL69520.1 C_Johannesburg/66 -MFFSLLLVLGLTEAEK-----------IKI--CLQKQVNSSFSLHNGFGGNLYATEEKRMFELVKPKAGASVLNQSTWIGFGDSRTDKSNSAFPRSADVSAKTADKFRFLSGGSLMLSMFGPPGKVDYLYQGCGKHKVFYEGVNW-------------SPHAAINCYRKNWTDIKLNFQKNIYELASQSHCMSLVNALDK-------TIPLQVTAGTAGNCN-----NSF-LKNPALY-TQ-----------EVKPSENKCGKENLAF---FTLPTQFGTYECKLHLVASCYFIYDSKEV-YNKRGCDNYFQVIYDSFGKVVGGLDNRVSPYTGNSG-----DTPTMQCDMLQLKPGRYSVRSSPRFLLMPERSYCFD-MKEKGP-VTAVQSIWGKGRES-DYA-VDQACLSTPGCMLIQKQKPYIGEADDHHGDQEMRELLSGL-DYEARCISQS-GWVNETSPFTEKYLLPPKFGRCPLAAKEESIPKIPDGLL-IPTSGTDT-TVTKPKSRIFGIDDLIIGVLFVAIVETGIGGYLLG-----SRKESGGGVT--KESAEKGFEKIGNDIQILKSSINIAIEKLNDRISHDEQA--IRDLTLEIENARSEALLGELGIIRALLVGNISIGLQES---LWELASEITNRA-GDLAVEVS-PGCW----IIDNNICDQSCQNFI-------FKFNETAPVPTIPPLDTKIDL-----QSDPFYWGSSLGLAITATISLAALVISGIAIC-----RTK-----
>HM748627.1_Influenza_C_virus_(C/Catalonia/1372/2009)_segment_4_hemagglutinin-esterase_(HE)_gene_partial_cds ------------LGLTE----------AEKIKICLQKQVNSSFSLHNGFGGNLYATEEKRMFELVKPKAGASVLNQSTWIGFGDSRTDRSNSAFPRSADVSEKTADKFRSLSGGSLMLSMFGPPGKVDYLYQGCGKHKVFYEGVNW-------------SPHAAIDCYRKNWTDIKLNFQKNIYELASQSHCLSLVNALDK-------TIPLQVTKGVAGNCN-----DSF-LKNPALY-TQ-----------EVKPSENKCGEENLAS---FTLPTRFGTYECKMHLVASCYFIYDSKEV-YNKRGCGNYFQVIYDSSGKVVGGLDNRVSPYTGNSG-----DTPTMQCDMLQLKPGRYSVRSSPRFLLMPERSYCFD-MKEKGP-VTAVQSIWGKGRKS-DYA-VDQACLSTPGCMLIQKQKPYVGEADDHHGDQEMRELLSGL-DYEARCISQS-GWVNETSPFAEEYLLPPKFGRCPLAAKEESIPKIPDGLL-IPTSGTDT-TVTKPKSRIFGIDDLIIGLLFVAIVEAGIGGYLLG-----SRKESGGGVT--KESAEKGFEKIGNDIQILRSSTNTAIEKLNDRISHDEQA--IRDLTLEIENARSEALLGELGIIRALLVGNISIGLQES---LWELASEITNRA-GDLAVEVS-PGCW----IIDNNICDQSCQDFI-------FKFNETAPVPTIPPLDTKIDL-----QSDPFYWGSSLGLAITAAISLAALVISGIA-----------ICR-
>NC_006310.2_Influenza_C_virus_(C/Ann_Arbor/1/50)_HEF_gene_for_hemagglutinin-esterase-fusion_complete_cds -MFFSLLLMLGLTEAEK-----------IKI--CLQKQVNSSFSLHNGFGGNLYATEEKRMFELVKPKAGASVLNQSTWIGFGDSRTDKSNSAFPRSADVSAKTADKFRSLSGGSLMLSMFGPPGKVDYLYQGCGKHKVFYEGVNW-------------SPHAAINCYRKNWTDIKLNFQKNIYELASQSHCMSLVNALDK-------TIPLQATAGVAKNCN-----NSF-LKNPALY------TQEVNPS-VEK--CGKENLAFFTLPTQF------GTYECKLHLVASCYFIYDSKEV-YNKRGCDNYFQVIYDSSGKVVGGLDNRVSPYTGNSG-----DTPTMQCDMLQLKPGRYSVRSSPRFLLMPERSYCFD-MKEKGP-VTAVQSIWGKGRES-DHA-VDQACLSTPGCMLIQKQKPYIGEADDHHGDQEMRELLSGL-DYEARCISQS-GWVNETSPFTEEYLLPPKFGRCPLAAKEESIPKIPDGLL-IPTSGTDT-TVTKPKSRIFGIDDLIIGLLFVAIVEAGIGGYLLG-----SRKVSGGGVT--KESAEKGFEKIGNDIQILRSSTNIAIEKLNDRISHDEQA--IRDLTLEIENARSEALLGELGIIRALLVGNISIGLQES---LWELASEITNRA-GDLAVEVS-PGCW----VIDNNICDQSCQNFI-------FKFNETAPVPTIPPLDTKIDL-----QSDPFYWGSSLGLAITAAISLAALVISGIAICRTK*---------
>AAA43717.1 B/Hong_Kong/8/73 --MKAIIVLLMVVT-----------SNADRI--CT------------GITSSNSPHVVKTATQ-----GEVNVTGVIPLT------TTPTKSHFANLKGTQTRGKLCPNCL-NCTDLDVALGRP--------KC---MGTIPSAKA------SILHEVKPVTSG--CFP-IMHD-----RTKIRQLPNLLRGYENIRLSARNVTNAETAPGGPYIVGTSGSCPNVTNGNGF-FATMAWAVPK--NKTATNPLTVEVPYICTKGEDQITV---W------GFH-SDDETQMVK--LYGDSKPQKFTSSANGVTTHYVSQ-----------IGGFPNQAEDEGLPQSGRIVVDYMVQKPGKTGTIAYQRGVLLPQKVWCAS---GRSK-VI----------KG-SLPLIG-----------------------------------------EADCLHEKYGGLNKSKPY-YTGEHAKAIGNCPIWVK-TPL-KLANGTKYRPPA-----KLLK-ERGFFGA--------IAGFLEGGWEGMIAGWHGYTSHGAHGVAVAADLKSTQEAINKITKNLNSLSELEVKNLQRLSGAMDELHNE--ILELDEKVDDLRADTISSQIELA-VLLSNEGIINSEDE--HLLALERKLKKML-GPSAVDIG-NGCF----ETKHK-CNQTCLDRIAAGTFNAGEFSLPT-FDSLNITAASLNDDGLD-NHTILLYYSTAASSLAVTLMIAIFIV---YMVSRDNVSCS-ICL-
>NC_002207.1_Influenza_B_virus_(B/Lee/1940)_segment_4_complete_sequence(2) --MKAIIVLLMVVT-----------SNADRI--CT------------GITSSNSPHVVKTATQ-----GEVNVTGVIPLT------TTPTKSHFANLKGTQTRGKLCPNCF-NCTDLDVALGRP--------KC---MGNTPSAKV------SILHEVKPATSG--CFP-IMHD-----RTKIRQLPNLLRGYENIRLSTSNVINTETAPGGPYKVGTSGSCPNVANGNGF-FNTMAWVIPKDNNKTAINPVTVEVPYICSEGEDQITV---W------GFH-SDDKTQMER--LYGDSNPQKFTSSANGVTTHYVSQ-----------IGGFPNQTEDEGLKQSGRIVVDYMVQKPGKTGTIVYQRGILLPQKVWCAS---GRSK-VI----------KG-SLPLIG-----------------------------------------EADCLHEKYGGLNKSKPY-YTGEHAKAIGNCPIWVK-TPL-KLANGTKYRPPA-----KLLK-ERGFFGA--------IAGFLEGGWEGMIAGWHGYTSHGAHGVAVAADLKSTQEAINKITKNLNYLSELEVKNLQRLSGAMNELHDE--ILELDEKVDDLRADTISSQIELA-VLLSNEGIINSEDE--HLLALERKLKKML-GPSAVEIG-NGCF----ETKHK-CNQTCLDRIAAGTFNAGDFSLPT-FDSLNITAASLNDDGLD-NHTILLYYSTAASSLAVTLMIAIFIV---YMVSRDNVSCS-ICL-
>NC_002207.1_Influenza_B_virus_(B/Lee/1940)_segment_4_complete_sequence ----------MVVT-----------SNADRI--CT------------GITSSNSPHVVKTATQ-----GEVNVTGVIPLT------TTPTKSHFANLKGTQTRGKLCPNCF-NCTDLDVALGRP--------KC---MGNTPSAKV------SILHEVKPATSG--CFP-IMHD-----RTKIRQLPNLLRGYENIRLSTSNVINTETAPGGPYKVGTSGSCPNVANGNGF-FNTMAWVIPKDNNKTAINPVTVEVPYICSEGEDQITV---W------GFH-SDDKTQMER--LYGDSNPQKFTSSANGVTTHYVSQ-----------IGGFPNQTEDEGLKQSGRIVVDYMVQKPGKTGTIVYQRGILLPQKVWCAS---GRSK-VI----------KG-SLPLIG-----------------------------------------EADCLHEKYGGLNKSKPY-YTGEHAKAIGNCPIWVK-TPL-KLANGTKYRPPA-----KLLK-ERGFFGA--------IAGFLEGGWEGMIAGWHGYTSHGAHGVAVAADLKSTQEAINKITKNLNYLSELEVKNLQRLSGAMNELHDE--ILELDEKVDDLRADTISSQIELA-VLLSNEGIINSEDE--HLLALERKLKKML-GPSAVEIG-NGCF----ETKHK-CNQTCLDRIAAGTFNAGDFSLPT-FDSLNITAASLNDDGLD-NHTILLYYSTAASSLAVTLMIAIFIV---YMVSRDNVSCS-ICL-
>AF387505.1_Influenza_B_virus_(B/Switzerland/4291/97)_hemagglutinin_mRNA_complete_cds --MKAIIVLLMVVT-----------SNADRI--CT------------GITSSNSPHVVKTATQ-----GEVNVTGVIPLT------TTPTKSHFANLKGTKTRGKLCPTCL-NCTDLDVALGRP--------MC---VGITPSAKA------SILHEVRPVTSG--CFP-IMHD-----RTKIRQLPNLLRGYEKIRLSTQNVINAEKAPGGPYRLGTSGSCPNATSRSGF-FATMAWAVPRDNNKTATNPLTVEVPYICTKEEDQITV---W------GFH-SDNKTQMKN--LYGDSNPQKFTSSANGVTTHYVSQ-----------IGGFPDQTEDGGLPQSGRIVVDYMVQKPGKTGTIVYQRGILLPQKVWCAS---GRSK-VI----------KG-SLPLIG-----------------------------------------EADCLHEKYGGLNKSKPY-YTGEHAKAIGNCPIWVK-TPL-KLANGTKYRPPA-----KLLK-ERGFFGA--------IAGFLEGGWEGMIAGWHGYTSHGAHGVAVAADLKSTQEAINKITKNLNSLSELEVKNLQRLSGAMDELHNE--ILELDEKVDDLRADTISSQIELA-VLLSNEGIINSEDE--HLLALERKLKKML-GPSAVDIG-NGCF----ETKHK-CNQTCLDRIAAGTFNAREFSLPT-FDSLNITAASLNDDGLD-NHTILLYYSTAASSLAVTLMIAIFIV---YMISRDNVSCS-ICL-
>AF387504.1_Influenza_B_virus_(B/Switzerland/4291/97)_hemagglutinin_mRNA_complete_cds --MKAIIVLLMVVT-----------SNADRI--CT------------GITSSNSPHVVKTATQ-----GEVNVTGVIPLT------TTPTKSHFANLKGTKTRGKLCPTCL-NCTDLDVALGRP--------MC---VGITPSAKA------SILHEVRPVTSG--CFP-IMHD-----RTKIRQLPNLLRGYEKIRLSTQNVINAEKAPGGPYRLGTSGSCPNATSRSGF-FATMAWAVPRDNNKTATNPLTVEVPYICTKEEDQITV---W------GFH-SDNKTQMKN--LYGDSNPQKFTSSANGVTTHYVSQ-----------IGGFPDQTEDGGLPQSGRIVVDYMVQKPGKTGTIVYQRGILLPQKVWCAS---GRSK-VI----------KG-SLPLIG-----------------------------------------EADCLHEKYGGLNKSKPY-YTGEHAKAIGNCPIWVK-TPL-KLANGTKYRPPA-----KLLK-ERGFFGA--------IAGFLEGGWEGMIAGWHGYTSHGAHGVAVAADLKSTQEAINKITKNLNSLSELEVKNLQRLSGAMDELHNE--ILELDEKVDDLRADTISSQIELA-VLLSNEGIINSEDE--HLLALERKLKKML-GPSAVDIG-NGCF----ETKHK-CNQTCLDRIAAGTFNAREFSLPT-FDSLNITAASLNDDGLD-NHTILLYYSTAASSLAVTLMIAIFIV---YMISRDNVSCS-ICL-
>AF387492.1_Influenza_B_virus_(B/Vienna/1/99)_hemagglutinin_mRNA_complete_cds --MKAIIVLLMVVT-----------SNADRI--CT------------GITSSNSPHVVKTATQ-----GEVNVTGAIPLT------TTPTKSHFANLKGTKTRGKLCPTCL-NCTDLDVALGRP--------MC---VGITPSAKA------SILHEVRPVTSG--CFP-IMHD-----RTKIRQLPNLLRGYEKIRLSTQNVINAEKAPGGPYRLGTSGSCPNATSKSGF-FATMAWAVPRDNNKTATNPLTVEVPHICTKEEDQITV---W------GFH-SDNKAQMKN--LYGDSNPQKFTSSANGITTHYVSQ-----------IGGFPDQTEDGGLPQSGRIVVDYMVQKPGKTGTIVYQRGILLPQKVWCAS---GRSK-VI----------KG-SLPLIG-----------------------------------------EADCLHEKYGGLNKSKPY-YTGEHAKAIGNCPIWVK-TPL-KLANGTKYRPPA-----KLLK-ERGFFGA--------IAGFLEGGWEGMIAGWHGYTSHGAHGVAVAADLKSTQEAINKITKNLNSLSELEVKNLQRLSGAMDELHNE--ILELDEKVDDLRADTISSQIELA-VLLSNEGIINSEDE--HLLALERKLKKML-GPSAVDIG-NGCF----ETKHK-CNQTCLDRIAAGTFNAGEFSLPT-FDSLNITAASLNDDGLD-NHTILLYYSTAASSLAVTLMIAIFIV---YMISRDNVSCS-ICL-
>AF387493.1_Influenza_B_virus_(B/Vienna/1/99)_hemagglutinin_mRNA_complete_cds --MKAIIVLLMVVT-----------SNADRI--CT------------GITSSNSPHVVKTATQ-----GEVNVTGAIPLT------TTPTKSHFANLKGTKTRGKLCPTCL-NCTDLDVALGRP--------MC---VGITPSAKA------SILHEVRPVTSG--CFP-IMHD-----RTKIRQLPNLLRGYEKIRLSTQNVINAEKAPGGPYRLGTSGSCPNATSKSGF-FATMAWAVPRDNNKTATNPLTVEVPHICTKEEDQITV---W------GFH-SDNKTQMKN--LYGDSNPQKFTSSANGITTHYVSQ-----------IGGFPDQTEDGGLPQSGRIVVDYMVQKPGKTGTIVYQRGILLPQKVWCAS---GRSK-VI----------KG-SLPLIG-----------------------------------------EADCLHEKYGGLNKSKPY-YTGEHAKAIGNCPIWVK-TPL-KLANGTKYRPPA-----KLLK-ERGFFGA--------IAGFLEGGWEGMIAGWHGYTSHGAHGVAVAADLKSTQEAINKITKNLNSLSELEVKNLQRLSGAMDELHNE--ILELDEKVDDLRADTISSQIELA-VLLSNEGIINSEDE--HLLALERKLKKML-GPSAVDIG-NGCF----ETKHK-CNQTCLDRIAAGTFNAGEFSLPT-FDSLNITAASLNDDGLD-NHTILLYYSTAASSLAVTLMIAIFIV---YMISRDNVSCS-ICL-
>AF387503.1_Influenza_B_virus_(B/Hong_Kong/157/99)_hemagglutinin_mRNA_complete_cds --MKAIIVLLMVVT-----------SNADRI--CT------------GITSSNSPHVVKTATQ-----GEVNVTGAIPLT------TTPTKSHFANLKGTKTRGKLCPTCL-NCTDLDVALGRP--------MC---VGITPSAKA------SILHEVRPVTSG--CFP-IMHD-----RTKIRQLPNLLRGYEKIRLSTQNVINAEKAPGGPYRLGTSGSCPNATSKSGF-FATMAWAVPRDNNKTATNPLTVEVPHICTKEEDQITV---W------GFH-SDNKTQMKN--LYGDSNPQKFTSSANGITTHYVSQ-----------IGGFPDQTEDGGLPQSGRIVVDYMVQKPGKTGTIVYQRGILLPQKVWCAS---GRSK-VI----------KG-SLPLIG-----------------------------------------EADCLHEKYGGLNKSKPY-YTGEHAKAIGNCPIWVK-TPL-KLANGTKYRPPA-----KLLK-ERGFFGA--------IAGFLEGGWEGMIAGWHGYTSHGAHGVAVAADLKSTQEAINKITKNLNSLSELEVKNLQRLSGAMDELHNE--ILELDEKVDDLRADTISSQIELA-VLLSNEGIINSEDE--HLLALERKLKKML-GPSAVDIG-NGCF----ETKHK-CNQTCLDRIAAGTFNAGEFSLPT-FDSLNITAASLNDDGLD-NHTILLYYSTAASSLAVTLMIAIFIV---YMISRDNVSCS-ICL-
>KX960439-1|APB91561.1|469|Influenza_A_virus_(A/blue-winged_teal/Guatemala/CIP049H108-11/2012(H14N3))_neuraminidase -------------------------MNPNQK--II------------TIGVVNTTLSTIALLI-----GVGNLVFNTVIH------EKIGDHQTVTHPTIT------TPAIPNCSDTIITYNNT---------------VINNITT------TIITEAERPFK---S-PLPLCP--------FRGFFPFHKD-NAIRLGEN--------KDVIVTREPYISCD-NDNCWSFALAQGALL-------GTKH---SNGTIKDRTPYRSLIR---F------PIGTAPVLGNYKE--IC------IAWSSSSCFDGKEWMH-----------VCMTGNDND-----ASAQIIYAGRMTDSIKSWR----KDILRTQESECQC-IDGTCV-VA----------VT-DGP-AANSA--------------------------------------DHRVYWIREGKIIKYEDV--PKTKIQHLEECSCYVDIDVY------------------C----------------------ICRDNWKGSNRPWMRINNETILETGYVCSKFHSDTPRPADPSTMS-CDSPSNVNGGPGVKGFGFKAGN---------------DVWLGRTVST-SGRSGFEIIKVTEGWINSPNHAKSITQTL-VSNNDWSGYSGSF----IVKTKDCFQPCF------------YVELIRGRPNKNDDVSWTS----------------NSIVTFCGLDNEPGS---GNWPDGSNIGF-MPK-
>NC_007366.1_Influenza_A_virus_(A/New_York/392/2004(H3N2))_segment_4_complete_sequence(2) --MKTIIALSYILCLVFAQKLPGNDNSTATL--CL------------GHHAVPNGTIVKTITN-----DQIEVTNATELV------QSSSTGGICDSPHQI-------LDGENCTLIDALLGDP--------QC---DGFQN-KKW------DLFVERSKAYSN--CYPYDVPD-----YASLRSLVASSGTLEFNNESFN-------W-TGVTQNGTSSACK-RRSNNSF-FSRLNWL-TH---LKFKYPA-LNVTMPNNEKFDKLYI---W------GVHHPGTDNDQIS--LYAQASG-RITVSTKRSQQTVIPS-----------IGSRPRIRD-----VPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKI--RSGKSS-IM----------RS-DAP-IG-KC--------------------------------------NSECITPN-GSIPNDKPF--QNVNRITYGACPRYVKQNTL-KLATGMRNVPEK-----Q----TRGIFGA--------IAGFIENGWEGMVDGWYGFRHQNSEGTGQAADLKSTQAAINQINGKLNRLIGKTNEKFHQIEKEFSEVEGR--IQDLEKYVEDTKIDLWSYNAELL-VALENQHTIDLTDS--EMNKLFERTKKQL-RENAEDMG-NGCF----KIYHK-CDNACIGSIRNGTYDHDVYRDEALNNRFQIKGVELKS---G-YKDWILWISFAISCFLLCVALLGFIM---WACQKGNIRCN-ICI-
>L43915.1_Influenza_A_virus_(A/goose/Leipzig/192/7/1979(H7N7))_hemagglutinin_mRNA_complete_cds --MNTQILILALVAIIP--------TNADKI--CL------------GHHAVSNGTKVNTLTE-----RGIEVVNATETV------ERTNIPRICS-KGKR------TIDLGQCGLLGTITGPP--------QC---DQFLE-FSA------DLIIERREGNDV--CYPGEFVN-----EEALRQILRESGGIDKETMGFT--------YSGIRTNGATNACR--RSGSSF-YAEMKWLLSN--TDNAAFPQ-TTKSYKNTRKDPALII---W------GIHHSGSTTEQTK--LYGSGSK-LITVGSSNYQQSFVPS-----------PGARPQVNG-----QSGRIDFHWLMLNPNDTVTFSFNGAFIAPDRASFLR---GKSM-GI----------QS-DVQ-VDANC--------------------------------------EGDCYHNG-GTIISNLPF--QNINSRAVGKCPRYVKQESL-LLATGMKNVPEI-----PKKRKKRGLFGA--------IAGFIENGWEGLVDGWYGFRHQNAQGEGTAADYKSTQSAIDQITGKLNRLIERTNQQFELIDNEFTEVEKQ--LGNVINWTRDSITEVWSYNAELL-VAMENQHTIDLADS--EMNKLYERVRRQL-RENAEEDG-TGCF----EIFHK-CDDDCMASIRNNTYDHSKYREEAMQNRIQIDPVKLSS---G-YKDVILWFSFGASCFILLAIAMGLIF---MCVKNGNMRCT-ICI-
>CY186284 H10N1 --MYKIVVIIALLGAVK---------GLDKI--CL------------GHHAVANGTIVKTLTN-----EQEEVTNATETV------ESTSLNRLC-MKGRN------HKDLGNCHPIGMLIGTP--------VC---DLHLTG-TW------DTLIERENAIAY--CYPGATVN-----EEALRQKIMESGGISKISTGFT-------YGSSINSAGTTKACM-RNGGNSF-YAELKWLVSK--SKGQNFPQ-TTNTYRNTDTAEHLIM---W------GIHHPSSTQEKND--LYGTQSL-SISVGSSTYQNNFVPV-----------VGARPQVNG-----QSGRIDFHWTLVQPGDNITFSHNGGLIAPSRVSKLI---GRGL-GI----------QS-DAP-IDNNC--------------------------------------ESKCFWRE-GSINTRLPF--QNLSPRTVGQCPKYVNKKSL-MLATGMRNVPELM----Q----GRGLFGA--------IAGFIENGWEGMVDGWYGFRHQNAQGTGQAADYKSTQAAIDQITGKLNRLIEKTNTEFESIESEFSEIEHQ--IGNVINWTKDSITDIWTYQAELL-VAMENQHTIDMADS--EMLNLYERVRKQL-RQNAEEDG-KGCF----EIYHA-CDDSCMESIRNNTYDHSQYREEALLNRLNINPVTLSS---G-YKDIILWFSFGASCFVLLAVVMGLVF---FCLKNGNMRCT-ICI-
>2008_B_H1N1_Miyagi -MEAKLLVLFCMFTVLK----------ADTI--CI------------GYHANNSTDTVDTVLE-----KNVTVTHSVNLL------EDNHNGKLCKLNGIA------PLQLGKCNVAGWLLGNP--------EC---DLLLTANSW------SYIIEASNSENGT-CYPGEFID-----YEELREQLSSVSSFEKFEIFPK----ASSWPNHETTKGVTAACS-YSGASSF-YRNLLWI-TK---KGTSYPK-LSKSYTNNKGKEVLVL---W------GVHHPPTTSEQQT--LYQNTDA-YVSVGSSKYNRRFTPE-----------IAARPKVRG-----QAGRMNYYWTLLDQGDTITFEATGNLIAPWYAFALN-KGSDSG-II----------TS-DAP-VY-NC--------------------------------------DTKCQTPH-GAINSTLPF--QNVHPITIGECPKYVKSTKL-RMATGLRNIPSI-----Q----SRGLFGA--------IAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKSTQNAIDGITNKVNSIIEKMNTQFTAVGKEFNNLERR--IENLNKKVDDGFLDVWTYNAELL-ILLENERTLDFHDS--NVRNLYEKVKSQL-RNNAKEIG-NGCF----EFYHK-CDDECMESVKNGTYDYPKYSEESKLNREEIDGVKLES--MG-VYQILAIYSTVASSLVLLVSLGAISF---WMCSNGSLQCR-ICI-