
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC139747.5 + phase: 0 /pseudo
(1608 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 1167 0.0
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 1162 0.0
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti... 236 4e-73
TC232995 230 3e-60
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 227 3e-59
NP004897 gag-protease polyprotein 219 8e-57
TC213445 130 2e-49
BM143109 177 3e-44
BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Gl... 167 3e-41
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 166 6e-41
AI959950 166 1e-40
AI855982 163 6e-40
CO983516 160 4e-39
CF920770 151 3e-36
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 131 2e-30
BI321712 126 7e-29
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 123 7e-28
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 120 4e-27
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 119 1e-26
TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%) 118 2e-26
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 1167 bits (3019), Expect = 0.0
Identities = 666/1629 (40%), Positives = 934/1629 (56%), Gaps = 31/1629 (1%)
Frame = +1
Query: 4 EGGSSNRPPLFDGSNYYFWKGKMELFLRSQDNDMWAVITDGDFVPTTKE------GAVKA 57
EGG NRPP+ DGSNY +WK +M FL+S D+ W + G P + +K
Sbjct: 16 EGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTDELKP 195
Query: 58 KSAWSTDEKAQVLLNSKARLFLSCALTMEESERVDECTNAKEVWDTLKIHHEGTSHVKET 117
+ W+ +E L NSKA L + ++ CT AK+ W+ LKI HEGTS VK +
Sbjct: 196 EEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTSKVKMS 375
Query: 118 RIDIGVRKFEVFEMSENETIDEMYARFTTIVNEMRSLGKAYSTHDRIRKILRCLPSVWRP 177
R+ + KFE +M E E I + + I N +LG+ + +RKILR LP +
Sbjct: 376 RLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLPKRFDM 555
Query: 178 MVTAITQAKDLKSMNLEDLIGSLRAHEVVLQGDKPVKKVKTLALKASQQTPSVADEDVQE 237
VTAI +A+D+ +M +++LIGSL+ E+ L D+ KK K LA ++ DE ++
Sbjct: 556 KVTAIEEAQDICNMRVDELIGSLQTFELGLS-DRAEKKSKNLAFVSN-------DEGEED 711
Query: 238 PQELEEVHEEEAEDELALISKRIQRMMLRRNQIRK--------------KFPKTNISIKT 283
+L+ +E + + L+ K+ +++ R ++ +K K+ K + +K
Sbjct: 712 EYDLDT--DEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKRS-DVKP 882
Query: 284 EADKSQVTCYGCNKTGHFKNECPDIKKVQRKPPFKKKAMITWDDMEESDSQEDADTDMGL 343
K + C+GC GH ECP K RK + +ESDS D + G+
Sbjct: 883 SHSKG-IQCHGCEGYGHIIAECPTHLKKHRKG--LSVCQSDTESEQESDSDRDVNALTGI 1053
Query: 344 MAQSDDEEEVIIYKTDSLYKDLENKIDSLLYDSNFLTNRCHSLIKELSEIKEEKEILQNK 403
++D + ++ + +L L S + + L K +++++ EKE + +
Sbjct: 1054 FETAEDSSDT---DSEITFDELAASYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHKEE 1224
Query: 404 YDESRKTIKILQDSHFDMSEKQREINRKQKGIMSVPSEVQKENILLKKEVETLKKVLTGF 463
E + + L +M++ + +N+ S+ E +LL K + + GF
Sbjct: 1225 ISELKGEVGFLNSKLENMTKSIKMLNKG--------SDTLDEVLLLGKNAGNQRGL--GF 1374
Query: 464 IKSTE---TFQNIVGSQNESTKKSGLGFKDPSKIIGSFVPKAKIRV-KCCFCDKYGHNES 519
+ T V ++N + + S+ G K+K + +C +C KYGH +
Sbjct: 1375 NPKSAGRTTMTEFVPAKNRTGATMS---QHRSRHHGMQQKKSKRKKWRCHYCGKYGHIKP 1545
Query: 520 ICHVKKKFIKQNNLYLSSERSHLNRSESSQKAEKAKKTCFYCNKSDHKRQNVTFRKDLLE 579
C+ +L H +S +S +K + K HK ++ L
Sbjct: 1546 FCY-----------HLHGHPHHGTQSSNS------RKKMMWVPK--HKAVSLVVHTSL-- 1662
Query: 580 ELTLKDPTLHGYLKFLSCQM*VLPQGARTKPWYLDSGCSRHMTGDRNCFLTFEKKDGGLV 639
+ + + WYLDSGCSRHMTG + L E V
Sbjct: 1663 ------------------------RASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYV 1770
Query: 640 TFGNNDKGKIRGKGTIGNLNSAKIENVQYVEGLKHNLLSISQLCDSGFEVIFKPNICEVR 699
TFG+ KGKI G G + + + V V+GL NL+SISQLCD GF V F + C V
Sbjct: 1771 TFGDGSKGKIIGMGKLVHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVT 1950
Query: 700 QASSNKLFFSGSRRKNLYVLELNDMP-AEFCFMSLEKDKWIWHKRAGHISMKTIAKLSQL 758
S L + N Y+ + + C S E + IWH+R GH+ ++ + K+
Sbjct: 1951 NEKSEVLMKGSRSKDNCYLWTPQETSYSSTCLSSKEDEVRIWHQRFGHLHLRGMKKIIDK 2130
Query: 759 DLVRGLPKISFEKDKICEACVKGKQVKSSFKTIEFISTQKPLELLHIDLFAPVQTASLTG 818
VRG+P + E+ +IC C GKQVK S + ++ +T + LELLH+DL P+Q SL G
Sbjct: 2131 GAVRGIPNLKIEEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGG 2310
Query: 819 KRYGFVIVDDFSRFTWVLFLKHKDESFEAFQNFCKRVQNEKGYNIITVRSDHGGEFENAS 878
KRY +V+VDDFSRFTWV F++ K E+FE F+ R+Q EK I +RSDHG EFEN+
Sbjct: 2311 KRYAYVVVDDFSRFTWVNFIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSR 2490
Query: 879 FKTFFDENGIKHNFSCARTPQQNGVVERKNRTLQEMARTMLNESNVENYFWAEAINTSCY 938
F F GI H FS A TPQQNG+VERKNRTLQE AR ML+ + WAEA+NT+CY
Sbjct: 2491 FTEFCTSEGITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACY 2670
Query: 939 ILNRVSIRKVLNKTPYELWKNIKPSISYFHIFGCYCYILNNKEKLGKFDPKSDKAIFLGY 998
I NRV++R+ T YE+WK KPS+ +FHIFG CYIL ++E+ K DPKSD IFLGY
Sbjct: 2671 IHNRVTLRRGTPTTLYEIWKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGY 2850
Query: 999 STTSKGYRVYNLKTQTVEISMHIIFDEYDEHSKPKENED--TEAPTLQNVPV--QNTENT 1054
ST S+ YRV+N +T+TV S++++ D+ K ED T + + +N EN+
Sbjct: 2851 STNSRAYRVFNSRTRTVMESINVVVDDLSPARKKDVEEDVRTSGDNVADAAKSGENAENS 3030
Query: 1055 VEKEDDQNVQDQSLQSPPRSWRMVGDHPTDQIIGSTTDGVRTRLSFQD--NNMAMISQME 1112
D+ N+ +S R +M HP + IIG GV TR + +N +S++E
Sbjct: 3031 DSATDESNINQPDKRSSTRIQKM---HPKELIIGDPNRGVTTRSREVEIVSNSCFVSKIE 3201
Query: 1113 PKSINEAIIDDSWIEVMKEELSQFERNKVWNLVPNNQDKTIIGTRWVFRNKLDEEGKVVR 1172
PK++ EA+ D+ WI M+EEL QF+RN+VW LVP + +IGT+W+F+NK +EEG + R
Sbjct: 3202 PKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITR 3381
Query: 1173 NKARLVAQGYNQQEGIDYDETFAPVARLEAIRILLAYAAHKSIKLFQMDVKSAFLNGFLN 1232
NKARLVAQGY Q EG+D+DETFAPVARLE+IR+LL A KL+QMDVKSAFLNG+LN
Sbjct: 3382 NKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLN 3561
Query: 1233 EEVYVSQPPGFINKEKPNHVFKLTKALYGLKQAPRAWYDRLSTFLIENGFSRGKIDTTLF 1292
EEVYV QP GF + P+HV++L KALYGLKQAPRAWY+RL+ FL + G+ +G ID TLF
Sbjct: 3562 EEVYVEQPKGFADPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLF 3741
Query: 1293 RKTHNTDLLIVQVYVDDIIFGATKIKMCEEFSNLMQSEFEMSMMGELGFFLGLQIKQHSN 1352
K +L+I Q+YVDDI+FG +M F MQSEFEMS++GEL +FLGLQ+KQ +
Sbjct: 3742 VKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMED 3921
Query: 1353 GIFISQEKYIKDILKKYKMNEAKIMSTPMHPSSSLDKDESGKSISEKEYRGMIGSLLYLT 1412
IF+SQ +Y K+I+KK+ M A TP L KDE+G S+ + YR MIGSLLYLT
Sbjct: 3922 SIFLSQSRYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLT 4101
Query: 1413 ASRPDIVFAVGLCARFQTCAKESHLTAVKRIFRYLVGTTDLGLWYRKGSSFDLVAYCDAD 1472
ASRPDI +AVG+CAR+Q K SHLT VKRI +Y+ GT+D G+ Y S+ LV YCDAD
Sbjct: 4102 ASRPDITYAVGVCARYQANPKISHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDAD 4281
Query: 1473 YAGDKVERKSTSGSCQFLGQALIGWSCRKQNTIELSTTEAEYVSAASCCSQILWVRNQLE 1532
+AG +RKSTSG C +LG LI W +KQN + LST EAEY++A S CSQ++W++ L+
Sbjct: 4282 WAGSADDRKSTSGGCFYLGNNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLK 4461
Query: 1533 DYSLRYTSVPIYCDNTSAINLSKNPIQHSRSKHIEIKHHFIRDHVQKKNIALSFVDTENQ 1592
+Y++ + +YCDN SAIN+SKNP+QHSR+KHI+I+HH+IRD V K I L VDTE Q
Sbjct: 4462 EYNVEQDVMTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTEEQ 4641
Query: 1593 LADIFTKPL 1601
+ADIFTK L
Sbjct: 4642 IADIFTKAL 4668
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 1162 bits (3007), Expect = 0.0
Identities = 657/1632 (40%), Positives = 921/1632 (56%), Gaps = 34/1632 (2%)
Frame = +1
Query: 4 EGGSSNRPPLFDGSNYYFWKGKMELFLRSQDNDMWAVITDGDFVPTTKE------GAVKA 57
EGG NRPP+ DG+NY +WK +M FL+S D+ W + G P + +K
Sbjct: 16 EGGPVNRPPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTNELKP 195
Query: 58 KSAWSTDEKAQVLLNSKARLFLSCALTMEESERVDECTNAKEVWDTLKIHHEGTSHVKET 117
+ W+ +E L NSKA L + ++ CT AK+ W+ LK HEGTS VK +
Sbjct: 196 EEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTSKVKMS 375
Query: 118 RIDIGVRKFEVFEMSENETIDEMYARFTTIVNEMRSLGKAYSTHDRIRKILRCLPSVWRP 177
R+ + KFE +M E E I + + I N +LG+ + +RKILR LP +
Sbjct: 376 RLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERMTDEKLVRKILRSLPKRFDM 555
Query: 178 MVTAITQAKDLKSMNLEDLIGSLRAHEVVLQGDKPVKKVKTLALKASQQTPSVADEDVQE 237
VTAI +A+D+ +M +++LIGSL+ E+ L D+ KK K LA ++ DE ++
Sbjct: 556 KVTAIEEAQDICNMRVDELIGSLQTFELGLS-DRTEKKSKNLAFVSN-------DEGEED 711
Query: 238 PQELEEVHEEEAEDELALISKRIQRMMLRRNQ------------IRKKFPKTNISIKTEA 285
+L+ +E + + L+ K+ +++ R ++ IRK S + +
Sbjct: 712 EYDLDT--DEGLTNAVVLLGKQFNKVLNRMDRRQKPHVRNIPFDIRKGSEYQKKSDEKPS 885
Query: 286 DKSQVTCYGCNKTGHFKNECPDIKKVQRKPPFKKKAMITWDDMEESDSQEDADTDMGLMA 345
+ C+GC GH K ECP K QRK + D ES+ + D+D D+ +
Sbjct: 886 HSKGIQCHGCEGYGHIKAECPTHLKKQRKG-----LSVCRSDDTESEQESDSDRDVNALT 1050
Query: 346 QSDDEEEVIIYKTDSLYKDLENKIDSLLYDSNFLTNRCHSLIKELSEIKEEKEILQNKYD 405
+ E DS D E D L L + ++++ +++K
Sbjct: 1051 GRFESAE------DSSDTDSEITFDELAISYRELCIKSEKILQQEAQLK----------- 1179
Query: 406 ESRKTIKILQDSHFDMSEKQREINRKQKGIMSVPSEVQKENILLKKEVETLKKVLTGFIK 465
K++ + + + EI SE++ E L ++E + K + K
Sbjct: 1180 ------KVIANLEAEKEAHEEEI-----------SELKGEVGFLNSKLENMTKSIKMLNK 1308
Query: 466 STETFQNIVGSQNESTKKSGLGFKDPSKIIGSFVPKAKIRVKCCFCDKYGHNESICHVKK 525
++ ++ + GLGF H +
Sbjct: 1309 GSDMLDEVLQLGKNVGNQRGLGFN--------------------------HKSAGRTTMT 1410
Query: 526 KFIKQNNLYLSSERSHLNRSESSQ--KAEKAKKTCFYCNKSDHKRQNVTFRKDLLEELTL 583
+F+ N ++ H +R +Q K+++ K C YC K H + T
Sbjct: 1411 EFVPAKNSTGATMSQHRSRHHGTQQKKSKRKKWRCHYCGKYGHIKPFCYHLHGHPHHGTQ 1590
Query: 584 KDPTLHGYL-----KFLSCQM*VLPQGARTKPWYLDSGCSRHMTGDRNCFLTFEKKDGGL 638
+ + K +S + + + + WYLDSGCSRHMTG + + E
Sbjct: 1591 SSSSGRKMMWVPKHKIVSLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFLVNIEPCSTSY 1770
Query: 639 VTFGNNDKGKIRGKGTIGNLNSAKIENVQYVEGLKHNLLSISQLCDSGFEVIFKPNICEV 698
VTFG+ KGKI G G + + + V V+GL NL+SISQLCD GF V F + C V
Sbjct: 1771 VTFGDGSKGKITGMGKLVHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLV 1950
Query: 699 RQASSNKLFFSGSRRKNLYVLELNDMP-AEFCFMSLEKDKWIWHKRAGHISMKTIAKLSQ 757
S L + N Y+ + + C S E + IWH+R GH+ ++ + K+
Sbjct: 1951 TNEKSEVLMKGSRSKDNCYLWTPQETSYSSTCLFSKEDEVKIWHQRFGHLHLRGMKKIID 2130
Query: 758 LDLVRGLPKISFEKDKICEACVKGKQVKSSFKTIEFISTQKPLELLHIDLFAPVQTASLT 817
VRG+P + E+ +IC C GKQVK S + ++ +T + LELLH+DL P+Q SL
Sbjct: 2131 KGAVRGIPNLKIEEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLG 2310
Query: 818 GKRYGFVIVDDFSRFTWVLFLKHKDESFEAFQNFCKRVQNEKGYNIITVRSDHGGEFENA 877
GKRY +V+VDDFSRFTWV F++ K ++FE F+ R+Q EK I +RSDHG EFEN+
Sbjct: 2311 GKRYAYVVVDDFSRFTWVNFIREKSDTFEVFKELSLRLQREKDCVIKRIRSDHGREFENS 2490
Query: 878 SFKTFFDENGIKHNFSCARTPQQNGVVERKNRTLQEMARTMLNESNVENYFWAEAINTSC 937
F F GI H FS A TPQQNG+VERKNRTLQE AR ML+ + WAEA+NT+C
Sbjct: 2491 KFTEFCTSEGITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTAC 2670
Query: 938 YILNRVSIRKVLNKTPYELWKNIKPSISYFHIFGCYCYILNNKEKLGKFDPKSDKAIFLG 997
YI NRV++R+ T YE+WK KP++ +FHIFG CYIL ++E+ K DPKSD IFLG
Sbjct: 2671 YIHNRVTLRRGTPTTLYEIWKGRKPTVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLG 2850
Query: 998 YSTTSKGYRVYNLKTQTVEISMHIIFDEYDEHSKPKENEDTE------APTLQNVPVQNT 1051
YST S+ YRV+N +T+TV S++++ D+ K ED A T ++ +N
Sbjct: 2851 YSTNSRAYRVFNSRTRTVMESINVVVDDLTPARKKDVEEDVRTSGDNVADTAKSA--ENA 3024
Query: 1052 ENTVEKEDDQNVQDQSLQSPPRSWRMVGDHPTDQIIGSTTDGVRTRLSFQD--NNMAMIS 1109
EN+ D+ N+ + R +M HP + IIG GV TR + +N +S
Sbjct: 3025 ENSDSATDEPNINQPDKRPSIRIQKM---HPKELIIGDPNRGVTTRSREIEIVSNSCFVS 3195
Query: 1110 QMEPKSINEAIIDDSWIEVMKEELSQFERNKVWNLVPNNQDKTIIGTRWVFRNKLDEEGK 1169
++EPK++ EA+ D+ WI M+EEL QF+RN+VW LVP + +IGT+W+F+NK +EEG
Sbjct: 3196 KIEPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGV 3375
Query: 1170 VVRNKARLVAQGYNQQEGIDYDETFAPVARLEAIRILLAYAAHKSIKLFQMDVKSAFLNG 1229
+ RNKARLVAQGY Q EG+D+DETFAPVARLE+IR+LL A KL+QMDVKSAFLNG
Sbjct: 3376 ITRNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNG 3555
Query: 1230 FLNEEVYVSQPPGFINKEKPNHVFKLTKALYGLKQAPRAWYDRLSTFLIENGFSRGKIDT 1289
+LNEE YV QP GF++ P+HV++L KALYGLKQAPRAWY+RL+ FL + G+ +G ID
Sbjct: 3556 YLNEEAYVEQPKGFVDPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDK 3735
Query: 1290 TLFRKTHNTDLLIVQVYVDDIIFGATKIKMCEEFSNLMQSEFEMSMMGELGFFLGLQIKQ 1349
TLF K +L+I Q+YVDDI+FG +M F MQSEFEMS++GEL +FLGLQ+KQ
Sbjct: 3736 TLFVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQ 3915
Query: 1350 HSNGIFISQEKYIKDILKKYKMNEAKIMSTPMHPSSSLDKDESGKSISEKEYRGMIGSLL 1409
+ IF+SQ KY K+I+KK+ M A TP L KDE+G S+ + YR MIGSLL
Sbjct: 3916 MEDSIFLSQSKYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLL 4095
Query: 1410 YLTASRPDIVFAVGLCARFQTCAKESHLTAVKRIFRYLVGTTDLGLWYRKGSSFDLVAYC 1469
YLTASRPDI +AVG+CAR+Q K SHL VKRI +Y+ GT+D G+ Y S LV YC
Sbjct: 4096 YLTASRPDITYAVGVCARYQANPKISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYC 4275
Query: 1470 DADYAGDKVERKSTSGSCQFLGQALIGWSCRKQNTIELSTTEAEYVSAASCCSQILWVRN 1529
DAD+AG +RKSTSG C +LG LI W +KQN + LST EAEY++A S CSQ++W++
Sbjct: 4276 DADWAGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQ 4455
Query: 1530 QLEDYSLRYTSVPIYCDNTSAINLSKNPIQHSRSKHIEIKHHFIRDHVQKKNIALSFVDT 1589
L++Y++ + +YCDN SAIN+SKNP+QHSR+KHI+I+HH+IRD V K I L VDT
Sbjct: 4456 MLKEYNVEQDVMTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLEHVDT 4635
Query: 1590 ENQLADIFTKPL 1601
E Q+ADIFTK L
Sbjct: 4636 EEQIADIFTKAL 4671
>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
Length = 1213
Score = 236 bits (602), Expect(2) = 4e-73
Identities = 121/184 (65%), Positives = 141/184 (75%)
Frame = +3
Query: 1301 LIVQVYVDDIIFGATKIKMCEEFSNLMQSEFEMSMMGELGFFLGLQIKQHSNGIFISQEK 1360
LI+ +YVDDIIFGAT +MC+EF LM+ FE SM GEL F LGLQI Q GIFI QEK
Sbjct: 489 LIIHIYVDDIIFGATSKRMCKEFFELMKDGFETSMKGELKFLLGLQIIQKVYGIFIHQEK 668
Query: 1361 YIKDILKKYKMNEAKIMSTPMHPSSSLDKDESGKSISEKEYRGMIGSLLYLTASRPDIVF 1420
Y K LK+++M+EAK M+TPMH S+ +DKDE G S KEY GMI SL YLT+SRPDIVF
Sbjct: 669 YTKSHLKRFRMDEAKPMATPMHRSTIIDKDEKGNHTS*KEYSGMIDSLSYLTSSRPDIVF 848
Query: 1421 AVGLCARFQTCAKESHLTAVKRIFRYLVGTTDLGLWYRKGSSFDLVAYCDADYAGDKVER 1480
V LCARFQ+ K SH+TAVKRI RYLVGTT+ LW++K S FDL+ YCD +AGDKVER
Sbjct: 849 VVCLCARFQSYPKISHVTAVKRILRYLVGTTNHCLWFKKRSEFDLLGYCDVYFAGDKVER 1028
Query: 1481 KSTS 1484
KSTS
Sbjct: 1029 KSTS 1040
Score = 59.3 bits (142), Expect(2) = 4e-73
Identities = 27/47 (57%), Positives = 33/47 (69%)
Frame = +2
Query: 1254 KLTKALYGLKQAPRAWYDRLSTFLIENGFSRGKIDTTLFRKTHNTDL 1300
K +YGLKQA RAWY+RLS+FL+ NGF+RG D LFRK +L
Sbjct: 347 KTLSCVYGLKQALRAWYERLSSFLVSNGFTRGITDPALFRKAQKGNL 487
>TC232995
Length = 1009
Score = 230 bits (587), Expect = 3e-60
Identities = 113/173 (65%), Positives = 138/173 (79%)
Frame = +2
Query: 1237 VSQPPGFINKEKPNHVFKLTKALYGLKQAPRAWYDRLSTFLIENGFSRGKIDTTLFRKTH 1296
V QPPGF +KPNHV+KL KALYGLKQAPRAWY+RLS FL+E FSRGK+DTTLF K
Sbjct: 2 VEQPPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFIKRK 181
Query: 1297 NTDLLIVQVYVDDIIFGATKIKMCEEFSNLMQSEFEMSMMGELGFFLGLQIKQHSNGIFI 1356
+ D+L+VQ+YVDDIIFG+T +C+EFS MQSEFEMSMMGEL +FLGLQIKQ GIFI
Sbjct: 182 HNDILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*GIFI 361
Query: 1357 SQEKYIKDILKKYKMNEAKIMSTPMHPSSSLDKDESGKSISEKEYRGMIGSLL 1409
+Q KY K+++K++ M+ AK MSTPM + LDKDESG+SI K+YR IG ++
Sbjct: 362 NQSKYCKELIKRFGMDSAKHMSTPMSTNCYLDKDESGQSIDIKQYRDAIGEVV 520
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 227 bits (579), Expect = 3e-59
Identities = 106/151 (70%), Positives = 128/151 (84%)
Frame = -3
Query: 1129 MKEELSQFERNKVWNLVPNNQDKTIIGTRWVFRNKLDEEGKVVRNKARLVAQGYNQQEGI 1188
M+EEL+QFERN VW LV ++ +IGT+WVFRNKLDE G ++RNKARLVA+GYNQ+EGI
Sbjct: 455 MQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEEGI 276
Query: 1189 DYDETFAPVARLEAIRILLAYAAHKSIKLFQMDVKSAFLNGFLNEEVYVSQPPGFINKEK 1248
DY+ET+APVARLE IR+LLAY + + KL+QMDVKSAFLNG + EEVYV QPPGF +K
Sbjct: 275 DYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIPDK 96
Query: 1249 PNHVFKLTKALYGLKQAPRAWYDRLSTFLIE 1279
P HV+KL KALYGLKQAPRAWY+R+S FL+E
Sbjct: 95 PTHVYKLQKALYGLKQAPRAWYERISNFLLE 3
>NP004897 gag-protease polyprotein
Length = 1923
Score = 219 bits (558), Expect = 8e-57
Identities = 189/688 (27%), Positives = 302/688 (43%), Gaps = 36/688 (5%)
Frame = +1
Query: 4 EGGSSNRPPLFDGSNYYFWKGKMELFLRSQDNDMW-AVITDGDFVPTTK-EG----AVKA 57
EGG NRPP+ DG+NY +WK +M FL+S D+ W AVI D + EG +K
Sbjct: 16 EGGPVNRPPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKDWEHPKMLDTEGKPTDGLKP 195
Query: 58 KSAWSTDEKAQVLLNSKARLFLSCALTMEESERVDECTNAKEVWDTLKIHHEGTSHVKET 117
+ W+ +E L NSKA L + ++ CT AK+ W+ LK HEGTS VK +
Sbjct: 196 EEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTSKVKMS 375
Query: 118 RIDIGVRKFEVFEMSENETIDEMYARFTTIVNEMRSLGKAYSTHDRIRKILRCLPSVWRP 177
R+ + KFE +M E E I + + I N +LG+ + +RKILR LP +
Sbjct: 376 RLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERMTDEKLVRKILRSLPKRFDM 555
Query: 178 MVTAITQAKDLKSMNLEDLIGSLRAHEVVLQGDKPVKKVKTLALKASQQTPSVADEDVQE 237
VTAI +A+D+ ++ +++LIGSL+ E+ L D+ KK K LA ++ DE ++
Sbjct: 556 KVTAIEEAQDICNLRVDELIGSLQTFELGL-SDRTEKKSKNLAFVSN-------DEGEED 711
Query: 238 PQELEEVHEEEAEDELALISKRIQRMMLRRNQ------------IRKKFPKTNISIKTEA 285
+L+ +E + + L+ K+ +++ R ++ IRK S + +
Sbjct: 712 EYDLDT--DEGLTNAVVLLGKQFNKVLNRMDRRQKPHVRNIPFDIRKGSEYQKRSDEKPS 885
Query: 286 DKSQVTCYGCNKTGHFKNECPDIKKVQRKPPFKKKAMITWDDMEESDSQEDADTDMGLMA 345
C+GC GH K ECP K QR K + D ES+ + D+D D+ +
Sbjct: 886 HSKGFQCHGCEGYGHIKAECPTHLKKQR-----KGLSVCRSDDTESEQESDSDRDVNALT 1050
Query: 346 --------QSDDEEEVIIYKTDSLYKDLENKIDSLLYDSNFLTNRCHSL-------IKEL 390
SD + E+ + + Y++L K + +L L +L +E+
Sbjct: 1051GRFESAEDSSDTDSEITFDELATSYRELCIKSEKILQQEAQLKKVIANLEAEKEAHEEEI 1230
Query: 391 SEIKEEKEILQNKYDESRKTIKILQDSHFDMSEKQREINR---KQKGIMSVPSEVQKENI 447
SE+K E L +K + K+IK+L DM ++ ++ + Q+G+ + I
Sbjct: 1231SELKGEVGFLNSKLENMTKSIKMLNKGS-DMLDEVLQLGKNVGNQRGLGF--NHKSAGRI 1401
Query: 448 LLKKEVETLKKVLTGFIKSTETFQNIVGSQNESTKKSGLGFKDPSKIIGSFVPKAKIRVK 507
+ + V K+ TG S ++ Q +S +K + +
Sbjct: 1402TMTEFVPA--KISTGATMSQHRSRHHGTQQKKSKRK---------------------KWR 1512
Query: 508 CCFCDKYGHNESICHVKKKFIKQNNLYLSSERSHLNRSESSQKAEKAKKTCFYCNKSDHK 567
C +C KYGH + C+ +L H +S SS ++ + K HK
Sbjct: 1513CHYCGKYGHIKPFCY-----------HLHGHPHHGTQSSSS------RRKMMWVPK--HK 1635
Query: 568 RQNVTFRKDLLEELTLKDPTLHGYLKFLSCQM*VLPQGARTKPWYLDSGCSRHMTGDRNC 627
++ L + + + WYLDSGCSRHMTG +
Sbjct: 1636IVSLVVHTSL--------------------------RASAKEDWYLDSGCSRHMTGVKEF 1737
Query: 628 FLTFEKKDGGLVTFGNNDKGKIRGKGTI 655
+ E VTFG+ KGKI G G +
Sbjct: 1738LVNIEPCSTSYVTFGDGSKGKITGMGKL 1821
>TC213445
Length = 705
Score = 130 bits (326), Expect(2) = 2e-49
Identities = 59/98 (60%), Positives = 76/98 (77%)
Frame = +1
Query: 1477 KVERKSTSGSCQFLGQALIGWSCRKQNTIELSTTEAEYVSAASCCSQILWVRNQLEDYSL 1536
K +R+STS +C F+G AL+ W +KQN++ LST EAEY+SA S +QI W+R QL DY L
Sbjct: 400 KTDRESTSDTCHFIGSALVSWHSKKQNSVVLSTAEAEYISARSYYAQIFWMRQQLFDYGL 579
Query: 1537 RYTSVPIYCDNTSAINLSKNPIQHSRSKHIEIKHHFIR 1574
+ +PI CDNTSAINLSKN I +SR+KHIEI+HHF+R
Sbjct: 580 KLDHIPIRCDNTSAINLSKNHILYSRTKHIEIRHHFLR 693
Score = 86.3 bits (212), Expect(2) = 2e-49
Identities = 40/68 (58%), Positives = 51/68 (74%)
Frame = +2
Query: 1404 MIGSLLYLTASRPDIVFAVGLCARFQTCAKESHLTAVKRIFRYLVGTTDLGLWYRKGSSF 1463
MI S LYL+ SRP I+F+V +C R+Q KESHL+ +KRI RYL+G +LGLWY K SS+
Sbjct: 197 MIESFLYLSTSRPHIMFSVCMCVRYQANPKESHLSVIKRIMRYLLGIINLGLWYPKNSSY 376
Query: 1464 DLVAYCDA 1471
+LV Y DA
Sbjct: 377 NLVGYSDA 400
>BM143109
Length = 415
Score = 177 bits (449), Expect = 3e-44
Identities = 87/137 (63%), Positives = 104/137 (75%)
Frame = +1
Query: 1239 QPPGFINKEKPNHVFKLTKALYGLKQAPRAWYDRLSTFLIENGFSRGKIDTTLFRKTHNT 1298
QPP N EKPNHVFKL K LYGLKQA RAWY+ LS FL++ GFS+GK+DT LF
Sbjct: 4 QPPVRKNSEKPNHVFKLKKVLYGLKQALRAWYELLSKFLLDKGFSKGKVDTNLFI*KKLN 183
Query: 1299 DLLIVQVYVDDIIFGATKIKMCEEFSNLMQSEFEMSMMGELGFFLGLQIKQHSNGIFISQ 1358
D+L+VQ+YVDDIIFG+T +C++FS MQ+EFEMSMM EL FFLGLQIKQ NGIFISQ
Sbjct: 184 DILLVQIYVDDIIFGSTNDSLCKKFSQDMQNEFEMSMMRELNFFLGLQIKQTKNGIFISQ 363
Query: 1359 EKYIKDILKKYKMNEAK 1375
KY KD++ ++ M K
Sbjct: 364 SKYCKDLIHRFGMENDK 414
>BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Glycine max},
partial (7%)
Length = 336
Score = 167 bits (423), Expect = 3e-41
Identities = 78/111 (70%), Positives = 97/111 (87%)
Frame = +3
Query: 1112 EPKSINEAIIDDSWIEVMKEELSQFERNKVWNLVPNNQDKTIIGTRWVFRNKLDEEGKVV 1171
EPK+I EAI+DD+WI VM+EEL+QFERN VW LV ++ +IGT+WVFRNKLDE G ++
Sbjct: 3 EPKNIKEAIVDDNWIIVMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIII 182
Query: 1172 RNKARLVAQGYNQQEGIDYDETFAPVARLEAIRILLAYAAHKSIKLFQMDV 1222
RNKARLVA+GYNQ+EGIDY+ET+APVARLEAIR+LLAYA+ + KL+QMDV
Sbjct: 183 RNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMNFKLYQMDV 335
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 166 bits (421), Expect = 6e-41
Identities = 91/212 (42%), Positives = 126/212 (58%), Gaps = 4/212 (1%)
Frame = +1
Query: 1394 KSISEKEYRGMIGSLLYLTASRPDIVFAVGLCARFQTCAKESHLTAVKRIFRYLVGTTDL 1453
+ + E+R +IGSL YL SRP+I FAV L +RF + SH+ A KR+ R + GT
Sbjct: 1 RGVDVTEFRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGS 180
Query: 1454 GLWY---RKGSSFDLVAYCDADYAGDKVERKSTSGSCQFLGQALIGWSCRKQNTIELSTT 1510
G+ + K DL+ Y D+D+ D + KST G A + S +KQ+ I LST
Sbjct: 181 GVLFPFKAKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTC 360
Query: 1511 EAEYVSAASCCSQILWVRNQLEDYSLRYTS-VPIYCDNTSAINLSKNPIQHSRSKHIEIK 1569
EAEYV+A+ Q +W+ N LE+ LR V + DN SAINL+K+P H RSKHIE++
Sbjct: 361 EAEYVAASLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELR 540
Query: 1570 HHFIRDHVQKKNIALSFVDTENQLADIFTKPL 1601
H+IRD V K N+ + + E QLAD+ TKP+
Sbjct: 541 FHYIRDQVSKGNVTVEYCKAEEQLADLMTKPI 636
>AI959950
Length = 466
Score = 166 bits (419), Expect = 1e-40
Identities = 81/132 (61%), Positives = 104/132 (78%)
Frame = -1
Query: 1126 IEVMKEELSQFERNKVWNLVPNNQDKTIIGTRWVFRNKLDEEGKVVRNKARLVAQGYNQQ 1185
++ M+EEL QF++N V LV + K ++G +W+F NKLDE+GKVVR KARLVA+GY+QQ
Sbjct: 397 MKAMQEELDQFQKNNV*KLVKLPKRKKVVGVKWIFCNKLDEDGKVVRYKARLVAKGYSQQ 218
Query: 1186 EGIDYDETFAPVARLEAIRILLAYAAHKSIKLFQMDVKSAFLNGFLNEEVYVSQPPGFIN 1245
EGIDY +TFA VARLE I ILL++A + ++KL+QMDVKSAFLNG + +EVYV QPPGF N
Sbjct: 217 EGIDYPKTFALVARLEVICILLSFATYSNMKLYQMDVKSAFLNGLIQKEVYVEQPPGFEN 38
Query: 1246 KEKPNHVFKLTK 1257
+ HVFKL K
Sbjct: 37 ETLHQHVFKLNK 2
>AI855982
Length = 484
Score = 163 bits (412), Expect = 6e-40
Identities = 84/161 (52%), Positives = 110/161 (68%), Gaps = 2/161 (1%)
Frame = +2
Query: 1082 PTDQIIGSTTDGVRTRLSFQD--NNMAMISQMEPKSINEAIIDDSWIEVMKEELSQFERN 1139
P D IIG + GV TR S +D NNMA +S +EPK+I EAI+DD+WI M+EEL+QFERN
Sbjct: 2 PLDNIIGDISKGVTTRHSLKDLCNNMAFVSMIEPKNIKEAIVDDNWIIAMQEELNQFERN 181
Query: 1140 KVWNLVPNNQDKTIIGTRWVFRNKLDEEGKVVRNKARLVAQGYNQQEGIDYDETFAPVAR 1199
VW LV + +I T+WVFRNKLDE ++ +KARLVA+GYNQ +G+DY+ T+A +AR
Sbjct: 182 NVWKLVEKPDNYPVI*TKWVFRNKLDEHRIIIIHKARLVAEGYNQVDGLDYEHTYASIAR 361
Query: 1200 LEAIRILLAYAAHKSIKLFQMDVKSAFLNGFLNEEVYVSQP 1240
L I + L+Y + L+ SA L+G L EVYV QP
Sbjct: 362 L*VIIMPLSYVYIMNSTLYHYACVSALLHGLLLHEVYVDQP 484
>CO983516
Length = 724
Score = 160 bits (405), Expect = 4e-39
Identities = 77/123 (62%), Positives = 95/123 (76%)
Frame = +2
Query: 1191 DETFAPVARLEAIRILLAYAAHKSIKLFQMDVKSAFLNGFLNEEVYVSQPPGFINKEKPN 1250
D+ F PVARLE+IR+LL A KL+QMDVKSAFLNG+LNEEVYV QP GFI+ P+
Sbjct: 356 DKEFHPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFIDPTHPD 535
Query: 1251 HVFKLTKALYGLKQAPRAWYDRLSTFLIENGFSRGKIDTTLFRKTHNTDLLIVQVYVDDI 1310
HV++L KALYGLKQAPRAWY+RL+ L + G+ +G ID TLF K +L+I Q+YVDDI
Sbjct: 536 HVYRLKKALYGLKQAPRAWYERLTELLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDI 715
Query: 1311 IFG 1313
+FG
Sbjct: 716 VFG 724
>CF920770
Length = 581
Score = 151 bits (381), Expect = 3e-36
Identities = 76/187 (40%), Positives = 113/187 (59%), Gaps = 13/187 (6%)
Frame = -2
Query: 34 DNDMWAVITDGDFVPTTKEGAV-------------KAKSAWSTDEKAQVLLNSKARLFLS 80
D ++W I G ++PTT E K + WS +++ +V N KA+ ++
Sbjct: 574 DLNIWEAIEIGPYIPTTVERVSIDGSSSSESITIEKPRDRWSEEDRKRVQYNLKAKNIIT 395
Query: 81 CALTMEESERVDECTNAKEVWDTLKIHHEGTSHVKETRIDIGVRKFEVFEMSENETIDEM 140
AL M+E RV C +AKE+WDTL++ HEGT+ VK +RI+ ++E+F M+ NE I M
Sbjct: 394 SALGMDEYFRVSNCKSAKEMWDTLRLTHEGTTDVKRSRINALTHEYELFRMNTNENIQSM 215
Query: 141 YARFTTIVNEMRSLGKAYSTHDRIRKILRCLPSVWRPMVTAITQAKDLKSMNLEDLIGSL 200
RFT IVN + +LGK + D I K+LRCL W+P VTAI++++DL +M+L L G L
Sbjct: 214 QKRFTHIVNHLAALGKEFQNEDLINKVLRCLSREWQPKVTAISESRDLSNMSLATLFGKL 35
Query: 201 RAHEVVL 207
+ HE+ L
Sbjct: 34 QEHEMEL 14
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 131 bits (330), Expect = 2e-30
Identities = 66/130 (50%), Positives = 91/130 (69%)
Frame = -2
Query: 1145 VPNNQDKTIIGTRWVFRNKLDEEGKVVRNKARLVAQGYNQQEGIDYDETFAPVARLEAIR 1204
VP KT +G RWV+ K+ G+V R KARLVA+GY Q GIDY +TF+PVA+L +R
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 1205 ILLAYAAHKSIKLFQMDVKSAFLNGFLNEEVYVSQPPGFINKEKPNHVFKLTKALYGLKQ 1264
+ LA AA L Q+D+K+AFL+G L E++Y+ QPPGF+ + + V KL ++LYGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 1265 APRAWYDRLS 1274
+PRAW+ + S
Sbjct: 46 SPRAWFGKFS 17
>BI321712
Length = 399
Score = 126 bits (317), Expect = 7e-29
Identities = 58/124 (46%), Positives = 86/124 (68%)
Frame = -3
Query: 1305 VYVDDIIFGATKIKMCEEFSNLMQSEFEMSMMGELGFFLGLQIKQHSNGIFISQEKYIKD 1364
+YVDD+IF M EEF M +EFEM+ MG + ++LG+++KQ GIFI+QE Y K+
Sbjct: 379 LYVDDLIFTGNNPSMFEEFKKDMSNEFEMTDMGLMAYYLGIEVKQEDKGIFITQEGYAKE 200
Query: 1365 ILKKYKMNEAKIMSTPMHPSSSLDKDESGKSISEKEYRGMIGSLLYLTASRPDIVFAVGL 1424
+LKK+KM++A + TPM S L K E G+++ Y+ +IGSL YLT +RPDI++ VG+
Sbjct: 199 VLKKFKMDDANPVGTPMECGSKLSKHEKGENVDPTLYKSLIGSLRYLTCTRPDILYVVGV 20
Query: 1425 CARF 1428
+R+
Sbjct: 19 VSRY 8
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 123 bits (308), Expect = 7e-28
Identities = 60/146 (41%), Positives = 93/146 (63%), Gaps = 3/146 (2%)
Frame = +2
Query: 1459 KGSSFDLVAYCDADYAGDKVERKSTSGSCQFLGQALIGWSCRKQNTIELSTTEAEYVSAA 1518
KG++ L YCDAD+AG ++R+STSG C F+G L+ W +KQ + S+ EAEY S A
Sbjct: 2 KGNT-QLSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMA 178
Query: 1519 SCCSQILWVRNQLEDYSLRY---TSVPIYCDNTSAINLSKNPIQHSRSKHIEIKHHFIRD 1575
+++W++ L++ LR+ + +YCDN +A++++ NP+ H R+KHIEI HFIR+
Sbjct: 179 MVTCELMWIKQFLQE--LRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIRE 352
Query: 1576 HVQKKNIALSFVDTENQLADIFTKPL 1601
+ K I F+ + +Q DI TK L
Sbjct: 353 KLLSKEIVTEFIGSNDQPVDILTKSL 430
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 120 bits (302), Expect = 4e-27
Identities = 84/286 (29%), Positives = 140/286 (48%), Gaps = 1/286 (0%)
Frame = +2
Query: 639 VTFGNNDKGKIRGKGTIGNLNSAKIENVQYVEGLKHNLLSISQLCD-SGFEVIFKPNICE 697
+T + + G G + +S + +V ++ G N+ S+SQL V F N
Sbjct: 20 ITLADGSRVVATGIGHVSPTSSLSLNSVVFILGCPFNITSLSQLTRFRNCSVTFDANSFV 199
Query: 698 VRQASSNKLFFSGSRRKNLYVLELNDMPAEFCFMSLEKDKWIWHKRAGHISMKTIAKLSQ 757
+++ + G LY L+ N + ++ K + H+R GH LS+
Sbjct: 200 IQECGTGWTIGVGIESHGLYYLKPN---LSWVCSAVTSPKLL-HERLGH------PHLSK 349
Query: 758 LDLVRGLPKISFEKDKICEACVKGKQVKSSFKTIEFISTQKPLELLHIDLFAPVQTASLT 817
L ++ +P + KD CE+C GK V+SS + +E P ++H D++ P + +S++
Sbjct: 350 LKIM--VPSLEKIKDLFCESCQLGKHVRSSXRHVES-RVDSPFLVIHXDIWGPNRVSSMS 520
Query: 818 GKRYGFVIVDDFSRFTWVLFLKHKDESFEAFQNFCKRVQNEKGYNIITVRSDHGGEFENA 877
RY +D+FS+ T V +K + E +F +++ + G I +RSD+ E+ ++
Sbjct: 521 -YRYFVTFIDEFSQCTRVFLMKERSEIL-SFLTSVNKIKTQFGKTIKILRSDNAKEYFSS 694
Query: 878 SFKTFFDENGIKHNFSCARTPQQNGVVERKNRTLQEMARTMLNESN 923
F GI H FSC TPQQN + ERKNR L E ART+L +N
Sbjct: 695 VISPFXSAQGILHQFSCPHTPQQNDIAERKNRHLVETARTLLLHAN 832
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 119 bits (297), Expect = 1e-26
Identities = 59/128 (46%), Positives = 81/128 (63%)
Frame = +3
Query: 1113 PKSINEAIIDDSWIEVMKEELSQFERNKVWNLVPNNQDKTIIGTRWVFRNKLDEEGKVVR 1172
P +I EA+ W + M +E+ E N W LVP KT +G RWV+ K+ GKV R
Sbjct: 21 PSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGPNGKVDR 200
Query: 1173 NKARLVAQGYNQQEGIDYDETFAPVARLEAIRILLAYAAHKSIKLFQMDVKSAFLNGFLN 1232
KARLVA+GY Q GI+Y +TF+PV L +R+ LA AA + L Q+D+K+AFL+G L
Sbjct: 201 LKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAFLHGDLE 380
Query: 1233 EEVYVSQP 1240
E++Y+ QP
Sbjct: 381 EDIYMEQP 404
>TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%)
Length = 493
Score = 118 bits (296), Expect = 2e-26
Identities = 56/124 (45%), Positives = 84/124 (67%), Gaps = 1/124 (0%)
Frame = +3
Query: 1479 ERKSTSGSCQFLGQALIGWSCRKQNTIELSTTEAEYVSAASCCSQILWVRNQLEDYSL-R 1537
+RKST+G F+G W +KQ + LST EAEYV+A SC +W+RN L++ + +
Sbjct: 12 DRKSTTGFVFFMGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELKMPQ 191
Query: 1538 YTSVPIYCDNTSAINLSKNPIQHSRSKHIEIKHHFIRDHVQKKNIALSFVDTENQLADIF 1597
+ I DN SA+ L+KNP+ H +SKHI+ ++HFIR+ ++KK + L +V +++Q ADIF
Sbjct: 192 EEPMEICVDNKSALALAKNPVFHEKSKHIDTRYHFIRECIEKKEVKLKYVMSQDQAADIF 371
Query: 1598 TKPL 1601
TKPL
Sbjct: 372 TKPL 383
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.318 0.134 0.389
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 67,962,243
Number of Sequences: 63676
Number of extensions: 917526
Number of successful extensions: 5070
Number of sequences better than 10.0: 168
Number of HSP's better than 10.0 without gapping: 4879
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5013
length of query: 1608
length of database: 12,639,632
effective HSP length: 110
effective length of query: 1498
effective length of database: 5,635,272
effective search space: 8441637456
effective search space used: 8441637456
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 66 (30.0 bits)
Medicago: description of AC139747.5