
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0134.9
(1610 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 613 e-175
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 608 e-174
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 232 1e-60
TC232995 226 6e-59
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti... 216 6e-56
TC213445 124 4e-47
BM143109 173 6e-43
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 171 3e-42
AI959950 169 1e-41
BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Gl... 160 5e-39
CO983516 152 9e-37
NP004897 gag-protease polyprotein 150 3e-36
AI855982 148 2e-35
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 140 3e-33
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 127 4e-29
BU549979 126 9e-29
CO982036 124 3e-28
TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%) 118 2e-26
BG508993 118 2e-26
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 114 5e-25
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 613 bits (1580), Expect = e-175
Identities = 301/573 (52%), Positives = 410/573 (71%)
Frame = +1
Query: 1027 GFNVSDKGKAPEEVEPEEDEPEEEAGPSNSQTLKKSRITAAHPKELILGNKDEPVRTRSA 1086
G NV+D K+ E E D +E+ + +RI HPKELI+G+ + V TRS
Sbjct: 2980 GDNVADAAKSGENAE-NSDSATDESNINQPDKRSSTRIQKMHPKELIIGDPNRGVTTRSR 3156
Query: 1087 FRPSEETLLSLKGLVSLIEPKSIDEALQDKDWILAMEEELNQFSKNDVWSLVKKPENVHV 1146
E ++S VS IEPK++ EAL D+ WI AM+EEL QF +N+VW LV +PE +V
Sbjct: 3157 ----EVEIVSNSCFVSKIEPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNV 3324
Query: 1147 IGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNH 1206
IGTKW+F+NK NE+G + RNKARLVAQGY+Q EG+D+ ETFAPVARLE+IRLL+ +
Sbjct: 3325 IGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACIL 3504
Query: 1207 NIVLHQMDVKSAFLNGYISEEVYVHQPPGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERL 1266
L+QMDVKSAFLNGY++EEVYV QP GF D PDHV++LKK+LYGLKQAPRAWYERL
Sbjct: 3505 KFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADPTHPDHVYRLKKALYGLKQAPRAWYERL 3684
Query: 1267 SSFLLENEFVRGKVDTTLFCKTYKDDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEM 1326
+ FL + + +G +D TLF K ++++I QIYVDDI+FG + + + F + MQ+EFEM
Sbjct: 3685 TEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEM 3864
Query: 1327 SMMGELKYFLGIQVDQTPEGTYIHQSKYTKELLKKFNMLESTVAKTPMHPTCILEKEDKS 1386
S++GEL YFLG+QV Q + ++ QS+Y K ++KKF M ++ +TP L K++
Sbjct: 3865 SLVGELTYFLGLQVKQMEDSIFLSQSRYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAG 4044
Query: 1387 GKVCQKLYRGMIGSLLYLTASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNL 1446
V Q LYR MIGSLLYLTASRPDI ++V +CAR+Q++P+ +HLT VKRIL+Y+ GT++
Sbjct: 4045 TSVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQANPKISHLTQVKRILKYVNGTSDY 4224
Query: 1447 GLMYKKT*EYKLSGYCDADYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAE 1506
G+MY L GYCDAD+AG +RKSTSG C +LG+NL+SW SK+Q+ ++LSTAEAE
Sbjct: 4225 GIMYCHCSNPMLVGYCDADWAGSADDRKSTSGGCFYLGNNLISWFSKKQNCVSLSTAEAE 4404
Query: 1507 YISAAICSTQMLWMKHQLEDYQILESNIPIYCDNTAAISLSKNPILHSRAKHIEVKYHFI 1566
YI+A +Q++WMK L++Y + + + +YCDN +AI++SKNP+ HSR KHI++++H+I
Sbjct: 4405 YIAAGSSCSQLVWMKQMLKEYNVEQDVMTLYCDNMSAINISKNPVQHSRTKHIDIRHHYI 4584
Query: 1567 RDYVQKGVLLLKFVDTDHQWADIFTKPLAEDRF 1599
RD V V+ LK VDT+ Q ADIFTK L ++F
Sbjct: 4585 RDLVDDKVITLKHVDTEEQIADIFTKALDANQF 4683
Score = 155 bits (391), Expect = 2e-37
Identities = 134/510 (26%), Positives = 230/510 (44%), Gaps = 19/510 (3%)
Frame = +1
Query: 11 KPPMFDGQRFEYWKDRLESFFLGFDADLWDIIVDGYERP--VDADGKKI----PRSEMTA 64
+PP+ DG +EYWK R+ +F D+ W ++ G+E P +D +GK P + T
Sbjct: 34 RPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTDELKPEEDWTK 213
Query: 65 EQKKLYSQHHKARAILLSAISYEEYQKITDREFAKGIFDSLKMSHEGNKKVKESKALSLI 124
E+ +L + KA L + + ++ I AK ++ LK++HEG KVK S+ L
Sbjct: 214 EEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTSKVKMSRLQLLA 393
Query: 125 QKYESFIMEPNESIEEMFSRFQLLVAGIRPLNKSYTTKDHVIRVIRCLPESWMPLVTSIE 184
K+E+ M+ E I + + L + T + V +++R LP+ + VT+IE
Sbjct: 394 TKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLPKRFDMKVTAIE 573
Query: 185 LTRDVENMSLEELISILKCHELKRSEMQDLRKKSIALKSKSEKAKVEKSKALQAEEEESE 244
+D+ NM ++ELI L+ EL S+ + + K++A S E EE+E +
Sbjct: 574 EAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDE-----------GEEDEYD 720
Query: 245 EASEDSGEDELTLISKRLNRIWKHRQSKFKGSGK------AKG-KYES-SGQKKSSIREV 296
+++ + + L+ K+ N++ + K + KG KY+ S K S + +
Sbjct: 721 LDTDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKRSDVKPSHSKGI 900
Query: 297 TCFECKESGHYKSDCPKLKKDKKPKKHFKTKKSLMVTFDESESE-DVDSDGEVQGLMAIV 355
C C+ GH ++CP K K +K L V ++ESE + DSD +V L I
Sbjct: 901 QCHGCEGYGHIIAECPTHLK--------KHRKGLSVCQSDTESEQESDSDRDVNALTGIF 1056
Query: 356 KDKGAESKEAVDSDSESEGDPDSDDENEVFASFSTSELKHALSDIMDKYNSLLSKHKKLK 415
+ ++++ D+DSE D EL + + K +L + +LK
Sbjct: 1057E----TAEDSSDTDSEITFD----------------ELAASYRKLCIKSEKILQQEAQLK 1176
Query: 416 KNLSAVSKTPSEHEKIISDLKNDNHALVNSNSVLKNQIAKLEEV-IACDASDSKHESKYE 474
K ++ + H++ IS+LK + L NS L+N ++ + D D
Sbjct: 1177KVIADLEAEKEAHKEEISELKGEVGFL---NSKLENMTKSIKMLNKGSDTLDEVLLLGKN 1347
Query: 475 KSFQR---FLAKSVDRSLMASMIYGVSRNG 501
QR F KS R+ M + +R G
Sbjct: 1348AGNQRGLGFNPKSAGRTTMTEFVPAKNRTG 1437
Score = 36.2 bits (82), Expect = 0.12
Identities = 35/121 (28%), Positives = 57/121 (46%)
Frame = +3
Query: 855 SAKREGL*DCACQK*PWWRV*E*QV*ESV*FLWNCT*FLLSQNSSTKWCC*EEEQNSSGD 914
++KRE L Q+*PW R+*+ QV + + +* L S ++T+W *EE+Q+ +
Sbjct: 2418 TSKRERLCHQENQE*PWQRI*KQQVH*ILHI*RHHS*VLCSHYTTTEWDS*EEKQDFARG 2597
Query: 915 G*NHAPRNWHG*ALLGRGSKYSMLHSEQNLCETNSE*DSL*IVEEHKTQHFLFSSFWLCL 974
HA LG ++SMLH +Q+ E +* +E + W +
Sbjct: 2598 CSGHASCQRTSL*SLG*SHEHSMLHPQQSHTEKRDSNHPV*NLEREEAICQALPHLWKSM 2777
Query: 975 L 975
L
Sbjct: 2778 L 2780
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 608 bits (1567), Expect = e-174
Identities = 301/575 (52%), Positives = 412/575 (71%), Gaps = 2/575 (0%)
Frame = +1
Query: 1027 GFNVSDKGKAPEEVEPEEDEPEEEAGPSNSQTLKKS--RITAAHPKELILGNKDEPVRTR 1084
G NV+D K+ E E + +E P+ +Q K+ RI HPKELI+G+ + V TR
Sbjct: 2983 GDNVADTAKSAENAENSDSATDE---PNINQPDKRPSIRIQKMHPKELIIGDPNRGVTTR 3153
Query: 1085 SAFRPSEETLLSLKGLVSLIEPKSIDEALQDKDWILAMEEELNQFSKNDVWSLVKKPENV 1144
S E ++S VS IEPK++ EAL D+ WI AM+EEL QF +N+VW LV +PE
Sbjct: 3154 SR----EIEIVSNSCFVSKIEPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGT 3321
Query: 1145 HVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSV 1204
+VIGTKW+F+NK NE+G + RNKARLVAQGY+Q EG+D+ ETFAPVARLE+IRLL+ +
Sbjct: 3322 NVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVAC 3501
Query: 1205 NHNIVLHQMDVKSAFLNGYISEEVYVHQPPGFEDEKKPDHVFKLKKSLYGLKQAPRAWYE 1264
L+QMDVKSAFLNGY++EE YV QP GF D PDHV++LKK+LYGLKQAPRAWYE
Sbjct: 3502 ILKFKLYQMDVKSAFLNGYLNEEAYVEQPKGFVDPTHPDHVYRLKKALYGLKQAPRAWYE 3681
Query: 1265 RLSSFLLENEFVRGKVDTTLFCKTYKDDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEF 1324
RL+ FL + + +G +D TLF K ++++I QIYVDDI+FG + + + F + MQ+EF
Sbjct: 3682 RLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEF 3861
Query: 1325 EMSMMGELKYFLGIQVDQTPEGTYIHQSKYTKELLKKFNMLESTVAKTPMHPTCILEKED 1384
EMS++GEL YFLG+QV Q + ++ QSKY K ++KKF M ++ +TP L K++
Sbjct: 3862 EMSLVGELTYFLGLQVKQMEDSIFLSQSKYAKNIVKKFGMENASHKRTPAPTHLKLSKDE 4041
Query: 1385 KSGKVCQKLYRGMIGSLLYLTASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTT 1444
V Q LYR MIGSLLYLTASRPDI ++V +CAR+Q++P+ +HL VKRIL+Y+ GT+
Sbjct: 4042 AGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQANPKISHLNQVKRILKYVNGTS 4221
Query: 1445 NLGLMYKKT*EYKLSGYCDADYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAE 1504
+ G+MY + L GYCDAD+AG +RKSTSG C +LG+NL+SW SK+Q+ ++LSTAE
Sbjct: 4222 DYGIMYCHCSDSMLVGYCDADWAGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAE 4401
Query: 1505 AEYISAAICSTQMLWMKHQLEDYQILESNIPIYCDNTAAISLSKNPILHSRAKHIEVKYH 1564
AEYI+A +Q++WMK L++Y + + + +YCDN +AI++SKNP+ HSR KHI++++H
Sbjct: 4402 AEYIAAGSSCSQLVWMKQMLKEYNVEQDVMTLYCDNMSAINISKNPVQHSRTKHIDIRHH 4581
Query: 1565 FIRDYVQKGVLLLKFVDTDHQWADIFTKPLAEDRF 1599
+IRD V V+ L+ VDT+ Q ADIFTK L ++F
Sbjct: 4582 YIRDLVDDKVITLEHVDTEEQIADIFTKALDANQF 4686
Score = 156 bits (394), Expect = 8e-38
Identities = 122/463 (26%), Positives = 213/463 (45%), Gaps = 16/463 (3%)
Frame = +1
Query: 11 KPPMFDGQRFEYWKDRLESFFLGFDADLWDIIVDGYERP--VDADGKKI----PRSEMTA 64
+PP+ DG +EYWK R+ +F D+ W ++ G+E P +D +GK P + T
Sbjct: 34 RPPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTNELKPEEDWTK 213
Query: 65 EQKKLYSQHHKARAILLSAISYEEYQKITDREFAKGIFDSLKMSHEGNKKVKESKALSLI 124
E+ +L + KA L + + ++ I AK ++ LK +HEG KVK S+ L
Sbjct: 214 EEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTSKVKMSRLQLLA 393
Query: 125 QKYESFIMEPNESIEEMFSRFQLLVAGIRPLNKSYTTKDHVIRVIRCLPESWMPLVTSIE 184
K+E+ M+ E I + + L + T + V +++R LP+ + VT+IE
Sbjct: 394 TKFENLKMKEEECIHDFHMNILEIANACTALGERMTDEKLVRKILRSLPKRFDMKVTAIE 573
Query: 185 LTRDVENMSLEELISILKCHELKRSEMQDLRKKSIALKSKSEKAKVEKSKALQAEEEESE 244
+D+ NM ++ELI L+ EL S+ + + K++A S E EE+E +
Sbjct: 574 EAQDICNMRVDELIGSLQTFELGLSDRTEKKSKNLAFVSNDE-----------GEEDEYD 720
Query: 245 EASEDSGEDELTLISKRLNRIWKHRQSKFK--------GSGKAKGKYESSGQKKSSIREV 296
+++ + + L+ K+ N++ + K K + S +K S + +
Sbjct: 721 LDTDEGLTNAVVLLGKQFNKVLNRMDRRQKPHVRNIPFDIRKGSEYQKKSDEKPSHSKGI 900
Query: 297 TCFECKESGHYKSDCPKLKKDKKPKKHFKTKKSLMV-TFDESESE-DVDSDGEVQGLMAI 354
C C+ GH K++CP K K +K L V D++ESE + DSD +V L
Sbjct: 901 QCHGCEGYGHIKAECPTHLK--------KQRKGLSVCRSDDTESEQESDSDRDVNALTG- 1053
Query: 355 VKDKGAESKEAVDSDSESEGDPDSDDENEVFASFSTSELKHALSDIMDKYNSLLSKHKKL 414
+ ++++ D+DSE D EL + ++ K +L + +L
Sbjct: 1054---RFESAEDSSDTDSEITFD----------------ELAISYRELCIKSEKILQQEAQL 1176
Query: 415 KKNLSAVSKTPSEHEKIISDLKNDNHALVNSNSVLKNQIAKLE 457
KK ++ + HE+ IS+LK + L NS L+N ++
Sbjct: 1177KKVIANLEAEKEAHEEEISELKGEVGFL---NSKLENMTKSIK 1296
Score = 36.6 bits (83), Expect = 0.091
Identities = 31/89 (34%), Positives = 49/89 (54%)
Frame = +3
Query: 855 SAKREGL*DCACQK*PWWRV*E*QV*ESV*FLWNCT*FLLSQNSSTKWCC*EEEQNSSGD 914
++KR+ L Q+*PW RV*+ QV + + +* L S ++TKW *+E+Q+ +
Sbjct: 2421 TSKRKRLCHQENQE*PWQRV*KQQVY*ILHI*RHHS*VLCSHYTTTKWHS*KEKQDFARS 2600
Query: 915 G*NHAPRNWHG*ALLGRGSKYSMLHSEQN 943
* HA LG ++SMLH +Q+
Sbjct: 2601 C*GHASCQRTSL*SLG*SHEHSMLHPQQS 2687
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 232 bits (591), Expect = 1e-60
Identities = 110/153 (71%), Positives = 132/153 (85%)
Frame = -3
Query: 1120 LAMEEELNQFSKNDVWSLVKKPENVHVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQE 1179
+AM+EELNQF +N+VW LV+KPEN VIGTKWVFRNKL+E G ++RNKARLVA+GY+Q+E
Sbjct: 461 IAMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEE 282
Query: 1180 GIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEVYVHQPPGFEDE 1239
GIDY ET+APVARLE IR+L+++ N L+QMDVKSAFLNG I EEVYV QPPGFE
Sbjct: 281 GIDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIP 102
Query: 1240 KKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLE 1272
KP HV+KL+K+LYGLKQAPRAWYER+S+FLLE
Sbjct: 101 DKPTHVYKLQKALYGLKQAPRAWYERISNFLLE 3
>TC232995
Length = 1009
Score = 226 bits (576), Expect = 6e-59
Identities = 112/173 (64%), Positives = 133/173 (76%)
Frame = +2
Query: 1230 VHQPPGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTY 1289
V QPPGFE KP+HV+KL+K+LYGLKQAPRAWYERLS+FLLE EF RGKVDTTLF K
Sbjct: 2 VEQPPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFIKRK 181
Query: 1290 KDDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEGTYI 1349
+DIL+VQIYVDDIIFGS N SLCKEFS MQ+EFEMSMMGELKYFLG+Q+ QT G +I
Sbjct: 182 HNDILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*GIFI 361
Query: 1350 HQSKYTKELLKKFNMLESTVAKTPMHPTCILEKEDKSGKVCQKLYRGMIGSLL 1402
+QSKY KEL+K+F M + TPM C L+K++ + K YR IG ++
Sbjct: 362 NQSKYCKELIKRFGMDSAKHMSTPMSTNCYLDKDESGQSIDIKQYRDAIGEVV 520
>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
Length = 1213
Score = 216 bits (550), Expect = 6e-56
Identities = 121/247 (48%), Positives = 158/247 (62%), Gaps = 2/247 (0%)
Frame = +3
Query: 1235 GFEDEKKPDHVFKLKKSL--YGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTYKDD 1292
GFED+++P HVF + L G+K ++ S ++ K+
Sbjct: 330 GFEDKERPCHVFMV*NKL*ELGMKG*VHF*FQMDSPEE*RTPHYSERLK--------KET 485
Query: 1293 ILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEGTYIHQS 1352
LI+ IYVDDIIFG+ ++ +CKEF E+M+ FE SM GELK+ LG+Q+ Q G +IHQ
Sbjct: 486 FLIIHIYVDDIIFGATSKRMCKEFFELMKDGFETSMKGELKFLLGLQIIQKVYGIFIHQE 665
Query: 1353 KYTKELLKKFNMLESTVAKTPMHPTCILEKEDKSGKVCQKLYRGMIGSLLYLTASRPDIL 1412
KYTK LK+F M E+ TPMH + I++K++K K Y GMI SL YLT+SRPDI+
Sbjct: 666 KYTKSHLKRFRMDEAKPMATPMHRSTIIDKDEKGNHTS*KEYSGMIDSLSYLTSSRPDIV 845
Query: 1413 FSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKT*EYKLSGYCDADYAGDRTE 1472
F V LCARFQS P+ +H+TAVKRILRYL GTTN L +KK E+ L GYCD +AGD+ E
Sbjct: 846 FVVCLCARFQSYPKISHVTAVKRILRYLVGTTNHCLWFKKRSEFDLLGYCDVYFAGDKVE 1025
Query: 1473 RKSTSGN 1479
RKSTS N
Sbjct: 1026 RKSTSRN 1046
>TC213445
Length = 705
Score = 124 bits (311), Expect(2) = 4e-47
Identities = 58/98 (59%), Positives = 77/98 (78%)
Frame = +1
Query: 1470 RTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQI 1529
+T+R+STS C F+GS LVSW SK+Q+++ LSTAEAEYISA Q+ WM+ QL DY +
Sbjct: 400 KTDRESTSDTCHFIGSALVSWHSKKQNSVVLSTAEAEYISARSYYAQIFWMRQQLFDYGL 579
Query: 1530 LESNIPIYCDNTAAISLSKNPILHSRAKHIEVKYHFIR 1567
+IPI CDNT+AI+LSKN IL+SR KHIE+++HF+R
Sbjct: 580 KLDHIPIRCDNTSAINLSKNHILYSRTKHIEIRHHFLR 693
Score = 84.3 bits (207), Expect(2) = 4e-47
Identities = 39/68 (57%), Positives = 50/68 (73%)
Frame = +2
Query: 1397 MIGSLLYLTASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKT*EY 1456
MI S LYL+ SRP I+FSV +C R+Q++P+E+HL+ +KRI+RYL G NLGL Y K Y
Sbjct: 197 MIESFLYLSTSRPHIMFSVCMCVRYQANPKESHLSVIKRIMRYLLGIINLGLWYPKNSSY 376
Query: 1457 KLSGYCDA 1464
L GY DA
Sbjct: 377 NLVGYSDA 400
>BM143109
Length = 415
Score = 173 bits (438), Expect = 6e-43
Identities = 86/133 (64%), Positives = 103/133 (76%)
Frame = +1
Query: 1232 QPPGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTYKD 1291
QPP ++ +KP+HVFKLKK LYGLKQA RAWYE LS FLL+ F +GKVDT LF +
Sbjct: 4 QPPVRKNSEKPNHVFKLKKVLYGLKQALRAWYELLSKFLLDKGFSKGKVDTNLFI*KKLN 183
Query: 1292 DILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEGTYIHQ 1351
DIL+VQIYVDDIIFGS N SLCK+FS+ MQ EFEMSMM EL +FLG+Q+ QT G +I Q
Sbjct: 184 DILLVQIYVDDIIFGSTNDSLCKKFSQDMQNEFEMSMMRELNFFLGLQIKQTKNGIFISQ 363
Query: 1352 SKYTKELLKKFNM 1364
SKY K+L+ +F M
Sbjct: 364 SKYCKDLIHRFGM 402
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 171 bits (432), Expect = 3e-42
Identities = 90/217 (41%), Positives = 138/217 (63%), Gaps = 4/217 (1%)
Frame = +1
Query: 1394 YRGMIGSLLYLTASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYK-- 1451
+R +IGSL YL SRP+I F+V L +RF PR +H+ A KR+LR +KGT G+++
Sbjct: 22 FRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVLFPFK 201
Query: 1452 -KT*EYKLSGYCDADYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISA 1510
K+ + L GY D+D+ D + KST G V+ +SK+Q IALST EAEY++A
Sbjct: 202 AKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAEYVAA 381
Query: 1511 AICSTQMLWMKHQLEDYQILESN-IPIYCDNTAAISLSKNPILHSRAKHIEVKYHFIRDY 1569
++ + Q +WM + LE+ ++ E + + DN +AI+L+K+P LH R+KHIE+++H+IRD
Sbjct: 382 SLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFHYIRDQ 561
Query: 1570 VQKGVLLLKFVDTDHQWADIFTKPLAEDRFNFILKNL 1606
V KG + +++ + Q AD+ TKP+ RF I L
Sbjct: 562 VSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSEL 672
>AI959950
Length = 466
Score = 169 bits (428), Expect(2) = 1e-41
Identities = 86/130 (66%), Positives = 102/130 (78%)
Frame = -1
Query: 1121 AMEEELNQFSKNDVWSLVKKPENVHVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQEG 1180
AM+EEL+QF KN+V LVK P+ V+G KW+F NKL+E G VVR KARLVA+GYSQQEG
Sbjct: 391 AMQEELDQFQKNNV*KLVKLPKRKKVVGVKWIFCNKLDEDGKVVRYKARLVAKGYSQQEG 212
Query: 1181 IDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEVYVHQPPGFEDEK 1240
IDY +TFA VARLE I +L+SF+ N+ L+QMDVKSAFLNG I +EVYV QPPGFE+E
Sbjct: 211 IDYPKTFALVARLEVICILLSFATYSNMKLYQMDVKSAFLNGLIQKEVYVEQPPGFENET 32
Query: 1241 KPDHVFKLKK 1250
HVFKL K
Sbjct: 31 LHQHVFKLNK 2
Score = 21.2 bits (43), Expect(2) = 1e-41
Identities = 8/16 (50%), Positives = 13/16 (81%)
Frame = -2
Query: 1100 LVSLIEPKSIDEALQD 1115
L+ ++PK IDEA++D
Sbjct: 453 LIFEMKPKHIDEAIKD 406
>BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Glycine max},
partial (7%)
Length = 336
Score = 160 bits (404), Expect = 5e-39
Identities = 74/111 (66%), Positives = 95/111 (84%)
Frame = +3
Query: 1105 EPKSIDEALQDKDWILAMEEELNQFSKNDVWSLVKKPENVHVIGTKWVFRNKLNEKGDVV 1164
EPK+I EA+ D +WI+ M+EELNQF +N+VW LV+KPEN VIGTKWVFRNKL+E G ++
Sbjct: 3 EPKNIKEAIVDDNWIIVMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIII 182
Query: 1165 RNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDV 1215
RNKARLVA+GY+Q+EGIDY ET+APVARLEAIR+L++++ N L+QMDV
Sbjct: 183 RNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMNFKLYQMDV 335
>CO983516
Length = 724
Score = 152 bits (385), Expect = 9e-37
Identities = 73/120 (60%), Positives = 92/120 (75%)
Frame = +2
Query: 1187 FAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEVYVHQPPGFEDEKKPDHVF 1246
F PVARLE+IRLL+ + L+QMDVKSAFLNGY++EEVYV QP GF D PDHV+
Sbjct: 365 FHPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFIDPTHPDHVY 544
Query: 1247 KLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTYKDDILIVQIYVDDIIFG 1306
+LKK+LYGLKQAPRAWYERL+ L + + +G +D TLF K ++++I QIYVDDI+FG
Sbjct: 545 RLKKALYGLKQAPRAWYERLTELLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIVFG 724
>NP004897 gag-protease polyprotein
Length = 1923
Score = 150 bits (380), Expect = 3e-36
Identities = 120/463 (25%), Positives = 211/463 (44%), Gaps = 16/463 (3%)
Frame = +1
Query: 11 KPPMFDGQRFEYWKDRLESFFLGFDADLWDIIVDGYERP--VDADGKKI----PRSEMTA 64
+PP+ DG +EYWK R+ +F D+ W ++ +E P +D +GK P + T
Sbjct: 34 RPPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKDWEHPKMLDTEGKPTDGLKPEEDWTK 213
Query: 65 EQKKLYSQHHKARAILLSAISYEEYQKITDREFAKGIFDSLKMSHEGNKKVKESKALSLI 124
E+ +L + KA L + + ++ I AK ++ LK +HEG KVK S+ L
Sbjct: 214 EEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTSKVKMSRLQLLA 393
Query: 125 QKYESFIMEPNESIEEMFSRFQLLVAGIRPLNKSYTTKDHVIRVIRCLPESWMPLVTSIE 184
K+E+ M+ E I + + L + T + V +++R LP+ + VT+IE
Sbjct: 394 TKFENLKMKEEECIHDFHMNILEIANACTALGERMTDEKLVRKILRSLPKRFDMKVTAIE 573
Query: 185 LTRDVENMSLEELISILKCHELKRSEMQDLRKKSIALKSKSEKAKVEKSKALQAEEEESE 244
+D+ N+ ++ELI L+ EL S+ + + K++A S E EE+E +
Sbjct: 574 EAQDICNLRVDELIGSLQTFELGLSDRTEKKSKNLAFVSNDE-----------GEEDEYD 720
Query: 245 EASEDSGEDELTLISKRLNRIWKHRQSKFK--------GSGKAKGKYESSGQKKSSIREV 296
+++ + + L+ K+ N++ + K K + S +K S +
Sbjct: 721 LDTDEGLTNAVVLLGKQFNKVLNRMDRRQKPHVRNIPFDIRKGSEYQKRSDEKPSHSKGF 900
Query: 297 TCFECKESGHYKSDCPKLKKDKKPKKHFKTKKSLMV-TFDESESE-DVDSDGEVQGLMAI 354
C C+ GH K++CP K K +K L V D++ESE + DSD +V L
Sbjct: 901 QCHGCEGYGHIKAECPTHLK--------KQRKGLSVCRSDDTESEQESDSDRDVNALTG- 1053
Query: 355 VKDKGAESKEAVDSDSESEGDPDSDDENEVFASFSTSELKHALSDIMDKYNSLLSKHKKL 414
+ ++++ D+DSE D EL + ++ K +L + +L
Sbjct: 1054---RFESAEDSSDTDSEITFD----------------ELATSYRELCIKSEKILQQEAQL 1176
Query: 415 KKNLSAVSKTPSEHEKIISDLKNDNHALVNSNSVLKNQIAKLE 457
KK ++ + HE+ IS+LK + L NS L+N ++
Sbjct: 1177KKVIANLEAEKEAHEEEISELKGEVGFL---NSKLENMTKSIK 1296
>AI855982
Length = 484
Score = 148 bits (374), Expect = 2e-35
Identities = 78/165 (47%), Positives = 111/165 (67%)
Frame = +2
Query: 1069 PKELILGNKDEPVRTRSAFRPSEETLLSLKGLVSLIEPKSIDEALQDKDWILAMEEELNQ 1128
P + I+G+ + V TR + + L + VS+IEPK+I EA+ D +WI+AM+EELNQ
Sbjct: 2 PLDNIIGDISKGVTTRHSLKD----LCNNMAFVSMIEPKNIKEAIVDDNWIIAMQEELNQ 169
Query: 1129 FSKNDVWSLVKKPENVHVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQEGIDYTETFA 1188
F +N+VW LV+KP+N VI TKWVFRNKL+E ++ +KARLVA+GY+Q +G+DY T+A
Sbjct: 170 FERNNVWKLVEKPDNYPVI*TKWVFRNKLDEHRIIIIHKARLVAEGYNQVDGLDYEHTYA 349
Query: 1189 PVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEVYVHQP 1233
+ARL I + +S+ N L+ SA L+G + EVYV QP
Sbjct: 350 SIARL*VIIMPLSYVYIMNSTLYHYACVSALLHGLLLHEVYVDQP 484
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 140 bits (354), Expect = 3e-33
Identities = 65/151 (43%), Positives = 100/151 (66%), Gaps = 1/151 (0%)
Frame = +2
Query: 1457 KLSGYCDADYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQ 1516
+LSGYCDAD+AG +R+STSG C F+G NLVSW SK+Q+ +A S+AEAEY S A+ + +
Sbjct: 14 QLSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCE 193
Query: 1517 MLWMKHQLEDYQILES-NIPIYCDNTAAISLSKNPILHSRAKHIEVKYHFIRDYVQKGVL 1575
++W+K L++ + E + +YCDN AA+ ++ NP+ H R KHIE+ HFIR+ + +
Sbjct: 194 LMWIKQFLQELRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEI 373
Query: 1576 LLKFVDTDHQWADIFTKPLAEDRFNFILKNL 1606
+ +F+ ++ Q DI TK L + + L
Sbjct: 374 VTEFIGSNDQPVDILTKSLRGPKIQIVCSKL 466
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 127 bits (319), Expect = 4e-29
Identities = 61/130 (46%), Positives = 90/130 (68%)
Frame = -2
Query: 1138 VKKPENVHVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQEGIDYTETFAPVARLEAIR 1197
V P +G +WV+ K+ G+V R KARLVA+GY+Q GIDY +TF+PVA+L +R
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 1198 LLISFSVNHNIVLHQMDVKSAFLNGYISEEVYVHQPPGFEDEKKPDHVFKLKKSLYGLKQ 1257
L ++ + + LHQ+D+K+AFL+G + E++Y+ QPPGF + + V KL +SLYGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 1258 APRAWYERLS 1267
+PRAW+ + S
Sbjct: 46 SPRAWFGKFS 17
>BU549979
Length = 615
Score = 126 bits (316), Expect = 9e-29
Identities = 65/183 (35%), Positives = 111/183 (60%), Gaps = 3/183 (1%)
Frame = -1
Query: 1420 RFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKT*EYKLSGYCDADYAGDRTERKSTSGN 1479
R+QS+P H K+++RYL+GT + LMYK+T ++ GY D+D+AG R+STSG
Sbjct: 606 RYQSNPGIDHWKTAKKVMRYLQGTKDYMLMYKQTNCLEVIGYSDSDFAGCVDSRRSTSGY 427
Query: 1480 CQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQILES---NIPI 1536
L +VSW S +Q+ IA ST E E++ ++ +W+K + ++++S + +
Sbjct: 426 IFMLADGVVSWRSSKQTLIATSTMEVEFVPCFEATSHGVWLKSFMSSLRVVDSISRPLKL 247
Query: 1537 YCDNTAAISLSKNPILHSRAKHIEVKYHFIRDYVQKGVLLLKFVDTDHQWADIFTKPLAE 1596
YCDN AA+ ++KN +R+KHI++KY IR+ V++ ++++ V+T+ D TK +
Sbjct: 246 YCDNFAAVFMAKNNKSGNRSKHIDIKYLVIRERVKEKKVVIEHVNTELMIVDPLTKGMTP 67
Query: 1597 DRF 1599
F
Sbjct: 66 KNF 58
>CO982036
Length = 674
Score = 124 bits (312), Expect = 3e-28
Identities = 76/212 (35%), Positives = 119/212 (55%), Gaps = 5/212 (2%)
Frame = -2
Query: 1289 YKDDILIVQ--IYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEG 1346
YK IL V +YVD II GS+ +L + + + + F + ++G+L YF+ I+V P+
Sbjct: 673 YKTHILTVYLLVYVDIIITGSSC-TLIQNLTSKLNSSFPLKLLGKLDYFVEIEVKSMPDL 497
Query: 1347 TYIHQSKYTKELLKKFNMLESTVAKTPMHPTCILEKEDKSGKVCQKLYRGMIGSLLYLTA 1406
+ ++ + +K ++ +PM TC L K D YR ++G+L Y T
Sbjct: 496 LFSLRTSIFEIFCRKPR*QAQPIS-SPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTV 320
Query: 1407 SRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYK---KT*EYKLSGYCD 1463
RP+I F+V+ +F S+P ++H T VKRILRYLKG+ + GL K + + G+CD
Sbjct: 319 IRPEISFAVNKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCD 140
Query: 1464 ADYAGDRTERKSTSGNCQFLGSNLVSWASKRQ 1495
AD+A +++STSG FLG NL+SW +Q
Sbjct: 139 ADWASAVDDKRSTSGAAVFLGPNLISWWXXKQ 44
>TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%)
Length = 493
Score = 118 bits (295), Expect = 2e-26
Identities = 55/131 (41%), Positives = 88/131 (66%), Gaps = 1/131 (0%)
Frame = +3
Query: 1470 RTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQI 1529
R +RKST+G F+G +W SK+Q + LST EAEY++A C +W+++ L++ ++
Sbjct: 6 RDDRKSTTGFVFFMGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELKM 185
Query: 1530 -LESNIPIYCDNTAAISLSKNPILHSRAKHIEVKYHFIRDYVQKGVLLLKFVDTDHQWAD 1588
E + I DN +A++L+KNP+ H ++KHI+ +YHFIR+ ++K + LK+V + Q AD
Sbjct: 186 PQEEPMEICVDNKSALALAKNPVFHEKSKHIDTRYHFIRECIEKKEVKLKYVMSQDQAAD 365
Query: 1589 IFTKPLAEDRF 1599
IFTKPL + F
Sbjct: 366 IFTKPLKLETF 398
>BG508993
Length = 374
Score = 118 bits (295), Expect = 2e-26
Identities = 53/123 (43%), Positives = 82/123 (66%), Gaps = 1/123 (0%)
Frame = +1
Query: 1441 KGTTNLGLMYKKT*EYKLSGYCDADYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIAL 1500
KGT + GL Y + YKL G+CD+D+AGD +RKST+G F+G + +W+SK+Q + L
Sbjct: 4 KGTIDFGLFYSPSNNYKLVGFCDSDFAGDVDDRKSTTGFVFFMGDCVFTWSSKKQGIVTL 183
Query: 1501 STAEAEYISAAICSTQMLWMKHQLEDYQILE-SNIPIYCDNTAAISLSKNPILHSRAKHI 1559
T EAEY++A C+ +W++ LE+ Q+L+ + IY DN +A L+KN + H R+KHI
Sbjct: 184 FTCEAEYVAATSCTCHAIWLRRLLEELQLLQKESTKIYVDNRSAQELAKNSVFHERSKHI 363
Query: 1560 EVK 1562
+ +
Sbjct: 364 DTR 372
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 114 bits (284), Expect = 5e-25
Identities = 56/134 (41%), Positives = 81/134 (59%)
Frame = +3
Query: 1100 LVSLIEPKSIDEALQDKDWILAMEEELNQFSKNDVWSLVKKPENVHVIGTKWVFRNKLNE 1159
L SL P +I EAL W AM +E+ N W LV P +G +WV+ K+
Sbjct: 3 LSSLTVPSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGP 182
Query: 1160 KGDVVRNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAF 1219
G V R KARLVA+GY+Q GI+Y +TF+PV L +RL ++ + + LHQ+D+K+AF
Sbjct: 183 NGKVDRLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAF 362
Query: 1220 LNGYISEEVYVHQP 1233
L+G + E++Y+ QP
Sbjct: 363 LHGDLEEDIYMEQP 404
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.335 0.145 0.463
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 68,577,265
Number of Sequences: 63676
Number of extensions: 955777
Number of successful extensions: 8058
Number of sequences better than 10.0: 167
Number of HSP's better than 10.0 without gapping: 7735
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 7992
length of query: 1610
length of database: 12,639,632
effective HSP length: 110
effective length of query: 1500
effective length of database: 5,635,272
effective search space: 8452908000
effective search space used: 8452908000
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.7 bits)
S2: 66 (30.0 bits)
Lotus: description of TM0134.9