
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0134.10
(1564 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 265 1e-91
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 265 3e-90
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti... 210 3e-54
TC213445 84 1e-34
TC232995 134 2e-32
CO982036 115 1e-25
NP004897 gag-protease polyprotein 110 6e-24
TC232593 weakly similar to UP|Q9XG91 (Q9XG91) Tpv2-1c protein (F... 106 9e-23
BM143109 105 1e-22
BI321712 105 2e-22
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 103 6e-22
AW185460 102 2e-21
CF920770 98 2e-20
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 94 4e-19
BG508993 92 2e-18
BU549979 91 5e-18
AI855899 similar to GP|2244960|emb| retrotransposon like protein... 89 1e-17
BU550037 88 3e-17
BM086359 84 6e-16
TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, ... 80 7e-15
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 265 bits (676), Expect(2) = 1e-91
Identities = 125/260 (48%), Positives = 183/260 (70%)
Frame = +1
Query: 1238 LKNIQNDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPE 1297
+K +++I QIYVDDI+FG + + + F + MQ+EFEMS++GEL YFLG+QV Q +
Sbjct: 3742 VKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMED 3921
Query: 1298 GTYIHQSKYTKELLKKFNMLESTVAKTPMHPTCILEKEDASGKVCQKLYHGMIGTLLYLT 1357
++ QS+Y K ++KKF M ++ +TP L K++A V Q LY MIG+LLYLT
Sbjct: 3922 SIFLSQSRYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLT 4101
Query: 1358 ASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKT*EYKLSGYCDAD 1417
ASRPDI ++V +CAR+Q++P+ +HLT VKRIL+Y+ GT++ G+MY L GYCDAD
Sbjct: 4102 ASRPDITYAVGVCARYQANPKISHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDAD 4281
Query: 1418 YAGDRTERKSTSENCQFLGSNLVSWASKWQSTIALSTAEAEYILTAICSTQMLWMKHQLE 1477
+AG +RKSTS C +LG+NL+SW SK Q+ ++LSTAEAEYI +Q++WMK L+
Sbjct: 4282 WAGSADDRKSTSGGCFYLGNNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLK 4461
Query: 1478 DYQILESNISIYCDNTAAIH 1497
+Y + + +++YCDN +AI+
Sbjct: 4462 EYNVEQDVMTLYCDNMSAIN 4521
Score = 113 bits (283), Expect = 6e-25
Identities = 112/433 (25%), Positives = 185/433 (41%), Gaps = 40/433 (9%)
Frame = +1
Query: 3 SFFLGFDADLWDIIVDGYERP--VDEEGKKI----PRSEMTADQKKLYSQHHKARAILLS 56
+F D+ W ++ G+E P +D EGK P + T ++ +L + KA L +
Sbjct: 88 AFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTDELKPEEDWTKEEDELALGNSKALNALFN 267
Query: 57 AISYEEYEKITDREYAKGIFESLKMSHEGNKKVKESKALSLIQKYESFIMEPNESIEEMF 116
+ + I AK +E LK++HEG KVK S+ L K+E+ M+ E I +
Sbjct: 268 GVDKNIFRLINTCTVAKDAWEILKITHEGTSKVKMSRLQLLATKFENLKMKEEECIHDFH 447
Query: 117 SRFQLLVAGIRPLNKSYTTKYHVIRVIRSLPESWMPLVTSIELTRDVERMSLEELISILK 176
+ L + T + V +++RSLP+ + VT+IE +D+ M ++ELI L+
Sbjct: 448 MNILEIANACTALGERITDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDELIGSLQ 627
Query: 177 CHELKHSEM----------------------LDSDE---DELTLISKRLNRIWK------ 205
EL S+ LD+DE + + L+ K+ N++
Sbjct: 628 TFELGLSDRAEKKSKNLAFVSNDEGEEDEYDLDTDEGLTNAVVLLGKQFNKVLNRMDKRQ 807
Query: 206 --HKQSKYRGSGKAKGKSESSGQKKSSIKEVTCFECKESGHYKSDCPKLKKDKRPKKHFK 263
H Q+ K + S K S K + C C+ GH ++CP KKH
Sbjct: 808 KPHVQNIPFDIRKGSKYQKRSDVKPSHSKGIQCHGCEGYGHIIAECP-----THLKKH-- 966
Query: 264 TKKSLMVTFDESESE-DVDYDGEVQGLMDIVKDKGAESKDVVDSDSESEGDPNSDDENEV 322
+K L V ++ESE + D D +V L I + ++D D+DSE D
Sbjct: 967 -RKGLSVCQSDTESEQESDSDRDVNALTGIFE----TAEDSSDTDSEITFD--------- 1104
Query: 323 FASFSTSELKHALSDIMDKYNSLLSKHKKLKKDLSAASKTPYEHEKIISDLKNDNHALVN 382
EL + + K +L + +LKK ++ H++ IS+LK + L
Sbjct: 1105-------ELAASYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHKEEISELKGEVGFL-- 1257
Query: 383 SNSVLKNQIAKLE 395
NS L+N ++
Sbjct: 1258-NSKLENMTKSIK 1293
Score = 92.4 bits (228), Expect(2) = 1e-91
Identities = 62/153 (40%), Positives = 85/153 (55%)
Frame = +3
Query: 1070 LDSGYGRRTESIFQE*CLEPSEEASKCPCNWNKMGVQKQVE*ER*CSQKQGKASCSRLQS 1129
LD Y RR +I +E* L S A C+W+++ +Q+Q +* R +QKQG+ CSRL S
Sbjct: 3237 LDQCYARRIGAIQKE*SLGASS*A*GN*CDWHQVDLQEQNQ*RRCHNQKQGQTGCSRLHS 3416
Query: 1130 AGRNRLH*NICSSSKTRSNQTADLLLSKP*HNSTSDGC*ECIPKWIHIRGSLCSSTPRF* 1189
R RL *+ C S T +Q P + DGC E I +WI SLC +
Sbjct: 3417 D*RCRL**DFCPSC*T*VHQIITWCSLYPQIQAVPDGCEERISEWIPE*RSLCGAAKGIC 3596
Query: 1190 R*EEPRPCLQAEEISLWSEASSQSMV*ETQLIP 1222
R + R C+QA+E SLW EASS+S+V*+ +P
Sbjct: 3597 RPDSSRSCIQAQEGSLWIEASSKSLV*KANRVP 3695
Score = 38.1 bits (87), Expect = 0.030
Identities = 32/96 (33%), Positives = 51/96 (52%)
Frame = +3
Query: 789 LHSPSAKREGL*DCACQK*PWWRV*E*QV*ESV*FLWNCT*FLLSQNSSTKWCC*EEEQN 848
+ S ++KRE L Q+*PW R+*+ QV + + +* L S ++T+W *EE+Q+
Sbjct: 2406 VESKTSKRERLCHQENQE*PWQRI*KQQVH*ILHI*RHHS*VLCSHYTTTEWDS*EEKQD 2585
Query: 849 SSGDG*NHAPRNWHG*ALLGRGSKHSVLHSEQNLCE 884
+ HA LG +HS+LH +Q+ E
Sbjct: 2586 FARGCSGHASCQRTSL*SLG*SHEHSMLHPQQSHTE 2693
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 265 bits (677), Expect(2) = 3e-90
Identities = 125/260 (48%), Positives = 183/260 (70%)
Frame = +1
Query: 1238 LKNIQNDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPE 1297
+K +++I QIYVDDI+FG + + + F + MQ+EFEMS++GEL YFLG+QV Q +
Sbjct: 3745 VKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMED 3924
Query: 1298 GTYIHQSKYTKELLKKFNMLESTVAKTPMHPTCILEKEDASGKVCQKLYHGMIGTLLYLT 1357
++ QSKY K ++KKF M ++ +TP L K++A V Q LY MIG+LLYLT
Sbjct: 3925 SIFLSQSKYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLT 4104
Query: 1358 ASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKT*EYKLSGYCDAD 1417
ASRPDI ++V +CAR+Q++P+ +HL VKRIL+Y+ GT++ G+MY + L GYCDAD
Sbjct: 4105 ASRPDITYAVGVCARYQANPKISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYCDAD 4284
Query: 1418 YAGDRTERKSTSENCQFLGSNLVSWASKWQSTIALSTAEAEYILTAICSTQMLWMKHQLE 1477
+AG +RKSTS C +LG+NL+SW SK Q+ ++LSTAEAEYI +Q++WMK L+
Sbjct: 4285 WAGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLK 4464
Query: 1478 DYQILESNISIYCDNTAAIH 1497
+Y + + +++YCDN +AI+
Sbjct: 4465 EYNVEQDVMTLYCDNMSAIN 4524
Score = 115 bits (288), Expect = 2e-25
Identities = 111/434 (25%), Positives = 185/434 (42%), Gaps = 41/434 (9%)
Frame = +1
Query: 3 SFFLGFDADLWDIIVDGYERP--VDEEGKKI----PRSEMTADQKKLYSQHHKARAILLS 56
+F D+ W ++ G+E P +D EGK P + T ++ +L + KA L +
Sbjct: 88 AFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTNELKPEEDWTKEEDELALGNSKALNALFN 267
Query: 57 AISYEEYEKITDREYAKGIFESLKMSHEGNKKVKESKALSLIQKYESFIMEPNESIEEMF 116
+ + I AK +E LK +HEG KVK S+ L K+E+ M+ E I +
Sbjct: 268 GVDKNIFRLINTCTVAKDAWEILKTTHEGTSKVKMSRLQLLATKFENLKMKEEECIHDFH 447
Query: 117 SRFQLLVAGIRPLNKSYTTKYHVIRVIRSLPESWMPLVTSIELTRDVERMSLEELISILK 176
+ L + T + V +++RSLP+ + VT+IE +D+ M ++ELI L+
Sbjct: 448 MNILEIANACTALGERMTDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDELIGSLQ 627
Query: 177 CHELKHSEM----------------------LDSDE---DELTLISKRLNRIWKHKQSKY 211
EL S+ LD+DE + + L+ K+ N++ +
Sbjct: 628 TFELGLSDRTEKKSKNLAFVSNDEGEEDEYDLDTDEGLTNAVVLLGKQFNKVLNRMDRRQ 807
Query: 212 R--------GSGKAKGKSESSGQKKSSIKEVTCFECKESGHYKSDCPKLKKDKRPKKHFK 263
+ K + S +K S K + C C+ GH K++CP K K
Sbjct: 808 KPHVRNIPFDIRKGSEYQKKSDEKPSHSKGIQCHGCEGYGHIKAECPTHLK--------K 963
Query: 264 TKKSLMV-TFDESESE-DVDYDGEVQGLMDIVKDKGAESKDVVDSDSESEGDPNSDDENE 321
+K L V D++ESE + D D +V L + ++D D+DSE D
Sbjct: 964 QRKGLSVCRSDDTESEQESDSDRDVNAL----TGRFESAEDSSDTDSEITFD-------- 1107
Query: 322 VFASFSTSELKHALSDIMDKYNSLLSKHKKLKKDLSAASKTPYEHEKIISDLKNDNHALV 381
EL + ++ K +L + +LKK ++ HE+ IS+LK + L
Sbjct: 1108--------ELAISYRELCIKSEKILQQEAQLKKVIANLEAEKEAHEEEISELKGEVGFL- 1260
Query: 382 NSNSVLKNQIAKLE 395
NS L+N ++
Sbjct: 1261--NSKLENMTKSIK 1296
Score = 87.4 bits (215), Expect(2) = 3e-90
Identities = 63/168 (37%), Positives = 88/168 (51%)
Frame = +3
Query: 1070 LDSGYGRRTESIFQE*CLEPSEEASKCPCNWNKMGVQKQVE*ER*CSQKQGKASCSRLQS 1129
LD Y RR +I +E* L S C+W+++ +Q+Q +* R +QKQG+ CSRL S
Sbjct: 3240 LDQCYARRIGAIQKE*SLGASS*TRGN*CDWHQVDLQEQNQ*RRCYNQKQGQTCCSRLHS 3419
Query: 1130 AGRNRLH*NICSSSKTRSNQTADLLLSKP*HNSTSDGC*ECIPKWIHIRGSLCSSTPRF* 1189
R RL *N T +Q P + DGC E + +WI SLC +
Sbjct: 3420 D*RCRL**NFRPCC*T*VHQIVTWCSLHPQIQAVPDGCEERVSEWIPE*RSLCGAAKGIC 3599
Query: 1190 R*EEPRPCLQAEEISLWSEASSQSMV*ETQLIPSGE*VCKG*SRYNSL 1237
R R C+QA+E SLW EASS+S+V*+ +P V +G + +SL
Sbjct: 3600 RSNSSRSCIQAQEGSLWIEASSKSLV*KANRVPYSARV*EGRN*QDSL 3743
Score = 38.9 bits (89), Expect = 0.018
Identities = 32/93 (34%), Positives = 51/93 (54%)
Frame = +3
Query: 789 LHSPSAKREGL*DCACQK*PWWRV*E*QV*ESV*FLWNCT*FLLSQNSSTKWCC*EEEQN 848
+ S ++KR+ L Q+*PW RV*+ QV + + +* L S ++TKW *+E+Q+
Sbjct: 2409 VESKTSKRKRLCHQENQE*PWQRV*KQQVY*ILHI*RHHS*VLCSHYTTTKWHS*KEKQD 2588
Query: 849 SSGDG*NHAPRNWHG*ALLGRGSKHSVLHSEQN 881
+ * HA LG +HS+LH +Q+
Sbjct: 2589 FARSC*GHASCQRTSL*SLG*SHEHSMLHPQQS 2687
>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
Length = 1213
Score = 210 bits (535), Expect = 3e-54
Identities = 109/198 (55%), Positives = 139/198 (70%)
Frame = +3
Query: 1234 YNSLLKNIQNDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVD 1293
Y+ LK + LI+ IYVDDIIFG+ ++ +CKEF E+M+ FE SM GELK+ LG+Q+
Sbjct: 459 YSERLK--KETFLIIHIYVDDIIFGATSKRMCKEFFELMKDGFETSMKGELKFLLGLQII 632
Query: 1294 QTPEGTYIHQSKYTKELLKKFNMLESTVAKTPMHPTCILEKEDASGKVCQKLYHGMIGTL 1353
Q G +IHQ KYTK LK+F M E+ TPMH + I++K++ K Y GMI +L
Sbjct: 633 QKVYGIFIHQEKYTKSHLKRFRMDEAKPMATPMHRSTIIDKDEKGNHTS*KEYSGMIDSL 812
Query: 1354 LYLTASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKT*EYKLSGY 1413
YLT+SRPDI+F V LCARFQS P+ +H+TAVKRILRYL GTTN L +KK E+ L GY
Sbjct: 813 SYLTSSRPDIVFVVCLCARFQSYPKISHVTAVKRILRYLVGTTNHCLWFKKRSEFDLLGY 992
Query: 1414 CDADYAGDRTERKSTSEN 1431
CD +AGD+ ERKSTS N
Sbjct: 993 CDVYFAGDKVERKSTSRN 1046
>TC213445
Length = 705
Score = 83.6 bits (205), Expect(2) = 1e-34
Identities = 40/76 (52%), Positives = 54/76 (70%)
Frame = +1
Query: 1422 RTERKSTSENCQFLGSNLVSWASKWQSTIALSTAEAEYILTAICSTQMLWMKHQLEDYQI 1481
+T+R+STS+ C F+GS LVSW SK Q+++ LSTAEAEYI Q+ WM+ QL DY +
Sbjct: 400 KTDRESTSDTCHFIGSALVSWHSKKQNSVVLSTAEAEYISARSYYAQIFWMRQQLFDYGL 579
Query: 1482 LESNISIYCDNTAAIH 1497
+I I CDNT+AI+
Sbjct: 580 KLDHIPIRCDNTSAIN 627
Score = 83.2 bits (204), Expect(2) = 1e-34
Identities = 38/68 (55%), Positives = 50/68 (72%)
Frame = +2
Query: 1349 MIGTLLYLTASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKT*EY 1408
MI + LYL+ SRP I+FSV +C R+Q++P+E+HL+ +KRI+RYL G NLGL Y K Y
Sbjct: 197 MIESFLYLSTSRPHIMFSVCMCVRYQANPKESHLSVIKRIMRYLLGIINLGLWYPKNSSY 376
Query: 1409 KLSGYCDA 1416
L GY DA
Sbjct: 377 NLVGYSDA 400
>TC232995
Length = 1009
Score = 134 bits (338), Expect(2) = 2e-32
Identities = 67/117 (57%), Positives = 84/117 (71%)
Frame = +2
Query: 1238 LKNIQNDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPE 1297
+K NDIL+VQIYVDDIIFGS N SLCKEFS MQ+EFEMSMMGELKYFLG+Q+ QT
Sbjct: 170 IKRKHNDILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ* 349
Query: 1298 GTYIHQSKYTKELLKKFNMLESTVAKTPMHPTCILEKEDASGKVCQKLYHGMIGTLL 1354
G +I+QSKY KEL+K+F M + TPM C L+K+++ + K Y IG ++
Sbjct: 350 GIFINQSKYCKELIKRFGMDSAKHMSTPMSTNCYLDKDESGQSIDIKQYRDAIGEVV 520
Score = 25.0 bits (53), Expect(2) = 2e-32
Identities = 20/56 (35%), Positives = 29/56 (51%)
Frame = +1
Query: 1184 STPRF*R*EEPRPCLQAEEISLWSEASSQSMV*ETQLIPSGE*VCKG*SRYNSLLK 1239
+TP F* +PCL + SLW E S MV* + S + + + *S Y+ + K
Sbjct: 7 TTPWF*NF**TKPCL*ITKGSLWFETSP*GMV*TIK*FSS*KRILQR*SGYHIIHK 174
>CO982036
Length = 674
Score = 115 bits (289), Expect = 1e-25
Identities = 68/196 (34%), Positives = 110/196 (55%), Gaps = 3/196 (1%)
Frame = -2
Query: 1250 IYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEGTYIHQSKYTKE 1309
+YVD II GS+ +L + + + + F + ++G+L YF+ I+V P+ + ++ +
Sbjct: 640 VYVDIIITGSSC-TLIQNLTSKLNSSFPLKLLGKLDYFVEIEVKSMPDLLFSLRTSIFEI 464
Query: 1310 LLKKFNMLESTVAKTPMHPTCILEKEDASGKVCQKLYHGMIGTLLYLTASRPDILFSVHL 1369
+K ++ +PM TC L K D+ Y ++G L Y T RP+I F+V+
Sbjct: 463 FCRKPR*QAQPIS-SPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTVIRPEISFAVNK 287
Query: 1370 CARFQSDPRETHLTAVKRILRYLKGTTNLGLMYK---KT*EYKLSGYCDADYAGDRTERK 1426
+F S+P ++H T VKRILRYLKG+ + GL K + + G+CDAD+A +++
Sbjct: 286 VCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCDADWASAVDDKR 107
Query: 1427 STSENCQFLGSNLVSW 1442
STS FLG NL+SW
Sbjct: 106 STSGAAVFLGPNLISW 59
>NP004897 gag-protease polyprotein
Length = 1923
Score = 110 bits (274), Expect = 6e-24
Identities = 109/434 (25%), Positives = 183/434 (42%), Gaps = 41/434 (9%)
Frame = +1
Query: 3 SFFLGFDADLWDIIVDGYERP--VDEEGKKI----PRSEMTADQKKLYSQHHKARAILLS 56
+F D+ W ++ +E P +D EGK P + T ++ +L + KA L +
Sbjct: 88 AFLKSLDSRTWKAVIKDWEHPKMLDTEGKPTDGLKPEEDWTKEEDELALGNSKALNALFN 267
Query: 57 AISYEEYEKITDREYAKGIFESLKMSHEGNKKVKESKALSLIQKYESFIMEPNESIEEMF 116
+ + I AK +E LK +HEG KVK S+ L K+E+ M+ E I +
Sbjct: 268 GVDKNIFRLINTCTVAKDAWEILKTTHEGTSKVKMSRLQLLATKFENLKMKEEECIHDFH 447
Query: 117 SRFQLLVAGIRPLNKSYTTKYHVIRVIRSLPESWMPLVTSIELTRDVERMSLEELISILK 176
+ L + T + V +++RSLP+ + VT+IE +D+ + ++ELI L+
Sbjct: 448 MNILEIANACTALGERMTDEKLVRKILRSLPKRFDMKVTAIEEAQDICNLRVDELIGSLQ 627
Query: 177 CHELKHSEM----------------------LDSDE---DELTLISKRLNRIWKHKQSKY 211
EL S+ LD+DE + + L+ K+ N++ +
Sbjct: 628 TFELGLSDRTEKKSKNLAFVSNDEGEEDEYDLDTDEGLTNAVVLLGKQFNKVLNRMDRRQ 807
Query: 212 R--------GSGKAKGKSESSGQKKSSIKEVTCFECKESGHYKSDCPKLKKDKRPKKHFK 263
+ K + S +K S K C C+ GH K++CP K K
Sbjct: 808 KPHVRNIPFDIRKGSEYQKRSDEKPSHSKGFQCHGCEGYGHIKAECPTHLK--------K 963
Query: 264 TKKSLMV-TFDESESE-DVDYDGEVQGLMDIVKDKGAESKDVVDSDSESEGDPNSDDENE 321
+K L V D++ESE + D D +V L + ++D D+DSE D
Sbjct: 964 QRKGLSVCRSDDTESEQESDSDRDVNAL----TGRFESAEDSSDTDSEITFD-------- 1107
Query: 322 VFASFSTSELKHALSDIMDKYNSLLSKHKKLKKDLSAASKTPYEHEKIISDLKNDNHALV 381
EL + ++ K +L + +LKK ++ HE+ IS+LK + L
Sbjct: 1108--------ELATSYRELCIKSEKILQQEAQLKKVIANLEAEKEAHEEEISELKGEVGFL- 1260
Query: 382 NSNSVLKNQIAKLE 395
NS L+N ++
Sbjct: 1261--NSKLENMTKSIK 1296
>TC232593 weakly similar to UP|Q9XG91 (Q9XG91) Tpv2-1c protein (Fragment),
partial (16%)
Length = 562
Score = 106 bits (264), Expect = 9e-23
Identities = 54/125 (43%), Positives = 76/125 (60%)
Frame = +1
Query: 1243 NDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEGTYIH 1302
N LIV +YVDD++ + L +EF + M FEM+ +G + YFLGI++ Q+ I
Sbjct: 184 NYFLIVSLYVDDLLVTRDDARLVEEFKQEMMQAFEMTNLGLMTYFLGIEIKQSQNKVLIC 363
Query: 1303 QSKYTKELLKKFNMLESTVAKTPMHPTCILEKEDASGKVCQKLYHGMIGTLLYLTASRPD 1362
Q KY KE+LKKF M E TPM+ K D + K+ + Y +IG L+YLTA+RPD
Sbjct: 364 QRKYAKEILKKFQMEECKSVSTPMNQKEKFNKVDGADKIDEGYYRSLIGCLMYLTATRPD 543
Query: 1363 ILFSV 1367
ILF++
Sbjct: 544 ILFAI 558
>BM143109
Length = 415
Score = 105 bits (263), Expect = 1e-22
Identities = 50/74 (67%), Positives = 60/74 (80%)
Frame = +1
Query: 1243 NDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEGTYIH 1302
NDIL+VQIYVDDIIFGS N SLCK+FS+ MQ EFEMSMM EL +FLG+Q+ QT G +I
Sbjct: 181 NDILLVQIYVDDIIFGSTNDSLCKKFSQDMQNEFEMSMMRELNFFLGLQIKQTKNGIFIS 360
Query: 1303 QSKYTKELLKKFNM 1316
QSKY K+L+ +F M
Sbjct: 361 QSKYCKDLIHRFGM 402
>BI321712
Length = 399
Score = 105 bits (262), Expect = 2e-22
Identities = 55/124 (44%), Positives = 77/124 (61%)
Frame = -3
Query: 1250 IYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEGTYIHQSKYTKE 1309
+YVDD+IF N S+ +EF + M EFEM+ MG + Y+LGI+V Q +G +I Q Y KE
Sbjct: 379 LYVDDLIFTGNNPSMFEEFKKDMSNEFEMTDMGLMAYYLGIEVKQEDKGIFITQEGYAKE 200
Query: 1310 LLKKFNMLESTVAKTPMHPTCILEKEDASGKVCQKLYHGMIGTLLYLTASRPDILFSVHL 1369
+LKKF M ++ TPM L K + V LY +IG+L YLT +RPDIL+ V +
Sbjct: 199 VLKKFKMDDANPVGTPMECGSKLSKHEKGENVDPTLYKSLIGSLRYLTCTRPDILYVVGV 20
Query: 1370 CARF 1373
+R+
Sbjct: 19 VSRY 8
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 103 bits (257), Expect = 6e-22
Identities = 59/156 (37%), Positives = 93/156 (58%), Gaps = 4/156 (2%)
Frame = +1
Query: 1346 YHGMIGTLLYLTASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMY--- 1402
+ +IG+L YL SRP+I F+V L +RF PR +H+ A KR+LR +KGT G+++
Sbjct: 22 FRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVLFPFK 201
Query: 1403 KKT*EYKLSGYCDADYAGDRTERKSTSENCQFLGSNLVSWASKWQSTIALSTAEAEYILT 1462
K+ + L GY D+D+ D + KST V+ +SK Q IALST EAEY+
Sbjct: 202 AKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAEYVAA 381
Query: 1463 AICSTQMLWMKHQLEDYQILESN-ISIYCDNTAAIH 1497
++ + Q +WM + LE+ ++ E +++ DN +AI+
Sbjct: 382 SLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAIN 489
>AW185460
Length = 411
Score = 102 bits (253), Expect = 2e-21
Identities = 52/106 (49%), Positives = 68/106 (64%)
Frame = +2
Query: 1358 ASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKT*EYKLSGYCDAD 1417
A+RPDI+++ L +RF P + H A KRILRYL+GT G+ Y +L GY D+D
Sbjct: 89 ATRPDIMYATSLLSRFMQSPSQIHFGAGKRILRYLQGTKAFGIWYTTETNSELLGYTDSD 268
Query: 1418 YAGDRTERKSTSENCQFLGSNLVSWASKWQSTIALSTAEAEYILTA 1463
+AG + KSTS LGS + SWASK Q+T+A STAEAEY+ A
Sbjct: 269 WAGSTDDMKSTSGYAFSLGSGMFSWASKKQATVAQSTAEAEYVAVA 406
>CF920770
Length = 581
Score = 98.2 bits (243), Expect = 2e-20
Identities = 57/186 (30%), Positives = 98/186 (52%), Gaps = 13/186 (6%)
Frame = -2
Query: 9 DADLWDIIVDGYERPVDEEGKKI-------------PRSEMTADQKKLYSQHHKARAILL 55
D ++W+ I G P E I PR + + +K + KA+ I+
Sbjct: 574 DLNIWEAIEIGPYIPTTVERVSIDGSSSSESITIEKPRDRWSEEDRKRVQYNLKAKNIIT 395
Query: 56 SAISYEEYEKITDREYAKGIFESLKMSHEGNKKVKESKALSLIQKYESFIMEPNESIEEM 115
SA+ +EY ++++ + AK ++++L+++HEG VK S+ +L +YE F M NE+I+ M
Sbjct: 394 SALGMDEYFRVSNCKSAKEMWDTLRLTHEGTTDVKRSRINALTHEYELFRMNTNENIQSM 215
Query: 116 FSRFQLLVAGIRPLNKSYTTKYHVIRVIRSLPESWMPLVTSIELTRDVERMSLEELISIL 175
RF +V + L K + + + +V+R L W P VT+I +RD+ MSL L L
Sbjct: 214 QKRFTHIVNHLAALGKEFQNEDLINKVLRCLSREWQPKVTAISESRDLSNMSLATLFGKL 35
Query: 176 KCHELK 181
+ HE++
Sbjct: 34 QEHEME 17
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 94.4 bits (233), Expect = 4e-19
Identities = 44/90 (48%), Positives = 64/90 (70%), Gaps = 1/90 (1%)
Frame = +2
Query: 1409 KLSGYCDADYAGDRTERKSTSENCQFLGSNLVSWASKWQSTIALSTAEAEYILTAICSTQ 1468
+LSGYCDAD+AG +R+STS C F+G NLVSW SK Q+ +A S+AEAEY A+ + +
Sbjct: 14 QLSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCE 193
Query: 1469 MLWMKHQLEDYQILES-NISIYCDNTAAIH 1497
++W+K L++ + E + +YCDN AA+H
Sbjct: 194 LMWIKQFLQELRFCEELQMKLYCDNQAALH 283
>BG508993
Length = 374
Score = 92.0 bits (227), Expect = 2e-18
Identities = 43/104 (41%), Positives = 65/104 (62%), Gaps = 1/104 (0%)
Frame = +1
Query: 1393 KGTTNLGLMYKKT*EYKLSGYCDADYAGDRTERKSTSENCQFLGSNLVSWASKWQSTIAL 1452
KGT + GL Y + YKL G+CD+D+AGD +RKST+ F+G + +W+SK Q + L
Sbjct: 4 KGTIDFGLFYSPSNNYKLVGFCDSDFAGDVDDRKSTTGFVFFMGDCVFTWSSKKQGIVTL 183
Query: 1453 STAEAEYILTAICSTQMLWMKHQLEDYQILE-SNISIYCDNTAA 1495
T EAEY+ C+ +W++ LE+ Q+L+ + IY DN +A
Sbjct: 184 FTCEAEYVAATSCTCHAIWLRRLLEELQLLQKESTKIYVDNRSA 315
>BU549979
Length = 615
Score = 90.5 bits (223), Expect = 5e-18
Identities = 47/128 (36%), Positives = 76/128 (58%), Gaps = 3/128 (2%)
Frame = -1
Query: 1372 RFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKT*EYKLSGYCDADYAGDRTERKSTSEN 1431
R+QS+P H K+++RYL+GT + LMYK+T ++ GY D+D+AG R+STS
Sbjct: 606 RYQSNPGIDHWKTAKKVMRYLQGTKDYMLMYKQTNCLEVIGYSDSDFAGCVDSRRSTSGY 427
Query: 1432 CQFLGSNLVSWASKWQSTIALSTAEAEYILTAICSTQMLWMKHQLEDYQILES---NISI 1488
L +VSW S Q+ IA ST E E++ ++ +W+K + ++++S + +
Sbjct: 426 IFMLADGVVSWRSSKQTLIATSTMEVEFVPCFEATSHGVWLKSFMSSLRVVDSISRPLKL 247
Query: 1489 YCDNTAAI 1496
YCDN AA+
Sbjct: 246 YCDNFAAV 223
>AI855899 similar to GP|2244960|emb| retrotransposon like protein {Arabidopsis
thaliana}, partial (18%)
Length = 418
Score = 89.4 bits (220), Expect = 1e-17
Identities = 51/136 (37%), Positives = 75/136 (54%), Gaps = 3/136 (2%)
Frame = +1
Query: 1315 NMLESTVAKTPMHPTCILEKEDASGKVCQKLYHGMIGTLLYLTASRPDILFSVHLCARFQ 1374
NML+ TPM + L K + Y ++G L Y+T +RP+I ++V+ + F
Sbjct: 10 NMLDCNGISTPMVSSYKLSKFGSELLPNAHQYRDIVGALQYVTLTRPNIAYNVNKVSEFM 189
Query: 1375 SDPRETHLTAVKRILRYLKGTTNLGLMYKKT---*EYKLSGYCDADYAGDRTERKSTSEN 1431
S P +++ VKRILRYL GT GL+ + + L Y D D+ D E +STS +
Sbjct: 190 SSPLQSY*LTVKRILRYLSGTVTQGLLLQPAHMDAKISLRAYNDLDWGSDPAEMRSTSGS 369
Query: 1432 CQFLGSNLVSWASKWQ 1447
C F GSNL++W+SK Q
Sbjct: 370 CIFSGSNLIAWSSKKQ 417
>BU550037
Length = 728
Score = 87.8 bits (216), Expect = 3e-17
Identities = 42/115 (36%), Positives = 72/115 (62%)
Frame = -3
Query: 1272 MQAEFEMSMMGELKYFLGIQVDQTPEGTYIHQSKYTKELLKKFNMLESTVAKTPMHPTCI 1331
M++EFEM+ +G++KYFLG+ + Q+ +G +I Q KY E+L+KF+M T +
Sbjct: 363 MESEFEMTDLGQMKYFLGM*IFQSEDGIFISQKKYAWEILRKFHMERCKPIATVLVVNEK 184
Query: 1332 LEKEDASGKVCQKLYHGMIGTLLYLTASRPDILFSVHLCARFQSDPRETHLTAVK 1386
K++ + +Y +IG+LLYL+A+RP+++F+ L +RF P + HL + K
Sbjct: 183 FSKDEEDNQGDASVYRSLIGSLLYLSATRPNLMFAATLLSRFTKSPSQVHLGSSK 19
>BM086359
Length = 427
Score = 83.6 bits (205), Expect = 6e-16
Identities = 53/147 (36%), Positives = 78/147 (53%), Gaps = 5/147 (3%)
Frame = +1
Query: 1288 LGIQVDQTPEGTYIHQSKYTKELLKKFNMLESTVAKTPMHPTCIL-----EKEDASGKVC 1342
LGI V Q+ G I Q KY ++L + ML+ + TPM P L E + G+ C
Sbjct: 1 LGIDVAQSSYGIVISQWKYALDILTETGMLDCLPSNTPMDPNVKLLSGQGEALEDPGR*C 180
Query: 1343 QKLYHGMIGTLLYLTASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMY 1402
++G L YLT +R DI F+V + ++F DP ++ A RILRY+K GL+Y
Sbjct: 181 C-----LVGRLNYLTVTRLDITFAVGVLSQFLKDPTDSQWNATIRILRYIKNAPGPGLLY 345
Query: 1403 KKT*EYKLSGYCDADYAGDRTERKSTS 1429
+ K+ Y DAD+ G +++ STS
Sbjct: 346 EDKGNGKVVCYFDADWPGSPSDKSSTS 426
>TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, partial (16%)
Length = 662
Score = 80.1 bits (196), Expect = 7e-15
Identities = 46/129 (35%), Positives = 74/129 (56%), Gaps = 2/129 (1%)
Frame = +3
Query: 1382 LTAVKRILRYLKGTTNLGLMYKKT*EYKLSGYCDADYAGDRTERKSTSENCQFLGSNLVS 1441
L A R+L+YLKG GL + + ++ G+ DAD+A KS + C FLGS+L+S
Sbjct: 12 LCAATRVLKYLKGCPRKGLSFSRESPIQILGFSDADWATCIDSSKSITWYCFFLGSSLIS 191
Query: 1442 WASKWQSTIALSTAEAEYILTAICST--QMLWMKHQLEDYQILESNISIYCDNTAAIH*V 1499
W +K Q+T++ S++ +E A+ ST ++ W+ + L+D + IYCDN +A+ *+
Sbjct: 192 WKAKKQNTVSRSSSSSEAKYRALTSTTCELQWLTYLLKDLHV----TLIYCDNQSALQ*L 359
Query: 1500 RILSYIQGQ 1508
I GQ
Sbjct: 360 PIKVIYHGQ 386
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.347 0.151 0.513
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 70,563,871
Number of Sequences: 63676
Number of extensions: 1040352
Number of successful extensions: 10123
Number of sequences better than 10.0: 117
Number of HSP's better than 10.0 without gapping: 7270
Number of HSP's successfully gapped in prelim test: 304
Number of HSP's that attempted gapping in prelim test: 2462
Number of HSP's gapped (non-prelim): 8041
length of query: 1564
length of database: 12,639,632
effective HSP length: 110
effective length of query: 1454
effective length of database: 5,635,272
effective search space: 8193685488
effective search space used: 8193685488
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.7 bits)
S2: 65 (29.6 bits)
Lotus: description of TM0134.10