
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147002.8 + phase: 0 /pseudo
(1664 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 155 1e-37
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 140 6e-33
NP004897 gag-protease polyprotein 127 4e-29
CF920770 110 7e-24
AI959950 75 2e-13
TC232995 75 2e-13
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti... 64 1e-11
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 68 4e-11
TC223814 60 8e-09
BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Gl... 51 5e-06
CF922226 51 5e-06
BM143109 46 1e-04
TC207027 weakly similar to UP|Q9LW95 (Q9LW95) KED, partial (17%) 46 2e-04
TC228567 42 0.003
AI855982 41 0.005
AI966222 39 0.019
TC213445 39 0.019
TC204982 PIR|T06394|T06394 isoprenylated protein - soybean (frag... 37 0.056
TC206178 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, part... 36 0.12
TC205401 similar to GB|AAP37850.1|30725656|BT008491 At4g26630 {A... 36 0.12
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 155 bits (393), Expect = 1e-37
Identities = 144/467 (30%), Positives = 235/467 (49%)
Frame = +3
Query: 1193 N*AQNC*RSSLR*WMDISYARRAKSIPKE*CVGSGTQTFSEEHYWNKMGIQKQAE*TRRS 1252
N*AQ C R + * +D YARR +I KE* +G+ + T W+++ +Q+Q +* R
Sbjct: 3198 N*AQECERGTD**VLDQCYARRIGAIQKE*SLGASS*TRGN*CDWHQVDLQEQNQ*RRCY 3377
Query: 1253 NQKQSQTCCSRLQSTRRH*LH*NICSSCKIGSNQVTSILRN*SWHNIISNGCQKCLS*WC 1312
NQKQ QTCCSRL S R L *N C +Q+ + + +GC++ +S W
Sbjct: 3378 NQKQGQTCCSRLHSD*RCRL**NFRPCC*T*VHQIVTWCSLHPQIQAVPDGCEERVSEWI 3557
Query: 1313 H*RRSVC*TTSWV*GS*AS*PCL*T*EITIWLETSSQSLV**TK*FLN*K*F*KRTS*HN 1372
RS+C + S +S C+ E ++W+E SS+SLV* *+ +* +
Sbjct: 3558 PE*RSLCGAAKGICRSNSSRSCIQAQEGSLWIEASSKSLV*KANRVPYSARV*EGRN*QD 3737
Query: 1373 TLQKDS*ERYFDCANIC**YNIWFY*CISLQRIF*VNAG*I*NEYDGRIEILSGNSNHPK 1432
+L + + D +IC**+ +W +A *I*+E R ++ SG +
Sbjct: 3738 SLCQTRC*KLDDSTDIC**HCVWRDVE*DASTFCPTDAI*I*DESCWRADLFSGTPSEAD 3917
Query: 1433 *RRSICSSNKIYKGASEEVQARRL*NDEHSNASNLHLKQRRYWNSSRPEALQRYDWFSVI 1492
R I + ++ K +EV + +++ +L +R W+ +++Q++DW I
Sbjct: 3918 GRLHIPLTKQVCKEHCQEVWDGKCQP*KNTCTYSLEAVKR*SWHQC*SKSVQKHDWELTI 4097
Query: 1493 PHCI*T*YFIQCMLVCKISIRS*RISFNCS*ENLQVSERNN*SWTPV*EIPRL*VDWIL* 1552
+ T*+ + +CKIS +S* S S EN ++ + + * W V + R W+L*
Sbjct: 4098 FNS*QT*HHLCSRCLCKISSQS*DKSLESSKENSEICKWHQ*LWDYVLSLFRFNAGWVL* 4277
Query: 1553 C*LCW**D*KKINQWKLSIPGRESDILGKQKTSNYCYVYSGSRVHFSCKLLHTTTLDETS 1612
C*L W *+K + W + + G +S + +Q+ +Y SRV+ S K L TT+LDE
Sbjct: 4278 C*LGWKCR*QKKHFWWMFLFGNQSYFMVQQEAELCVPIYC*SRVYCSRKQLFTTSLDEAD 4457
Query: 1613 VGRLSDQC*QYSHLL**YCCYLFVKESNSTFKSQAY*NQTPFYQRLC 1659
+ + + +L* + CY + +S ST ++QA+*+ T Y R C
Sbjct: 4458 AEGVQCRTRCHDIVL*QHECY*YF*KSCSTQQNQAH*H*TSLY*RSC 4598
Score = 123 bits (308), Expect = 8e-28
Identities = 125/477 (26%), Positives = 209/477 (43%), Gaps = 23/477 (4%)
Frame = +1
Query: 1 WKTNMYSFIMGLDEELWDILEDGVDDLD-LDEEGAAIDR----KIHTPAQKKLYKKHHKI 55
WK M +F+ LD W + G + LD EG + + T + +L + K
Sbjct: 70 WKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTNELKPEEDWTKEEDELALGNSKA 249
Query: 56 RGIIVASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDE 115
+ + + + ++ + AK + L EG+ KVK ++ +L ++E +MK++E
Sbjct: 250 LNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTSKVKMSRLQLLATKFENLKMKEEE 429
Query: 116 SIEEMYSRFQTLVSGLQILKKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVED 175
I + + + + L + V KILRSLP R+ KVTAIEEA+D+ + V++
Sbjct: 430 CIHDFHMNILEIANACTALGERMTDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDE 609
Query: 176 LVSSLKVHEMSLNEHETSKKSKSIALPSKGKTSKSSKAYKASESVEESPDGDSDEDQSVK 235
L+ SL+ E+ L++ T KKSK++A S E E+ D D+DE +
Sbjct: 610 LIGSLQTFELGLSD-RTEKKSKNLAFVSN------------DEGEEDEYDLDTDEGLTNA 750
Query: 236 MAMLSNK----LEYLARKQK------KFLSKRGSYKNFKKEDQK-------GCFNCKKPG 278
+ +L + L + R+QK F ++GS + KK D+K C C+ G
Sbjct: 751 VVLLGKQFNKVLNRMDRRQKPHVRNIPFDIRKGS-EYQKKSDEKPSHSKGIQCHGCEGYG 927
Query: 279 HFIADCPDLQKEKFKGKSKKSSFNSSKFRKQIKKSLMATWEDLDSESGSDKEEADDDAKA 338
H A+CP K++ KG S S +D +SE SD +D D A
Sbjct: 928 HIKAECPTHLKKQRKGLSVCRS------------------DDTESEQESD---SDRDVNA 1044
Query: 339 AVGLVATVSSEAVSEAESDSEDENEVYSKIPRQELVDSLKELLSLFEHRTNELTDLKEKY 398
G +ED ++ S+I EL S +EL E + LK+
Sbjct: 1045LTGRF------------ESAEDSSDTDSEITFDELAISYRELCIKSEKILQQEAQLKKVI 1188
Query: 399 VDLMKQQKSTLLELKASEEELKG-FNLISATYEDRLKSLCQKLQEKCDKGSGNKHEI 454
+L ++++ E+ ELKG +++ E+ KS+ + +KGS E+
Sbjct: 1189ANLEAEKEAHEEEI----SELKGEVGFLNSKLENMTKSI-----KMLNKGSDMLDEV 1332
Score = 51.2 bits (121), Expect = 4e-06
Identities = 54/196 (27%), Positives = 96/196 (48%)
Frame = +3
Query: 873 C**LQQMDLGKIH*K*RLCM*SV*QLLHSNTI*KRIENFESQK*SWWRI*K*AI*TFL*K 932
C * Q+ LG+++ + +*S+ + + KR+ + E+Q+* W R+*K + L
Sbjct: 2334 CG*FLQIYLGQLYQREIRHL*SIQGVESKTSKRKRLCHQENQE*PWQRV*KQQVY*ILHI 2513
Query: 933 TWDSP*VFFS*NSTTKWGCREKEQNLTRNGQNHDP*KQLS*TFLGRSSQYFLLYSK*DLY 992
*V S +TTKW +++Q+ R+ H ++ S LG S ++ +L+ +
Sbjct: 2514 *RHHS*VLCSHYTTTKWHS*KEKQDFARSC*GHASCQRTSL*SLG*SHEHSMLHPQQSHT 2693
Query: 993 QTYVGENSL*TL*RKKTQYLLLSSVWMYLLHLKH*RLSEEI*CQGSKRNLFRLL*KVKGI 1052
+++* L R++ L +W +LH *R E+ Q RN+ +L K + I
Sbjct: 2694 *KRDSNHTV*NLEREEANCQALPHLWKSMLHFGR*RAKEKDGSQE*CRNILGILYKQQSI 2873
Query: 1053 QSV*FRNTMC*RIYAC 1068
S+ F+N C I+ C
Sbjct: 2874 *SIQFQNQNCDGIHQC 2921
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 140 bits (352), Expect = 6e-33
Identities = 138/467 (29%), Positives = 231/467 (48%)
Frame = +3
Query: 1193 N*AQNC*RSSLR*WMDISYARRAKSIPKE*CVGSGTQTFSEEHYWNKMGIQKQAE*TRRS 1252
N*AQ C R + R* +D YARR +I KE* +G+ + W+++ +Q+Q +* R
Sbjct: 3195 N*AQECERGTDR*VLDQCYARRIGAIQKE*SLGASS*A*GN*CDWHQVDLQEQNQ*RRCH 3374
Query: 1253 NQKQSQTCCSRLQSTRRH*LH*NICSSCKIGSNQVTSILRN*SWHNIISNGCQKCLS*WC 1312
NQKQ QT CSRL S R L *+ C SC +Q+ + + +GC++ +S W
Sbjct: 3375 NQKQGQTGCSRLHSD*RCRL**DFCPSC*T*VHQIITWCSLYPQIQAVPDGCEERISEWI 3554
Query: 1313 H*RRSVC*TTSWV*GS*AS*PCL*T*EITIWLETSSQSLV**TK*FLN*K*F*KRTS*HN 1372
RS+C + +S C+ E ++W+E SS+SLV* *+ +* +
Sbjct: 3555 PE*RSLCGAAKGICRPDSSRSCIQAQEGSLWIEASSKSLV*KANRVPYSARV*EGRN*QD 3734
Query: 1373 TLQKDS*ERYFDCANIC**YNIWFY*CISLQRIF*VNAG*I*NEYDGRIEILSGNSNHPK 1432
L + + DC +IC**+ +W +A *I*+E R ++ SG S+
Sbjct: 3735 PLCQTRC*KLDDCTDIC**HCVWRDVE*DASTFCSTDAI*I*DESCWRADLFSGTSSEAD 3914
Query: 1433 *RRSICSSNKIYKGASEEVQARRL*NDEHSNASNLHLKQRRYWNSSRPEALQRYDWFSVI 1492
I + ++ K +EV + + + +L + + +++Q++D I
Sbjct: 3915 GGLHIPLTKQVCKEHCQEVWDGECQS*KDTCTYSLEAVKG*SRHQC*SKSVQKHDRELTI 4094
Query: 1493 PHCI*T*YFIQCMLVCKISIRS*RISFNCS*ENLQVSERNN*SWTPV*EIPRL*VDWIL* 1552
+ T + + +CKIS +S S + S EN ++ + + * W V + + W+L*
Sbjct: 4095 FNS*QTRHHLCSRCLCKISSQSQDKSLDSSKENSEICKWH**LWDYVLSLFKSNAGWVL* 4274
Query: 1553 C*LCW**D*KKINQWKLSIPGRESDILGKQKTSNYCYVYSGSRVHFSCKLLHTTTLDETS 1612
C*L W *+K + W + + G++ + +Q+ +YS SRV+ S K L T +LDE
Sbjct: 4275 C*LGWKCR*QKKHFWWMLLFGKQPYFMVQQEAELCVPIYSRSRVYCSRKQLFTASLDEAD 4454
Query: 1613 VGRLSDQC*QYSHLL**YCCYLFVKESNSTFKSQAY*NQTPFYQRLC 1659
+ + + +L* + CY + +S ST ++QA+*+ T YQR C
Sbjct: 4455 AEGVQCRTRCHDIVL*QHECY*YF*KSCSTQQNQAH*H*TSLYQRSC 4595
Score = 128 bits (322), Expect = 2e-29
Identities = 125/478 (26%), Positives = 206/478 (42%), Gaps = 22/478 (4%)
Frame = +1
Query: 1 WKTNMYSFIMGLDEELWDILEDGVDDLD-LDEEGAAIDR----KIHTPAQKKLYKKHHKI 55
WK M +F+ LD W + G + LD EG D + T + +L + K
Sbjct: 70 WKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTDELKPEEDWTKEEDELALGNSKA 249
Query: 56 RGIIVASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDE 115
+ + + + ++ + AK + L EG+ KVK ++ +L ++E +MK++E
Sbjct: 250 LNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTSKVKMSRLQLLATKFENLKMKEEE 429
Query: 116 SIEEMYSRFQTLVSGLQILKKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVED 175
I + + + + L + V KILRSLP R+ KVTAIEEA+D+ + V++
Sbjct: 430 CIHDFHMNILEIANACTALGERITDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDE 609
Query: 176 LVSSLKVHEMSLNEHETSKKSKSIALPSKGKTSKSSKAYKASESVEESPDGDSDEDQSVK 235
L+ SL+ E+ L++ KKSK++A S E E+ D D+DE +
Sbjct: 610 LIGSLQTFELGLSD-RAEKKSKNLAFVSN------------DEGEEDEYDLDTDEGLTNA 750
Query: 236 MAMLSNK----LEYLARKQKKFL----------SKRGSYKNFKKEDQKG--CFNCKKPGH 279
+ +L + L + ++QK + SK + K KG C C+ GH
Sbjct: 751 VVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKRSDVKPSHSKGIQCHGCEGYGH 930
Query: 280 FIADCPDLQKEKFKGKSKKSSFNSSKFRKQIKKSLMATWEDLDSESGSDKEEADDDAKAA 339
IA+CP K+ KG S S D +SE SD +D D A
Sbjct: 931 IIAECPTHLKKHRKGLSVCQS-------------------DTESEQESD---SDRDVNAL 1044
Query: 340 VGLVATVSSEAVSEAESDSEDENEVYSKIPRQELVDSLKELLSLFEHRTNELTDLKEKYV 399
G+ T +ED ++ S+I EL S ++L E + LK+
Sbjct: 1045TGIFET------------AEDSSDTDSEITFDELAASYRKLCIKSEKILQQEAQLKKVIA 1188
Query: 400 DLMKQQKSTLLELKASEEELKG-FNLISATYEDRLKSLCQKLQEKCDKGSGNKHEIAL 456
DL ++++ E+ ELKG +++ E+ KS+ + +KGS E+ L
Sbjct: 1189DLEAEKEAHKEEI----SELKGEVGFLNSKLENMTKSI-----KMLNKGSDTLDEVLL 1335
Score = 49.3 bits (116), Expect = 1e-05
Identities = 54/196 (27%), Positives = 96/196 (48%)
Frame = +3
Query: 873 C**LQQMDLGKIH*K*RLCM*SV*QLLHSNTI*KRIENFESQK*SWWRI*K*AI*TFL*K 932
C * Q+ LGK++ + +*S+ ++ + +R+ + E+Q+* W RI*K + L
Sbjct: 2331 CG*FLQIYLGKLYQREIRNL*SIQRVESKTSKRERLCHQENQE*PWQRI*KQQVH*ILHI 2510
Query: 933 TWDSP*VFFS*NSTTKWGCREKEQNLTRNGQNHDP*KQLS*TFLGRSSQYFLLYSK*DLY 992
*V S +TT+W E++Q+ R H ++ S LG S ++ +L+ +
Sbjct: 2511 *RHHS*VLCSHYTTTEWDS*EEKQDFARGCSGHASCQRTSL*SLG*SHEHSMLHPQQSHT 2690
Query: 993 QTYVGENSL*TL*RKKTQYLLLSSVWMYLLHLKH*RLSEEI*CQGSKRNLFRLL*KVKGI 1052
+ + +* L R++ L +W +LHL *R ++ Q RN+ +L K + I
Sbjct: 2691 EKRDSNHPV*NLEREEAICQALPHLWKSMLHLGR*RAKKKDGSQE*CRNIPGILYKQQSI 2870
Query: 1053 QSV*FRNTMC*RIYAC 1068
S+ F+N I+ C
Sbjct: 2871 *SIQFQNQNSDGIHQC 2918
>NP004897 gag-protease polyprotein
Length = 1923
Score = 127 bits (319), Expect = 4e-29
Identities = 126/477 (26%), Positives = 210/477 (43%), Gaps = 23/477 (4%)
Frame = +1
Query: 1 WKTNMYSFIMGLDEELWD-ILEDGVDDLDLDEEGAAID----RKIHTPAQKKLYKKHHKI 55
WK M +F+ LD W +++D LD EG D + T + +L + K
Sbjct: 70 WKARMVAFLKSLDSRTWKAVIKDWEHPKMLDTEGKPTDGLKPEEDWTKEEDELALGNSKA 249
Query: 56 RGIIVASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDE 115
+ + + + ++ + AK + L EG+ KVK ++ +L ++E +MK++E
Sbjct: 250 LNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTSKVKMSRLQLLATKFENLKMKEEE 429
Query: 116 SIEEMYSRFQTLVSGLQILKKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVED 175
I + + + + L + V KILRSLP R+ KVTAIEEA+D+ L V++
Sbjct: 430 CIHDFHMNILEIANACTALGERMTDEKLVRKILRSLPKRFDMKVTAIEEAQDICNLRVDE 609
Query: 176 LVSSLKVHEMSLNEHETSKKSKSIALPSKGKTSKSSKAYKASESVEESPDGDSDEDQSVK 235
L+ SL+ E+ L++ T KKSK++A S E E+ D D+DE +
Sbjct: 610 LIGSLQTFELGLSD-RTEKKSKNLAFVSN------------DEGEEDEYDLDTDEGLTNA 750
Query: 236 MAMLSNK----LEYLARKQK------KFLSKRGSYKNFKKEDQK-------GCFNCKKPG 278
+ +L + L + R+QK F ++GS + K+ D+K C C+ G
Sbjct: 751 VVLLGKQFNKVLNRMDRRQKPHVRNIPFDIRKGS-EYQKRSDEKPSHSKGFQCHGCEGYG 927
Query: 279 HFIADCPDLQKEKFKGKSKKSSFNSSKFRKQIKKSLMATWEDLDSESGSDKEEADDDAKA 338
H A+CP K++ KG S S +D +SE SD +D D A
Sbjct: 928 HIKAECPTHLKKQRKGLSVCRS------------------DDTESEQESD---SDRDVNA 1044
Query: 339 AVGLVATVSSEAVSEAESDSEDENEVYSKIPRQELVDSLKELLSLFEHRTNELTDLKEKY 398
G +ED ++ S+I EL S +EL E + LK+
Sbjct: 1045LTGRF------------ESAEDSSDTDSEITFDELATSYRELCIKSEKILQQEAQLKKVI 1188
Query: 399 VDLMKQQKSTLLELKASEEELKG-FNLISATYEDRLKSLCQKLQEKCDKGSGNKHEI 454
+L ++++ E+ ELKG +++ E+ KS+ + +KGS E+
Sbjct: 1189ANLEAEKEAHEEEI----SELKGEVGFLNSKLENMTKSI-----KMLNKGSDMLDEV 1332
>CF920770
Length = 581
Score = 110 bits (274), Expect = 7e-24
Identities = 62/187 (33%), Positives = 104/187 (55%), Gaps = 12/187 (6%)
Frame = -2
Query: 13 DEELWDILEDG------VDDLDLDEEGAAIDRKIHTPAQK------KLYKKHHKIRGIIV 60
D +W+ +E G V+ + +D ++ I P + K + + K + II
Sbjct: 574 DLNIWEAIEIGPYIPTTVERVSIDGSSSSESITIEKPRDRWSEEDRKRVQYNLKAKNIIT 395
Query: 61 ASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDESIEEM 120
+++ EY ++S+ +AK M+ +L EG+ VK ++ L H+YELFRM +E+I+ M
Sbjct: 394 SALGMDEYFRVSNCKSAKEMWDTLRLTHEGTTDVKRSRINALTHEYELFRMNTNENIQSM 215
Query: 121 YSRFQTLVSGLQILKKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVEDLVSSL 180
RF +V+ L L K + + D ++K+LR L W+PKVTAI E++DL+ +S+ L L
Sbjct: 214 QKRFTHIVNHLAALGKEFQNEDLINKVLRCLSREWQPKVTAISESRDLSNMSLATLFGKL 35
Query: 181 KVHEMSL 187
+ HEM L
Sbjct: 34 QEHEMEL 14
>AI959950
Length = 466
Score = 75.5 bits (184), Expect = 2e-13
Identities = 48/128 (37%), Positives = 72/128 (55%)
Frame = -3
Query: 1210 SYARRAKSIPKE*CVGSGTQTFSEEHYWNKMGIQKQAE*TRRSNQKQSQTCCSRLQSTRR 1269
S ARR S+ KE*C+ + T +E W++M I Q + QS+ C RL +T R
Sbjct: 392 SDARRT*SVSKE*CLEAR*ITKKKEGSWSEMDIL*QTRRGW*GCEIQSKISC*RLLTTGR 213
Query: 1270 H*LH*NICSSCKIGSNQVTSILRN*SWHNIISNGCQKCLS*WCH*RRSVC*TTSWV*GS* 1329
+ L N+C+ C SN + + N + ++SNGC+KC+S W + + S+C*TT+W+*
Sbjct: 212 YRLPKNLCTCCTFRSNMHLTFICNL**YEVVSNGCKKCISKWLNPKGSLC*TTAWI*K*N 33
Query: 1330 AS*PCL*T 1337
S C *T
Sbjct: 32 PSSTCF*T 9
>TC232995
Length = 1009
Score = 75.1 bits (183), Expect = 2e-13
Identities = 61/169 (36%), Positives = 91/169 (53%)
Frame = +1
Query: 1320 *TTSWV*GS*AS*PCL*T*EITIWLETSSQSLV**TK*FLN*K*F*KRTS*HNTLQKDS* 1379
*TT W * * + PCL* + ++W ETS +V* K*F +*K +R S ++ + K+
Sbjct: 4 *TTPWF*NF**TKPCL*ITKGSLWFETSP*GMV*TIK*FSS*KRILQR*SGYHIIHKEKA 183
Query: 1380 ERYFDCANIC**YNIWFY*CISLQRIF*VNAG*I*NEYDGRIEILSGNSNHPK*RRSICS 1439
YF +NIC**YN W +* +Q +F A *I*N DGR ++LSG +N R I
Sbjct: 184 **YFVGSNIC**YNFWIH**FIVQGVFP*YAK*I*NVNDGRTKVLSGITNQANSIRYIHQ 363
Query: 1440 SNKIYKGASEEVQARRL*NDEHSNASNLHLKQRRYWNSSRPEALQRYDW 1488
S +I +G +++ +++ L L+ R W+ R + + R W
Sbjct: 364 SIQILQGIDQKIWDG*CKTHVYTDEH*LLLR*R*IWSVYRHKTISRCYW 510
>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
Length = 1213
Score = 63.5 bits (153), Expect(2) = 1e-11
Identities = 64/176 (36%), Positives = 93/176 (52%)
Frame = +2
Query: 1388 IC**YNIWFY*CISLQRIF*VNAG*I*NEYDGRIEILSGNSNHPK*RRSICSSNKIYKGA 1447
+C**+N+W +Q +F*VN G I*NEY+ ++ SNH + S +IYK
Sbjct: 503 LC**HNLWCNLKKDVQGVF*VNEGWI*NEYER*AKVPPRTSNHSESLWDFYPSREIYKVP 682
Query: 1448 SEEVQARRL*NDEHSNASNLHLKQRRYWNSSRPEALQRYDWFSVIPHCI*T*YFIQCMLV 1507
S++VQ + AS + Q S + + YD F +I + *T Y + + +
Sbjct: 683 SKKVQNG*SQTYGNPYASFHNH*QG*ER*SYFIKGV*WYD*FFIIFNF**TRYCVCRLPL 862
Query: 1508 CKISIRS*RISFNCS*ENLQVSERNN*SWTPV*EIPRL*VDWIL*C*LCW**D*KK 1563
CKIS+ S S CS*++L++S N *S + V*E +* IL*C CW** K+
Sbjct: 863 CKISVLSKNFSCYCS*KDLKISCWNY*SLSMV*EKV*V*SFRIL*CLFCW**SRKE 1030
Score = 26.2 bits (56), Expect(2) = 1e-11
Identities = 17/42 (40%), Positives = 24/42 (56%)
Frame = +1
Query: 1342 IWLETSSQSLV**TK*FLN*K*F*KRTS*HNTLQKDS*ERYF 1383
+W ETSS+SLV* K + K +R + T+QK S + F
Sbjct: 364 LWFETSSKSLV*KAKFISSFKWIHQRNNGPRTIQKGSKRKPF 489
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 67.8 bits (164), Expect = 4e-11
Identities = 61/152 (40%), Positives = 81/152 (53%), Gaps = 4/152 (2%)
Frame = -2
Query: 1211 YARRAKSIPKE*CVGSGTQTFSEEHYWNKMGIQKQAE*TRRSNQKQSQTCCSRLQSTRRH 1270
+ARR +SI K+*CV + +T + NKMG K *T + K R+ S R +
Sbjct: 456 HARRTESI*KK*CVETSRKT*KLSCHRNKMGF*K*IR*TWHNY*K*G*ISSKRV*SRRGN 277
Query: 1271 *LH*NICSSCKIGSN-QVTSILRN*SWHN---IISNGCQKCLS*WCH*RRSVC*TTSWV* 1326
L NICSSCKI S+ SI HN +SNGC KC S W + RRS+C*TT +*
Sbjct: 276 RL*RNICSSCKIRSH*NAFSICI----HNEF*TLSNGC*KCFSKWFNSRRSIC*TTPRL* 109
Query: 1327 GS*AS*PCL*T*EITIWLETSSQSLV**TK*F 1358
+ CL* + ++W +TS +V* K F
Sbjct: 108 NPG*TNSCL*IAKGSLWFKTSP*GVV*TYKQF 13
>TC223814
Length = 607
Score = 60.1 bits (144), Expect = 8e-09
Identities = 41/101 (40%), Positives = 61/101 (59%), Gaps = 1/101 (0%)
Frame = +1
Query: 122 SRFQTLVSGLQILKKSYVS-SDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVEDLVSSL 180
S+ Q +++ L+ L K+ + DH++KIL+SL + RP V A+ ++KDL +L VE+ +L
Sbjct: 223 SKVQNIMNNLRSLSKT*DNHDDHITKILQSLLIQ*RP*VIALCDSKDLKSLPVEEFDGTL 402
Query: 181 KVHEMSLNEHETSKKSKSIALPSKGKTSKSSKAYKASESVE 221
+VHE+ L E E +K K IA SK KA K S S E
Sbjct: 403 QVHELELMEDEGQRKGKFIA-------SKVQKALKRSLSRE 504
>BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Glycine max},
partial (7%)
Length = 336
Score = 50.8 bits (120), Expect = 5e-06
Identities = 37/86 (43%), Positives = 46/86 (53%)
Frame = +2
Query: 1200 RSSLR*WMDISYARRAKSIPKE*CVGSGTQTFSEEHYWNKMGIQKQAE*TRRSNQKQSQT 1259
RS R* +D +ARR K I K+ C+ +T YWNKMG K *T + K SQ
Sbjct: 20 RSHSR**LDHCHARRTKPI*KKQCMEISRKT*KLSCYWNKMGF*K*IR*TWYNY*K*SQV 199
Query: 1260 CCSRLQSTRRH*LH*NICSSCKIGSN 1285
R+ S R + L NICS CKI S+
Sbjct: 200 SSERI*SRRGNRL*RNICSCCKIRSH 277
>CF922226
Length = 667
Score = 50.8 bits (120), Expect = 5e-06
Identities = 51/197 (25%), Positives = 81/197 (40%), Gaps = 1/197 (0%)
Frame = -3
Query: 109 FRMKDDESIEEMYSRFQTLVSGLQILKKSYVSSDHVSKILRSLPSRWRPKVTAIEEAKDL 168
F+M +D S+ E F L+ L+ + + D +L LP + + +D
Sbjct: 614 FKMHEDRSVGEQLDLFNKLILDLENIDVTIDDEDQALLLLCYLPKSYSHFKETLLFGRD- 438
Query: 169 NTLSVEDLVSSLKVHEMSLNEHETSKKSKS-IALPSKGKTSKSSKAYKASESVEESPDGD 227
++S++++ ++L E LNE + K S S L ++GKT K D
Sbjct: 437 -SVSLDEVQTALNSKE--LNERKEKKSSASGEGLTARGKTFKK----------------D 315
Query: 228 SDEDQSVKMAMLSNKLEYLARKQKKFLSKRGSYKNFKKEDQKGCFNCKKPGHFIADCPDL 287
S+ D+ +KQK K G FK C++CKK GH CP+
Sbjct: 314 SEFDK---------------KKQKPENQKNGEGNIFKIR----CYHCKKEGHTRKVCPER 192
Query: 288 QKEKFKGKSKKSSFNSS 304
QK KK S N++
Sbjct: 191 QKNGGSNNRKKDSGNAA 141
>BM143109
Length = 415
Score = 46.2 bits (108), Expect = 1e-04
Identities = 44/121 (36%), Positives = 61/121 (50%)
Frame = +3
Query: 1331 S*PCL*T*EITIWLETSSQSLV**TK*FLN*K*F*KRTS*HNTLQKDS*ERYFDCANIC* 1390
+* CL*T + IW++TS LV* +* + F KR *+ E Y +IC*
Sbjct: 33 A*SCL*TEKGFIWIKTSP*GLV*TFE*ISFRQGFFKR*G*Y*PFYLKEIE*YTLSTDIC* 212
Query: 1391 *YNIWFY*CISLQRIF*VNAG*I*NEYDGRIEILSGNSNHPK*RRSICSSNKIYKGASEE 1450
*Y WF * SLQ+IF A *I*N D +++ S +N +I S KI + +
Sbjct: 213 *YYFWFN**FSLQKIFSRYAK*I*NVNDA*VKLFSWTTNQANKEWNIYQSIKILQRPDSQ 392
Query: 1451 V 1451
+
Sbjct: 393 I 395
>TC207027 weakly similar to UP|Q9LW95 (Q9LW95) KED, partial (17%)
Length = 1005
Score = 45.8 bits (107), Expect = 2e-04
Identities = 54/233 (23%), Positives = 87/233 (37%), Gaps = 1/233 (0%)
Frame = +3
Query: 189 EHETSKKSKSIALPSKGKTSKSSKAYKASESVEESPDGDSDEDQSVKMAMLSNKLEYLAR 248
+ + K K A KGK + +E DGD +E + K K E +
Sbjct: 237 KEKKKKYDKIDAXKVKGKEDDGKDEGNKEKKDKEKGDGDGEEKKEKKDKEKEKKKEKKDK 416
Query: 249 -KQKKFLSKRGSYKNFKKEDQKGCFNCKKPGHFIADCPDLQKEKFKGKSKKSSFNSSKFR 307
++ L ++G KN + ED +G KK +KEK K KK K
Sbjct: 417 DEETDTLKEKG--KNDEGEDDEGNKKKKK--------DKKEKEKDHKKEKKDKEEGEKED 566
Query: 308 KQIKKSLMATWEDLDSESGSDKEEADDDAKAAVGLVATVSSEAVSEAESDSEDENEVYSK 367
+++ S+ D+D E + E +D K E + + + +D+ E K
Sbjct: 567 SKVEVSV----RDIDIEEIKKEGEKEDKGKDG-------GKEVKEKKKKEDKDKKEKKKK 713
Query: 368 IPRQELVDSLKELLSLFEHRTNELTDLKEKYVDLMKQQKSTLLELKASEEELK 420
+ ++ L L E ++ L EK D+ +Q K E EE K
Sbjct: 714 VTGKDKTKDLSTLKQKLEKINGKIQPLLEKKADIERQIKEVEAEGHVVNEENK 872
>TC228567
Length = 1531
Score = 41.6 bits (96), Expect = 0.003
Identities = 70/342 (20%), Positives = 141/342 (40%), Gaps = 17/342 (4%)
Frame = +2
Query: 290 EKFKGKSKKSSFNSSKFRKQIKKSLMATWEDLDSESGSDKEEADDDAKAAVGLVATV--S 347
E+ K +K++ + ++ R SL + E+ S S K+ + + A V L A + S
Sbjct: 383 EELKLNIEKATSDVNRLRVA-SVSLKSKLEEEKSVLASLKQSEEKASAAVVNLQAELEKS 559
Query: 348 SEAVSEAESDSEDENEVYSKIPR--QELVDSLKELLSLFEHRTNELTDLKEKYVDLMKQQ 405
A++ + + E+ +++P+ Q+ E SL + EL + +E+ V+ K +
Sbjct: 560 RSAIAFIQMKENEAREMMTELPKKLQKASQEADEAKSLAQAAQAELIEAQEE-VEQAKAK 736
Query: 406 KSTL-LELKASEEELKGFNLISATYEDRLKSLCQKLQEKCDKGSGNKHEIALDDFIMAGI 464
STL L A+++E++ + D + +L EK + GNK++
Sbjct: 737 SSTLESSLLAAQKEIEAAKVAEMLARDAITAL-----EKSESAKGNKNDK---------- 871
Query: 465 DRSKVASMIYSTYKNKGKGIGYSEEKSK---EYSLKSYCDCIKDGLKSTFVPEGTNAKTA 521
D S + ++ Y + +EE++ E + + L+S E N + +
Sbjct: 872 DSSSMVTLTLEEYHELSRRAYKAEEQANARIEAATSQIQIARESELRSLEKLEELNEELS 1051
Query: 522 V--QSKPEASGSQAKITSKPENLKIKVMTKSDPKSQKIKILKRSE-------PVHQNLIK 572
V +S A+G+ K ++ ++ T + Q+ K + +E P H +
Sbjct: 1052VRRESLKIATGNSEKANEGKLAVEHELRTWRAEQKQQEKATELNEQTSDPTEPAHDSS-S 1228
Query: 573 PESKIPKQKDQKNKAATASEKTIPKGVKPKVLNDQKPLSIHP 614
P+ K+P + A+ ++K KV+ HP
Sbjct: 1229PKGKVPSNNTEAESASNKNKKKKKSSFPSKVVMFFAKKKTHP 1354
>AI855982
Length = 484
Score = 40.8 bits (94), Expect = 0.005
Identities = 43/123 (34%), Positives = 60/123 (47%)
Frame = +1
Query: 1200 RSSLR*WMDISYARRAKSIPKE*CVGSGTQTFSEEHYWNKMGIQKQAE*TRRSNQKQSQT 1259
RS R* +D +ARR +SI K+*CV + +T + NKMG+ K *T + T
Sbjct: 115 RSHSR**LDNCHARRTESI*KK*CVETSRKT**LSCHMNKMGL*K*IR*TSHNYYT*G*T 294
Query: 1260 CCSRLQSTRRH*LH*NICSSCKIGSNQVTSILRN*SWHNIISNGCQKCLS*WCH*RRSVC 1319
RL S+RR L IC C I S+ I+ + +S KC + W S+C
Sbjct: 295 SSRRL*SSRRTRL*TYICFYC*IISHYNAFIICIHNEFYTLSLCMCKCSTSWPTPT*SLC 474
Query: 1320 *TT 1322
*+T
Sbjct: 475 *ST 483
>AI966222
Length = 430
Score = 38.9 bits (89), Expect = 0.019
Identities = 24/66 (36%), Positives = 41/66 (61%)
Frame = +3
Query: 961 NGQNHDP*KQLS*TFLGRSSQYFLLYSK*DLYQTYVGENSL*TL*RKKTQYLLLSSVWMY 1020
+G NH * * LG S++Y +L S+ +LY+T++ ++SL* + KTQ+++ S +
Sbjct: 3 DG*NHAK**LNP*ALLG*SNEYCMLSSEQNLYKTHLEKDSL*IMEGTKTQHIIFLSF*V* 182
Query: 1021 LLHLKH 1026
+ H KH
Sbjct: 183 VFHYKH 200
>TC213445
Length = 705
Score = 38.9 bits (89), Expect = 0.019
Identities = 27/76 (35%), Positives = 42/76 (54%)
Frame = +3
Query: 1576 SDILGKQKTSNYCYVYSGSRVHFSCKLLHTTTLDETSVGRLSDQC*QYSHLL**YCCYLF 1635
S I+ K C + SR++F KLL T LDET+ L + * Y++ +* Y C
Sbjct: 450 SSIMA**KAK*CCLINCRSRIYFC*KLLCTNLLDETTTF*LWFET*SYTYPM*QYKCN*S 629
Query: 1636 VKESNSTFKSQAY*NQ 1651
+++S S ++AY*N+
Sbjct: 630 IQKSYSVL*NKAY*NK 677
Score = 30.4 bits (67), Expect = 6.8
Identities = 15/31 (48%), Positives = 21/31 (67%)
Frame = +1
Query: 1498 T*YFIQCMLVCKISIRS*RISFNCS*ENLQV 1528
T Y + C+ VCKIS +S RIS C *+N ++
Sbjct: 232 TSYNV*CLYVCKISSKSQRISPKCH*KNNEI 324
>TC204982 PIR|T06394|T06394 isoprenylated protein - soybean (fragment)
{Glycine max;} , complete
Length = 1230
Score = 37.4 bits (85), Expect = 0.056
Identities = 26/92 (28%), Positives = 38/92 (41%)
Frame = +3
Query: 525 KPEASGSQAKITSKPENLKIKVMTKSDPKSQKIKILKRSEPVHQNLIKPESKIPKQKDQK 584
KP+ +G + K KP K + K DP+ K K +P + K + K K++
Sbjct: 453 KPKPAGPEKKEAEKP---KAEPEKKKDPEKPKADPPKAEKPKTEPEKKKDGGGEKPKEEP 623
Query: 585 NKAATASEKTIPKGVKPKVLNDQKPLSIHPKV 616
K EK P KPK PL + P +
Sbjct: 624 EKKKDGGEKPKPGPEKPKDKPTPAPLPVQPHI 719
>TC206178 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, partial (69%)
Length = 1138
Score = 36.2 bits (82), Expect = 0.12
Identities = 15/45 (33%), Positives = 29/45 (64%), Gaps = 8/45 (17%)
Frame = +2
Query: 251 KKFLSKRGSYKN--FKKEDQKG------CFNCKKPGHFIADCPDL 287
+K S R SY++ ++++ ++G C NCK+PGH+ +CP++
Sbjct: 260 RKIRSDRFSYRDAPYRRDSRRGFSRDNLCKNCKRPGHYARECPNV 394
Score = 31.6 bits (70), Expect = 3.0
Identities = 10/20 (50%), Positives = 14/20 (70%)
Frame = +2
Query: 267 DQKGCFNCKKPGHFIADCPD 286
++K C NC+K GH DCP+
Sbjct: 638 NEKACNNCRKTGHLARDCPN 697
Score = 31.2 bits (69), Expect = 4.0
Identities = 9/16 (56%), Positives = 13/16 (81%)
Frame = +2
Query: 271 CFNCKKPGHFIADCPD 286
C+NCK+PGH + CP+
Sbjct: 458 CWNCKEPGHMASSCPN 505
>TC205401 similar to GB|AAP37850.1|30725656|BT008491 At4g26630 {Arabidopsis
thaliana;} , partial (16%)
Length = 912
Score = 36.2 bits (82), Expect = 0.12
Identities = 50/215 (23%), Positives = 90/215 (41%), Gaps = 3/215 (1%)
Frame = +3
Query: 174 EDLVSSLKVHEMSLNEHETSKKSK--SIALPSKGKTSKSSKAYKASESVEESPDGDSDED 231
ED+ K + S + E++KKSK IA+P+K ++ K S ++ +S D D D
Sbjct: 36 EDIKEKKKHSKTSSTKKESAKKSKIEKIAVPNKSRSPPKRAPKKPSSNLSKS---DEDSD 206
Query: 232 QSVKMAMLSNKLEYLARKQKKFLSKRGSYKNFKKEDQ-KGCFNCKKPGHFIADCPDLQKE 290
+S K+ K E +++ +K S + +K + KG K + ++
Sbjct: 207 ESPKVFSRKKKNEKGGKQKTATPTKSASEEKTEKVTRGKG-----KKKEKSRPSDNQLRD 371
Query: 291 KFKGKSKKSSFNSSKFRKQIKKSLMATWEDLDSESGSDKEEADDDAKAAVGLVATVSSEA 350
K+ +FN++ F +KK DL S K ++ + ++ EA
Sbjct: 372 AICEILKEVNFNTATFTDILKKLAKQFDMDLTPRKASIKSMIQEE-------LTKLADEA 530
Query: 351 VSEAESDSEDENEVYSKIPRQELVDSLKELLSLFE 385
E + +++E S QE+ LK L+ E
Sbjct: 531 DDEDREEDAEKDEAPS--TGQEVEG*LKSRLNCLE 629
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.357 0.156 0.559
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 69,526,778
Number of Sequences: 63676
Number of extensions: 976711
Number of successful extensions: 11556
Number of sequences better than 10.0: 84
Number of HSP's better than 10.0 without gapping: 6502
Number of HSP's successfully gapped in prelim test: 547
Number of HSP's that attempted gapping in prelim test: 4523
Number of HSP's gapped (non-prelim): 7783
length of query: 1664
length of database: 12,639,632
effective HSP length: 110
effective length of query: 1554
effective length of database: 5,635,272
effective search space: 8757212688
effective search space used: 8757212688
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.7 bits)
S2: 66 (30.0 bits)
Medicago: description of AC147002.8