
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146649.9 - phase: 0 /pseudo
(1684 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 132 1e-30
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 129 1e-29
NP004897 gag-protease polyprotein 126 7e-29
CF920770 111 2e-24
AI959950 79 2e-14
TC232995 72 3e-12
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 69 2e-11
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti... 66 8e-11
TC223814 57 5e-08
BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Gl... 52 2e-06
BM143109 47 5e-05
AI855982 47 9e-05
CF922226 45 2e-04
TC218783 similar to UP|Q40363 (Q40363) NuM1 protein, partial (19%) 44 6e-04
TC207027 weakly similar to UP|Q9LW95 (Q9LW95) KED, partial (17%) 41 0.004
TC218782 similar to UP|Q41042 (Q41042) Pisum sativum L. (clone n... 41 0.005
TC225033 similar to UP|O65758 (O65758) Histone H1, partial (81%) 38 0.043
TC213445 37 0.073
TC205401 similar to GB|AAP37850.1|30725656|BT008491 At4g26630 {A... 37 0.096
TC228340 similar to UP|Q9CJT2 (Q9CJT2) OppB, partial (5%) 36 0.16
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 132 bits (333), Expect = 1e-30
Identities = 142/522 (27%), Positives = 249/522 (47%)
Frame = +3
Query: 1142 GANHWKQKQS*ENKISLQTRRIFDRIAFNN*AKNC*RSSLR*WMDTSYARRAKSVSKK*C 1201
GA++ + KQ KI + N*A+ C R + * +D YARR ++ K+*
Sbjct: 3111 GADYRRSKQRSHYKIKGD*DCLQFMFCLQN*AQECERGTD**VLDQCYARRIGAIQKE*S 3290
Query: 1202 VGSGTQTQSEEHYWNKMGIQKQAE*TRRGNQKQS*TCCTRIQSTRRH*LY*NICSSCKIG 1261
+G+ + T+ W+++ +Q+Q +* R NQKQ TCC+R+ S R L *N C
Sbjct: 3291 LGASS*TRGN*CDWHQVDLQEQNQ*RRCYNQKQGQTCCSRLHSD*RCRL**NFRPCC*T* 3470
Query: 1262 SNQVTSILCN*SWHNIISNGCQKCFS*WCH*RRSVCQTTSWV*GS*AS*PCL*T**ITIR 1321
+Q+ + + +GC++ S W RS+C + S +S C+ ++
Sbjct: 3471 VHQIVTWCSLHPQIQAVPDGCEERVSEWIPE*RSLCGAAKGICRSNSSRSCIQAQEGSLW 3650
Query: 1322 LETSSQSLV**TK*FLN*K*F*KRTS*HNTFQKDS*ERHFDCANIC**YNIWFY*CISLQ 1381
+E SS+SLV* *+ +* ++ + + D +IC**+ +W
Sbjct: 3651 IEASSKSLV*KANRVPYSARV*EGRN*QDSLCQTRC*KLDDSTDIC**HCVWRDVE*DAS 3830
Query: 1382 GIF*VNAG*I*NEHDGRIEVLSWNSNQPKQRRSICSSNKIYKGASEEVQTRRL*SDEHSN 1441
+A *I*+E R ++ S ++ R I + ++ K +EV + +++
Sbjct: 3831 TFCPTDAI*I*DESCWRADLFSGTPSEADGRLHIPLTKQVCKEHCQEVWDGKCQP*KNTC 4010
Query: 1442 ASNLHLKQRRYWNCSRPEAIQRYDWFSVIPHCI*T*YFIQRMLVCKISIRS*RISFNCS* 1501
+L +R W+ +++Q++DW I + T*+ + +CKIS +S* S S
Sbjct: 4011 TYSLEAVKR*SWHQC*SKSVQKHDWELTIFNS*QT*HHLCSRCLCKISSQS*DKSLESSK 4190
Query: 1502 KNLQVSERNN*SWTPV*EIPRLQVDWIL*C*LCW*QD*KKINQ*KLSIPGRESDILGKQK 1561
+N ++ + + * W V + R W+L*C*L W *+K + + + G +S + +Q+
Sbjct: 4191 ENSEICKWHQ*LWDYVLSLFRFNAGWVL*C*LGWKCR*QKKHFWWMFLFGNQSYFMVQQE 4370
Query: 1562 TSNYCYVYSKSRIHFSCKLLYTTTLDETSVGRLSD*C*QYSHLL**YCCYLFVKESNSTF 1621
+Y SR++ S K L+TT+LDE + + +L* + CY + +S ST
Sbjct: 4371 AELCVPIYC*SRVYCSRKQLFTTSLDEADAEGVQCRTRCHDIVL*QHECY*YF*KSCSTQ 4550
Query: 1622 KSQAY*DQTPFYQRLCSKRNFRYTIH*Y*ASMG*YIYKAFIC 1663
++QA+* T Y R C ++ *+* + Y +K C
Sbjct: 4551 QNQAH*H*TSLY*RSC***SYHTGAC*H*GTNSRYFHKGIGC 4676
Score = 120 bits (301), Expect = 5e-27
Identities = 97/388 (25%), Positives = 174/388 (44%), Gaps = 25/388 (6%)
Frame = +1
Query: 18 DPEEFSWWKTNMYSYIMGLDEELWDILEDGVDDLD-LDEEGAAIDR----KIHTPAQKKT 72
D + +WK M +++ LD W + G + LD EG + + T + +
Sbjct: 49 DGTNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTNELKPEEDWTKEEDEL 228
Query: 73 YKKHHKIRGIIVASIPRTEYMKMSDKSTAKAMFASLCANYEGSKKVREAKALMLVHQYEL 132
+ K + + + + ++ + AK + L +EG+ KV+ ++ +L ++E
Sbjct: 229 ALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTSKVKMSRLQLLATKFEN 408
Query: 133 FKMKDDESIEEMYSRFQTLVSGLQILKKSYVTSDLVSKILRSLPSRWRPKVTAIEEAKDL 192
KMK++E I + + + + L + LV KILRSLP R+ KVTAIEEA+D+
Sbjct: 409 LKMKEEECIHDFHMNILEIANACTALGERMTDEKLVRKILRSLPKRFDMKVTAIEEAQDI 588
Query: 193 NTLSVEDLVSSLKVHEMSLNEHESSKKSKSIALPSKGKSSKSSKAYKASESEEESPDGDS 252
+ V++L+ SL+ E+ L++ + KKSK++A S E EE+ D D+
Sbjct: 589 CNMRVDELIGSLQTFELGLSD-RTEKKSKNLAFVSN------------DEGEEDEYDLDT 729
Query: 253 DEDHSVKMAMLSNK----LEYLARKQKKFLSK-----RNGYKNWKREDQK-------GCF 296
DE + + +L + L + R+QK + R G + K+ D+K C
Sbjct: 730 DEGLTNAVVLLGKQFNKVLNRMDRRQKPHVRNIPFDIRKGSEYQKKSDEKPSHSKGIQCH 909
Query: 297 NCKKPGHFIADCPDLQKEKSKS----RPKKPSFSSSKFRKQIKKSLMATWEDLDSESGSD 352
C+ GH A+CP K++ K R + +L +E + S +D
Sbjct: 910 GCEGYGHIKAECPTHLKKQRKGLSVCRSDDTESEQESDSDRDVNALTGRFESAEDSSDTD 1089
Query: 353 KEEADDDAKAAMRLVATVSSEAVSESES 380
E D+ + R + + SE + + E+
Sbjct: 1090SEITFDELAISYRELC-IKSEKILQQEA 1170
Score = 40.8 bits (94), Expect = 0.005
Identities = 43/157 (27%), Positives = 74/157 (46%)
Frame = +3
Query: 884 KRIENFESQK*SWWRI*K*AI*IFL*KTWDSP*IFFS*NSTKKWGCREKEQNFTRNGQNH 943
KR+ + E+Q+* W R+*K + L *+ S +T KW +++Q+F R+ H
Sbjct: 2433 KRLCHQENQE*PWQRV*KQQVY*ILHI*RHHS*VLCSHYTTTKWHS*KEKQDFARSC*GH 2612
Query: 944 DP*K*FRQTFMGRSS*YFMLYSK*DLYQTYVGENYI*TL*RKKTQYLLLSSVWMYLLHSK 1003
+ +G S + ML+ + + +* L R++ L +W +LH
Sbjct: 2613 ASCQRTSL*SLG*SHEHSMLHPQQSHT*KRDSNHTV*NLEREEANCQALPHLWKSMLHFG 2792
Query: 1004 H*RLSEEI*CQGSKRNLFRLL*KVKSVQSV*FRNTLC 1040
*R E+ Q RN+ +L K +S+ S+ F+N C
Sbjct: 2793 R*RAKEKDGSQE*CRNILGILYKQQSI*SIQFQNQNC 2903
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 129 bits (324), Expect = 1e-29
Identities = 106/405 (26%), Positives = 183/405 (45%), Gaps = 31/405 (7%)
Frame = +1
Query: 18 DPEEFSWWKTNMYSYIMGLDEELWDILEDGVDDLD-LDEEGAAIDR----KIHTPAQKKT 72
D + +WK M +++ LD W + G + LD EG D + T + +
Sbjct: 49 DGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTDELKPEEDWTKEEDEL 228
Query: 73 YKKHHKIRGIIVASIPRTEYMKMSDKSTAKAMFASLCANYEGSKKVREAKALMLVHQYEL 132
+ K + + + + ++ + AK + L +EG+ KV+ ++ +L ++E
Sbjct: 229 ALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTSKVKMSRLQLLATKFEN 408
Query: 133 FKMKDDESIEEMYSRFQTLVSGLQILKKSYVTSDLVSKILRSLPSRWRPKVTAIEEAKDL 192
KMK++E I + + + + L + LV KILRSLP R+ KVTAIEEA+D+
Sbjct: 409 LKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLPKRFDMKVTAIEEAQDI 588
Query: 193 NTLSVEDLVSSLKVHEMSLNEHESSKKSKSIALPSKGKSSKSSKAYKASESEEESPDGDS 252
+ V++L+ SL+ E+ L++ + KKSK++A S E EE+ D D+
Sbjct: 589 CNMRVDELIGSLQTFELGLSD-RAEKKSKNLAFVSN------------DEGEEDEYDLDT 729
Query: 253 DEDHSVKMAMLSNK----LEYLARKQKKFLSK-----RNGYKNWKREDQK-------GCF 296
DE + + +L + L + ++QK + R G K KR D K C
Sbjct: 730 DEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKRSDVKPSHSKGIQCH 909
Query: 297 NCKKPGHFIADCPDLQKEKSKSRPKKPSFSSSKFRKQIKK---SLMATWEDLDSESGSDK 353
C+ GH IA+CP K+ K S + S+ + +L +E + S +D
Sbjct: 910 GCEGYGHIIAECPTHLKKHRKGLSVCQSDTESEQESDSDRDVNALTGIFETAEDSSDTDS 1089
Query: 354 EEADDDAKAAMRLVATVSSEAVSESE-------SDSEDENEVYSK 391
E D+ A+ R + + SE + + E +D E E E + +
Sbjct: 1090EITFDELAASYRKLC-IKSEKILQQEAQLKKVIADLEAEKEAHKE 1221
Score = 120 bits (300), Expect = 7e-27
Identities = 139/522 (26%), Positives = 246/522 (46%)
Frame = +3
Query: 1142 GANHWKQKQS*ENKISLQTRRIFDRIAFNN*AKNC*RSSLR*WMDTSYARRAKSVSKK*C 1201
GA++ + KQ KI R+ N*A+ C R + R* +D YARR ++ K+*
Sbjct: 3108 GADYRRSKQRGHYKIKGG*DRLKLMFCLQN*AQECERGTDR*VLDQCYARRIGAIQKE*S 3287
Query: 1202 VGSGTQTQSEEHYWNKMGIQKQAE*TRRGNQKQS*TCCTRIQSTRRH*LY*NICSSCKIG 1261
+G+ + W+++ +Q+Q +* R NQKQ T C+R+ S R L *+ C SC
Sbjct: 3288 LGASS*A*GN*CDWHQVDLQEQNQ*RRCHNQKQGQTGCSRLHSD*RCRL**DFCPSC*T* 3467
Query: 1262 SNQVTSILCN*SWHNIISNGCQKCFS*WCH*RRSVCQTTSWV*GS*AS*PCL*T**ITIR 1321
+Q+ + + +GC++ S W RS+C + +S C+ ++
Sbjct: 3468 VHQIITWCSLYPQIQAVPDGCEERISEWIPE*RSLCGAAKGICRPDSSRSCIQAQEGSLW 3647
Query: 1322 LETSSQSLV**TK*FLN*K*F*KRTS*HNTFQKDS*ERHFDCANIC**YNIWFY*CISLQ 1381
+E SS+SLV* *+ +* + + + DC +IC**+ +W
Sbjct: 3648 IEASSKSLV*KANRVPYSARV*EGRN*QDPLCQTRC*KLDDCTDIC**HCVWRDVE*DAS 3827
Query: 1382 GIF*VNAG*I*NEHDGRIEVLSWNSNQPKQRRSICSSNKIYKGASEEVQTRRL*SDEHSN 1441
+A *I*+E R ++ S S++ I + ++ K +EV S + +
Sbjct: 3828 TFCSTDAI*I*DESCWRADLFSGTSSEADGGLHIPLTKQVCKEHCQEVWDGECQS*KDTC 4007
Query: 1442 ASNLHLKQRRYWNCSRPEAIQRYDWFSVIPHCI*T*YFIQRMLVCKISIRS*RISFNCS* 1501
+L + + +++Q++D I + T + + +CKIS +S S + S
Sbjct: 4008 TYSLEAVKG*SRHQC*SKSVQKHDRELTIFNS*QTRHHLCSRCLCKISSQSQDKSLDSSK 4187
Query: 1502 KNLQVSERNN*SWTPV*EIPRLQVDWIL*C*LCW*QD*KKINQ*KLSIPGRESDILGKQK 1561
+N ++ + + * W V + + W+L*C*L W *+K + + + G++ + +Q+
Sbjct: 4188 ENSEICKWH**LWDYVLSLFKSNAGWVL*C*LGWKCR*QKKHFWWMLLFGKQPYFMVQQE 4367
Query: 1562 TSNYCYVYSKSRIHFSCKLLYTTTLDETSVGRLSD*C*QYSHLL**YCCYLFVKESNSTF 1621
+YS+SR++ S K L+T +LDE + + +L* + CY + +S ST
Sbjct: 4368 AELCVPIYSRSRVYCSRKQLFTASLDEADAEGVQCRTRCHDIVL*QHECY*YF*KSCSTQ 4547
Query: 1622 KSQAY*DQTPFYQRLCSKRNFRYTIH*Y*ASMG*YIYKAFIC 1663
++QA+* T YQR C + *+* + Y +K F C
Sbjct: 4548 QNQAH*H*TSLYQRSC***SDHTEAC*H*GTNSRYFHKGFGC 4673
Score = 38.9 bits (89), Expect = 0.019
Identities = 48/187 (25%), Positives = 89/187 (46%)
Frame = +3
Query: 851 C**LQHMDLGKIH*K*RLCM*SV*QLLRSNTI*KRIENFESQK*SWWRI*K*AI*IFL*K 910
C * + LGK++ + +*S+ ++ + +R+ + E+Q+* W RI*K + L
Sbjct: 2331 CG*FLQIYLGKLYQREIRNL*SIQRVESKTSKRERLCHQENQE*PWQRI*KQQVH*ILHI 2510
Query: 911 TWDSP*IFFS*NSTKKWGCREKEQNFTRNGQNHDP*K*FRQTFMGRSS*YFMLYSK*DLY 970
*+ S +T +W E++Q+F R H + +G S + ML+ +
Sbjct: 2511 *RHHS*VLCSHYTTTEWDS*EEKQDFARGCSGHASCQRTSL*SLG*SHEHSMLHPQQSHT 2690
Query: 971 QTYVGENYI*TL*RKKTQYLLLSSVWMYLLHSKH*RLSEEI*CQGSKRNLFRLL*KVKSV 1030
+ + +* L R++ L +W +LH *R ++ Q RN+ +L K +S+
Sbjct: 2691 EKRDSNHPV*NLEREEAICQALPHLWKSMLHLGR*RAKKKDGSQE*CRNIPGILYKQQSI 2870
Query: 1031 QSV*FRN 1037
S+ F+N
Sbjct: 2871 *SIQFQN 2891
>NP004897 gag-protease polyprotein
Length = 1923
Score = 126 bits (317), Expect = 7e-29
Identities = 100/388 (25%), Positives = 175/388 (44%), Gaps = 25/388 (6%)
Frame = +1
Query: 18 DPEEFSWWKTNMYSYIMGLDEELWD-ILEDGVDDLDLDEEGAAID----RKIHTPAQKKT 72
D + +WK M +++ LD W +++D LD EG D + T + +
Sbjct: 49 DGTNYEYWKARMVAFLKSLDSRTWKAVIKDWEHPKMLDTEGKPTDGLKPEEDWTKEEDEL 228
Query: 73 YKKHHKIRGIIVASIPRTEYMKMSDKSTAKAMFASLCANYEGSKKVREAKALMLVHQYEL 132
+ K + + + + ++ + AK + L +EG+ KV+ ++ +L ++E
Sbjct: 229 ALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTSKVKMSRLQLLATKFEN 408
Query: 133 FKMKDDESIEEMYSRFQTLVSGLQILKKSYVTSDLVSKILRSLPSRWRPKVTAIEEAKDL 192
KMK++E I + + + + L + LV KILRSLP R+ KVTAIEEA+D+
Sbjct: 409 LKMKEEECIHDFHMNILEIANACTALGERMTDEKLVRKILRSLPKRFDMKVTAIEEAQDI 588
Query: 193 NTLSVEDLVSSLKVHEMSLNEHESSKKSKSIALPSKGKSSKSSKAYKASESEEESPDGDS 252
L V++L+ SL+ E+ L++ + KKSK++A S E EE+ D D+
Sbjct: 589 CNLRVDELIGSLQTFELGLSD-RTEKKSKNLAFVSN------------DEGEEDEYDLDT 729
Query: 253 DEDHSVKMAMLSNK----LEYLARKQKKFLSK-----RNGYKNWKREDQK-------GCF 296
DE + + +L + L + R+QK + R G + KR D+K C
Sbjct: 730 DEGLTNAVVLLGKQFNKVLNRMDRRQKPHVRNIPFDIRKGSEYQKRSDEKPSHSKGFQCH 909
Query: 297 NCKKPGHFIADCPDLQKEKSKS----RPKKPSFSSSKFRKQIKKSLMATWEDLDSESGSD 352
C+ GH A+CP K++ K R + +L +E + S +D
Sbjct: 910 GCEGYGHIKAECPTHLKKQRKGLSVCRSDDTESEQESDSDRDVNALTGRFESAEDSSDTD 1089
Query: 353 KEEADDDAKAAMRLVATVSSEAVSESES 380
E D+ + R + + SE + + E+
Sbjct: 1090SEITFDELATSYRELC-IKSEKILQQEA 1170
>CF920770
Length = 581
Score = 111 bits (278), Expect = 2e-24
Identities = 61/187 (32%), Positives = 105/187 (55%), Gaps = 12/187 (6%)
Frame = -2
Query: 37 DEELWDILEDG------VDDLDLDEEGAAIDRKIHTPAQK------KTYKKHHKIRGIIV 84
D +W+ +E G V+ + +D ++ I P + K + + K + II
Sbjct: 574 DLNIWEAIEIGPYIPTTVERVSIDGSSSSESITIEKPRDRWSEEDRKRVQYNLKAKNIIT 395
Query: 85 ASIPRTEYMKMSDKSTAKAMFASLCANYEGSKKVREAKALMLVHQYELFKMKDDESIEEM 144
+++ EY ++S+ +AK M+ +L +EG+ V+ ++ L H+YELF+M +E+I+ M
Sbjct: 394 SALGMDEYFRVSNCKSAKEMWDTLRLTHEGTTDVKRSRINALTHEYELFRMNTNENIQSM 215
Query: 145 YSRFQTLVSGLQILKKSYVTSDLVSKILRSLPSRWRPKVTAIEEAKDLNTLSVEDLVSSL 204
RF +V+ L L K + DL++K+LR L W+PKVTAI E++DL+ +S+ L L
Sbjct: 214 QKRFTHIVNHLAALGKEFQNEDLINKVLRCLSREWQPKVTAISESRDLSNMSLATLFGKL 35
Query: 205 KVHEMSL 211
+ HEM L
Sbjct: 34 QEHEMEL 14
>AI959950
Length = 466
Score = 79.0 bits (193), Expect = 2e-14
Identities = 49/128 (38%), Positives = 73/128 (56%)
Frame = -3
Query: 1188 SYARRAKSVSKK*CVGSGTQTQSEEHYWNKMGIQKQAE*TRRGNQKQS*TCCTRIQSTRR 1247
S ARR SVSK+*C+ + T+ +E W++M I Q G + QS C R+ +T R
Sbjct: 392 SDARRT*SVSKE*CLEAR*ITKKKEGSWSEMDIL*QTRRGW*GCEIQSKISC*RLLTTGR 213
Query: 1248 H*LY*NICSSCKIGSNQVTSILCN*SWHNIISNGCQKCFS*WCH*RRSVCQTTSWV*GS* 1307
+ L N+C+ C SN + +CN + ++SNGC+KC S W + + S+C TT+W+*
Sbjct: 212 YRLPKNLCTCCTFRSNMHLTFICNL**YEVVSNGCKKCISKWLNPKGSLC*TTAWI*K*N 33
Query: 1308 AS*PCL*T 1315
S C *T
Sbjct: 32 PSSTCF*T 9
>TC232995
Length = 1009
Score = 71.6 bits (174), Expect = 3e-12
Identities = 61/168 (36%), Positives = 90/168 (53%)
Frame = +1
Query: 1299 TTSWV*GS*AS*PCL*T**ITIRLETSSQSLV**TK*FLN*K*F*KRTS*HNTFQKDS*E 1358
TT W * * + PCL* ++ ETS +V* K*F +*K +R S ++ K+
Sbjct: 7 TTPWF*NF**TKPCL*ITKGSLWFETSP*GMV*TIK*FSS*KRILQR*SGYHIIHKEKA* 186
Query: 1359 RHFDCANIC**YNIWFY*CISLQGIF*VNAG*I*NEHDGRIEVLSWNSNQPKQRRSICSS 1418
+F +NIC**YN W +* +QG+F A *I*N +DGR +VLS +NQ R I S
Sbjct: 187 *YFVGSNIC**YNFWIH**FIVQGVFP*YAK*I*NVNDGRTKVLSGITNQANSIRYIHQS 366
Query: 1419 NKIYKGASEEVQTRRL*SDEHSNASNLHLKQRRYWNCSRPEAIQRYDW 1466
+I +G +++ + +++ L L+ R W+ R + I R W
Sbjct: 367 IQILQGIDQKIWDG*CKTHVYTDEH*LLLR*R*IWSVYRHKTISRCYW 510
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 68.9 bits (167), Expect = 2e-11
Identities = 48/112 (42%), Positives = 62/112 (54%)
Frame = -2
Query: 1189 YARRAKSVSKK*CVGSGTQTQSEEHYWNKMGIQKQAE*TRRGNQKQS*TCCTRIQSTRRH 1248
+ARR +S+ KK*CV + +T + NKMG K *T K * R+ S R +
Sbjct: 456 HARRTESI*KK*CVETSRKT*KLSCHRNKMGF*K*IR*TWHNY*K*G*ISSKRV*SRRGN 277
Query: 1249 *LY*NICSSCKIGSNQVTSILCN*SWHNIISNGCQKCFS*WCH*RRSVCQTT 1300
L NICSSCKI S+ +C + +SNGC KCFS W + RRS+C TT
Sbjct: 276 RL*RNICSSCKIRSH*NAFSICIHNEF*TLSNGC*KCFSKWFNSRRSIC*TT 121
>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
Length = 1213
Score = 65.9 bits (159), Expect(2) = 8e-11
Identities = 64/177 (36%), Positives = 95/177 (53%), Gaps = 6/177 (3%)
Frame = +2
Query: 1366 IC**YNIWFY*CISLQGIF*VNAG*I*NEHDGRIEVLSWNSNQPKQRRSICSSNKIYKGA 1425
+C**+N+W +QG+F*VN G I*NE++ +V SN + S +IYK
Sbjct: 503 LC**HNLWCNLKKDVQGVF*VNEGWI*NEYER*AKVPPRTSNHSESLWDFYPSREIYKVP 682
Query: 1426 SEEVQTRRL*SDEHSNA-SNLHL-----KQRRYWNCSRPEAIQRYDWFSVIPHCI*T*YF 1479
S++VQ *S + N ++ H ++ Y+ + + YD F +I + *T Y
Sbjct: 683 SKKVQNG--*SQTYGNPYASFHNH*QG*ER*SYFI----KGV*WYD*FFIIFNF**TRYC 844
Query: 1480 IQRMLVCKISIRS*RISFNCS*KNLQVSERNN*SWTPV*EIPRLQVDWIL*C*LCW* 1536
+ R+ +CKIS+ S S CS*K+L++S N *S + V*E + IL*C CW*
Sbjct: 845 VCRLPLCKISVLSKNFSCYCS*KDLKISCWNY*SLSMV*EKV*V*SFRIL*CLFCW* 1015
Score = 20.8 bits (42), Expect(2) = 8e-11
Identities = 16/39 (41%), Positives = 21/39 (53%)
Frame = +1
Query: 1323 ETSSQSLV**TK*FLN*K*F*KRTS*HNTFQKDS*ERHF 1361
ETSS+SLV* K + K +R + T QK S + F
Sbjct: 373 ETSSKSLV*KAKFISSFKWIHQRNNGPRTIQKGSKRKPF 489
>TC223814
Length = 607
Score = 57.4 bits (137), Expect = 5e-08
Identities = 40/101 (39%), Positives = 59/101 (57%), Gaps = 1/101 (0%)
Frame = +1
Query: 146 SRFQTLVSGLQILKKSYVT-SDLVSKILRSLPSRWRPKVTAIEEAKDLNTLSVEDLVSSL 204
S+ Q +++ L+ L K+ D ++KIL+SL + RP V A+ ++KDL +L VE+ +L
Sbjct: 223 SKVQNIMNNLRSLSKT*DNHDDHITKILQSLLIQ*RP*VIALCDSKDLKSLPVEEFDGTL 402
Query: 205 KVHEMSLNEHESSKKSKSIALPSKGKSSKSSKAYKASESEE 245
+VHE+ L E E +K K IA SK KA K S S E
Sbjct: 403 QVHELELMEDEGQRKGKFIA-------SKVQKALKRSLSRE 504
>BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Glycine max},
partial (7%)
Length = 336
Score = 52.0 bits (123), Expect = 2e-06
Identities = 42/105 (40%), Positives = 52/105 (49%)
Frame = +2
Query: 1178 RSSLR*WMDTSYARRAKSVSKK*CVGSGTQTQSEEHYWNKMGIQKQAE*TRRGNQKQS*T 1237
RS R* +D +ARR K + KK C+ +T YWNKMG K *T K S
Sbjct: 20 RSHSR**LDHCHARRTKPI*KKQCMEISRKT*KLSCYWNKMGF*K*IR*TWYNY*K*SQV 199
Query: 1238 CCTRIQSTRRH*LY*NICSSCKIGSNQVTSILCN*SWHNIISNGC 1282
RI S R + L NICS CKI S+ +C + +SNGC
Sbjct: 200 SSERI*SRRGNRL*RNICSCCKIRSH*NAFGICIHNEL*TLSNGC 334
>BM143109
Length = 415
Score = 47.4 bits (111), Expect = 5e-05
Identities = 45/121 (37%), Positives = 63/121 (51%)
Frame = +3
Query: 1309 S*PCL*T**ITIRLETSSQSLV**TK*FLN*K*F*KRTS*HNTFQKDS*ERHFDCANIC* 1368
+* CL*T I ++TS LV* +* + F KR *+ F E + +IC*
Sbjct: 33 A*SCL*TEKGFIWIKTSP*GLV*TFE*ISFRQGFFKR*G*Y*PFYLKEIE*YTLSTDIC* 212
Query: 1369 *YNIWFY*CISLQGIF*VNAG*I*NEHDGRIEVLSWNSNQPKQRRSICSSNKIYKGASEE 1428
*Y WF * SLQ IF A *I*N +D +++ SW +NQ + +I S KI + +
Sbjct: 213 *YYFWFN**FSLQKIFSRYAK*I*NVNDA*VKLFSWTTNQANKEWNIYQSIKILQRPDSQ 392
Query: 1429 V 1429
+
Sbjct: 393 I 395
>AI855982
Length = 484
Score = 46.6 bits (109), Expect = 9e-05
Identities = 43/123 (34%), Positives = 60/123 (47%)
Frame = +1
Query: 1178 RSSLR*WMDTSYARRAKSVSKK*CVGSGTQTQSEEHYWNKMGIQKQAE*TRRGNQKQS*T 1237
RS R* +D +ARR +S+ KK*CV + +T + NKMG+ K *T *T
Sbjct: 115 RSHSR**LDNCHARRTESI*KK*CVETSRKT**LSCHMNKMGL*K*IR*TSHNYYT*G*T 294
Query: 1238 CCTRIQSTRRH*LY*NICSSCKIGSNQVTSILCN*SWHNIISNGCQKCFS*WCH*RRSVC 1297
R+ S+RR L IC C I S+ I+C + +S KC + W S+C
Sbjct: 295 SSRRL*SSRRTRL*TYICFYC*IISHYNAFIICIHNEFYTLSLCMCKCSTSWPTPT*SLC 474
Query: 1298 QTT 1300
+T
Sbjct: 475 *ST 483
>CF922226
Length = 667
Score = 45.4 bits (106), Expect = 2e-04
Identities = 48/193 (24%), Positives = 82/193 (41%), Gaps = 1/193 (0%)
Frame = -3
Query: 133 FKMKDDESIEEMYSRFQTLVSGLQILKKSYVTSDLVSKILRSLPSRWRPKVTAIEEAKDL 192
FKM +D S+ E F L+ L+ + + D +L LP + + +D
Sbjct: 614 FKMHEDRSVGEQLDLFNKLILDLENIDVTIDDEDQALLLLCYLPKSYSHFKETLLFGRD- 438
Query: 193 NTLSVEDLVSSLKVHEMSLNEHESSKKSKS-IALPSKGKSSKSSKAYKASESEEESPDGD 251
++S++++ ++L E LNE + K S S L ++GK+ K D
Sbjct: 437 -SVSLDEVQTALNSKE--LNERKEKKSSASGEGLTARGKTFKK----------------D 315
Query: 252 SDEDHSVKMAMLSNKLEYLARKQKKFLSKRNGYKNWKREDQKGCFNCKKPGHFIADCPDL 311
S+ D +K++K +++NG N + C++CKK GH CP+
Sbjct: 314 SEFD----------------KKKQKPENQKNGEGNIFKIR---CYHCKKEGHTRKVCPER 192
Query: 312 QKEKSKSRPKKPS 324
QK + KK S
Sbjct: 191 QKNGGSNNRKKDS 153
>TC218783 similar to UP|Q40363 (Q40363) NuM1 protein, partial (19%)
Length = 870
Score = 43.9 bits (102), Expect = 6e-04
Identities = 52/196 (26%), Positives = 78/196 (39%), Gaps = 19/196 (9%)
Frame = +3
Query: 210 SLNEHESSKKSKSIALPSKGKSSK----SSKAYK----ASESEEESPDGDSDEDHSVKMA 261
S ++ E K + +A+PSK +S+K S+ A K AS S ES D +SDED + K
Sbjct: 84 SSSDSEDEKPAAKVAVPSKNQSAKNGTLSTLAKKGKPAASSSSSESSDDNSDEDEAPKTK 263
Query: 262 MLSNKLEYLARKQKKFLSKRNGYKNWKREDQKGCFNCKKPGHFIADCPDLQKEKSKSRPK 321
+ + +NG+ + K+ Q +S
Sbjct: 264 VAP-------------AAGKNGHASTKKT---------------------QPSESSDSDS 341
Query: 322 KPSFSSSKFRKQIKKSLMATW-----------EDLDSESGSDKEEADDDAKAAMRLVATV 370
S SSS K KK A E D ES S+ + D+DAK A+ V+
Sbjct: 342 SDSDSSSDEGKSKKKPTTAKLPTLPVAPAKKVESSDDES-SESSDEDNDAKPAVTAVSKP 518
Query: 371 SSEAVSESESDSEDEN 386
S+ A + ES D++
Sbjct: 519 SARAQKKVESSDSDDS 566
>TC207027 weakly similar to UP|Q9LW95 (Q9LW95) KED, partial (17%)
Length = 1005
Score = 41.2 bits (95), Expect = 0.004
Identities = 43/192 (22%), Positives = 71/192 (36%)
Frame = +3
Query: 213 EHESSKKSKSIALPSKGKSSKSSKAYKASESEEESPDGDSDEDHSVKMAMLSNKLEYLAR 272
+ + K K A KGK + ++E DGD +E K K E +
Sbjct: 237 KEKKKKYDKIDAXKVKGKEDDGKDEGNKEKKDKEKGDGDGEEKKEKKDKEKEKKKEKKDK 416
Query: 273 KQKKFLSKRNGYKNWKREDQKGCFNCKKPGHFIADCPDLQKEKSKSRPKKPSFSSSKFRK 332
++ K G KN + ED +G KK +KEK + KK K
Sbjct: 417 DEETDTLKEKG-KNDEGEDDEGNKKKKK--------DKKEKEKDHKKEKKDKEEGEKEDS 569
Query: 333 QIKKSLMATWEDLDSESGSDKEEADDDAKAAMRLVATVSSEAVSESESDSEDENEVYSKI 392
+++ S+ D+D E + E +D K E + + + +D+ E K+
Sbjct: 570 KVEVSV----RDIDIEEIKKEGEKEDKGKDG-------GKEVKEKKKKEDKDKKEKKKKV 716
Query: 393 PRQELIDSLETL 404
++ L TL
Sbjct: 717 TGKDKTKDLSTL 752
>TC218782 similar to UP|Q41042 (Q41042) Pisum sativum L. (clone na-481-5),
partial (20%)
Length = 925
Score = 40.8 bits (94), Expect = 0.005
Identities = 48/180 (26%), Positives = 70/180 (38%), Gaps = 10/180 (5%)
Frame = +1
Query: 232 SKSSKAYKASESEEESPDGDSDEDHSVKMAMLSNKLEYLARKQKKFLSKRNGYKNWKRED 291
+K KA +S S + S D SDED E +KQ K + + G + +D
Sbjct: 49 AKKGKAASSSSSSDSSEDDSSDED------------EVATKKQTKEVKVQKGKEESSSDD 192
Query: 292 QKGCFNCKKPGHFIADCPDLQKEKS-----KSRPKKPSFSSSKFRKQIKKSLMATWEDLD 346
+KP +A P Q K+ + KP+ SSS E D
Sbjct: 193 SSSESEDEKPAAKVAVPPKNQSAKNGTLSTPAEKGKPAASSSSS------------ESSD 336
Query: 347 SESGSDKEEADDDAKAAMRLVA-----TVSSEAVSESESDSEDENEVYSKIPRQELIDSL 401
+S D+ A AA + V T SE+ SES+SDS + + K P + +L
Sbjct: 337 DDSDEDEAPKSKVAPAAGKNVPASTKITQPSES-SESDSDSSSDEDKNKKKPATAKLPAL 513
Score = 32.0 bits (71), Expect = 2.4
Identities = 19/53 (35%), Positives = 28/53 (51%), Gaps = 1/53 (1%)
Frame = +1
Query: 208 EMSLNEHESSKKSKSIALPSKGKS-SKSSKAYKASESEEESPDGDSDEDHSVK 259
E S ++ + + KS P+ GK+ S+K + SES E D SDED + K
Sbjct: 325 ESSDDDSDEDEAPKSKVAPAAGKNVPASTKITQPSESSESDSDSSSDEDKNKK 483
>TC225033 similar to UP|O65758 (O65758) Histone H1, partial (81%)
Length = 1228
Score = 37.7 bits (86), Expect = 0.043
Identities = 24/73 (32%), Positives = 37/73 (49%), Gaps = 3/73 (4%)
Frame = +1
Query: 547 SEPEASGSKAKIISKPEN--LKSKVMTKPDPKTPKIKILKRSESVPQSLIK-PESKILKP 603
S+P+A+ K K ++KP+ +K KP PK K ++ P +K P+S + KP
Sbjct: 862 SKPKAAAPKKKAVAKPKAKAAATKPKAKPAKAAPKKVAAKPAKKTPVKAVKKPKSVVKKP 1041
Query: 604 KDQKNKAVTASEK 616
K K+ A A K
Sbjct: 1042KSVKSPAKKAKAK 1080
>TC213445
Length = 705
Score = 37.0 bits (84), Expect = 0.073
Identities = 27/76 (35%), Positives = 42/76 (54%)
Frame = +3
Query: 1554 SDILGKQKTSNYCYVYSKSRIHFSCKLLYTTTLDETSVGRLSD*C*QYSHLL**YCCYLF 1613
S I+ K C + +SRI+F KLL T LDET+ L * Y++ +* Y C
Sbjct: 450 SSIMA**KAK*CCLINCRSRIYFC*KLLCTNLLDETTTF*LWFET*SYTYPM*QYKCN*S 629
Query: 1614 VKESNSTFKSQAY*DQ 1629
+++S S ++AY*++
Sbjct: 630 IQKSYSVL*NKAY*NK 677
>TC205401 similar to GB|AAP37850.1|30725656|BT008491 At4g26630 {Arabidopsis
thaliana;} , partial (16%)
Length = 912
Score = 36.6 bits (83), Expect = 0.096
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 24/183 (13%)
Frame = +3
Query: 198 EDLVSSLKVHEMSLNEHESSKKSK--SIALPSKGKS-------SKSSKAYKASESEEESP 248
ED+ K + S + ES+KKSK IA+P+K +S SS K+ E +ESP
Sbjct: 36 EDIKEKKKHSKTSSTKKESAKKSKIEKIAVPNKSRSPPKRAPKKPSSNLSKSDEDSDESP 215
Query: 249 ----------DGDSDEDHSVKMAMLSNKLEYLARKQKKFLSKRNGYKNWKREDQKGCFNC 298
G + + + K E + R + K K N R+ C
Sbjct: 216 KVFSRKKKNEKGGKQKTATPTKSASEEKTEKVTRGKGKKKEKSRPSDNQLRD--AICEIL 389
Query: 299 KKPGHFIADCPDLQKEKSKS-----RPKKPSFSSSKFRKQIKKSLMATWEDLDSESGSDK 353
K+ A D+ K+ +K P+K S S ++++ K L +D D E ++K
Sbjct: 390 KEVNFNTATFTDILKKLAKQFDMDLTPRKASI-KSMIQEELTK-LADEADDEDREEDAEK 563
Query: 354 EEA 356
+EA
Sbjct: 564 DEA 572
>TC228340 similar to UP|Q9CJT2 (Q9CJT2) OppB, partial (5%)
Length = 466
Score = 35.8 bits (81), Expect = 0.16
Identities = 27/81 (33%), Positives = 37/81 (45%), Gaps = 3/81 (3%)
Frame = +3
Query: 546 QSEPEASGSKAKIIS--KPENLKSKVMTKPDPKTPKIKILKRSESVPQSLIK-PESKILK 602
Q P+ K K + KP+ K K + P PK K K LK + + +K P+SK K
Sbjct: 129 QKSPKPKRRKPKTLKPPKPKRRKPKTLKPPKPKRRKPKTLKPPKPKRRKTLKSPKSKRRK 308
Query: 603 PKDQKNKAVTASEKTIPKGVK 623
PK K+ A + I K K
Sbjct: 309 PKALKSL*PMAKRRGIEKSPK 371
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.361 0.158 0.581
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 71,892,394
Number of Sequences: 63676
Number of extensions: 1033885
Number of successful extensions: 12756
Number of sequences better than 10.0: 83
Number of HSP's better than 10.0 without gapping: 7110
Number of HSP's successfully gapped in prelim test: 598
Number of HSP's that attempted gapping in prelim test: 5032
Number of HSP's gapped (non-prelim): 8562
length of query: 1684
length of database: 12,639,632
effective HSP length: 110
effective length of query: 1574
effective length of database: 5,635,272
effective search space: 8869918128
effective search space used: 8869918128
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.9 bits)
S2: 66 (30.0 bits)
Medicago: description of AC146649.9