
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144722.10 + phase: 0 /pseudo
(1595 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 103 6e-22
NP004897 gag-protease polyprotein 101 2e-21
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 101 3e-21
CF920770 80 5e-15
TC232995 80 7e-15
BM143109 64 4e-10
TC223814 60 6e-09
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 53 2e-08
AI959950 49 1e-07
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti... 55 2e-07
BI469652 weakly similar to GP|18149115|dbj| reverse transcriptas... 55 2e-07
TC207027 weakly similar to UP|Q9LW95 (Q9LW95) KED, partial (17%) 49 2e-05
CF922226 45 3e-04
TC213445 41 0.004
TC228567 40 0.008
TC204982 PIR|T06394|T06394 isoprenylated protein - soybean (frag... 39 0.014
AI966222 38 0.031
BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Gl... 33 0.14
TC218782 similar to UP|Q41042 (Q41042) Pisum sativum L. (clone n... 36 0.15
TC215289 weakly similar to UP|Q9SJB0 (Q9SJB0) Expressed protein,... 36 0.15
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 103 bits (257), Expect = 6e-22
Identities = 101/369 (27%), Positives = 161/369 (43%), Gaps = 17/369 (4%)
Frame = +1
Query: 3 QYELFRMKDDESIEEMYSRFQTLVSGLQILKKSYVASDHVSKILRSLPSRWRPKVTAIEE 62
++E +MK++E I + + + + L + V KILRSLP R+ KVTAIEE
Sbjct: 397 KFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLPKRFDMKVTAIEE 576
Query: 63 AKDLNTLSVEDLVSSLKVHEMSLNEHETSKKSKSIALPSKGKISKSSKAYKASESEEESP 122
A+D+ + V++L+ SL+ E+ L++ KKSK++A S E EE+
Sbjct: 577 AQDICNMRVDELIGSLQTFELGLSD-RAEKKSKNLAFVSN------------DEGEEDEY 717
Query: 123 DGDSDEDQSVKMAMLSNK----LEYLARKQKKFLSK-----RGSYKNSKNEDQK------ 167
D D+DE + + +L + L + ++QK + R K K D K
Sbjct: 718 DLDTDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKRSDVKPSHSKG 897
Query: 168 -GCFNCKKPGHFIADCPDLQKEKFKGKSKKSSFNSSKFRKQIKKSLMATWEDLDSESGSD 226
C C+ GH IA+CP K+ KG S S D +SE SD
Sbjct: 898 IQCHGCEGYGHIIAECPTHLKKHRKGLSVCQS-------------------DTESEQESD 1020
Query: 227 KEEADDDAKAAVGLVATVSSEAVSEAESDSEDENEVYSKIPRQELVDSLKELLSLFEHRT 286
+D D A G+ T +ED ++ S+I EL S ++L E
Sbjct: 1021---SDRDVNALTGIFET------------AEDSSDTDSEITFDELAASYRKLCIKSEKIL 1155
Query: 287 NELTDLKEKYVDLMKQQKSTLLELKASEEELKG-FNLISTTYEDRLKSLCQKLQEKCDKG 345
+ LK+ DL ++++ E+ ELKG +++ E+ KS+ + +KG
Sbjct: 1156QQEAQLKKVIADLEAEKEAHKEEI----SELKGEVGFLNSKLENMTKSI-----KMLNKG 1308
Query: 346 SGNKHEIAL 354
S E+ L
Sbjct: 1309SDTLDEVLL 1335
Score = 79.0 bits (193), Expect(2) = 3e-21
Identities = 112/407 (27%), Positives = 174/407 (42%), Gaps = 8/407 (1%)
Frame = +2
Query: 1153 LLKATVNKKALITLKHLLQLQDWKQSGYFYPTQLIMA*YYIKWMSKVPFLMVSLKKKCML 1212
LLKAT+ K ++ L QL D S Y+ + +WM + F M + KK M
Sbjct: 3398 LLKATLRLKV*TLMRLLPQLLDLSPSDYYLV*LVSSNSSCTRWM*RAHF*MDT*MKKSMW 3577
Query: 1213 NNLLGLRILSILTMFINLRNHYMA*NKLPELGMID*VIS*LKMILKEDKLTQHSSEGLLR 1272
++ L+ I M+ R M *+KL ELGM * S L + ++LT+ S +
Sbjct: 3578 SSQRDLQTRLIQIMYTGSRRLSMD*SKLQELGMKG*QSSLLSKGIGREELTRPSLSNKML 3757
Query: 1273 KIF*LCKYMLMI*YLVLLMHLFANNFLS*CRMNLK*V*WEN*NSFLEFKSTKVKKEYMFI 1332
K *L +YMLM L + + C +NL+*V E+* F +FK ++ + Y
Sbjct: 3758 KT**LHRYMLMTLCLEGCRMRCFDILFNRCNLNLR*VLLES*LIFWDFK*SRWRTPYSSH 3937
Query: 1333 KQNIQRSF*RSSS*KI--------VK**TLQCIQPAP*AKKILEQ**TRSYTEV*LVLCY 1384
K +QR+ RS ++ + *+ Q ++ AP K+ TE * Y
Sbjct: 3938 KAGMQRTLSRSLGWRMPVIKGHLHLLT*SCQRMKQAPVLIKVC--------TEA**GAYY 4093
Query: 1385 TSLHLDLLFYSVYACVQDFNQILENLI*LQLRESSGI*KEQLILDSCIGNP*IIS*LDSV 1444
D VQD I + *L+ RE +* + + C I L V
Sbjct: 4094 I*QLADPTSPMQ*VFVQDIKPIPR*VT*LK*REF*NM*MALVTMGLCTVIVQIQCWLGIV 4273
Query: 1445 MLIMLVIGLKENQPVEIVNSWERI*YPGQAKDKQLLLCLQQKQNTFQLQVVAHNYFG*NI 1504
MLI L + + E + + WE + G A+ + + LQQK + Q + H+ FG*+
Sbjct: 4274 MLIGLEVQMTEKALLVDASIWETTLFHGSARSRTVCPYLQQKPSILQQEAAVHS*FG*SR 4453
Query: 1505 SWKTIRLMLTVFPFIVIILLLFVCQRIQFYIQEPSILKSNTILSETM 1551
++ V +L + +I F EPS L + +SE +
Sbjct: 4454 C*RSTMSNKMS*HCTVTT*VLLIFLKILFNTAEPSTLTLDITISEIL 4594
Score = 50.4 bits (119), Expect = 6e-06
Identities = 55/196 (28%), Positives = 96/196 (48%)
Frame = +3
Query: 771 C**LQQMDLGKIH*K*RLCM*SV*QLLHSNTI*KRIENFESQK*LWWRI*K*AI*TFL*K 830
C * Q+ LGK++ + +*S+ ++ + +R+ + E+Q+* W RI*K + L
Sbjct: 2331 CG*FLQIYLGKLYQREIRNL*SIQRVESKTSKRERLCHQENQE*PWQRI*KQQVH*ILHI 2510
Query: 831 TWDSP*VFLS*NSTTKWGCREKEQNITRNGQNHDP*KQLS*TFLGRSSQYFMLYSK*DLY 890
*V S +TT+W E++Q+ R H ++ S LG S ++ ML+ +
Sbjct: 2511 *RHHS*VLCSHYTTTEWDS*EEKQDFARGCSGHASCQRTSL*SLG*SHEHSMLHPQQSHT 2690
Query: 891 QTYVGENNI*TL*RKKTQYLLLSSVWMYLLHLKH*RLSEEI*CQGSKRNLFRLL*KVKGI 950
+ + +* L R++ L +W +LHL *R ++ Q RN+ +L K + I
Sbjct: 2691 EKRDSNHPV*NLEREEAICQALPHLWKSMLHLGR*RAKKKDGSQE*CRNIPGILYKQQSI 2870
Query: 951 QSV*FRNTMC*RIYAC 966
S+ F+N I+ C
Sbjct: 2871 *SIQFQNQNSDGIHQC 2918
Score = 43.1 bits (100), Expect(2) = 3e-21
Identities = 28/67 (41%), Positives = 39/67 (57%)
Frame = +3
Query: 1085 N*A*NC*RSSLR*WMDISYARRAKSISKE*CVGSGIHTLSEEHYWNKMGIQKQAE*TRRS 1144
N*A C R + R* +D YARR +I KE* +G+ W+++ +Q+Q +* R
Sbjct: 3195 N*AQECERGTDR*VLDQCYARRIGAIQKE*SLGASS*A*GN*CDWHQVDLQEQNQ*RRCH 3374
Query: 1145 NQKQSQT 1151
NQKQ QT
Sbjct: 3375 NQKQGQT 3395
>NP004897 gag-protease polyprotein
Length = 1923
Score = 101 bits (252), Expect = 2e-21
Identities = 101/367 (27%), Positives = 166/367 (44%), Gaps = 17/367 (4%)
Frame = +1
Query: 3 QYELFRMKDDESIEEMYSRFQTLVSGLQILKKSYVASDHVSKILRSLPSRWRPKVTAIEE 62
++E +MK++E I + + + + L + V KILRSLP R+ KVTAIEE
Sbjct: 397 KFENLKMKEEECIHDFHMNILEIANACTALGERMTDEKLVRKILRSLPKRFDMKVTAIEE 576
Query: 63 AKDLNTLSVEDLVSSLKVHEMSLNEHETSKKSKSIALPSKGKISKSSKAYKASESEEESP 122
A+D+ L V++L+ SL+ E+ L++ T KKSK++A S E EE+
Sbjct: 577 AQDICNLRVDELIGSLQTFELGLSD-RTEKKSKNLAFVSN------------DEGEEDEY 717
Query: 123 DGDSDEDQSVKMAMLSNK----LEYLARKQK------KFLSKRGSYKNSKNEDQ----KG 168
D D+DE + + +L + L + R+QK F ++GS +++++ KG
Sbjct: 718 DLDTDEGLTNAVVLLGKQFNKVLNRMDRRQKPHVRNIPFDIRKGSEYQKRSDEKPSHSKG 897
Query: 169 --CFNCKKPGHFIADCPDLQKEKFKGKSKKSSFNSSKFRKQIKKSLMATWEDLDSESGSD 226
C C+ GH A+CP K++ KG S S +D +SE SD
Sbjct: 898 FQCHGCEGYGHIKAECPTHLKKQRKGLSVCRS------------------DDTESEQESD 1023
Query: 227 KEEADDDAKAAVGLVATVSSEAVSEAESDSEDENEVYSKIPRQELVDSLKELLSLFEHRT 286
+D D A G +ED ++ S+I EL S +EL E
Sbjct: 1024---SDRDVNALTGRF------------ESAEDSSDTDSEITFDELATSYRELCIKSEKIL 1158
Query: 287 NELTDLKEKYVDLMKQQKSTLLELKASEEELKG-FNLISTTYEDRLKSLCQKLQEKCDKG 345
+ LK+ +L ++++ E+ ELKG +++ E+ KS+ + +KG
Sbjct: 1159QQEAQLKKVIANLEAEKEAHEEEI----SELKGEVGFLNSKLENMTKSI-----KMLNKG 1311
Query: 346 SGNKHEI 352
S E+
Sbjct: 1312SDMLDEV 1332
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 101 bits (251), Expect = 3e-21
Identities = 101/367 (27%), Positives = 166/367 (44%), Gaps = 17/367 (4%)
Frame = +1
Query: 3 QYELFRMKDDESIEEMYSRFQTLVSGLQILKKSYVASDHVSKILRSLPSRWRPKVTAIEE 62
++E +MK++E I + + + + L + V KILRSLP R+ KVTAIEE
Sbjct: 397 KFENLKMKEEECIHDFHMNILEIANACTALGERMTDEKLVRKILRSLPKRFDMKVTAIEE 576
Query: 63 AKDLNTLSVEDLVSSLKVHEMSLNEHETSKKSKSIALPSKGKISKSSKAYKASESEEESP 122
A+D+ + V++L+ SL+ E+ L++ T KKSK++A S E EE+
Sbjct: 577 AQDICNMRVDELIGSLQTFELGLSD-RTEKKSKNLAFVSN------------DEGEEDEY 717
Query: 123 DGDSDEDQSVKMAMLSNK----LEYLARKQK------KFLSKRGSYKNSKNEDQ----KG 168
D D+DE + + +L + L + R+QK F ++GS K++++ KG
Sbjct: 718 DLDTDEGLTNAVVLLGKQFNKVLNRMDRRQKPHVRNIPFDIRKGSEYQKKSDEKPSHSKG 897
Query: 169 --CFNCKKPGHFIADCPDLQKEKFKGKSKKSSFNSSKFRKQIKKSLMATWEDLDSESGSD 226
C C+ GH A+CP K++ KG S S +D +SE SD
Sbjct: 898 IQCHGCEGYGHIKAECPTHLKKQRKGLSVCRS------------------DDTESEQESD 1023
Query: 227 KEEADDDAKAAVGLVATVSSEAVSEAESDSEDENEVYSKIPRQELVDSLKELLSLFEHRT 286
+D D A G +ED ++ S+I EL S +EL E
Sbjct: 1024---SDRDVNALTGRF------------ESAEDSSDTDSEITFDELAISYRELCIKSEKIL 1158
Query: 287 NELTDLKEKYVDLMKQQKSTLLELKASEEELKG-FNLISTTYEDRLKSLCQKLQEKCDKG 345
+ LK+ +L ++++ E+ ELKG +++ E+ KS+ + +KG
Sbjct: 1159QQEAQLKKVIANLEAEKEAHEEEI----SELKGEVGFLNSKLENMTKSI-----KMLNKG 1311
Query: 346 SGNKHEI 352
S E+
Sbjct: 1312SDMLDEV 1332
Score = 73.9 bits (180), Expect(2) = 2e-19
Identities = 115/408 (28%), Positives = 173/408 (42%), Gaps = 8/408 (1%)
Frame = +2
Query: 1152 LLLKATVNKKALITLKHLLQLQDWKQSGYFYPTQLIMA*YYIKWMSKVPFLMVSLKKKCM 1211
LLLKAT+ K +K L D S + +WM + F M + KK M
Sbjct: 3398 LLLKATLRLKV*TLMKLSPLLLDLSPSDCYLV*LASSNSSCTRWM*RARF*MDT*MKKPM 3577
Query: 1212 LNNLLGLRILSILTMFINLRNHYMA*NKLPELGMID*VIS*LKMILKEDKLTQHSSEGLL 1271
++ L I I M+ R M *+KL ELGM * S L + ++LT+ S +
Sbjct: 3578 WSSQRDL*IQLIQIMYTGSRRLSMD*SKLQELGMKG*QSSLLSKGIGREELTRLSLSNKM 3757
Query: 1272 RKIF*LCKYMLMI*YLVLLMHLFANNFLS*CRMNLK*V*WEN*NSFLEFKSTKVKKEYMF 1331
K * +YMLM L + + C +NL+*V E+* F + K ++ K Y
Sbjct: 3758 LKT***HRYMLMTLCLEGCRMRCFDILSNRCNLNLR*VLLES*LIFWDSK*SRWKTPYSS 3937
Query: 1332 IKQNIQRSF*RSSS*KI--------VK**TLQCIQPAP*AKKILEQ**TRSYTEV*LVLC 1383
K ++QR+ RS K+ + *+ Q ++ AP K+ TE *L
Sbjct: 3938 HKASMQRTLSRSLGWKMPAIKEHLHLLT*SCQKMKLAPVLIKVC--------TEA*LGAY 4093
Query: 1384 YTSLHLDLLFYSVYACVQDFNQILENLI*LQLRESSGI*KEQLILDSCIGNP*IIS*LDS 1443
Y DL VQD IL + *++ RE +* + + C I L
Sbjct: 4094 YI*QLADLTSPMQ*VFVQDIKPILR*VT*IK*REF*NM*MAPVTMGLCTVIVQIQCWLGI 4273
Query: 1444 VMLIMLVIGLKENQPVEIVNSWERI*YPGQAKDKQLLLCLQQKQNTFQLQVVAHNYFG*N 1503
VMLI L + + E + V+ WE I + G A+ + + L KQ+ Q + HN FG*+
Sbjct: 4274 VMLIGLEVQMTEKALLVDVSIWEPILFHGSARSRTVCPYLLLKQSILQQEAAVHN*FG*S 4453
Query: 1504 ISWKTIRLMLTVFPFIVIILLLFVCQRIQFYIQEPSILKSNTILSETM 1551
++ V +L + +I F EPS L + + E +
Sbjct: 4454 RC*RSTMSNKMS*HCTVTT*VLLIFLKILFNTAEPSTLTLDITILEIL 4597
Score = 52.4 bits (124), Expect = 2e-06
Identities = 55/196 (28%), Positives = 95/196 (48%)
Frame = +3
Query: 771 C**LQQMDLGKIH*K*RLCM*SV*QLLHSNTI*KRIENFESQK*LWWRI*K*AI*TFL*K 830
C * Q+ LG+++ + +*S+ + + KR+ + E+Q+* W R+*K + L
Sbjct: 2334 CG*FLQIYLGQLYQREIRHL*SIQGVESKTSKRKRLCHQENQE*PWQRV*KQQVY*ILHI 2513
Query: 831 TWDSP*VFLS*NSTTKWGCREKEQNITRNGQNHDP*KQLS*TFLGRSSQYFMLYSK*DLY 890
*V S +TTKW +++Q+ R+ H ++ S LG S ++ ML+ +
Sbjct: 2514 *RHHS*VLCSHYTTTKWHS*KEKQDFARSC*GHASCQRTSL*SLG*SHEHSMLHPQQSHT 2693
Query: 891 QTYVGENNI*TL*RKKTQYLLLSSVWMYLLHLKH*RLSEEI*CQGSKRNLFRLL*KVKGI 950
+ +* L R++ L +W +LH *R E+ Q RN+ +L K + I
Sbjct: 2694 *KRDSNHTV*NLEREEANCQALPHLWKSMLHFGR*RAKEKDGSQE*CRNILGILYKQQSI 2873
Query: 951 QSV*FRNTMC*RIYAC 966
S+ F+N C I+ C
Sbjct: 2874 *SIQFQNQNCDGIHQC 2921
Score = 42.0 bits (97), Expect(2) = 2e-19
Identities = 28/67 (41%), Positives = 39/67 (57%)
Frame = +3
Query: 1085 N*A*NC*RSSLR*WMDISYARRAKSISKE*CVGSGIHTLSEEHYWNKMGIQKQAE*TRRS 1144
N*A C R + * +D YARR +I KE* +G+ T W+++ +Q+Q +* R
Sbjct: 3198 N*AQECERGTD**VLDQCYARRIGAIQKE*SLGASS*TRGN*CDWHQVDLQEQNQ*RRCY 3377
Query: 1145 NQKQSQT 1151
NQKQ QT
Sbjct: 3378 NQKQGQT 3398
>CF920770
Length = 581
Score = 80.5 bits (197), Expect = 5e-15
Identities = 38/84 (45%), Positives = 55/84 (65%)
Frame = -2
Query: 2 HQYELFRMKDDESIEEMYSRFQTLVSGLQILKKSYVASDHVSKILRSLPSRWRPKVTAIE 61
H+YELFRM +E+I+ M RF +V+ L L K + D ++K+LR L W+PKVTAI
Sbjct: 265 HEYELFRMNTNENIQSMQKRFTHIVNHLAALGKEFQNEDLINKVLRCLSREWQPKVTAIS 86
Query: 62 EAKDLNTLSVEDLVSSLKVHEMSL 85
E++DL+ +S+ L L+ HEM L
Sbjct: 85 ESRDLSNMSLATLFGKLQEHEMEL 14
>TC232995
Length = 1009
Score = 80.1 bits (196), Expect = 7e-15
Identities = 59/128 (46%), Positives = 73/128 (56%)
Frame = +3
Query: 1212 LNNLLGLRILSILTMFINLRNHYMA*NKLPELGMID*VIS*LKMILKEDKLTQHSSEGLL 1271
LNN L L+ L TMFIN + +M *NK GM D*VI LK E K H S
Sbjct: 3 LNNPLVLKFLINQTMFINYKRLFMV*NKPLGHGMND*VIFFLKKNSPEVKWIPHYS*RES 182
Query: 1272 RKIF*LCKYMLMI*YLVLLMHLFANNFLS*CRMNLK*V*WEN*NSFLEFKSTKVKKEYMF 1331
IF KYMLMI*+L LM A +F C++NLK *WEN*++F ++KS+K+ K Y
Sbjct: 183 IMIFCWFKYMLMI*FLDPLMIHCARSFPLICKVNLKCQ*WEN*STFWDYKSSKLNKVYSS 362
Query: 1332 IKQNIQRS 1339
I N R+
Sbjct: 363 INPNTARN 386
>BM143109
Length = 415
Score = 64.3 bits (155), Expect = 4e-10
Identities = 50/126 (39%), Positives = 70/126 (54%)
Frame = +2
Query: 1214 NLLGLRILSILTMFINLRNHYMA*NKLPELGMID*VIS*LKMILKEDKLTQHSSEGLLRK 1273
NLL + L M +N + YM *NK LGM *V + ++ +L
Sbjct: 5 NLL*GKTQKSLIMSLN*KRFYMD*NKPLGLGMNF*VNFF*TRVFQKVRLILTFLFKRN*M 184
Query: 1274 IF*LCKYMLMI*YLVLLMHLFANNFLS*CRMNLK*V*WEN*NSFLEFKSTKVKKEYMFIK 1333
I+ +YMLMI +LV LM LFA NFL C+MNLK * +* FL++KS+K + EY+ +
Sbjct: 185 IYS*YRYMLMILFLVQLMILFAKNFLKICKMNLKCQ*CVS*TFFLDYKSSKQRMEYLSVN 364
Query: 1334 QNIQRS 1339
QNI ++
Sbjct: 365 QNIAKT 382
>TC223814
Length = 607
Score = 60.5 bits (145), Expect = 6e-09
Identities = 41/101 (40%), Positives = 60/101 (58%), Gaps = 1/101 (0%)
Frame = +1
Query: 20 SRFQTLVSGLQILKKS-YVASDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVEDLVSSL 78
S+ Q +++ L+ L K+ DH++KIL+SL + RP V A+ ++KDL +L VE+ +L
Sbjct: 223 SKVQNIMNNLRSLSKT*DNHDDHITKILQSLLIQ*RP*VIALCDSKDLKSLPVEEFDGTL 402
Query: 79 KVHEMSLNEHETSKKSKSIALPSKGKISKSSKAYKASESEE 119
+VHE+ L E E +K K IA SK KA K S S E
Sbjct: 403 QVHELELMEDEGQRKGKFIA-------SKVQKALKRSLSRE 504
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 52.8 bits (125), Expect(2) = 2e-08
Identities = 40/94 (42%), Positives = 49/94 (51%), Gaps = 3/94 (3%)
Frame = -1
Query: 1155 KATVNKKALITLKHLLQLQDWKQSGYF---YPTQLIMA*YYIKWMSKVPFLMVSLKKKCM 1211
K + K+ KH+LQLQD K F YP ++ + IKWM KV F MV KKK M
Sbjct: 301 KGIIKKRE*TMKKHMLQLQD*KSLECF*HMYP**ILNS---IKWMLKVLF*MV*FKKKYM 131
Query: 1212 LNNLLGLRILSILTMFINLRNHYMA*NKLPELGM 1245
LNN L+ MFIN + +M *NK GM
Sbjct: 130 LNNPQALKSRINQLMFINCKRLFMV*NKPLGRGM 29
Score = 25.4 bits (54), Expect(2) = 2e-08
Identities = 17/39 (43%), Positives = 22/39 (55%)
Frame = -2
Query: 1103 YARRAKSISKE*CVGSGIHTLSEEHYWNKMGIQKQAE*T 1141
+ARR +SI K+*CV + T + NKMG K *T
Sbjct: 456 HARRTESI*KK*CVETSRKT*KLSCHRNKMGF*K*IR*T 340
>AI959950
Length = 466
Score = 48.5 bits (114), Expect(2) = 1e-07
Identities = 33/77 (42%), Positives = 42/77 (53%)
Frame = -2
Query: 1153 LLKATVNKKALITLKHLLQLQDWKQSGYFYPTQLIMA*YYIKWMSKVPFLMVSLKKKCML 1212
LLK T N+K T K L L K ++ Q I+ * IKWM KV F M K+K ML
Sbjct: 240 LLKVTHNRKV*TTQKPLHLLHV*K*YASYFHLQPIVI*SCIKWM*KVHF*MA*SKRKFML 61
Query: 1213 NNLLGLRILSILTMFIN 1229
NN L L++ + MF+N
Sbjct: 60 NNRLDLKMKPFINMFLN 10
Score = 27.3 bits (59), Expect(2) = 1e-07
Identities = 15/33 (45%), Positives = 21/33 (63%)
Frame = -3
Query: 1102 SYARRAKSISKE*CVGSGIHTLSEEHYWNKMGI 1134
S ARR S+SKE*C+ + T +E W++M I
Sbjct: 392 SDARRT*SVSKE*CLEAR*ITKKKEGSWSEMDI 294
>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
Length = 1213
Score = 55.5 bits (132), Expect = 2e-07
Identities = 72/212 (33%), Positives = 102/212 (47%)
Frame = +1
Query: 1250 IS*LKMILKEDKLTQHSSEGLLRKIF*LCKYMLMI*YLVLLMHLFANNFLS*CRMNLK*V 1309
IS K I + + + +G RK F L MLM * LV A +FLS* RM+LK V
Sbjct: 412 ISSFKWIHQRNNGPRTIQKGSKRKPFLLFISMLMT*SLVQPQKGCARSFLS**RMDLKRV 591
Query: 1310 *WEN*NSFLEFKSTKVKKEYMFIKQNIQRSF*RSSS*KIVK**TLQCIQPAP*AKKILEQ 1369
* +*+S +FKS + ++ IK+NIQ *+ S CI P +
Sbjct: 592 *KVS*SSS*DFKSFRKFMGFLSIKRNIQSPI*KGSEWMKPNLWQPLCIVPQSLTRMRKVI 771
Query: 1370 **TRSYTEV*LVLCYTSLHLDLLFYSVYACVQDFNQILENLI*LQLRESSGI*KEQLILD 1429
+ V*L+L + L +D + +A VQDF+ I + L+ LQL+ S I E LI+
Sbjct: 772 ILHKRSIVV*LILYHI*LLVDQILCLSFAFVQDFSLIQKFLMLLQLKGS*DILLELLIIV 951
Query: 1430 SCIGNP*IIS*LDSVMLIMLVIGLKENQPVEI 1461
+ + D VM I+LVI K VE+
Sbjct: 952 YGLRKGLSLIF*DIVMFILLVIK*KGRALVEM 1047
>BI469652 weakly similar to GP|18149115|dbj| reverse transcriptase {Silene
noctiflora}, partial (60%)
Length = 427
Score = 55.1 bits (131), Expect = 2e-07
Identities = 31/67 (46%), Positives = 41/67 (60%)
Frame = +1
Query: 1182 YPTQLIMA*YYIKWMSKVPFLMVSLKKKCMLNNLLGLRILSILTMFINLRNHYMA*NKLP 1241
+P QLI * + KW+ KV F M LK+KCM +NL L + T+F N + +M *N+
Sbjct: 220 FPLQLIKT*IFFKWILKVVF*MTLLKRKCMSDNLQTL*TIHF*TIFSNFKRLHMV*NRHL 399
Query: 1242 ELGMID* 1248
LGMID*
Sbjct: 400 MLGMID* 420
>TC207027 weakly similar to UP|Q9LW95 (Q9LW95) KED, partial (17%)
Length = 1005
Score = 48.9 bits (115), Expect = 2e-05
Identities = 54/233 (23%), Positives = 88/233 (37%), Gaps = 1/233 (0%)
Frame = +3
Query: 87 EHETSKKSKSIALPSKGKISKSSKAYKASESEEESPDGDSDEDQSVKMAMLSNKLEYLAR 146
+ + K K A KGK + ++E DGD +E + K K E +
Sbjct: 237 KEKKKKYDKIDAXKVKGKEDDGKDEGNKEKKDKEKGDGDGEEKKEKKDKEKEKKKEKKDK 416
Query: 147 -KQKKFLSKRGSYKNSKNEDQKGCFNCKKPGHFIADCPDLQKEKFKGKSKKSSFNSSKFR 205
++ L ++G KN + ED +G KK +KEK K KK K
Sbjct: 417 DEETDTLKEKG--KNDEGEDDEGNKKKKK--------DKKEKEKDHKKEKKDKEEGEKED 566
Query: 206 KQIKKSLMATWEDLDSESGSDKEEADDDAKAAVGLVATVSSEAVSEAESDSEDENEVYSK 265
+++ S+ D+D E + E +D K E + + + +D+ E K
Sbjct: 567 SKVEVSV----RDIDIEEIKKEGEKEDKGKDG-------GKEVKEKKKKEDKDKKEKKKK 713
Query: 266 IPRQELVDSLKELLSLFEHRTNELTDLKEKYVDLMKQQKSTLLELKASEEELK 318
+ ++ L L E ++ L EK D+ +Q K E EE K
Sbjct: 714 VTGKDKTKDLSTLKQKLEKINGKIQPLLEKKADIERQIKEVEAEGHVVNEENK 872
>CF922226
Length = 667
Score = 44.7 bits (104), Expect = 3e-04
Identities = 49/197 (24%), Positives = 79/197 (39%), Gaps = 1/197 (0%)
Frame = -3
Query: 7 FRMKDDESIEEMYSRFQTLVSGLQILKKSYVASDHVSKILRSLPSRWRPKVTAIEEAKDL 66
F+M +D S+ E F L+ L+ + + D +L LP + + +D
Sbjct: 614 FKMHEDRSVGEQLDLFNKLILDLENIDVTIDDEDQALLLLCYLPKSYSHFKETLLFGRD- 438
Query: 67 NTLSVEDLVSSLKVHEMSLNEHETSKKSKS-IALPSKGKISKSSKAYKASESEEESPDGD 125
++S++++ ++L E LNE + K S S L ++GK K D
Sbjct: 437 -SVSLDEVQTALNSKE--LNERKEKKSSASGEGLTARGKTFKK----------------D 315
Query: 126 SDEDQSVKMAMLSNKLEYLARKQKKFLSKRGSYKNSKNEDQKGCFNCKKPGHFIADCPDL 185
S+ D+ +KQK K G K C++CKK GH CP+
Sbjct: 314 SEFDK---------------KKQKPENQKNGEGNIFKIR----CYHCKKEGHTRKVCPER 192
Query: 186 QKEKFKGKSKKSSFNSS 202
QK KK S N++
Sbjct: 191 QKNGGSNNRKKDSGNAA 141
>TC213445
Length = 705
Score = 41.2 bits (95), Expect = 0.004
Identities = 28/75 (37%), Positives = 41/75 (54%)
Frame = +2
Query: 1468 I*YPGQAKDKQLLLCLQQKQNTFQLQVVAHNYFG*NISWKTIRLMLTVFPFIVIILLLFV 1527
+*Y G K K +L QKQN F L+V+ H FG*+ ++ T+ L ++ V I + +
Sbjct: 449 L*YHGIVKSKIVLSYQLQKQNIFLLEVIMHKSFG*DNNFLTMV*NLIIYLSDVTIQVQLI 628
Query: 1528 CQRIQFYIQEPSILK 1542
+I F E SILK
Sbjct: 629 YPKIIFCTLEQSILK 673
>TC228567
Length = 1531
Score = 40.0 bits (92), Expect = 0.008
Identities = 70/342 (20%), Positives = 140/342 (40%), Gaps = 17/342 (4%)
Frame = +2
Query: 188 EKFKGKSKKSSFNSSKFRKQIKKSLMATWEDLDSESGSDKEEADDDAKAAVGLVATV--S 245
E+ K +K++ + ++ R SL + E+ S S K+ + + A V L A + S
Sbjct: 383 EELKLNIEKATSDVNRLRVA-SVSLKSKLEEEKSVLASLKQSEEKASAAVVNLQAELEKS 559
Query: 246 SEAVSEAESDSEDENEVYSKIPR--QELVDSLKELLSLFEHRTNELTDLKEKYVDLMKQQ 303
A++ + + E+ +++P+ Q+ E SL + EL + +E+ V+ K +
Sbjct: 560 RSAIAFIQMKENEAREMMTELPKKLQKASQEADEAKSLAQAAQAELIEAQEE-VEQAKAK 736
Query: 304 KSTL-LELKASEEELKGFNLISTTYEDRLKSLCQKLQEKCDKGSGNKHEIALDDFIMAGI 362
STL L A+++E++ + D + +L EK + GNK++
Sbjct: 737 SSTLESSLLAAQKEIEAAKVAEMLARDAITAL-----EKSESAKGNKNDK---------- 871
Query: 363 DRSKVASMIYSTYKNKGKGIGYSEEKSK---EYSLKSYCDCIKDGLKSTFVPEGTNAITA 419
D S + ++ Y + +EE++ E + + L+S E N +
Sbjct: 872 DSSSMVTLTLEEYHELSRRAYKAEEQANARIEAATSQIQIARESELRSLEKLEELNEELS 1051
Query: 420 V--QSKPEASGSQAKITSKPENLKIKVMTKSDPKSQKIKILKRSE-------PVHQNLIK 470
V +S A+G+ K ++ ++ T + Q+ K + +E P H +
Sbjct: 1052VRRESLKIATGNSEKANEGKLAVEHELRTWRAEQKQQEKATELNEQTSDPTEPAHDSS-S 1228
Query: 471 PESKIPKQKDQKNKAATASEKTIPKGVKPKVLNDQKPLSIHP 512
P+ K+P + A+ ++K KV+ HP
Sbjct: 1229PKGKVPSNNTEAESASNKNKKKKKSSFPSKVVMFFAKKKTHP 1354
>TC204982 PIR|T06394|T06394 isoprenylated protein - soybean (fragment)
{Glycine max;} , complete
Length = 1230
Score = 39.3 bits (90), Expect = 0.014
Identities = 32/133 (24%), Positives = 54/133 (40%), Gaps = 4/133 (3%)
Frame = +3
Query: 386 EEKSKEYSLKSYCDCIKDGLKSTFVPEGTNAITAVQ----SKPEASGSQAKITSKPENLK 441
+EK + C C + ++ +G +I +++ KP+ +G + K KP K
Sbjct: 333 DEKENIVFITVVC-CSPEKIRDKLCYKGGGSIKSIEILEPPKPKPAGPEKKEAEKP---K 500
Query: 442 IKVMTKSDPKSQKIKILKRSEPVHQNLIKPESKIPKQKDQKNKAATASEKTIPKGVKPKV 501
+ K DP+ K K +P + K + K K++ K EK P KPK
Sbjct: 501 AEPEKKKDPEKPKADPPKAEKPKTEPEKKKDGGGEKPKEEPEKKKDGGEKPKPGPEKPKD 680
Query: 502 LNDQKPLSIHPKV 514
PL + P +
Sbjct: 681 KPTPAPLPVQPHI 719
>AI966222
Length = 430
Score = 38.1 bits (87), Expect = 0.031
Identities = 23/66 (34%), Positives = 41/66 (61%)
Frame = +3
Query: 859 NGQNHDP*KQLS*TFLGRSSQYFMLYSK*DLYQTYVGENNI*TL*RKKTQYLLLSSVWMY 918
+G NH * * LG S++Y ML S+ +LY+T++ ++++* + KTQ+++ S +
Sbjct: 3 DG*NHAK**LNP*ALLG*SNEYCMLSSEQNLYKTHLEKDSL*IMEGTKTQHIIFLSF*V* 182
Query: 919 LLHLKH 924
+ H KH
Sbjct: 183 VFHYKH 200
>BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Glycine max},
partial (7%)
Length = 336
Score = 32.7 bits (73), Expect(2) = 0.14
Identities = 22/50 (44%), Positives = 26/50 (52%)
Frame = +2
Query: 1092 RSSLR*WMDISYARRAKSISKE*CVGSGIHTLSEEHYWNKMGIQKQAE*T 1141
RS R* +D +ARR K I K+ C+ T YWNKMG K *T
Sbjct: 20 RSHSR**LDHCHARRTKPI*KKQCMEISRKT*KLSCYWNKMGF*K*IR*T 169
Score = 21.9 bits (45), Expect(2) = 0.14
Identities = 14/42 (33%), Positives = 19/42 (44%)
Frame = +1
Query: 1155 KATVNKKALITLKHLLQLQDWKQSGYFYPTQLIMA*YYIKWM 1196
K + K+ KH+L LQD K F+ +IKWM
Sbjct: 208 KDIIKKRE*TMKKHMLLLQD*KPLECFWHMHP**TLNFIKWM 333
>TC218782 similar to UP|Q41042 (Q41042) Pisum sativum L. (clone na-481-5),
partial (20%)
Length = 925
Score = 35.8 bits (81), Expect = 0.15
Identities = 43/153 (28%), Positives = 68/153 (44%), Gaps = 10/153 (6%)
Frame = +1
Query: 33 KKSYVASDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVEDLVSSLKVHEMSLNEHETSK 92
KK AS S S +V ++ K++ ++ SS + S +E E K
Sbjct: 52 KKGKAASSSSSSDSSEDDSSDEDEVATKKQTKEVKVQKGKEESSS----DDSSSESEDEK 219
Query: 93 KSKSIALPSKGKISK----SSKAYK----ASESEEESPDGDSDEDQS--VKMAMLSNKLE 142
+ +A+P K + +K S+ A K AS S ES D DSDED++ K+A + K
Sbjct: 220 PAAKVAVPPKNQSAKNGTLSTPAEKGKPAASSSSSESSDDDSDEDEAPKSKVAPAAGKNV 399
Query: 143 YLARKQKKFLSKRGSYKNSKNEDQKGCFNCKKP 175
+ K + S +S +++ K N KKP
Sbjct: 400 PASTKITQPSESSESDSDSSSDEDK---NKKKP 489
>TC215289 weakly similar to UP|Q9SJB0 (Q9SJB0) Expressed protein, partial
(32%)
Length = 1476
Score = 35.8 bits (81), Expect = 0.15
Identities = 29/103 (28%), Positives = 51/103 (49%), Gaps = 2/103 (1%)
Frame = +3
Query: 194 SKKSSFNSSKFRKQIKKSLMATWEDLDSESGSDKEEADDDAKAAVGLVATVSSEAVSEAE 253
S++SSF S + L+ ED D + ++EE DD+ KA V + SS + S
Sbjct: 900 SRRSSFYS--WSNPQSMPLLPVDEDQDYDYEEEEEEEDDEEKAR--KVPSASSSSSSSLA 1067
Query: 254 SDSEDENEVYSKIPR--QELVDSLKELLSLFEHRTNELTDLKE 294
+ + E++V ++ R + ++ L F+ R+ L DL+E
Sbjct: 1068EEKKQEDQVQLRLNRVPESYAAHMRLRLGSFKARSFSLADLQE 1196
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.356 0.157 0.537
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 63,211,760
Number of Sequences: 63676
Number of extensions: 862387
Number of successful extensions: 11334
Number of sequences better than 10.0: 87
Number of HSP's better than 10.0 without gapping: 6375
Number of HSP's successfully gapped in prelim test: 528
Number of HSP's that attempted gapping in prelim test: 4453
Number of HSP's gapped (non-prelim): 7605
length of query: 1595
length of database: 12,639,632
effective HSP length: 110
effective length of query: 1485
effective length of database: 5,635,272
effective search space: 8368378920
effective search space used: 8368378920
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.7 bits)
S2: 65 (29.6 bits)
Medicago: description of AC144722.10