
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC124217.5 + phase: 0 /pseudo
(1307 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
CF922226 183 4e-46
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 154 2e-37
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 153 4e-37
CO981347 66 7e-31
AW760164 similar to GP|11994422|dbj oxidoreductase short-chain ... 89 1e-17
BI784757 86 1e-16
TC222253 similar to UP|Q9ZQE4 (Q9ZQE4) Copia-like retroelement p... 80 5e-15
TC232910 similar to UP|Q6I8N0 (Q6I8N0) Pol polypeptide, partial ... 67 5e-14
TC234745 weakly similar to UP|Q9FFM0 (Q9FFM0) Copia-like retrotr... 70 6e-12
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 64 6e-10
BI424202 63 1e-09
NP004897 gag-protease polyprotein 59 1e-08
BI701169 56 9e-08
CA937893 similar to GP|20805072|dbj retrovirus-related pol polyp... 54 3e-07
BI425191 weakly similar to GP|14586969|gb| pol polyprotein {Citr... 50 7e-06
BU083646 weakly similar to GP|6642775|gb| gag-pol polyprotein {V... 47 7e-05
BI788167 46 9e-05
BI321802 similar to PIR|H72173|H7217 D5L protein - variola minor... 45 2e-04
TC213393 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 44 4e-04
TC233180 similar to UP|Q944K0 (Q944K0) At1g18030/T10F20_3, parti... 44 6e-04
>CF922226
Length = 667
Score = 183 bits (465), Expect = 4e-46
Identities = 98/220 (44%), Positives = 143/220 (64%), Gaps = 9/220 (4%)
Frame = -3
Query: 93 MTKSLAHRQLLKQQLYSFKMVESISISEQLTEFNKILVDLANIEVNTEDEDKALLLLCSL 152
MTKSL +R KQ LYSFKM E S+ EQL FNK+++DL NI+V +DED+ALLLLC L
Sbjct: 665 MTKSLVNRLYXKQSLYSFKMHEDRSVGEQLDLFNKLILDLENIDVTIDDEDQALLLLCYL 486
Query: 153 PKSFEHFKDTILYGKEGTTTLEEVQAALRTKELTKFKDLKVDEGSEGLNVARG----RNE 208
PKS+ HFK+T+L+G++ + +L+EVQ AL +KEL + K+ K EGL ARG ++
Sbjct: 485 PKSYSHFKETLLFGRD-SVSLDEVQTALNSKELNERKEKKSSASGEGL-TARGKTFKKDS 312
Query: 209 HRGKGKGKSRSKSRSKGFDKSKYKCFLCHKQGHFKKDCPDKGGDGSPSVQVAEASN---- 264
K K K ++ +G + K +C+ C K+GH +K CP++ +G + + ++ N
Sbjct: 311 EFDKKKQKPENQKNGEG-NIFKIRCYHCKKEGHTRKVCPERQKNGGSNNRKKDSGNAAIV 135
Query: 265 -EEGYESTGALVVTSWKSEKSWVLDSGCSYHMCPRKEYFE 303
++GYES AL+V+ E W++DSGCS+HM P K +FE
Sbjct: 134 QDDGYESAEALMVSEKNPETKWIMDSGCSWHMTPNKSWFE 15
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 154 bits (389), Expect = 2e-37
Identities = 126/471 (26%), Positives = 216/471 (45%), Gaps = 23/471 (4%)
Frame = +1
Query: 113 VESISISEQLTEFNKILVDL-ANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYGKEGTT 171
++S I +Q + K++ DL A E + E+ + + L E+ +I +G+
Sbjct: 1135 IKSEKILQQEAQLKKVIADLEAEKEAHKEEISELKGEVGFLNSKLENMTKSIKMLNKGSD 1314
Query: 172 TLEEVQAALRTKELTKFKDLKVDEGSEGLNV------ARGR-----NEHRGKGKGKSRSK 220
TL+EV L K + L + S G A+ R ++HR + G + K
Sbjct: 1315 TLDEV--LLLGKNAGNQRGLGFNPKSAGRTTMTEFVPAKNRTGATMSQHRSRHHGMQQKK 1488
Query: 221 SRSKGFDKSKYKCFLCHKQGHFKKDCPDKGGDGSPSVQVAEASNE----EGYESTGALVV 276
S+ K K++C C K GH K C G Q + + + +++ +V
Sbjct: 1489 SKRK-----KWRCHYCGKYGHIKPFCYHLHGHPHHGTQSSNSRKKMMWVPKHKAVSLVVH 1653
Query: 277 TSWKS--EKSWVLDSGCSYHMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGNVRLKMF 334
TS ++ ++ W LDSGCS HM KE+ + V G+ K+ GMG + +
Sbjct: 1654 TSLRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGKL---VH 1824
Query: 335 DGREFLLRDVRFVPELKRNLISLSMFDGLGYCTRIEHGVCKISHG-ALITVKGSKMNGLY 393
DG L + V V L NLIS+S G+ C +++ + + +KGS+
Sbjct: 1825 DGLPSLNK-VLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNC 2001
Query: 394 ILDGSIVIGNASVASVVPHNNSELWHLRLGHVSERGLVELAKQGL---LGKDKLDKLEFC 450
L +S + +WH R GH+ RG+ ++ +G + K+++ C
Sbjct: 2002 YLWTPQETSYSSTCLSSKEDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRIC 2181
Query: 451 EHCILGKQHRVKFGSGMHHS-SRLFEYVHSDLLGPSKTPTHGGGSYFLSIIDDYSRRVWV 509
C +GKQ ++ H + SR+ E +H DL+GP + + GG Y ++DD+SR WV
Sbjct: 2182 GECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWV 2361
Query: 510 FVLKKKSDTF*KFKE*HTLIENQMGTKLKGLRTDNGLEFVSEQFNDFLQVE 560
+++KS+TF FKE ++ + +K +R+D+G EF + +F +F E
Sbjct: 2362 NFIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSE 2514
Score = 30.4 bits (67), Expect = 5.4
Identities = 45/158 (28%), Positives = 62/158 (38%), Gaps = 4/158 (2%)
Frame = +3
Query: 815 RDGFIGKEPNLETCEAT*TPKGSWMQVDLQEKGRYSGC*RSKVQSKTCGQGFHSSGGDRL 874
R G I KE +L * W QVDLQE+ + C K Q +T HS RL
Sbjct: 3258 RIGAIQKE*SLGASS*A*GN*CDWHQVDLQEQNQ*RRCHNQK-QGQTGCSRLHSD*RCRL 3434
Query: 875 Q*DLFTGGETLFHKDTYGYRESIQS*VGTNGCEDCFLTW*P*RDNLHGATGRFC----GR 930
*D T H+ Q +GCE+ W P +L GA C R
Sbjct: 3435 **DFCPSC*T*VHQIITWCSLYPQIQAVPDGCEERISEWIPE*RSLCGAAKGICRPDSSR 3614
Query: 931 QV*GMSFEEIFVWVEAKP*AMVSSV**ISFEDRLCEKR 968
+ +E +W+EA ++V + + R+ E R
Sbjct: 3615 SC--IQAQEGSLWIEASSKSLV*KANRVPYSARV*EGR 3722
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 153 bits (387), Expect = 4e-37
Identities = 126/477 (26%), Positives = 218/477 (45%), Gaps = 29/477 (6%)
Frame = +1
Query: 113 VESISISEQLTEFNKILVDL-ANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYGKEGTT 171
++S I +Q + K++ +L A E + E+ + + L E+ +I +G+
Sbjct: 1138 IKSEKILQQEAQLKKVIANLEAEKEAHEEEISELKGEVGFLNSKLENMTKSIKMLNKGSD 1317
Query: 172 TLEEV-----------------QAALRTKELTKFKDLKVDEGSEGLNVARGRNEHRGKGK 214
L+EV ++A RT +T+F K S G +++ R+ H G +
Sbjct: 1318 MLDEVLQLGKNVGNQRGLGFNHKSAGRTT-MTEFVPAK---NSTGATMSQHRSRHHGTQQ 1485
Query: 215 GKSRSKSRSKGFDKSKYKCFLCHKQGHFKKDCPDKGGDGSPSVQVAEASNE----EGYES 270
KS+ K K++C C K GH K C G Q + + + ++
Sbjct: 1486 KKSKRK---------KWRCHYCGKYGHIKPFCYHLHGHPHHGTQSSSSGRKMMWVPKHKI 1638
Query: 271 TGALVVTSWKS--EKSWVLDSGCSYHMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGN 328
+V TS ++ ++ W LDSGCS HM KE+ + V G+ K+ GMG
Sbjct: 1639 VSLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMGK 1818
Query: 329 VRLKMFDGREFLLRDVRFVPELKRNLISLSMFDGLGYCTRIEHGVCKISHG-ALITVKGS 387
+ + DG L + V V L NLIS+S G+ C +++ + + +KGS
Sbjct: 1819 L---VHDGLPSLNK-VLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGS 1986
Query: 388 KMNGLYILDGSIVIGNASVASVVPHNNSELWHLRLGHVSERGLVELAKQGL---LGKDKL 444
+ L +S + ++WH R GH+ RG+ ++ +G + K+
Sbjct: 1987 RSKDNCYLWTPQETSYSSTCLFSKEDEVKIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKI 2166
Query: 445 DKLEFCEHCILGKQHRVKFGSGMHHS-SRLFEYVHSDLLGPSKTPTHGGGSYFLSIIDDY 503
++ C C +GKQ ++ H + SR+ E +H DL+GP + + GG Y ++DD+
Sbjct: 2167 EEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDF 2346
Query: 504 SRRVWVFVLKKKSDTF*KFKE*HTLIENQMGTKLKGLRTDNGLEFVSEQFNDFLQVE 560
SR WV +++KSDTF FKE ++ + +K +R+D+G EF + +F +F E
Sbjct: 2347 SRFTWVNFIREKSDTFEVFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSE 2517
Score = 32.0 bits (71), Expect = 1.8
Identities = 44/156 (28%), Positives = 62/156 (39%), Gaps = 2/156 (1%)
Frame = +3
Query: 815 RDGFIGKEPNLETCEAT*TPKGSWMQVDLQEKGRYSGC*RSKVQSKTCGQGFHSSGGDRL 874
R G I KE +L T W QVDLQE+ + C K Q +TC HS RL
Sbjct: 3261 RIGAIQKE*SLGASS*TRGN*CDWHQVDLQEQNQ*RRCYNQK-QGQTCCSRLHSD*RCRL 3437
Query: 875 Q*DLFTGGETLFHKDTYGYRESIQS*VGTNGCEDCFLTW*P*RDNLHGATGRFC--GRQV 932
*+ T H+ Q +GCE+ W P +L GA C
Sbjct: 3438 **NFRPCC*T*VHQIVTWCSLHPQIQAVPDGCEERVSEWIPE*RSLCGAAKGICRSNSSR 3617
Query: 933 *GMSFEEIFVWVEAKP*AMVSSV**ISFEDRLCEKR 968
+ +E +W+EA ++V + + R+ E R
Sbjct: 3618 SCIQAQEGSLWIEASSKSLV*KANRVPYSARV*EGR 3725
>CO981347
Length = 624
Score = 65.9 bits (159), Expect(4) = 7e-31
Identities = 32/38 (84%), Positives = 35/38 (91%)
Frame = +2
Query: 519 F*KFKE*HTLIENQMGTKLKGLRTDNGLEFVSEQFNDF 556
F*KF+E*HTLI NQ+GTKLK LRTDNGLEFV EQFN+F
Sbjct: 107 F*KFRE*HTLIGNQLGTKLKVLRTDNGLEFVLEQFNEF 220
Score = 60.8 bits (146), Expect(4) = 7e-31
Identities = 25/36 (69%), Positives = 32/36 (88%)
Frame = +3
Query: 484 PSKTPTHGGGSYFLSIIDDYSRRVWVFVLKKKSDTF 519
PS+ THGG SYFL+IIDD+SRRVW++VLK KS++F
Sbjct: 3 PSRVKTHGGSSYFLTIIDDFSRRVWLYVLKNKSESF 110
Score = 41.6 bits (96), Expect(4) = 7e-31
Identities = 26/77 (33%), Positives = 38/77 (48%)
Frame = +1
Query: 557 LQVERNQEA*DRTENTATKWSC*THV*DSFGACEVYDSRSWVT*EFLG*SSDNGCLFDQ* 616
LQ R+Q+A + +T +W D FG EV+ ++ + LG S + +FD *
Sbjct: 220 LQENRHQKAQNSPSHTIAEWFSRKDEYDHFGKSEVHATKCKTAKDLLGRSCKHNIIFD** 399
Query: 617 MSINWDRLQDTYGGMEW 633
M R QDT G +EW
Sbjct: 400 MPFINLRFQDTNGSLEW 450
Score = 25.8 bits (55), Expect(4) = 7e-31
Identities = 17/49 (34%), Positives = 25/49 (50%)
Frame = +2
Query: 634 ETGRLLFFEGFWSFDICAYQARQA*A*SFEMCLHLLSGRCEGVQVVEIG 682
ET L +G W + + R+A ++C+H LS R VQ +EIG
Sbjct: 452 ETT*LFRIKGVWITGL*SC*TRKAGCKGCKVCVHWLS*RS*KVQAMEIG 598
>AW760164 similar to GP|11994422|dbj oxidoreductase short-chain
dehydrogenase/reductase family-like protein {Arabidopsis
thaliana}, partial (9%)
Length = 428
Score = 89.0 bits (219), Expect = 1e-17
Identities = 47/91 (51%), Positives = 65/91 (70%), Gaps = 1/91 (1%)
Frame = +3
Query: 2 MGS-KWDIEKFTGSNDFGLWKVKMQAVLTQQKCVEALKGEAAMPATLTQEEKREMIDKAK 60
MGS K+++EKFTG NDFGL +KM+A+L QQ VEAL GE + + +K+ ++ KA
Sbjct: 75 MGSAKYEVEKFTGQNDFGLC*LKMRALLVQQGLVEALDGEIKLEKMMADGDKKALLQKAY 254
Query: 61 SAIVLCLGDKVLRDVAREATAASM*AKLESL 91
+AI+L LGDKVLR V++E TA + +KLE L
Sbjct: 255 NAIILSLGDKVLRQVSKETTAVGVWSKLEVL 347
>BI784757
Length = 430
Score = 85.5 bits (210), Expect = 1e-16
Identities = 41/110 (37%), Positives = 69/110 (62%), Gaps = 1/110 (0%)
Frame = +1
Query: 448 EFCEHCILGKQHRVKFGSGMH-HSSRLFEYVHSDLLGPSKTPTHGGGSYFLSIIDDYSRR 506
E C+ C+ KQ R F + + E ++SD+ GP +T + GG YF+S ID+ +R+
Sbjct: 61 EVCDGCLQCKQSRSTFKQNVPIRAKEKLEVIYSDVCGPMQTESLGGNRYFISFIDELTRK 240
Query: 507 VWVFVLKKKSDTF*KFKE*HTLIENQMGTKLKGLRTDNGLEFVSEQFNDF 556
VWV+++++KSD F F++ + + Q G+ +K LRT+ G E+VS +F +F
Sbjct: 241 VWVYLIRRKSDFFEVFEKFKNMAKKQSGSLIKILRTNGGGEYVSTEFQEF 390
>TC222253 similar to UP|Q9ZQE4 (Q9ZQE4) Copia-like retroelement pol
polyprotein, partial (4%)
Length = 919
Score = 80.5 bits (197), Expect = 5e-15
Identities = 41/102 (40%), Positives = 61/102 (59%), Gaps = 1/102 (0%)
Frame = +1
Query: 450 CEHCILGKQHRVKFGSGMH-HSSRLFEYVHSDLLGPSKTPTHGGGSYFLSIIDDYSRRVW 508
C+ C +GK+HR F +G +L + VH DL + PTHG +YF++ IDD+S+++W
Sbjct: 331 CDTCEIGKKHRESFPTGKSWRMKKLLKIVHLDLC-TVEIPTHGDNNYFITFIDDFSKKMW 507
Query: 509 VFVLKKKSDTF*KFKE*HTLIENQMGTKLKGLRTDNGLEFVS 550
V+ LK+KS+ FK E Q G K+K L D G E++S
Sbjct: 508 VYFLKQKSEACNAFKMFKAFAEKQNGCKVKALIIDKGQEYLS 633
>TC232910 similar to UP|Q6I8N0 (Q6I8N0) Pol polypeptide, partial (3%)
Length = 690
Score = 66.6 bits (161), Expect(3) = 5e-14
Identities = 46/163 (28%), Positives = 85/163 (51%), Gaps = 6/163 (3%)
Frame = +2
Query: 301 YFETLTLKEGGVVRLGNNKACKVQGMGNVRLKMFDGREFLLRDVRFVPELKRNLISLSMF 360
+ E + +G +V +GN + G+G V L G+ L DV FVP +++NL+S +
Sbjct: 206 FMEFRPIDDGSIVNMGNVATEPILGLGCVNLVFTSGKSLYL-DVLFVPGIRKNLLSGMIL 382
Query: 361 DGLGYCTRIEHGVCKIS-HGALITVKGSKMNGLYILDGSIVIGNASV-----ASVVPHNN 414
+ G+ +E +S HG+ + G + N ++ L+ + + SV +S+
Sbjct: 383 NNCGFKQVLESDKYILSRHGSFVGF-GYRCNEMFKLNIDVPFVHESVCMASCSSITNMTK 559
Query: 415 SELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCILGK 457
SE+WH RLGHV + L +++K ++ ++ +E C+ C+L K
Sbjct: 560 SEIWHARLGHVHYKRLKDMSKTCMIPPFDMN-IEKCKTCMLTK 685
Score = 26.2 bits (56), Expect(3) = 5e-14
Identities = 9/17 (52%), Positives = 10/17 (57%)
Frame = +3
Query: 230 KYKCFLCHKQGHFKKDC 246
K C C K GH K+DC
Sbjct: 24 KLSCSKCGKPGHLKRDC 74
Score = 23.9 bits (50), Expect(3) = 5e-14
Identities = 10/32 (31%), Positives = 18/32 (56%), Gaps = 1/32 (3%)
Frame = +1
Query: 272 GALVVTSWKSEK-SWVLDSGCSYHMCPRKEYF 302
G +++ S K + +W DSG + H+C + F
Sbjct: 112 GLMILKSNKDDDVAWWFDSGATSHVCKDRRLF 207
>TC234745 weakly similar to UP|Q9FFM0 (Q9FFM0) Copia-like retrotransposable
element, partial (5%)
Length = 494
Score = 70.1 bits (170), Expect = 6e-12
Identities = 41/84 (48%), Positives = 53/84 (62%)
Frame = -2
Query: 827 TCEAT*TPKGSWMQVDLQEKGRYSGC*RSKVQSKTCGQGFHSSGGDRLQ*DLFTGGETLF 886
TC+ T SWMQV LQEK ++S C +VQ KTCG GF++SG L *+ T G+TLF
Sbjct: 286 TCQRVNT---SWMQVGLQEKEKHSKCEEVEVQGKTCG*GFYTSGQS*L**NFLTCGKTLF 116
Query: 887 HKDTYGYRESIQS*VGTNGCEDCF 910
+KD G +S+ S V GC+D F
Sbjct: 115 YKDLDGNCKSV*SGVTIIGCQDNF 44
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 63.5 bits (153), Expect = 6e-10
Identities = 56/180 (31%), Positives = 91/180 (50%), Gaps = 5/180 (2%)
Frame = +2
Query: 386 GSKMNGLYILDGSIVIGNASVASVVPHNNSELWHLRLGHVSERGLVELAKQGLLGKDKLD 445
G + +GLY L ++ + V S V + +L H RLGH L+K ++ L+
Sbjct: 236 GIESHGLYYLKPNL----SWVCSAV--TSPKLLHERLGHP------HLSKLKIM-VPSLE 376
Query: 446 KLE--FCEHCILGKQHRVKFGSGMHHSSRL---FEYVHSDLLGPSKTPTHGGGSYFLSII 500
K++ FCE C LGK R S H SR+ F +H D+ GP++ + YF++ I
Sbjct: 377 KIKDLFCESCQLGKHVR---SSXRHVESRVDSPFLVIHXDIWGPNRVSSM-SYRYFVTFI 544
Query: 501 DDYSRRVWVFVLKKKSDTF*KFKE*HTLIENQMGTKLKGLRTDNGLEFVSEQFNDFLQVE 560
D++S+ VF++K++S+ F I+ Q G +K LR+DN E+ S + F +
Sbjct: 545 DEFSQCTRVFLMKERSEIL-SFLTSVNKIKTQFGKTIKILRSDNAKEYFSSVISPFXSAQ 721
>BI424202
Length = 421
Score = 62.8 bits (151), Expect = 1e-09
Identities = 38/108 (35%), Positives = 55/108 (50%), Gaps = 1/108 (0%)
Frame = -3
Query: 409 VVPHNNSELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCILGKQHRVKFGSGMH 468
++ +S LWH RLGH+S + L +G+L E CI GKQ K G
Sbjct: 401 IMNEESSMLWHRRLGHISIERIKRLVNEGVLSTLDFADFETYVDCIKGKQTN-KSKKGAK 225
Query: 469 HSSRLFEYVHSDLLGPSKTPTHGGGSYFLSIIDDYSRRVWV-FVLKKK 515
SS L E +H+D+ P YF++ IDDYSR +++ F+L+ K
Sbjct: 224 RSSNLLEIIHTDICCPDMDA--NSLKYFITFIDDYSRYMYLYFILRMK 87
>NP004897 gag-protease polyprotein
Length = 1923
Score = 58.9 bits (141), Expect = 1e-08
Identities = 57/240 (23%), Positives = 99/240 (40%), Gaps = 23/240 (9%)
Frame = +1
Query: 113 VESISISEQLTEFNKILVDL-ANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYGKEGTT 171
++S I +Q + K++ +L A E + E+ + + L E+ +I +G+
Sbjct: 1138 IKSEKILQQEAQLKKVIANLEAEKEAHEEEISELKGEVGFLNSKLENMTKSIKMLNKGSD 1317
Query: 172 TLEEV---------QAALRTKE-------LTKFKDLKVDEGSEGLNVARGRNEHRGKGKG 215
L+EV Q L +T+F K+ G+ ++HR + G
Sbjct: 1318 MLDEVLQLGKNVGNQRGLGFNHKSAGRITMTEFVPAKISTGAT-------MSQHRSRHHG 1476
Query: 216 KSRSKSRSKGFDKSKYKCFLCHKQGHFKKDCPDKGGDGSPSVQVAEASNE----EGYEST 271
+ KS+ K K++C C K GH K C G Q + + + ++
Sbjct: 1477 TQQKKSKRK-----KWRCHYCGKYGHIKPFCYHLHGHPHHGTQSSSSRRKMMWVPKHKIV 1641
Query: 272 GALVVTSWKS--EKSWVLDSGCSYHMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGNV 329
+V TS ++ ++ W LDSGCS HM KE+ + V G+ K+ GMG +
Sbjct: 1642 SLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMGKL 1821
>BI701169
Length = 407
Score = 56.2 bits (134), Expect = 9e-08
Identities = 26/63 (41%), Positives = 40/63 (63%)
Frame = +2
Query: 494 SYFLSIIDDYSRRVWVFVLKKKSDTF*KFKE*HTLIENQMGTKLKGLRTDNGLEFVSEQF 553
S+FL IDD+SR+ WV+ K K + F FK+ ++E + G K+K +R+D G EF S +F
Sbjct: 110 SFFL-FIDDFSRKTWVYFFKHKLEVFENFKKFKAIVEKESGFKIKAMRSDRGGEF*SNEF 286
Query: 554 NDF 556
+
Sbjct: 287 QKY 295
>CA937893 similar to GP|20805072|dbj retrovirus-related pol polyprotein from
transposon TNT 1-94-like, partial (7%)
Length = 412
Score = 54.3 bits (129), Expect = 3e-07
Identities = 33/106 (31%), Positives = 58/106 (54%)
Frame = -2
Query: 3 GSKWDIEKFTGSNDFGLWKVKMQAVLTQQKCVEALKGEAAMPATLTQEEKREMIDKAKSA 62
G+K+++ KF G+ +F LW+ +++ +L Q ++ L+ + E E+ ++ +
Sbjct: 315 GAKFEVGKFDGTGNFRLWQKRVKDLLA*QGLLKVLRDSKSNNTEALDWE--EL*ERTATT 142
Query: 63 IVLCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQLY 108
I LCL D+ L + A + KLES YM KSL ++ L Q+LY
Sbjct: 141 IRLCLVDEFLYHMMELAFPGEVWKKLESQYMLKSLTNKLYLMQKLY 4
>BI425191 weakly similar to GP|14586969|gb| pol polyprotein {Citrus x
paradisi}, partial (7%)
Length = 423
Score = 50.1 bits (118), Expect = 7e-06
Identities = 32/101 (31%), Positives = 52/101 (50%)
Frame = +3
Query: 447 LEFCEHCILGKQHRVKFGSGMHHSSRLFEYVHSDLLGPSKTPTHGGGSYFLSIIDDYSRR 506
L C CI K H +K + S++L E +H+D+ P + G YF++ IDDYS
Sbjct: 33 LNICVDCIKEK-HTMKRAT---RSTQLLEIMHTDICRPFDVNSFGKERYFITFIDDYSHY 200
Query: 507 VWVFVLKKKSDTF*KFKE*HTLIENQMGTKLKGLRTDNGLE 547
+V++L +K + +E Q+ K+K +R+D G E
Sbjct: 201 GYVYLLHEKFQAVDVLEIHLNEVERQLDRKVKVVRSDRGGE 323
>BU083646 weakly similar to GP|6642775|gb| gag-pol polyprotein {Vitis
vinifera}, partial (19%)
Length = 428
Score = 46.6 bits (109), Expect = 7e-05
Identities = 38/134 (28%), Positives = 58/134 (42%)
Frame = +1
Query: 259 VAEASNEEGYESTGALVVTSWKSEKSWVLDSGCSYHMCPRKEYFETLTLKEGGVVRLGNN 318
V + S EE +S S SW++DSGC+ HM +E F L V++ N
Sbjct: 1 VKDQSQEEQLFVVSCFAASS--ST*SWLIDSGCTNHMTYDRELFTELDEVVFSKVKIRNE 174
Query: 319 KACKVQGMGNVRLKMFDGREFLLRDVRFVPELKRNLISLSMFDGLGYCTRIEHGVCKISH 378
V+G V + G + L+ +V +V E+ +NL+S+ GY E C I
Sbjct: 175 AYIDVKGKETVAI*GHTGLK-LISNVLYVSEISQNLLSVPQLLKKGYKVLFEDKNCMIKD 351
Query: 379 GALITVKGSKMNGL 392
V +M G+
Sbjct: 352 SESREVFNIQMKGM 393
>BI788167
Length = 421
Score = 46.2 bits (108), Expect = 9e-05
Identities = 32/92 (34%), Positives = 48/92 (51%), Gaps = 3/92 (3%)
Frame = +2
Query: 282 EKSWVLDSGCS---YHMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGNVRLKMFDGRE 338
+ S +LD G S +HM P E F T G V +GN+ A ++ G+G ++LKM+DG
Sbjct: 29 DNS*LLDIGFSCATWHMTPC*E*FCTYETILEGFVFMGNDHALEIVGVGTIKLKMYDGIV 208
Query: 339 FLLRDVRFVPELKRNLISLSMFDGLGYCTRIE 370
++ V V LK+N + + D L IE
Sbjct: 209 RTIQGVLHVKGLKKNQLFVGKLDDLECKIHIE 304
>BI321802 similar to PIR|H72173|H7217 D5L protein - variola minor virus
(strain Garcia-1966), partial (17%)
Length = 431
Score = 45.1 bits (105), Expect = 2e-04
Identities = 28/96 (29%), Positives = 51/96 (52%), Gaps = 2/96 (2%)
Frame = -3
Query: 313 VRLGNNKACKVQGMGNVRLKMFDGREFLLRDVRFVPELKRNLISLSMFDGLGYCTRIEHG 372
+ +G++K +V+ +G+ +L + G L+D VP ++NLIS+S D LGY +
Sbjct: 345 IYVGDDKLVEVKAIGHFKLLLCTGFYLDLKDTFVVPSFRQNLISVSYLDKLGYLCSFGNN 166
Query: 373 VCKISHGALITVKGSKM--NGLYILDGSIVIGNASV 406
+ ++S + I S + + LY+LD G + V
Sbjct: 165 IFRLSFKSDIVGTRSLLVNDNLYLLDTVAFYGESYV 58
>TC213393 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (23%)
Length = 678
Score = 44.3 bits (103), Expect = 4e-04
Identities = 32/100 (32%), Positives = 48/100 (48%), Gaps = 1/100 (1%)
Frame = -2
Query: 435 KQGLLGKDKLDKLEFCEHCILGKQHRVKFGSGMHHSSRL-FEYVHSDLLGPSKTPTHGGG 493
KQ + KL L CE C GK + F + R F VHSD+ G
Sbjct: 389 KQMVSSLSKLPTLS-CESCQFGKHVQSSFPYCVICRDRFPFVLVHSDVWGLCHGMPTLES 213
Query: 494 SYFLSIIDDYSRRVWVFVLKKKSDTF*KFKE*HTLIENQM 533
YF++ I+DYS + W+F++ +S+ F* F + I+N +
Sbjct: 212 RYFVTFIEDYS*QTWLFLMVNRSELF*IFTSFYQEIKNNL 93
>TC233180 similar to UP|Q944K0 (Q944K0) At1g18030/T10F20_3, partial (11%)
Length = 916
Score = 43.5 bits (101), Expect = 6e-04
Identities = 43/171 (25%), Positives = 73/171 (42%), Gaps = 15/171 (8%)
Frame = +2
Query: 269 ESTGALVVTSWKSEKSWVLDSGCSYHMCPRKEYFETLTLKEGGV-VRLGNNKACKVQGMG 327
+S+ L + +E ++DSG + HM P YF + T G + + N + G G
Sbjct: 404 KSSSFLSFNASGTENI*IIDSGVTDHMTPHSSYFSSYTFLIGNQHIIVANGSHIPIIGCG 583
Query: 328 NVRLKMFDGREFLLRDVRFVPELKRNLISLSMFD-GLGYCTRIEHGVC---KISHGALIT 383
N++L+ L +V +VP+L NL+S+ L H C ++ G +I
Sbjct: 584 NIQLQ----SSLHLNNVLYVPKLSNNLLSIHKIT*DLNCVVTFFHSHCVF*DLAMGRMIG 751
Query: 384 VKGSKMNGLYILDGS--------IVIGNASVASVVPHNNSELW--HLRLGH 424
+ + GLY L + + S P ++S++W H LGH
Sbjct: 752 I-AKE*GGLYYLQHEDNKECTR*KALTSNHQTSSEPWSSSQIWLQHKCLGH 901
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.346 0.154 0.537
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 61,062,182
Number of Sequences: 63676
Number of extensions: 924521
Number of successful extensions: 7448
Number of sequences better than 10.0: 71
Number of HSP's better than 10.0 without gapping: 5363
Number of HSP's successfully gapped in prelim test: 165
Number of HSP's that attempted gapping in prelim test: 1769
Number of HSP's gapped (non-prelim): 5882
length of query: 1307
length of database: 12,639,632
effective HSP length: 108
effective length of query: 1199
effective length of database: 5,762,624
effective search space: 6909386176
effective search space used: 6909386176
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.7 bits)
S2: 65 (29.6 bits)
Medicago: description of AC124217.5