
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0220.7
(720 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 264 9e-71
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 263 2e-70
CO981347 119 9e-42
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 120 3e-27
BI784757 119 5e-27
TC222253 similar to UP|Q9ZQE4 (Q9ZQE4) Copia-like retroelement p... 111 1e-24
BI701169 108 6e-24
BI427153 100 2e-21
BI424202 76 2e-20
CO983154 96 4e-20
BI702130 weakly similar to GP|4234854|gb| pol polyprotein {Zea m... 67 2e-18
CF922226 83 4e-16
CO981879 82 1e-15
TC222001 similar to UP|C716_NEPRA (O04164) Cytochrome P450 71A6 ... 76 6e-14
BI425191 weakly similar to GP|14586969|gb| pol polyprotein {Citr... 73 5e-13
BG359773 60 3e-09
BI425121 57 2e-08
TC213393 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 57 3e-08
TC213920 54 2e-07
BM892610 53 4e-07
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 264 bits (675), Expect = 9e-71
Identities = 174/553 (31%), Positives = 272/553 (48%), Gaps = 26/553 (4%)
Frame = +1
Query: 191 SSNTFGSALNTESRGRGSQKSHNQSQGRG-------RSKSRGRSQTRV-RNDITCWNCDR 242
+ N G N +S GR + ++ R RS+ G Q + R C C +
Sbjct: 1348 AGNQRGLGFNPKSAGRTTMTEFVPAKNRTGATMSQHRSRHHGMQQKKSKRKKWRCHYCGK 1527
Query: 243 KGHFTNQCKAPRKKKNYQKR*DDDESA------NAATEEVADTLICSLDSPVDSWVIDSG 296
GH C ++ + + + A V T + + S + W +DSG
Sbjct: 1528 YGHIKPFCYHLHGHPHHGTQSSNSRKKMMWVPKHKAVSLVVHTSLRA--SAKEDWYLDSG 1701
Query: 297 ASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGI 356
S H KE L N V DG I+G+G + + L +L+ V V G+
Sbjct: 1702 CSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGKL----VHDGLPSLNKVLLVKGL 1869
Query: 357 KRNLISIGQLDDEGYHTTFGGGAWKVT--KGNLVVARGKKRGSLYMVAEEDMIAVTEAIN 414
NLISI QL DEG++ F VT K +++ + + + Y+ ++ + ++
Sbjct: 1870 TANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNCYLWTPQETSYSSTCLS 2049
Query: 415 SSS----IWHQRLGHMSEKGMKIMASKGKMS---NLKHVDLGVCEHCILGKQRKVSFSKA 467
S IWHQR GH+ +GMK + KG + NLK + +C C +GKQ K+S K
Sbjct: 2050 SKEDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGECQIGKQVKMSHQKL 2229
Query: 468 GRKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKW 527
++ S LEL+H D+ GP V+SLGG RY +DD +R WV F++ KS+ F VFK+
Sbjct: 2230 QHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIREKSETFEVFKEL 2409
Query: 528 KTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTL 587
++ + IK ++SD+G E+++ F +FC+ GI + TP+QNG+ ER NRTL
Sbjct: 2410 SLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEGITHEFSAAITPQQNGIVERKNRTL 2589
Query: 588 NERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPE---EVWYGKEVSLSHLK 644
E AR M LP W +A+NTA Y+ NR V L P E+W G++ S+ H
Sbjct: 2590 QEAARVMLHAKELPYNLWAEAMNTACYIHNR---VTLRRGTPTTLYEIWKGRKPSVKHFH 2760
Query: 645 VFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVL 704
+FG Y+L D ++R K+DPK+ F+GY ++ YR ++ + R ++ SINV ++
Sbjct: 2761 IFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINVVVDDLSP 2940
Query: 705 YKDRSSAESMSSS 717
+ + E + +S
Sbjct: 2941 ARKKDVEEDVRTS 2979
Score = 45.8 bits (107), Expect = 7e-05
Identities = 79/358 (22%), Positives = 144/358 (40%), Gaps = 57/358 (15%)
Frame = +1
Query: 19 WKMLMEDYLYQKMLYQPLTGKKPNDMK-QEDWD-------LLDRQALGVIRLTLSKNVAF 70
WK +++ + + KML GK +++K +EDW L + +AL + + KN+ F
Sbjct: 118 WKAVIKGWEHPKML--DTEGKPTDELKPEEDWTKEEDELALGNSKALNALFNGVDKNI-F 288
Query: 71 NIVNEKTTA----DLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNTII 126
++N T A +++K K + L + NL+M E + + + I
Sbjct: 289 RLINTCTVAKDAWEILKITHEGTSKVKMSRLQLLATKFENLKMKEEECIHDFHMNILEIA 468
Query: 127 SQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDI------RDLILS 180
+ +++ +E +V +L+SLP + VTA+ + ++ D++ +L LS
Sbjct: 469 NACTALGERITDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLS 648
Query: 181 EDIRRK---------DSGES------------------SNTFGSALNTESRGRGSQKSHN 213
+ +K D GE F LN + QK H
Sbjct: 649 DRAEKKSKNLAFVSNDEGEEDEYDLDTDEGLTNAVVLLGKQFNKVLNRMDK---RQKPHV 819
Query: 214 QS---QGRGRSKSRGRSQTRVRND--ITCWNCDRKGHFTNQCKAPRKKKNYQK-----R* 263
Q+ R SK + RS + + I C C+ GH +C P K ++K +
Sbjct: 820 QNIPFDIRKGSKYQKRSDVKPSHSKGIQCHGCEGYGHIIAEC--PTHLKKHRKGLSVCQS 993
Query: 264 DDDESANAATEEVADTLICSLDSPVDSWVIDSGASFHTIPSKELLSNY--ICGKFGKV 319
D + + ++ + L ++ DS DS +F EL ++Y +C K K+
Sbjct: 994 DTESEQESDSDRDVNALTGIFETAEDSSDTDSEITF-----DELAASYRKLCIKSEKI 1152
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 263 bits (672), Expect = 2e-70
Identities = 176/552 (31%), Positives = 270/552 (48%), Gaps = 27/552 (4%)
Frame = +1
Query: 193 NTFGSALNTESRGRGS-------QKSHNQSQGRGRSKSRGRSQTRV-RNDITCWNCDRKG 244
N G N +S GR + + S + + RS+ G Q + R C C + G
Sbjct: 1357 NQRGLGFNHKSAGRTTMTEFVPAKNSTGATMSQHRSRHHGTQQKKSKRKKWRCHYCGKYG 1536
Query: 245 HFTNQCKAPRKKKNYQKR*DDDESANAATEEVADTLICSL-------DSPVDSWVIDSGA 297
H C ++ + S+ V I SL S + W +DSG
Sbjct: 1537 HIKPFCYHLHGHPHHGTQ---SSSSGRKMMWVPKHKIVSLVVHTSLRASAKEDWYLDSGC 1707
Query: 298 SFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIK 357
S H KE L N V DG I G+G + + L +L+ V V G+
Sbjct: 1708 SRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMGKL----VHDGLPSLNKVLLVKGLT 1875
Query: 358 RNLISIGQLDDEGYHTTFGGGAWKVT--KGNLVVARGKKRGSLYMVAEEDMIAVTEAINS 415
NLISI QL DEG++ F VT K +++ + + + Y+ ++ + + S
Sbjct: 1876 ANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNCYLWTPQETSYSSTCLFS 2055
Query: 416 SS----IWHQRLGHMSEKGMKIMASKGKMS---NLKHVDLGVCEHCILGKQRKVSFSKAG 468
IWHQR GH+ +GMK + KG + NLK + +C C +GKQ K+S K
Sbjct: 2056 KEDEVKIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGECQIGKQVKMSHQKLQ 2235
Query: 469 RKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWK 528
++ S LEL+H D+ GP V+SLGG RY +DD +R WV F++ KSD F VFK+
Sbjct: 2236 HQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIREKSDTFEVFKELS 2415
Query: 529 TEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLN 588
++ + IK ++SD+G E+++ +F +FC+ GI + TP+QNG+ ER NRTL
Sbjct: 2416 LRLQREKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSAAITPQQNGIVERKNRTLQ 2595
Query: 589 ERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPE---EVWYGKEVSLSHLKV 645
E AR M LP W +A+NTA Y+ NR V L P E+W G++ ++ H +
Sbjct: 2596 EAARVMLHAKELPYNLWAEAMNTACYIHNR---VTLRRGTPTTLYEIWKGRKPTVKHFHI 2766
Query: 646 FGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLY 705
FG Y+L D ++R K+DPK+ F+GY ++ YR ++ + R ++ SINV ++
Sbjct: 2767 FGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINVVVDDLTPA 2946
Query: 706 KDRSSAESMSSS 717
+ + E + +S
Sbjct: 2947 RKKDVEEDVRTS 2982
Score = 48.1 bits (113), Expect = 1e-05
Identities = 60/265 (22%), Positives = 117/265 (43%), Gaps = 15/265 (5%)
Frame = +1
Query: 19 WKMLMEDYLYQKMLYQPLTGKKPNDMK-QEDWD-------LLDRQALGVIRLTLSKNVAF 70
WK +++ + + KML GK N++K +EDW L + +AL + + KN+ F
Sbjct: 118 WKAVIKGWEHPKML--DTEGKPTNELKPEEDWTKEEDELALGNSKALNALFNGVDKNI-F 288
Query: 71 NIVNEKTTA-DLMKALSNMYE--KPFAANKVHLIRRLF-NLRMGEGNSVTEHINSFNTII 126
++N T A D + L +E +++ L+ F NL+M E + + + I
Sbjct: 289 RLINTCTVAKDAWEILKTTHEGTSKVKMSRLQLLATKFENLKMKEEECIHDFHMNILEIA 468
Query: 127 SQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLILSEDIRRK 186
+ +++ +E +V +L+SLP + VTA+ + ++ D++ + + ++
Sbjct: 469 NACTALGERMTDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLS 648
Query: 187 DSGESSNTFGSALNTESRGRGSQKSHNQSQGRGRSKS---RGRSQTRVRNDITCWNCDRK 243
D E + L S G + ++ G + + G+ +V N + + +K
Sbjct: 649 DRTEKKS---KNLAFVSNDEGEEDEYDLDTDEGLTNAVVLLGKQFNKVLNRM---DRRQK 810
Query: 244 GHFTNQCKAPRKKKNYQKR*DDDES 268
H N RK YQK+ D+ S
Sbjct: 811 PHVRNIPFDIRKGSEYQKKSDEKPS 885
>CO981347
Length = 624
Score = 119 bits (298), Expect(3) = 9e-42
Identities = 58/125 (46%), Positives = 82/125 (65%)
Frame = +2
Query: 520 VFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGV 579
+F F++ T + NQ G K+K L++DNG E+ ++F +FC + GI+ K +P TP QNG+
Sbjct: 104 IF*KFRE*HTLIGNQLGTKLKVLRTDNGLEFVLEQFNEFCRKIGIKRHKIVPHTP*QNGL 283
Query: 580 AERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVS 639
AERMN T+ ER RCM + + LPK FW +A NT +YLINR PS L ++ P E W G+
Sbjct: 284 AERMNMTILERVRCMLLSARLPKTFWGEAANTTSYLINRCPSSTLGFKTPMEAWSGETT* 463
Query: 640 LSHLK 644
L +K
Sbjct: 464 LFRIK 478
Score = 48.1 bits (113), Expect(3) = 9e-42
Identities = 20/36 (55%), Positives = 29/36 (80%)
Frame = +3
Query: 486 PAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVF 521
P+ VK+ GGS Y++T IDD +R+VW+Y LK+KS+ F
Sbjct: 3 PSRVKTHGGSSYFLTIIDDFSRRVWLYVLKNKSESF 110
Score = 42.7 bits (99), Expect(3) = 9e-42
Identities = 21/49 (42%), Positives = 30/49 (60%)
Frame = +3
Query: 636 KEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFW 684
K + S LKVFG +++ D K+ KLD +A+KC FIGY + Y+ W
Sbjct: 453 KPPNYSGLKVFGSLAF---DHVKQGKLDARAVKCVFIGYPKGVKRYKLW 590
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 120 bits (300), Expect = 3e-27
Identities = 91/286 (31%), Positives = 143/286 (49%), Gaps = 3/286 (1%)
Frame = +2
Query: 319 VYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIGQLDD-EGYHTTFGG 377
+ LADG + GIG + SS +L++V + G N+ S+ QL TF
Sbjct: 20 ITLADGSRVVATGIGHVSPTSS----LSLNSVVFILGCPFNITSLSQLTRFRNCSVTFDA 187
Query: 378 GAWKVTKGNL--VVARGKKRGSLYMVAEEDMIAVTEAINSSSIWHQRLGHMSEKGMKIMA 435
++ + + + G + LY + + ++ V A+ S + H+RLGH +KIM
Sbjct: 188 NSFVIQECGTGWTIGVGIESHGLYYL-KPNLSWVCSAVTSPKLLHERLGHPHLSKLKIM- 361
Query: 436 SKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWGPAPVKSLGGS 495
+ +L+ + CE C LGK + S + S L ++H D+WGP V S+
Sbjct: 362 ----VPSLEKIKDLFCESCQLGKHVRSSXRHVESRVDSPFL-VIHXDIWGPNRVSSMS-Y 523
Query: 496 RYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEF 555
RY+VTFID+ ++ V+ +K +S++ S F +++ Q G IK L+SDN EY S
Sbjct: 524 RYFVTFIDEFSQCTRVFLMKERSEILS-FLTSVNKIKTQFGKTIKILRSDNAKEYFSSVI 700
Query: 556 KKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMRIQSGLP 601
F S GI + P TP+QN +AER NR L E AR + + + P
Sbjct: 701 SPFXSAQGILHQFSCPHTPQQNDIAERKNRHLVETARTLLLHANEP 838
>BI784757
Length = 430
Score = 119 bits (298), Expect = 5e-27
Identities = 54/115 (46%), Positives = 81/115 (69%)
Frame = +1
Query: 450 VCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKV 509
VC+ C+ KQ + +F + EKLE++++DV GP +SLGG+RY+++FID+ TRKV
Sbjct: 64 VCDGCLQCKQSRSTFKQNVPIRAKEKLEVIYSDVCGPMQTESLGGNRYFISFIDELTRKV 243
Query: 510 WVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGI 564
WVY ++ KSD F VF+K+K + Q+G IK L+++ GGEY S EF++FC + GI
Sbjct: 244 WVYLIRRKSDFFEVFEKFKNMAKKQSGSLIKILRTNGGGEYVSTEFQEFCDQQGI 408
>TC222253 similar to UP|Q9ZQE4 (Q9ZQE4) Copia-like retroelement pol
polyprotein, partial (4%)
Length = 919
Score = 111 bits (278), Expect = 1e-24
Identities = 58/146 (39%), Positives = 87/146 (58%)
Frame = +1
Query: 449 GVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRK 508
GVC+ C +GK+ + SF + L++VH D+ + + G + Y++TFIDD ++K
Sbjct: 325 GVCDTCEIGKKHRESFPTGKSWRMKKLLKIVHLDLC-TVEIPTHGDNNYFITFIDDFSKK 501
Query: 509 VWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIK 568
+WVYFLK KS+ + FK +K E Q G K+K+L D G EY S + F ++GI+
Sbjct: 502 MWVYFLKQKSEACNAFKMFKAFAEKQNGCKVKALIIDKGQEYLS--YTIFFEKHGIQHQL 675
Query: 569 TIPGTPEQNGVAERMNRTLNERARCM 594
T TP+ NGV ER N+T+ + RCM
Sbjct: 676 TTKYTPQHNGVTERKNKTIMDMVRCM 753
>BI701169
Length = 407
Score = 108 bits (271), Expect = 6e-24
Identities = 51/97 (52%), Positives = 69/97 (70%)
Frame = +2
Query: 498 YVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKK 557
+ FIDD +RK WVYF K K +VF FKK+K VE ++G KIK+++SD GGE+ S EF+K
Sbjct: 113 FFLFIDDFSRKTWVYFFKHKLEVFENFKKFKAIVEKESGFKIKAMRSDRGGEF*SNEFQK 292
Query: 558 FCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCM 594
+C ++GIR + +P+QNGVAER NRT+ AR M
Sbjct: 293 YCDDHGIRRPLMVLRSPQQNGVAERKNRTILNMARSM 403
Score = 29.6 bits (65), Expect = 4.9
Identities = 23/80 (28%), Positives = 33/80 (40%)
Frame = +3
Query: 461 KVSFSKAGRKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDV 520
K SF K + LEL+HTD GP SL S ++ + + K FL
Sbjct: 6 KKSFPKESNLRAKKLLELIHTD-GGPIKTSSLDKSNHFFSSLMIFQEKHGCIFLSIN*RC 182
Query: 521 FSVFKKWKTEVENQTGLKIK 540
+ K K + + LK+K
Sbjct: 183 LRISKSSKPLLRKKVVLKLK 242
>BI427153
Length = 422
Score = 100 bits (249), Expect = 2e-21
Identities = 55/134 (41%), Positives = 75/134 (55%), Gaps = 1/134 (0%)
Frame = +1
Query: 569 TIPGTPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQL 628
T P TP+QNG+AER N L E AR + + S +P W DA+ TA +LINR PS L+ Q+
Sbjct: 10 TCPHTPQQNGIAERKNHHLLETARSLMLNSNVPTHHWGDAVLTACFLINRMPSSSLENQI 189
Query: 629 PEEVWYGKEVSL-SHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQ 687
P + + ++ KVFGC +V S DKL +++KC F+GY GY +
Sbjct: 190 PHSIVFPNDLLFYVSPKVFGCTCFVHDLSPGLDKLSARSVKCVFLGYSRLQKGYTCYFPN 369
Query: 688 NRKIIRSINVTFNE 701
R+ S NVTF E
Sbjct: 370 MRRYYMSANVTFFE 411
>BI424202
Length = 421
Score = 76.3 bits (186), Expect(2) = 2e-20
Identities = 39/100 (39%), Positives = 60/100 (60%)
Frame = -3
Query: 415 SSSIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSE 474
SS +WH+RLGH+S + +K + ++G +S L D CI GKQ + SK G K S
Sbjct: 386 SSMLWHRRLGHISIERIKRLVNEGVLSTLDFADFETYVDCIKGKQ--TNKSKKGAKRSSN 213
Query: 475 KLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFL 514
LE++HTD+ P +Y++TFIDD +R +++YF+
Sbjct: 212 LLEIIHTDIC--CPDMDANSLKYFITFIDDYSRYMYLYFI 99
Score = 42.0 bits (97), Expect(2) = 2e-20
Identities = 19/40 (47%), Positives = 27/40 (67%)
Frame = -2
Query: 509 VWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGG 548
++V L SK++ VFK +K EVE Q G +IK ++SD GG
Sbjct: 120 IYVSLLHSKNEALDVFKVFKAEVEKQCGKQIKIMRSDRGG 1
>CO983154
Length = 568
Score = 96.3 bits (238), Expect = 4e-20
Identities = 57/150 (38%), Positives = 82/150 (54%), Gaps = 2/150 (1%)
Frame = +3
Query: 573 TPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEV 632
TP+QNG+AER NR L E AR + + +P W DA+ T+ +LINR PS L+ Q+P +
Sbjct: 6 TPQQNGIAERKNRHLLETARSLMLNLNVPIHHWGDAVLTSCFLINRMPSSSLENQIPHSL 185
Query: 633 WYGKEVSLSHL--KVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRK 690
+ + L H+ KVFGC +V S DKL +++KC F+GY GY+ + R+
Sbjct: 186 VFPHD-PLFHVSPKVFGCTCFVHDLSPGLDKLSARSVKCVFLGYSRLQKGYKCYSPTMRR 362
Query: 691 IIRSINVTFNESVLYKDRSSAESMSSSKQL 720
S +VTF E + S S S + L
Sbjct: 363 YYMSADVTFFEDTPFFSPSVDHSSSLQEVL 452
>BI702130 weakly similar to GP|4234854|gb| pol polyprotein {Zea mays},
partial (14%)
Length = 421
Score = 67.4 bits (163), Expect(2) = 2e-18
Identities = 37/84 (44%), Positives = 53/84 (63%)
Frame = -1
Query: 579 VAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEV 638
VAER NRTL + R MR LP+ W DA+ TAAY++NR P+ + + P E++ G +
Sbjct: 250 VAERRNRTLLDMVRSMRSNVKLPQFLWIDALKTAAYILNRVPTKAVS-KTPFELFKGWKP 74
Query: 639 SLSHLKVFGCVSYVLIDSDKRDKL 662
SL H++V+GC S V I + + KL
Sbjct: 73 SLRHIRVWGCPSEVRIYNPQEKKL 2
Score = 43.9 bits (102), Expect(2) = 2e-18
Identities = 26/57 (45%), Positives = 33/57 (57%), Gaps = 9/57 (15%)
Frame = -2
Query: 531 VENQTGLKIKSLKSDNGGEY------DSQ---EFKKFCSENGIRMIKTIPGTPEQNG 578
VE Q G +IK ++SD GGEY D Q F KF E+ I T+PG+P+QNG
Sbjct: 420 VEKQCGKQIKIVRSDRGGEYYGRYTEDGQAPGSFAKFLQEHEIVAQYTMPGSPDQNG 250
>CF922226
Length = 667
Score = 83.2 bits (204), Expect = 4e-16
Identities = 61/228 (26%), Positives = 105/228 (45%), Gaps = 12/228 (5%)
Frame = -3
Query: 96 NKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAA 155
N+++ + L++ +M E SV E ++ FN +I L ++ +T D+E L LL LP S++
Sbjct: 647 NRLYXKQSLYSFKMHEDRSVGEQLDLFNKLILDLENIDVTIDDEDQALLLLCYLPKSYSH 468
Query: 156 TVTAVSNSARDNKLKFDDIRDLILSEDIRRKD------SGESSNTFGSALNTESR-GRGS 208
+ RD+ + D+++ + S+++ + SGE G +S +
Sbjct: 467 FKETLL-FGRDS-VSLDEVQTALNSKELNERKEKKSSASGEGLTARGKTFKKDSEFDKKK 294
Query: 209 QKSHNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQCKAPRKKKNYQKR*DDDES 268
QK NQ G G I C++C ++GH C P ++KN +S
Sbjct: 293 QKPENQKNGEGNIFK-----------IRCYHCKKEGHTRKVC--PERQKNGGSNNRKKDS 153
Query: 269 ANAATE-----EVADTLICSLDSPVDSWVIDSGASFHTIPSKELLSNY 311
NAA E A+ L+ S +P W++DSG S+H P+K +
Sbjct: 152 GNAAIVQDDGYESAEALMVSEKNPETKWIMDSGCSWHMTPNKSWFEQF 9
>CO981879
Length = 576
Score = 81.6 bits (200), Expect = 1e-15
Identities = 45/106 (42%), Positives = 61/106 (57%)
Frame = -1
Query: 522 SVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAE 581
S+FK + ++ Q +KIK +SDNG EY ++ K ENGI + TP+QNGVAE
Sbjct: 573 SIFKTFFQMIQTQFQVKIKVFRSDNGREYFNKHLSKXXLENGIIHQSSCVDTPQQNGVAE 394
Query: 582 RMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQ 627
R NR L E AR + Q+ PK W +AI T YL N+ L++Q
Sbjct: 393 RKNRHLXEVARALLFQNKAPKYXWGEAILTGTYLKNKNA*QNLEFQ 256
Score = 57.4 bits (137), Expect = 2e-08
Identities = 32/89 (35%), Positives = 52/89 (57%), Gaps = 5/89 (5%)
Frame = -2
Query: 618 RGPSVPLDYQLPEEVWYG----KEVSLS-HLKVFGCVSYVLIDSDKRDKLDPKAIKCFFI 672
R PS L+++ P +V+ +S + LK+FGC +V I + KL+P+A KC F+
Sbjct: 284 RMPSKILNFRTPLDVFTSAFPNNRLSCTLPLKIFGCTVFVHIHEPNQGKLEPRAKKCVFV 105
Query: 673 GYGSDMYGYRFWDEQNRKIIRSINVTFNE 701
GY + GY+ +D ++K +I+VTF E
Sbjct: 104 GYAPNQKGYKCFDPTSKKTFVTIDVTFFE 18
>TC222001 similar to UP|C716_NEPRA (O04164) Cytochrome P450 71A6 (Fragment)
, partial (21%)
Length = 912
Score = 75.9 bits (185), Expect = 6e-14
Identities = 41/103 (39%), Positives = 56/103 (53%)
Frame = -2
Query: 600 LPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDSDKR 659
+P FW A+ AAYLIN P+ L P E +G +SHL++FGC+ Y R
Sbjct: 911 MPPNFWNYALLHAAYLINCIPTPFLQNTSPYERLHGHIPDISHLRIFGCLCYASTIKANR 732
Query: 660 DKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNES 702
KL+P+A C FIG+ + GY +D + II S NV F E+
Sbjct: 731 KKLEPRAHPCIFIGFKPNTKGYMLYDLHSHNIITSRNVVFYEN 603
>BI425191 weakly similar to GP|14586969|gb| pol polyprotein {Citrus x
paradisi}, partial (7%)
Length = 423
Score = 72.8 bits (177), Expect = 5e-13
Identities = 41/108 (37%), Positives = 59/108 (53%)
Frame = +3
Query: 442 NLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTF 501
+L DL +C CI + K + +A R + + LE++HTD+ P V S G RY++TF
Sbjct: 15 DLAFSDLNICVDCI---KEKHTMKRATRST--QLLEIMHTDICRPFDVNSFGKERYFITF 179
Query: 502 IDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGE 549
IDD + +VY L K V + EVE Q K+K ++SD GGE
Sbjct: 180 IDDYSHYGYVYLLHEKFQAVDVLEIHLNEVERQLDRKVKVVRSDRGGE 323
>BG359773
Length = 382
Score = 60.5 bits (145), Expect = 3e-09
Identities = 43/127 (33%), Positives = 61/127 (47%), Gaps = 9/127 (7%)
Frame = -1
Query: 545 DNGGEY----DSQE-----FKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMR 595
D GGEY D E F K + GI T+ G P+ NGV+E+ NR L + M
Sbjct: 379 DRGGEYYGRYDETEQYXGPFAKLIQKRGICAQYTMLGIPQ*NGVSEKRNRILMDMVTSML 200
Query: 596 IQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLID 655
I LP W A+ YL+NR PS + + E+W + S+ HL V+G + + I
Sbjct: 199 INLTLPISLWMYALKIVMYLLNRVPSKAVP-KTHFELWTNRTPSIRHLDVWGFQTEIRIY 23
Query: 656 SDKRDKL 662
+ + KL
Sbjct: 22 NPQERKL 2
>BI425121
Length = 412
Score = 57.4 bits (137), Expect = 2e-08
Identities = 47/162 (29%), Positives = 68/162 (41%), Gaps = 10/162 (6%)
Frame = +3
Query: 491 SLGGSRYYVTFIDDSTRKVWVYFLKSKSDVF----------SVFKKWKTEVENQTGLKIK 540
SLG +Y +DD +R WVYFL K + VF EV + L+I
Sbjct: 6 SLGCKKYEFLTVDDYSRYTWVYFLAHKHESLRYFIRGFKMKKVFVFLLLEVTMELSLRIL 185
Query: 541 SLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMRIQSGL 600
S FC NGI + P TP++N V ER NRTL E+AR +
Sbjct: 186 S*NH-------------FCERNGIFHNLS*PRTPQENRVVERKNRTLQEKARTI------ 308
Query: 601 PKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSH 642
+ T ++ N+ P+ + P E+W G+ +S+
Sbjct: 309 --------LFTTCFVQNKILIRPMIKKTPYELWKGRRHIISY 410
>TC213393 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (23%)
Length = 678
Score = 57.0 bits (136), Expect = 3e-08
Identities = 31/97 (31%), Positives = 51/97 (51%)
Frame = -2
Query: 437 KGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWGPAPVKSLGGSR 496
K +S+L + CE C GK + SF LVH+DVWG SR
Sbjct: 389 KQMVSSLSKLPTLSCESCQFGKHVQSSFPYCVICRDRFPFVLVHSDVWGLCHGMPTLESR 210
Query: 497 YYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVEN 533
Y+VTFI+D + + W++ + ++S++F +F + E++N
Sbjct: 209 YFVTFIEDYS*QTWLFLMVNRSELF*IFTSFYQEIKN 99
>TC213920
Length = 428
Score = 53.9 bits (128), Expect = 2e-07
Identities = 37/87 (42%), Positives = 48/87 (54%), Gaps = 2/87 (2%)
Frame = -3
Query: 559 CSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINR 618
CSE I + TI GTP+QNGV+E+ NRTL + R M I L W + TA L+NR
Sbjct: 369 CSETCICVQYTILGTPQQNGVSEKRNRTLMDMVRSMLIN*TLSISLWMYTLKTAM*LLNR 190
Query: 619 GPS--VPLDYQLPEEVWYGKEVSLSHL 643
PS VP + P E+ + S+ HL
Sbjct: 189 VPSKVVP---KTPFEL*TNRIPSIRHL 118
>BM892610
Length = 421
Score = 53.1 bits (126), Expect = 4e-07
Identities = 32/88 (36%), Positives = 49/88 (55%)
Frame = -3
Query: 614 YLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIG 673
YLINR P+ L +Q+P V + K + + L+ FGC Y + + KLD ++ +C F+G
Sbjct: 413 YLINRLPTTSLKFQVPCTV-FNKLPNYNFLRNFGCSCYPFLRPYNKHKLDFRSHECLFLG 237
Query: 674 YGSDMYGYRFWDEQNRKIIRSINVTFNE 701
Y + GY+ N K+ S +V FNE
Sbjct: 236 YSTPHKGYKCL-SPNGKLYVSKDVIFNE 156
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.317 0.134 0.396
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 30,482,679
Number of Sequences: 63676
Number of extensions: 407705
Number of successful extensions: 2175
Number of sequences better than 10.0: 90
Number of HSP's better than 10.0 without gapping: 2072
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2134
length of query: 720
length of database: 12,639,632
effective HSP length: 104
effective length of query: 616
effective length of database: 6,017,328
effective search space: 3706674048
effective search space used: 3706674048
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (28.5 bits)
Lotus: description of TM0220.7