
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0032.12
(921 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 474 e-134
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 470 e-132
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 130 3e-30
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 122 6e-28
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 120 2e-27
BU548243 119 8e-27
TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%) 114 3e-25
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 114 3e-25
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 112 6e-25
CO981879 74 6e-25
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 111 1e-24
TC222001 similar to UP|C716_NEPRA (O04164) Cytochrome P450 71A6 ... 110 3e-24
TC232995 108 8e-24
TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotr... 107 2e-23
BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Ara... 101 2e-21
CO983516 100 3e-21
BE211208 98 2e-20
CO983154 97 3e-20
CO981347 69 5e-20
BM307983 96 6e-20
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 474 bits (1220), Expect = e-134
Identities = 304/905 (33%), Positives = 457/905 (49%), Gaps = 28/905 (3%)
Frame = +1
Query: 39 VWHNRLGHPTSSALNHLRNNKLIYCDPS---RSSTVCDSCVLGKHVRLPFSSSE-TITLR 94
+WH R GH + + + + P+ +C C +GK V++ + T R
Sbjct: 2071 IWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGECQIGKQVKMSHQKLQHQTTSR 2250
Query: 95 SFDILHSDLW-TSPILSTAGHRYYVLVLDDHTDFLWMFPISKKSQVYDIFTTLATLIKTQ 153
++LH DL + S G RY +V+DD + F W+ I +KS+ +++F L+ ++ +
Sbjct: 2251 VLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIREKSETFEVFKELSLRLQRE 2430
Query: 154 FSANIKCLQCDNGREYDNDSFRRYCHANGLIFRFSCPHTSSQNGKSERKIRTINNMIRTL 213
IK ++ D+GRE++N F +C + G+ FS T QNG ERK RT+ R +
Sbjct: 2431 KDCVIKRIRSDHGREFENSRFTEFCTSEGITHEFSAAITPQQNGIVERKNRTLQEAARVM 2610
Query: 214 LAHSSVPPSFWHHALQMATYLLNILPRKTLRNDSPTQRLYH----RDPSYSHLRVFGCLC 269
L +P + W A+ A Y+ N R TLR +PT LY R PS H +FG C
Sbjct: 2611 LHAKELPYNLWAEAMNTACYIHN---RVTLRRGTPTT-LYEIWKGRKPSVKHFHIFGSPC 2778
Query: 270 FPFFPSATINKLQPRSSPCVFLGYPMNHRGYKCYDLSNRKLIISRHVIFDE-SRFPFADL 328
+ K+ P+S +FLGY N R Y+ ++ R ++ S +V+ D+ S D+
Sbjct: 2779 YILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINVVVDDLSPARKKDV 2958
Query: 329 PLESTSSYDCFTEDLPPSLIHHWQTTSTRPPDLSIPPSSPTDSTMPSPAPTSSPTSSAPL 388
+ +S D + + + + + S TD + + S T +
Sbjct: 2959 EEDVRTSGDNVAD-------------AAKSGENAENSDSATDESNINQPDKRSSTRIQKM 3099
Query: 389 PLPPVPPTPPTRTMTTHSMHGISKPKKPFSLSVSIDDPSISPLPCNPKQALSDPNWKFAM 448
+ P R +TT S F + P N K+AL+D W AM
Sbjct: 3100 HPKELIIGDPNRGVTTRSREVEIVSNSCFVSKIE---------PKNVKEALTDEFWINAM 3252
Query: 449 QPEFNALIRNNTWELVPRPCDVNVIRYM*IFRHKKQSNGLFEHYKARLVGDGWSQIAGVD 508
Q E RN WELVPRP NVI IF++K G+ KARLV G++QI GVD
Sbjct: 3253 QEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVD 3432
Query: 509 CDETFNLVVKPATIRTVLSIALSRSWPIH*LDVHNAFLHGDLHETVYMHQPLGFRDSQHP 568
DETF V + +IR +L +A + ++ +DV +AFL+G L+E VY+ QP GF D HP
Sbjct: 3433 FDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADPTHP 3612
Query: 569 DYVCRRKKSLYGLKQAPRAWYQRFADYVSSIGFQHSSSDHSYIL-----------LYVDD 617
D+V R KK+LYGLKQAPRAWY+R ++++ G++ D + + +YVDD
Sbjct: 3613 DHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDD 3792
Query: 618 IILVASSHDLRKSFMALLAFEFAMKDLGPLSYFLGIAVTRYVGGLFLSQSTYASEIIARA 677
I+ S+++ + F+ + EF M +G L+YFLG+ V + +FLSQS YA I+ +
Sbjct: 3793 IVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSRYAKNIVKKF 3972
Query: 678 GMASCNPSATPVDTK*KLSSSSGTPCEDVTLYQSLAGALQYLTFTRPNISYVVQQVCLHM 737
GM + + TP T KLS D +LY+S+ G+L YLT +RP+I+Y V +
Sbjct: 3973 GMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQ 4152
Query: 738 HAPHTEHMLAMFVAL*LTVFTCIPPLLRNLFLIRMLTG-------GMSCTRRSTSGYCVF 790
P H+ + L T ++ ML G G + R+STSG C +
Sbjct: 4153 ANPKISHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDADWAGSADDRKSTSGGCFY 4332
Query: 791 LGDNLISWSSKRQPTLSHSSAEAEYRGVANVVSESCWIRNLLLELHFPLSQETLVHCDNV 850
LG+NLISW SK+Q +S S+AEAEY + S+ W++ +L E + TL +CDN+
Sbjct: 4333 LGNNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQDVMTL-YCDNM 4509
Query: 851 SSIYLSGNPVHHQRTKHIEMDIHFVREKVARGQARILHVPSRHQIADIFTKGLPRVLFDD 910
S+I +S NPV H RTKHI++ H++R+ V + HV + QIADIFTK L F+
Sbjct: 4510 SAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTEEQIADIFTKALDANQFEK 4689
Query: 911 FRSSL 915
R L
Sbjct: 4690 LRGKL 4704
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 470 bits (1209), Expect = e-132
Identities = 299/907 (32%), Positives = 453/907 (48%), Gaps = 30/907 (3%)
Frame = +1
Query: 39 VWHNRLGHPTSSALNHLRNNKLIYCDPS---RSSTVCDSCVLGKHVRLPFSSSE-TITLR 94
+WH R GH + + + + P+ +C C +GK V++ + T R
Sbjct: 2074 IWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGECQIGKQVKMSHQKLQHQTTSR 2253
Query: 95 SFDILHSDLW-TSPILSTAGHRYYVLVLDDHTDFLWMFPISKKSQVYDIFTTLATLIKTQ 153
++LH DL + S G RY +V+DD + F W+ I +KS +++F L+ ++ +
Sbjct: 2254 VLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIREKSDTFEVFKELSLRLQRE 2433
Query: 154 FSANIKCLQCDNGREYDNDSFRRYCHANGLIFRFSCPHTSSQNGKSERKIRTINNMIRTL 213
IK ++ D+GRE++N F +C + G+ FS T QNG ERK RT+ R +
Sbjct: 2434 KDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSAAITPQQNGIVERKNRTLQEAARVM 2613
Query: 214 LAHSSVPPSFWHHALQMATYLLNILPRKTLRNDSPTQRLYH----RDPSYSHLRVFGCLC 269
L +P + W A+ A Y+ N R TLR +PT LY R P+ H +FG C
Sbjct: 2614 LHAKELPYNLWAEAMNTACYIHN---RVTLRRGTPTT-LYEIWKGRKPTVKHFHIFGSPC 2781
Query: 270 FPFFPSATINKLQPRSSPCVFLGYPMNHRGYKCYDLSNRKLIISRHVIFDESRFPFADLP 329
+ K+ P+S +FLGY N R Y+ ++ R ++ S +V+ D
Sbjct: 2782 YILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINVVVD---------- 2931
Query: 330 LESTSSYDCFTEDLPPSLIHHWQTTSTRPPDLSIPPSSPTDSTMPSPAPTSSPTSSAPLP 389
DL P+ + D + ++ S + T P + P
Sbjct: 2932 ------------DLTPARKKDVEEDVRTSGDNVADTAKSAENAENSDSATDEPNINQPDK 3075
Query: 390 LPPV--PPTPPTRTMTTHSMHGISKPKKPFSLSVSIDDPSISPL-PCNPKQALSDPNWKF 446
P + P + G++ + + + + +S + P N K+AL+D W
Sbjct: 3076 RPSIRIQKMHPKELIIGDPNRGVTTRSR--EIEIVSNSCFVSKIEPKNVKEALTDEFWIN 3249
Query: 447 AMQPEFNALIRNNTWELVPRPCDVNVIRYM*IFRHKKQSNGLFEHYKARLVGDGWSQIAG 506
AMQ E RN WELVPRP NVI IF++K G+ KARLV G++QI G
Sbjct: 3250 AMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEG 3429
Query: 507 VDCDETFNLVVKPATIRTVLSIALSRSWPIH*LDVHNAFLHGDLHETVYMHQPLGFRDSQ 566
VD DETF V + +IR +L +A + ++ +DV +AFL+G L+E Y+ QP GF D
Sbjct: 3430 VDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEAYVEQPKGFVDPT 3609
Query: 567 HPDYVCRRKKSLYGLKQAPRAWYQRFADYVSSIGFQHSSSDHSYIL-----------LYV 615
HPD+V R KK+LYGLKQAPRAWY+R ++++ G++ D + + +YV
Sbjct: 3610 HPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIYV 3789
Query: 616 DDIILVASSHDLRKSFMALLAFEFAMKDLGPLSYFLGIAVTRYVGGLFLSQSTYASEIIA 675
DDI+ S+++ + F+ + EF M +G L+YFLG+ V + +FLSQS YA I+
Sbjct: 3790 DDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSKYAKNIVK 3969
Query: 676 RAGMASCNPSATPVDTK*KLSSSSGTPCEDVTLYQSLAGALQYLTFTRPNISYVVQQVCL 735
+ GM + + TP T KLS D +LY+S+ G+L YLT +RP+I+Y V
Sbjct: 3970 KFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCAR 4149
Query: 736 HMHAPHTEHMLAMFVAL*LTVFTCIPPLLRNLFLIRMLTG-------GMSCTRRSTSGYC 788
+ P H+ + L T ++ ML G G + R+STSG C
Sbjct: 4150 YQANPKISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYCDADWAGSADDRKSTSGGC 4329
Query: 789 VFLGDNLISWSSKRQPTLSHSSAEAEYRGVANVVSESCWIRNLLLELHFPLSQETLVHCD 848
+LG NLISW SK+Q +S S+AEAEY + S+ W++ +L E + TL +CD
Sbjct: 4330 FYLGTNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQDVMTL-YCD 4506
Query: 849 NVSSIYLSGNPVHHQRTKHIEMDIHFVREKVARGQARILHVPSRHQIADIFTKGLPRVLF 908
N+S+I +S NPV H RTKHI++ H++R+ V + HV + QIADIFTK L F
Sbjct: 4507 NMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLEHVDTEEQIADIFTKALDANQF 4686
Query: 909 DDFRSSL 915
+ R L
Sbjct: 4687 EKLRGKL 4707
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment),
partial (30%)
Length = 687
Score = 130 bits (327), Expect = 3e-30
Identities = 62/128 (48%), Positives = 84/128 (65%)
Frame = +2
Query: 776 GMSCTRRSTSGYCVFLGDNLISWSSKRQPTLSHSSAEAEYRGVANVVSESCWIRNLLLEL 835
G RRSTSGYCVF+G NL+SW SK+Q ++ SSAEAEYR +A V E WI+ L EL
Sbjct: 47 GCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCELMWIKQFLQEL 226
Query: 836 HFPLSQETLVHCDNVSSIYLSGNPVHHQRTKHIEMDIHFVREKVARGQARILHVPSRHQI 895
F + ++CDN ++++++ NPV H+RTKHIE+D HF+REK+ + + S Q
Sbjct: 227 RFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEIVTEFIGSNDQP 406
Query: 896 ADIFTKGL 903
DI TK L
Sbjct: 407 VDILTKSL 430
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 122 bits (307), Expect = 6e-28
Identities = 62/150 (41%), Positives = 95/150 (63%)
Frame = -3
Query: 447 AMQPEFNALIRNNTWELVPRPCDVNVIRYM*IFRHKKQSNGLFEHYKARLVGDGWSQIAG 506
AMQ E N RNN W+LV +P + VI +FR+K +G+ KARLV G++Q G
Sbjct: 458 AMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEEG 279
Query: 507 VDCDETFNLVVKPATIRTVLSIALSRSWPIH*LDVHNAFLHGDLHETVYMHQPLGFRDSQ 566
+D +ET+ V + IR +L+ ++ ++ +DV +AFL+G + E VY+ QP GF
Sbjct: 278 IDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIPD 99
Query: 567 HPDYVCRRKKSLYGLKQAPRAWYQRFADYV 596
P +V + +K+LYGLKQAPRAWY+R ++++
Sbjct: 98 KPTHVYKLQKALYGLKQAPRAWYERISNFL 9
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 120 bits (302), Expect = 2e-27
Identities = 73/180 (40%), Positives = 99/180 (54%), Gaps = 12/180 (6%)
Frame = +1
Query: 486 NGLFEHYKARLVGDGWSQIAGVDCDETFNLVVKPATIRTVLSIALSRSWPIH*LDVHNAF 545
+G + +KARLV ++Q+ G D TF+ V K A + + S+A+ WP+ *LD NAF
Sbjct: 25 SGTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAF 204
Query: 546 LHGDLHETVYMHQPLGF-RDSQHPDYVCRRKKSLYGLKQAPRAWYQRFADYVSSIGFQHS 604
LHG L E VYM QPLGF + + VC+ +S YGLKQ+PRAW F ++I +
Sbjct: 205 LHGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAW--PFLYCGAAIWYDSH 378
Query: 605 SSDHS-----------YILLYVDDIILVASSHDLRKSFMALLAFEFAMKDLGPLSYFLGI 653
+DHS Y+++YVDDI + S L +F KDLG L YFLGI
Sbjct: 379 EADHSVFYCHSPQGCIYLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558
>BU548243
Length = 599
Score = 119 bits (297), Expect = 8e-27
Identities = 61/135 (45%), Positives = 84/135 (62%)
Frame = -1
Query: 782 RSTSGYCVFLGDNLISWSSKRQPTLSHSSAEAEYRGVANVVSESCWIRNLLLELHFPLSQ 841
RST G +FLG NLISW S++Q + SS EAEYR +A +E WI+ LL+EL P +
Sbjct: 554 RSTLGSAIFLGPNLISWWSRKQQVTAQSSTEAEYRSIAQTSAELTWIQALLMELQIPFTP 375
Query: 842 ETLVHCDNVSSIYLSGNPVHHQRTKHIEMDIHFVREKVARGQARILHVPSRHQIADIFTK 901
++ CDN S++ ++ N V H RTKH+E+D+ FV EKV Q +I H+P+ Q A I TK
Sbjct: 374 PVIL-CDNKSAVAIAHNLVFHSRTKHMEIDVFFVHEKVLSKQLQIFHIPALDQWAGILTK 198
Query: 902 GLPRVLFDDFRSSLS 916
L F +S L+
Sbjct: 197 PLSSARFTFLKSKLT 153
>TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%)
Length = 493
Score = 114 bits (284), Expect = 3e-25
Identities = 55/135 (40%), Positives = 84/135 (61%)
Frame = +3
Query: 781 RRSTSGYCVFLGDNLISWSSKRQPTLSHSSAEAEYRGVANVVSESCWIRNLLLELHFPLS 840
R+ST+G+ F+GD +W SK+QP ++ S+ EAEY + V + W+RNLL EL P
Sbjct: 15 RKSTTGFVFFMGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELKMPQE 194
Query: 841 QETLVHCDNVSSIYLSGNPVHHQRTKHIEMDIHFVREKVARGQARILHVPSRHQIADIFT 900
+ + DN S++ L+ NPV H+++KHI+ HF+RE + + + ++ +V S+ Q ADIFT
Sbjct: 195 EPMEICVDNKSALALAKNPVFHEKSKHIDTRYHFIRECIEKKEVKLKYVMSQDQAADIFT 374
Query: 901 KGLPRVLFDDFRSSL 915
K L F RS L
Sbjct: 375 KPLKLETFVKLRSML 419
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 114 bits (284), Expect = 3e-25
Identities = 64/181 (35%), Positives = 97/181 (53%), Gaps = 1/181 (0%)
Frame = +2
Query: 41 HNRLGHPTSSALNHLRNNKLIYCDPSRSSTV-CDSCVLGKHVRLPFSSSETITLRSFDIL 99
H RLGHP HL K++ + + C+SC LGKHVR E+ F ++
Sbjct: 317 HERLGHP------HLSKLKIMVPSLEKIKDLFCESCQLGKHVRSSXRHVESRVDSPFLVI 478
Query: 100 HSDLWTSPILSTAGHRYYVLVLDDHTDFLWMFPISKKSQVYDIFTTLATLIKTQFSANIK 159
H D+W +S+ +RY+V +D+ + +F + ++S++ F T IKTQF IK
Sbjct: 479 HXDIWGPNRVSSMSYRYFVTFIDEFSQCTRVFLMKERSEILS-FLTSVNKIKTQFGKTIK 655
Query: 160 CLQCDNGREYDNDSFRRYCHANGLIFRFSCPHTSSQNGKSERKIRTINNMIRTLLAHSSV 219
L+ DN +EY + + A G++ +FSCPHT QN +ERK R + RTLL H++
Sbjct: 656 ILRSDNAKEYFSSVISPFXSAQGILHQFSCPHTPQQNDIAERKNRHLVETARTLLLHANE 835
Query: 220 P 220
P
Sbjct: 836 P 838
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 112 bits (281), Expect = 6e-25
Identities = 57/133 (42%), Positives = 79/133 (58%)
Frame = -2
Query: 464 VPRPCDVNVIRYM*IFRHKKQSNGLFEHYKARLVGDGWSQIAGVDCDETFNLVVKPATIR 523
VP P + ++ K G + KARLV G++Q+ G+D +TF+ V K T+R
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 524 TVLSIALSRSWPIH*LDVHNAFLHGDLHETVYMHQPLGFRDSQHPDYVCRRKKSLYGLKQ 583
L++A WP+H LD+ NAFLHGDL E +YM QP GF VC+ +SLYGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 584 APRAWYQRFADYV 596
+PRAW+ +F+ V
Sbjct: 46 SPRAWFGKFSHVV 8
>CO981879
Length = 576
Score = 74.3 bits (181), Expect(2) = 6e-25
Identities = 39/95 (41%), Positives = 50/95 (52%)
Frame = -1
Query: 142 IFTTLATLIKTQFSANIKCLQCDNGREYDNDSFRRYCHANGLIFRFSCPHTSSQNGKSER 201
IF T +I+TQF IK + DNGREY N + NG+I + SC T QNG +ER
Sbjct: 570 IFKTFFQMIQTQFQVKIKVFRSDNGREYFNKHLSKXXLENGIIHQSSCVDTPQQNGVAER 391
Query: 202 KIRTINNMIRTLLAHSSVPPSFWHHALQMATYLLN 236
K R + + R LL + P W A+ TYL N
Sbjct: 390 KNRHLXEVARALLFQNKAPKYXWGEAILTGTYLKN 286
Score = 59.3 bits (142), Expect(2) = 6e-25
Identities = 34/99 (34%), Positives = 52/99 (52%), Gaps = 1/99 (1%)
Frame = -2
Query: 228 LQMATYLLNI-LPRKTLRNDSPTQRLYHRDPSYSHLRVFGCLCFPFFPSATINKLQPRSS 286
++M + +LN P + P RL P L++FGC F KL+PR+
Sbjct: 287 IRMPSKILNFRTPLDVFTSAFPNNRLSCTLP----LKIFGCTVFVHIHEPNQGKLEPRAK 120
Query: 287 PCVFLGYPMNHRGYKCYDLSNRKLIISRHVIFDESRFPF 325
CVF+GY N +GYKC+D +++K ++ V F E + PF
Sbjct: 119 KCVFVGYAPNQKGYKCFDPTSKKTFVTIDVTFFE-KTPF 6
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 111 bits (278), Expect = 1e-24
Identities = 55/129 (42%), Positives = 76/129 (58%)
Frame = +3
Query: 431 LPCNPKQALSDPNWKFAMQPEFNALIRNNTWELVPRPCDVNVIRYM*IFRHKKQSNGLFE 490
+P ++AL P W+ AM E AL N TWELVP P + ++ K NG +
Sbjct: 18 VPSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGPNGKVD 197
Query: 491 HYKARLVGDGWSQIAGVDCDETFNLVVKPATIRTVLSIALSRSWPIH*LDVHNAFLHGDL 550
KARLV G++Q+ G++ +TF+ V T+R L++A R WP+H LD+ NAFLHGDL
Sbjct: 198 RLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAFLHGDL 377
Query: 551 HETVYMHQP 559
E +YM QP
Sbjct: 378 EEDIYMEQP 404
>TC222001 similar to UP|C716_NEPRA (O04164) Cytochrome P450 71A6 (Fragment)
, partial (21%)
Length = 912
Score = 110 bits (275), Expect = 3e-24
Identities = 48/103 (46%), Positives = 70/103 (67%)
Frame = -2
Query: 219 VPPSFWHHALQMATYLLNILPRKTLRNDSPTQRLYHRDPSYSHLRVFGCLCFPFFPSATI 278
+PP+FW++AL A YL+N +P L+N SP +RL+ P SHLR+FGCLC+ A
Sbjct: 911 MPPNFWNYALLHAAYLINCIPTPFLQNTSPYERLHGHIPDISHLRIFGCLCYASTIKANR 732
Query: 279 NKLQPRSSPCVFLGYPMNHRGYKCYDLSNRKLIISRHVIFDES 321
KL+PR+ PC+F+G+ N +GY YDL + +I SR+V+F E+
Sbjct: 731 KKLEPRAHPCIFIGFKPNTKGYMLYDLHSHNIITSRNVVFYEN 603
>TC232995
Length = 1009
Score = 108 bits (271), Expect = 8e-24
Identities = 63/170 (37%), Positives = 91/170 (53%), Gaps = 11/170 (6%)
Frame = +2
Query: 556 MHQPLGFRDSQHPDYVCRRKKSLYGLKQAPRAWYQRFADYVSSIGFQHSSSD-------- 607
+ QP GF S P++V + +K+LYGLKQAPRAWY+R ++++ F D
Sbjct: 2 VEQPPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFIKRK 181
Query: 608 HSYILL---YVDDIILVASSHDLRKSFMALLAFEFAMKDLGPLSYFLGIAVTRYVGGLFL 664
H+ ILL YVDDII +++ L K F + EF M +G L YFLG+ + + G+F+
Sbjct: 182 HNDILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*GIFI 361
Query: 665 SQSTYASEIIARAGMASCNPSATPVDTK*KLSSSSGTPCEDVTLYQSLAG 714
+QS Y E+I R GM S +TP+ T L D+ Y+ G
Sbjct: 362 NQSKYCKELIKRFGMDSAKHMSTPMSTNCYLDKDESGQSIDIKQYRDAIG 511
>TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotransposon
Hopscotch polyprotein, partial (7%)
Length = 1446
Score = 107 bits (267), Expect = 2e-23
Identities = 49/99 (49%), Positives = 69/99 (69%)
Frame = +2
Query: 781 RRSTSGYCVFLGDNLISWSSKRQPTLSHSSAEAEYRGVANVVSESCWIRNLLLELHFPLS 840
R ST GYCV +G+NL+ W S + ++ SSAEAEY+ + E WI+ LL EL F +
Sbjct: 38 RGSTLGYCVSIGENLVLWKSNK*NVVARSSAEAEYKAMTVATCELIWIKQLLQELKFGST 217
Query: 841 QETLVHCDNVSSIYLSGNPVHHQRTKHIEMDIHFVREKV 879
Q+ + CDN ++++++ NPV H+RTKHIE+D HFVREKV
Sbjct: 218 QQMKLCCDNQAALHIASNPVFHERTKHIEIDCHFVREKV 334
>BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Arabidopsis
thaliana}, partial (18%)
Length = 421
Score = 101 bits (251), Expect = 2e-21
Identities = 55/138 (39%), Positives = 79/138 (56%), Gaps = 11/138 (7%)
Frame = -2
Query: 576 KSLYGLKQAPRAWYQRFADYVSSIGFQHSSSDHSY-----------ILLYVDDIILVASS 624
KSLYGLKQA R WY++ + + G+ S SD+S +L+YVDDIIL S
Sbjct: 420 KSLYGLKQASRKWYEKLTNLLLKEGYIQSISDYSLFTLTKGNTFTALLVYVDDIILAGDS 241
Query: 625 HDLRKSFMALLAFEFAMKDLGPLSYFLGIAVTRYVGGLFLSQSTYASEIIARAGMASCNP 684
D +L F +K+LG L YFLG+ V G+ +SQ Y +++ +G+ C P
Sbjct: 240 IDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLGITISQRKYCLDLLKDSGLLGCKP 61
Query: 685 SATPVDTK*KLSSSSGTP 702
++TP+DT KL S++GTP
Sbjct: 60 ASTPLDTSIKLHSAAGTP 7
>CO983516
Length = 724
Score = 100 bits (249), Expect = 3e-21
Identities = 50/122 (40%), Positives = 76/122 (61%), Gaps = 11/122 (9%)
Frame = +2
Query: 509 CDETFNLVVKPATIRTVLSIALSRSWPIH*LDVHNAFLHGDLHETVYMHQPLGFRDSQHP 568
CD+ F+ V + +IR +L +A + ++ +DV +AFL+G L+E VY+ QP GF D HP
Sbjct: 353 CDKEFHPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFIDPTHP 532
Query: 569 DYVCRRKKSLYGLKQAPRAWYQRFADYVSSIGFQHSSSDHSYIL-----------LYVDD 617
D+V R KK+LYGLKQAPRAWY+R + ++ G++ D + + +YVDD
Sbjct: 533 DHVYRLKKALYGLKQAPRAWYERLTELLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDD 712
Query: 618 II 619
I+
Sbjct: 713 IV 718
>BE211208
Length = 413
Score = 97.8 bits (242), Expect = 2e-20
Identities = 52/126 (41%), Positives = 76/126 (60%), Gaps = 1/126 (0%)
Frame = +2
Query: 610 YILLYVDDIILVASSHDLRKSFMALLAFEFAMKDLGPLSYFLGIAVTRY-VGGLFLSQST 668
Y+L+YVDDII+ S+ L +S + L F++K LG L YFLGI V G + L+QS
Sbjct: 29 YLLVYVDDIIITGRSNYLIQSLVHHLNSNFSLKQLGQLDYFLGIEVHHTPTGSVLLTQSK 208
Query: 669 YASEIIARAGMASCNPSATPVDTK*KLSSSSGTPCEDVTLYQSLAGALQYLTFTRPNISY 728
Y +++ + MA P ++P+ T +LS + D T+Y+S+ GALQY T TRP IS+
Sbjct: 209 YICDLLHKTDMAEAKPISSPMVTNLRLSKNGDDLLSDPTMYRSVVGALQYPTITRPEISF 388
Query: 729 VVQQVC 734
+VC
Sbjct: 389 AANKVC 406
>CO983154
Length = 568
Score = 97.1 bits (240), Expect = 3e-20
Identities = 64/192 (33%), Positives = 94/192 (48%), Gaps = 2/192 (1%)
Frame = +3
Query: 191 HTSSQNGKSERKIRTINNMIRTLLAHSSVPPSFWHHALQMATYLLNILPRKTLRNDSPTQ 250
HT QNG +ERK R + R+L+ + +VP W A+ + +L+N +P +L N P
Sbjct: 3 HTPQQNGIAERKNRHLLETARSLMLNLNVPIHHWGDAVLTSCFLINRMPSSSLENQIPHS 182
Query: 251 RLYHRDPSYSHL--RVFGCLCFPFFPSATINKLQPRSSPCVFLGYPMNHRGYKCYDLSNR 308
++ DP + H+ +VFGC CF S ++KL RS CVFLGY +GYKCY + R
Sbjct: 183 LVFPHDPLF-HVSPKVFGCTCFVHDLSPGLDKLSARSVKCVFLGYSRLQKGYKCYSPTMR 359
Query: 309 KLIISRHVIFDESRFPFADLPLESTSSYDCFTEDLPPSLIHHWQTTSTRPPDLSIPPSSP 368
+ +S V F E F+ S+S + P L + Q S +P SSP
Sbjct: 360 RYYMSADVTFFEDTPFFSPSVDHSSSLQEVLPIPSPYPLXNSGQNVSI------VPSSSP 521
Query: 369 TDSTMPSPAPTS 380
+ P T+
Sbjct: 522 NSLEVILPPLTT 557
>CO981347
Length = 624
Score = 68.9 bits (167), Expect(3) = 5e-20
Identities = 36/103 (34%), Positives = 52/103 (49%)
Frame = +2
Query: 148 TLIKTQFSANIKCLQCDNGREYDNDSFRRYCHANGLIFRFSCPHTSSQNGKSERKIRTIN 207
TLI Q +K L+ DNG E+ + F +C G+ PHT QNG +ER TI
Sbjct: 131 TLIGNQLGTKLKVLRTDNGLEFVLEQFNEFCRKIGIKRHKIVPHTP*QNGLAERMNMTIL 310
Query: 208 NMIRTLLAHSSVPPSFWHHALQMATYLLNILPRKTLRNDSPTQ 250
+R +L + +P +FW A +YL+N P TL +P +
Sbjct: 311 ERVRCMLLSARLPKTFWGEAANTTSYLINRCPSSTLGFKTPME 439
Score = 41.2 bits (95), Expect(3) = 5e-20
Identities = 21/51 (41%), Positives = 28/51 (54%)
Frame = +3
Query: 255 RDPSYSHLRVFGCLCFPFFPSATINKLQPRSSPCVFLGYPMNHRGYKCYDL 305
+ P+YS L+VFG L F KL R+ CVF+GYP + YK + L
Sbjct: 453 KPPNYSGLKVFGSLAFDHVKQG---KLDARAVKCVFIGYPKGVKRYKLWKL 596
Score = 26.6 bits (57), Expect(3) = 5e-20
Identities = 8/35 (22%), Positives = 21/35 (59%)
Frame = +3
Query: 106 SPILSTAGHRYYVLVLDDHTDFLWMFPISKKSQVY 140
S + + G Y++ ++DD + +W++ + KS+ +
Sbjct: 6 SRVKTHGGSSYFLTIIDDFSRRVWLYVLKNKSESF 110
>BM307983
Length = 406
Score = 96.3 bits (238), Expect = 6e-20
Identities = 54/128 (42%), Positives = 74/128 (57%), Gaps = 1/128 (0%)
Frame = +2
Query: 478 IFRHKKQSNGLFEHYKARLVGDGWSQIAGVDCDETFNLVVKPATIRTVLSIALSR-SWPI 536
I+ K ++ + YKARLV G+ Q G+D +ETF K + ++ W +
Sbjct: 17 IYTVKY*ADDTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*IQSGSSSP*QQAQFGWEM 196
Query: 537 H*LDVHNAFLHGDLHETVYMHQPLGFRDSQHPDYVCRRKKSLYGLKQAPRAWYQRFADYV 596
H DV NAFLHG L E VYM P G+ S + VCR KK+LYGLKQ+PRAW+ RF +
Sbjct: 197 HQFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAWFGRFTQAM 376
Query: 597 SSIGFQHS 604
S+G++ S
Sbjct: 377 LSLGYKQS 400
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.325 0.138 0.439
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 51,721,211
Number of Sequences: 63676
Number of extensions: 988781
Number of successful extensions: 26570
Number of sequences better than 10.0: 1598
Number of HSP's better than 10.0 without gapping: 12595
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 18528
length of query: 921
length of database: 12,639,632
effective HSP length: 106
effective length of query: 815
effective length of database: 5,889,976
effective search space: 4800330440
effective search space used: 4800330440
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 63 (28.9 bits)
Lotus: description of TM0032.12