
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC149471.17 - phase: 0 /pseudo
(1391 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 543 e-154
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 542 e-154
TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, ... 214 2e-55
BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Ara... 189 6e-48
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 177 2e-44
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 164 2e-40
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 161 2e-39
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 160 3e-39
BQ296988 similar to GP|21740616|em OSJNBb0089K24.12 {Oryza sativ... 157 2e-38
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 143 5e-34
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 136 7e-32
TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotr... 126 1e-31
CO982036 132 1e-30
CO981879 94 2e-30
BM307983 128 2e-29
TC222001 similar to UP|C716_NEPRA (O04164) Cytochrome P450 71A6 ... 128 2e-29
TC232995 124 3e-28
BU548243 123 6e-28
BU764568 100 6e-27
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti... 102 1e-26
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 543 bits (1398), Expect = e-154
Identities = 346/1103 (31%), Positives = 560/1103 (50%), Gaps = 19/1103 (1%)
Frame = +1
Query: 302 SSSSQASSNHISANPLISTTISAPESSSAGIIPKPSYWLLDSGANEHISCNLSFFSSFYR 361
SSSS + + ++S + +SA W LDSG + H++ F +
Sbjct: 1591 SSSSGRKMMWVPKHKIVSLVVHTSLRASA-----KEDWYLDSGCSRHMTGVKEFLVNIEP 1755
Query: 362 IPPVYVSLPNKTCVLVQYAGTVSFTSNFYLSHVLYSPAFTHNLISVAKLCESLSYSLHFT 421
YV+ + + + G + L+ VL T NLIS+++LC+ ++++FT
Sbjct: 1756 CSTSYVTFGDGSKGKITGMGKLVHDGLPSLNKVLLVKGLTANLISISQLCDE-GFNVNFT 1932
Query: 422 SAHCIIQDTMSLKMIGLAKQLDGLYKYTP--SSCSSNSVFSSVSHKSCNVVATISCNSSS 479
+ C++ + S ++ ++ D Y +TP +S SS +FS
Sbjct: 1933 KSECLVTNEKSEVLMKGSRSKDNCYLWTPQETSYSSTCLFSKEDEVK------------- 2073
Query: 480 SIPSNALWHFRLGHLSHQRLHSMSLLY--------PNIISSNNKDVCDLCHFAKHKHLPF 531
+WH R GHL L M + PN+ + +C C K +
Sbjct: 2074 ------IWHQRFGHL---HLRGMKKIIDKGAVRGIPNLKIEEGR-ICGECQIGKQVKMS- 2220
Query: 532 NSSISHASTN--FELLHLDIWGPLSIASVHGHRYFLTIVDDHSRFLWVILLKSKAEVSTH 589
+ + H +T+ ELLH+D+ GP+ + S+ G RY +VDD SRF WV ++ K++
Sbjct: 2221 HQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIREKSDTFEV 2400
Query: 590 VINFITMIQTQFHITPKFIRTDNGPEF---MLSTFYASHGIIHQKSCVETPQQNGRVERK 646
+Q + K IR+D+G EF + F S GI H+ S TPQQNG VERK
Sbjct: 2401 FKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSAAITPQQNGIVERK 2580
Query: 647 HQHILNVGRALLFQSKLPPSFWSYAILHAVFLINRVPTPILHNQSPYFVLHHQLPALNLF 706
++ + R +L +LP + W+ A+ A ++ NRV + Y + + P + F
Sbjct: 2581 NRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEIWKGRKPTVKHF 2760
Query: 707 KVFGCLCYASTLQSHRTKLQPRARKSIFLGYKSGFKGFTLYDIQSREIFVSRHVTFHETF 766
+FG CY + R K+ P++ IFLGY + + + +++ ++R + S +V +
Sbjct: 2761 HIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINVVVDDLT 2940
Query: 767 LPYP---HTSLSTTPNWEYFSSSNFSDVSNQPTPINSPAIIDDILPPSPPINPPPPPPIP 823
+ T+ + ++ + + N + + P I PS I P +
Sbjct: 2941 PARKKDVEEDVRTSGDNVADTAKSAENAENSDSATDEPNINQPDKRPSIRIQKMHPKELI 3120
Query: 824 VVSPASRTSTRQTTTPSYLQDYVCNNIHTSPYPINNYISHHNLSNNYSSFVMSLHTTTEP 883
+ P R TT S + V N S FV + EP
Sbjct: 3121 IGDP-----NRGVTTRSREIEIVSN----------------------SCFVSKI----EP 3207
Query: 884 KSYAEASKHDCWKQAMQVELQALEKTGTWQLVDLPSNIKPIGCRWIYKVKYHADGSIERH 943
K+ EA + W AMQ EL+ ++ W+LV P IG +WI+K K + +G I R+
Sbjct: 3208 KNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRN 3387
Query: 944 KARLVAKGYNQIEGLDYFDTYSPVAKLTTIRLVIALSSIHNWHLHQLDVNNAFLHGDLQE 1003
KARLVA+GY QIEG+D+ +T++PVA+L +IRL++ ++ I + L+Q+DV +AFL+G L E
Sbjct: 3388 KARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNE 3567
Query: 1004 DVYMLIPPG-IKSNKPNQVCKLQKSLYGLKQASRKWYEKLTSVLSHHHYIQASSDHSLFV 1062
+ Y+ P G + P+ V +L+K+LYGLKQA R WYE+LT L+ Y + D +LFV
Sbjct: 3568 EAYVEQPKGFVDPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFV 3747
Query: 1063 KKTSSSFTILLVYVDDIIIAGDSLTEFTYIKSVLDASFKIKDLGQLKYFLGIEVAHSKLG 1122
K+ + + I +YVDDI+ G S + + + F++ +G+L YFLG++V +
Sbjct: 3748 KQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDS 3927
Query: 1123 ISLCQRKYCLDLLADSGTIDSKPVSTPSDSSIKLHQDSSPSYADIPSYRRLVGRLLYLNT 1182
I L Q KY +++ G ++ TP+ + +KL +D + + D YR ++G LLYL
Sbjct: 3928 IFLSQSKYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTA 4107
Query: 1183 TRPDITFITQQLSQFLSQPTQAHHTAALRVLRYLKGCPGRGLFFPRNSSINLQGFSDADW 1242
+RPDIT+ +++ + P +H R+L+Y+ G G+ + S L G+ DADW
Sbjct: 4108 SRPDITYAVGVCARYQANPKISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYCDADW 4287
Query: 1243 AGCLDTRRSISGQCFFLGNSLISWRTKKQITVSRSSSEAEYRALASATCELQWILYLLQD 1302
AG D R+S SG CF+LG +LISW +KKQ VS S++EAEY A S+ +L W+ +L++
Sbjct: 4288 AGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKE 4467
Query: 1303 IHISCPKLPVLYCDNQSALHIAANPVFHERTKHLEIDCHIVREKVQAGILKLLPVSSQDQ 1362
++ + LYCDN SA++I+ NPV H RTKH++I H +R+ V ++ L V +++Q
Sbjct: 4468 YNVE-QDVMTLYCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLEHVDTEEQ 4644
Query: 1363 VADFFTKALLPKPFNILLSKMGL 1385
+AD FTKAL F L K+G+
Sbjct: 4645 IADIFTKALDANQFEKLRGKLGI 4713
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 542 bits (1397), Expect = e-154
Identities = 340/1064 (31%), Positives = 549/1064 (50%), Gaps = 17/1064 (1%)
Frame = +1
Query: 339 WLLDSGANEHISCNLSFFSSFYRIPPVYVSLPNKTCVLVQYAGTVSFTSNFYLSHVLYSP 398
W LDSG + H++ F + YV+ + + + G + L+ VL
Sbjct: 1684 WYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGKLVHDGLPSLNKVLLVK 1863
Query: 399 AFTHNLISVAKLCESLSYSLHFTSAHCIIQDTMSLKMIGLAKQLDGLYKYTPSSCSSNSV 458
T NLIS+++LC+ ++++FT + C++ + S ++ ++ D Y +TP S +S
Sbjct: 1864 GLTANLISISQLCDE-GFNVNFTKSECLVTNEKSEVLMKGSRSKDNCYLWTPQETSYSS- 2037
Query: 459 FSSVSHKSCNVVATISCNSSSSIPSNALWHFRLGHLSHQRLHSMSLLY--------PNII 510
+C SS +WH R GHL L M + PN+
Sbjct: 2038 ---------------TCLSSKE-DEVRIWHQRFGHL---HLRGMKKIIDKGAVRGIPNLK 2160
Query: 511 SSNNKDVCDLCHFAKHKHLPFNSSISHASTN--FELLHLDIWGPLSIASVHGHRYFLTIV 568
+ +C C K + + + H +T+ ELLH+D+ GP+ + S+ G RY +V
Sbjct: 2161 IEEGR-ICGECQIGKQVKMS-HQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVV 2334
Query: 569 DDHSRFLWVILLKSKAEVSTHVINFITMIQTQFHITPKFIRTDNGPEF---MLSTFYASH 625
DD SRF WV ++ K+E +Q + K IR+D+G EF + F S
Sbjct: 2335 DDFSRFTWVNFIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSE 2514
Query: 626 GIIHQKSCVETPQQNGRVERKHQHILNVGRALLFQSKLPPSFWSYAILHAVFLINRVPTP 685
GI H+ S TPQQNG VERK++ + R +L +LP + W+ A+ A ++ NRV
Sbjct: 2515 GITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLR 2694
Query: 686 ILHNQSPYFVLHHQLPALNLFKVFGCLCYASTLQSHRTKLQPRARKSIFLGYKSGFKGFT 745
+ Y + + P++ F +FG CY + R K+ P++ IFLGY + + +
Sbjct: 2695 RGTPTTLYEIWKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYR 2874
Query: 746 LYDIQSREIFVSRHVTFHETFLPYP---HTSLSTTPNWEYFSSSNFSDVSNQPTPINSPA 802
+++ ++R + S +V + + T+ + ++ + + N + +
Sbjct: 2875 VFNSRTRTVMESINVVVDDLSPARKKDVEEDVRTSGDNVADAAKSGENAENSDSATDESN 3054
Query: 803 IIDDILPPSPPINPPPPPPIPVVSPASRTSTRQTTTPSYLQDYVCNNIHTSPYPINNYIS 862
I S I P + + P R TT S + V N
Sbjct: 3055 INQPDKRSSTRIQKMHPKELIIGDP-----NRGVTTRSREVEIVSN-------------- 3177
Query: 863 HHNLSNNYSSFVMSLHTTTEPKSYAEASKHDCWKQAMQVELQALEKTGTWQLVDLPSNIK 922
S FV + EPK+ EA + W AMQ EL+ ++ W+LV P
Sbjct: 3178 --------SCFVSKI----EPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTN 3321
Query: 923 PIGCRWIYKVKYHADGSIERHKARLVAKGYNQIEGLDYFDTYSPVAKLTTIRLVIALSSI 982
IG +WI+K K + +G I R+KARLVA+GY QIEG+D+ +T++PVA+L +IRL++ ++ I
Sbjct: 3322 VIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACI 3501
Query: 983 HNWHLHQLDVNNAFLHGDLQEDVYMLIPPGIKS-NKPNQVCKLQKSLYGLKQASRKWYEK 1041
+ L+Q+DV +AFL+G L E+VY+ P G P+ V +L+K+LYGLKQA R WYE+
Sbjct: 3502 LKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADPTHPDHVYRLKKALYGLKQAPRAWYER 3681
Query: 1042 LTSVLSHHHYIQASSDHSLFVKKTSSSFTILLVYVDDIIIAGDSLTEFTYIKSVLDASFK 1101
LT L+ Y + D +LFVK+ + + I +YVDDI+ G S + + + F+
Sbjct: 3682 LTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFE 3861
Query: 1102 IKDLGQLKYFLGIEVAHSKLGISLCQRKYCLDLLADSGTIDSKPVSTPSDSSIKLHQDSS 1161
+ +G+L YFLG++V + I L Q +Y +++ G ++ TP+ + +KL +D +
Sbjct: 3862 MSLVGELTYFLGLQVKQMEDSIFLSQSRYAKNIVKKFGMENASHKRTPAPTHLKLSKDEA 4041
Query: 1162 PSYADIPSYRRLVGRLLYLNTTRPDITFITQQLSQFLSQPTQAHHTAALRVLRYLKGCPG 1221
+ D YR ++G LLYL +RPDIT+ +++ + P +H T R+L+Y+ G
Sbjct: 4042 GTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQANPKISHLTQVKRILKYVNGTSD 4221
Query: 1222 RGLFFPRNSSINLQGFSDADWAGCLDTRRSISGQCFFLGNSLISWRTKKQITVSRSSSEA 1281
G+ + S+ L G+ DADWAG D R+S SG CF+LGN+LISW +KKQ VS S++EA
Sbjct: 4222 YGIMYCHCSNPMLVGYCDADWAGSADDRKSTSGGCFYLGNNLISWFSKKQNCVSLSTAEA 4401
Query: 1282 EYRALASATCELQWILYLLQDIHISCPKLPVLYCDNQSALHIAANPVFHERTKHLEIDCH 1341
EY A S+ +L W+ +L++ ++ + LYCDN SA++I+ NPV H RTKH++I H
Sbjct: 4402 EYIAAGSSCSQLVWMKQMLKEYNVE-QDVMTLYCDNMSAINISKNPVQHSRTKHIDIRHH 4578
Query: 1342 IVREKVQAGILKLLPVSSQDQVADFFTKALLPKPFNILLSKMGL 1385
+R+ V ++ L V +++Q+AD FTKAL F L K+G+
Sbjct: 4579 YIRDLVDDKVITLKHVDTEEQIADIFTKALDANQFEKLRGKLGI 4710
>TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, partial (16%)
Length = 662
Score = 214 bits (545), Expect = 2e-55
Identities = 113/188 (60%), Positives = 142/188 (75%), Gaps = 4/188 (2%)
Frame = +3
Query: 1208 AALRVLRYLKGCPGRGLFFPRNSSINLQGFSDADWAGCLDTRRSISGQCFFLGNSLISWR 1267
AA RVL+YLKGCP +GL F R S I + GFSDADWA C+D+ +SI+ CFFLG+SLISW+
Sbjct: 18 AATRVLKYLKGCPRKGLSFSRESPIQILGFSDADWATCIDSSKSITWYCFFLGSSLISWK 197
Query: 1268 TKKQITVSR--SSSEAEYRALASATCELQWILYLLQDIHISCPKLPVLYCDNQSALH-IA 1324
KKQ TVSR SSSEA+YRAL S TCELQW+ YLL+D+H++ ++YCDNQSAL +
Sbjct: 198 AKKQNTVSRSSSSSEAKYRALTSTTCELQWLTYLLKDLHVT-----LIYCDNQSALQ*LP 362
Query: 1325 ANPVFHERTKHLEIDCHIVREKVQAGILK-LLPVSSQDQVADFFTKALLPKPFNILLSKM 1383
++H + LEIDCHIVREK Q G++ LLPVSS +Q+AD FTKAL PK F+ LSK+
Sbjct: 363 IKVIYHGQ---LEIDCHIVREKTQQGLMHCLLPVSSSNQLADIFTKALSPKLFSSNLSKL 533
Query: 1384 GLINIYQP 1391
GL +I+ P
Sbjct: 534 GLSDIFLP 557
>BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Arabidopsis
thaliana}, partial (18%)
Length = 421
Score = 189 bits (481), Expect = 6e-48
Identities = 94/140 (67%), Positives = 113/140 (80%)
Frame = -2
Query: 1026 KSLYGLKQASRKWYEKLTSVLSHHHYIQASSDHSLFVKKTSSSFTILLVYVDDIIIAGDS 1085
KSLYGLKQASRKWYEKLT++L YIQ+ SD+SLF ++FT LLVYVDDII+AGDS
Sbjct: 420 KSLYGLKQASRKWYEKLTNLLLKEGYIQSISDYSLFTLTKGNTFTALLVYVDDIILAGDS 241
Query: 1086 LTEFTYIKSVLDASFKIKDLGQLKYFLGIEVAHSKLGISLCQRKYCLDLLADSGTIDSKP 1145
+ EF IK+VLD +FKIK+LG+LKYFLG+EVAHS+LGI++ QRKYCLDLL DSG + KP
Sbjct: 240 IDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLGITISQRKYCLDLLKDSGLLGCKP 61
Query: 1146 VSTPSDSSIKLHQDSSPSYA 1165
STP D+SIKLH + YA
Sbjct: 60 ASTPLDTSIKLHSAAGTPYA 1
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 177 bits (450), Expect = 2e-44
Identities = 85/159 (53%), Positives = 112/159 (69%), Gaps = 1/159 (0%)
Frame = +2
Query: 1234 LQGFSDADWAGCLDTRRSISGQCFFLGNSLISWRTKKQITVSRSSSEAEYRALASATCEL 1293
L G+ DADWAGC RRS SG C F+G +L+SW++KKQ V+RSS+EAEYR++A TCEL
Sbjct: 17 LSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCEL 196
Query: 1294 QWILYLLQDIHISCPKLPV-LYCDNQSALHIAANPVFHERTKHLEIDCHIVREKVQAGIL 1352
WI LQ++ C +L + LYCDNQ+ALHIA+NPVFHERTKH+EIDCH +REK+ + +
Sbjct: 197 MWIKQFLQELRF-CEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEI 373
Query: 1353 KLLPVSSQDQVADFFTKALLPKPFNILLSKMGLINIYQP 1391
+ S DQ D TK+L I+ SK+G ++Y P
Sbjct: 374 VTEFIGSNDQPVDILTKSLRGPKIQIVCSKLGAYDLYAP 490
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 164 bits (416), Expect = 2e-40
Identities = 76/133 (57%), Positives = 99/133 (74%), Gaps = 1/133 (0%)
Frame = -2
Query: 915 VDLPSNIKPIGCRWIYKVKYHADGSIERHKARLVAKGYNQIEGLDYFDTYSPVAKLTTIR 974
V LP P+GCRW+Y VK G ++R KARLVAKGY Q+ G+DY DT+SPVAKLTT+R
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 975 LVIALSSIHNWHLHQLDVNNAFLHGDLQEDVYMLIPPG-IKSNKPNQVCKLQKSLYGLKQ 1033
L +A+++I +W LHQLD+ NAFLHGDL+ED+YM PPG + + VCKL +SLYGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 1034 ASRKWYEKLTSVL 1046
+ R W+ K + V+
Sbjct: 46 SPRAWFGKFSHVV 8
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 161 bits (408), Expect = 2e-39
Identities = 74/134 (55%), Positives = 97/134 (72%)
Frame = +3
Query: 877 LHTTTEPKSYAEASKHDCWKQAMQVELQALEKTGTWQLVDLPSNIKPIGCRWIYKVKYHA 936
L + T P + EA H W+QAM E+QALE GTW+LV LP +GCRW+Y VK
Sbjct: 3 LSSLTVPSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGP 182
Query: 937 DGSIERHKARLVAKGYNQIEGLDYFDTYSPVAKLTTIRLVIALSSIHNWHLHQLDVNNAF 996
+G ++R KARLVAKGY Q+ G++Y DT+SPV LTT+RL +A+++I +W LHQLD+ NAF
Sbjct: 183 NGKVDRLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAF 362
Query: 997 LHGDLQEDVYMLIP 1010
LHGDL+ED+YM P
Sbjct: 363 LHGDLEEDIYMEQP 404
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 160 bits (406), Expect = 3e-39
Identities = 79/150 (52%), Positives = 109/150 (72%), Gaps = 1/150 (0%)
Frame = -3
Query: 898 AMQVELQALEKTGTWQLVDLPSNIKPIGCRWIYKVKYHADGSIERHKARLVAKGYNQIEG 957
AMQ EL E+ W+LV+ P N IG +W+++ K G I R+KARLVAKGYNQ EG
Sbjct: 458 AMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEEG 279
Query: 958 LDYFDTYSPVAKLTTIRLVIALSSIHNWHLHQLDVNNAFLHGDLQEDVYMLIPPGIK-SN 1016
+DY +TY+PVA+L IR+++A SI N+ L+Q+DV +AFL+G +QE+VY+ PPG + +
Sbjct: 278 IDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIPD 99
Query: 1017 KPNQVCKLQKSLYGLKQASRKWYEKLTSVL 1046
KP V KLQK+LYGLKQA R WYE++++ L
Sbjct: 98 KPTHVYKLQKALYGLKQAPRAWYERISNFL 9
>BQ296988 similar to GP|21740616|em OSJNBb0089K24.12 {Oryza sativa (japonica
cultivar-group)}, partial (1%)
Length = 408
Score = 157 bits (398), Expect = 2e-38
Identities = 68/136 (50%), Positives = 104/136 (76%)
Frame = -1
Query: 33 RCNHLVQSWLINSVSDSIAQTIVFYDTAFEVWHDLQERFSKVDRIRIANLRSTINNLKQG 92
RCN L+ SW++NSV SI+++IVF D A +VW DL+ERFS+ D +R++ ++ I L QG
Sbjct: 408 RCNMLIHSWILNSVEPSISRSIVFMDNASDVWLDLKERFSQGDLVRVSEIQQEIYALTQG 229
Query: 93 SKSVLDYFTEMKALWEELASHRPIPNCSCIHPCRCEASKVAKIHRNEDQIMQFLTGLNDQ 152
++SV ++++ KALWEEL + PIPNC+C H C C+A ++A+ H + +M+FLTGLND+
Sbjct: 228 TRSVTTFYSDKKALWEELEIYMPIPNCTCHHRCSCDAMRLARRHHHTLHVMRFLTGLNDE 49
Query: 153 FSIVRTQVLLLDPLPS 168
F+ V++Q+LL++PLPS
Sbjct: 48 FNAVKSQILLIEPLPS 1
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 143 bits (361), Expect = 5e-34
Identities = 73/221 (33%), Positives = 130/221 (58%), Gaps = 3/221 (1%)
Frame = +1
Query: 1166 DIPSYRRLVGRLLYLNTTRPDITFITQQLSQFLSQPTQAHHTAALRVLRYLKGCPGRGLF 1225
D+ +RRL+G L YL +RP+I F +S+F+ +P +H AA RVLR +KG G G+
Sbjct: 10 DVTEFRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVL 189
Query: 1226 FP---RNSSINLQGFSDADWAGCLDTRRSISGQCFFLGNSLISWRTKKQITVSRSSSEAE 1282
FP ++ +L G++D+DW + +S G F ++ ++ +KKQ ++ S+ EAE
Sbjct: 190 FPFKAKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAE 369
Query: 1283 YRALASATCELQWILYLLQDIHISCPKLPVLYCDNQSALHIAANPVFHERTKHLEIDCHI 1342
Y A + C+ W++ LL+++ + K L DN+SA+++A +P H R+KH+E+ H
Sbjct: 370 YVAASLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFHY 549
Query: 1343 VREKVQAGILKLLPVSSQDQVADFFTKALLPKPFNILLSKM 1383
+R++V G + + +++Q+AD TK + F + S++
Sbjct: 550 IRDQVSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSEL 672
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 136 bits (342), Expect = 7e-32
Identities = 78/179 (43%), Positives = 106/179 (58%), Gaps = 2/179 (1%)
Frame = +1
Query: 938 GSIERHKARLVAKGYNQIEGLDYFDTYSPVAKLTTIRLVIALSSIHNWHLHQLDVNNAFL 997
G+I++ KARLVAK Y Q+ G DY T+SPVAK+ + L+ +++ + +W L LD NAFL
Sbjct: 28 GTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAFL 207
Query: 998 HGDLQEDVYMLIPPGI--KSNKPNQVCKLQKSLYGLKQASRKWYEKLTSVLSHHHYIQAS 1055
HG L+E+VYM P G + N VC+L +S YGLKQ+ R W + Y
Sbjct: 208 HGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAW--PFLYCGAAIWYDSHE 381
Query: 1056 SDHSLFVKKTSSSFTILLVYVDDIIIAGDSLTEFTYIKSVLDASFKIKDLGQLKYFLGI 1114
+DHS+F + L+VYVDDI I G T +K L F+ KDLG+L+YFLGI
Sbjct: 382 ADHSVFYCHSPQGCIYLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558
>TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotransposon
Hopscotch polyprotein, partial (7%)
Length = 1446
Score = 126 bits (317), Expect(2) = 1e-31
Identities = 61/109 (55%), Positives = 77/109 (69%)
Frame = +2
Query: 1239 DADWAGCLDTRRSISGQCFFLGNSLISWRTKKQITVSRSSSEAEYRALASATCELQWILY 1298
DA+WA R S G C +G +L+ W++ K V+RSS+EAEY+A+ ATCEL WI
Sbjct: 8 DANWAVSPIDRGSTLGYCVSIGENLVLWKSNK*NVVARSSAEAEYKAMTVATCELIWIKQ 187
Query: 1299 LLQDIHISCPKLPVLYCDNQSALHIAANPVFHERTKHLEIDCHIVREKV 1347
LLQ++ + L CDNQ+ALHIA+NPVFHERTKH+EIDCH VREKV
Sbjct: 188 LLQELKFGSTQQMKLCCDNQAALHIASNPVFHERTKHIEIDCHFVREKV 334
Score = 30.0 bits (66), Expect(2) = 1e-31
Identities = 14/33 (42%), Positives = 20/33 (60%)
Frame = +3
Query: 1357 VSSQDQVADFFTKALLPKPFNILLSKMGLINIY 1389
VSS DQ+A+ FTK+L + SK+G +Y
Sbjct: 363 VSSNDQLANIFTKSLRGPRIQNICSKLGAFELY 461
>CO982036
Length = 674
Score = 132 bits (332), Expect = 1e-30
Identities = 84/211 (39%), Positives = 116/211 (54%), Gaps = 3/211 (1%)
Frame = -2
Query: 1064 KTSSSFTILLVYVDDIIIAGDSLTEFTYIKSVLDASFKIKDLGQLKYFLGIEVAHSKLGI 1123
KT LLVYVD III G S T + S L++SF +K LG+L YF+ IEV S +
Sbjct: 670 KTHILTVYLLVYVD-IIITGSSCTLIQNLTSKLNSSFPLKLLGKLDYFVEIEVK-SMPDL 497
Query: 1124 SLCQRKYCLDLLADSGTIDSKPVSTPSDSSIKLHQDSSPSYADIPSYRRLVGRLLYLNTT 1183
R ++ ++P+S+P ++ KL + S ++ YR +VG L Y
Sbjct: 496 LFSLRTSIFEIFCRKPR*QAQPISSPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTVI 317
Query: 1184 RPDITFITQQLSQFLSQPTQAHHTAALRVLRYLKGCPGRGL-FFPRNSS--INLQGFSDA 1240
RP+I+F ++ QF+S P +H T R+LRYLKG GL P SS + ++GF DA
Sbjct: 316 RPEISFAVNKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCDA 137
Query: 1241 DWAGCLDTRRSISGQCFFLGNSLISWRTKKQ 1271
DWA +D +RS SG FLG +LISW KQ
Sbjct: 136 DWASAVDDKRSTSGAAVFLGPNLISWWXXKQ 44
>CO981879
Length = 576
Score = 94.4 bits (233), Expect(2) = 2e-30
Identities = 48/92 (52%), Positives = 59/92 (63%), Gaps = 3/92 (3%)
Frame = -1
Query: 593 FITMIQTQFHITPKFIRTDNGPEFM---LSTFYASHGIIHQKSCVETPQQNGRVERKHQH 649
F MIQTQF + K R+DNG E+ LS +GIIHQ SCV+TPQQNG ERK++H
Sbjct: 558 FFQMIQTQFQVKIKVFRSDNGREYFNKHLSKXXLENGIIHQSSCVDTPQQNGVAERKNRH 379
Query: 650 ILNVGRALLFQSKLPPSFWSYAILHAVFLINR 681
+ V RALLFQ+K P W AIL +L N+
Sbjct: 378 LXEVARALLFQNKAPKYXWGEAILTGTYLKNK 283
Score = 58.5 bits (140), Expect(2) = 2e-30
Identities = 31/89 (34%), Positives = 50/89 (55%), Gaps = 5/89 (5%)
Frame = -2
Query: 681 RVPTPILHNQSPYFVLHHQLPALNL-----FKVFGCLCYASTLQSHRTKLQPRARKSIFL 735
R+P+ IL+ ++P V P L K+FGC + + ++ KL+PRA+K +F+
Sbjct: 284 RMPSKILNFRTPLDVFTSAFPNNRLSCTLPLKIFGCTVFVHIHEPNQGKLEPRAKKCVFV 105
Query: 736 GYKSGFKGFTLYDIQSREIFVSRHVTFHE 764
GY KG+ +D S++ FV+ VTF E
Sbjct: 104 GYAPNQKGYKCFDPTSKKTFVTIDVTFFE 18
>BM307983
Length = 406
Score = 128 bits (321), Expect = 2e-29
Identities = 65/133 (48%), Positives = 89/133 (66%), Gaps = 2/133 (1%)
Frame = +2
Query: 924 IGCRWIYKVKYHADGSIERHKARLVAKGYNQIEGLDYFDTYSPVAK-LTTIRLVIALSSI 982
+GCRWIY VKY AD +++R+KARLVAKGY Q G+DY +T++ K + + +
Sbjct: 2 VGCRWIYTVKY*ADDTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*IQSGSSSP*QQAQ 181
Query: 983 HNWHLHQLDVNNAFLHGDLQEDVYMLIPPGI-KSNKPNQVCKLQKSLYGLKQASRKWYEK 1041
W +HQ DV NAFLHG L+E+VYM IPPG SN N+VC+L+K+LYGLKQ+ R W+ +
Sbjct: 182 FGWEMHQFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAWFGR 361
Query: 1042 LTSVLSHHHYIQA 1054
T + Y Q+
Sbjct: 362 FTQAMLSLGYKQS 400
>TC222001 similar to UP|C716_NEPRA (O04164) Cytochrome P450 71A6 (Fragment)
, partial (21%)
Length = 912
Score = 128 bits (321), Expect = 2e-29
Identities = 54/102 (52%), Positives = 76/102 (73%)
Frame = -2
Query: 663 LPPSFWSYAILHAVFLINRVPTPILHNQSPYFVLHHQLPALNLFKVFGCLCYASTLQSHR 722
+PP+FW+YA+LHA +LIN +PTP L N SPY LH +P ++ ++FGCLCYAST++++R
Sbjct: 911 MPPNFWNYALLHAAYLINCIPTPFLQNTSPYERLHGHIPDISHLRIFGCLCYASTIKANR 732
Query: 723 TKLQPRARKSIFLGYKSGFKGFTLYDIQSREIFVSRHVTFHE 764
KL+PRA IF+G+K KG+ LYD+ S I SR+V F+E
Sbjct: 731 KKLEPRAHPCIFIGFKPNTKGYMLYDLHSHNIITSRNVVFYE 606
>TC232995
Length = 1009
Score = 124 bits (311), Expect = 3e-28
Identities = 63/170 (37%), Positives = 102/170 (59%), Gaps = 1/170 (0%)
Frame = +2
Query: 1010 PPGIK-SNKPNQVCKLQKSLYGLKQASRKWYEKLTSVLSHHHYIQASSDHSLFVKKTSSS 1068
PPG + S+KPN V KLQK+LYGLKQA R WYE+L++ L + + D +LF+K+ +
Sbjct: 11 PPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFIKRKHND 190
Query: 1069 FTILLVYVDDIIIAGDSLTEFTYIKSVLDASFKIKDLGQLKYFLGIEVAHSKLGISLCQR 1128
++ +YVDDII + + + + F++ +G+LKYFLG+++ ++ GI + Q
Sbjct: 191 ILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*GIFINQS 370
Query: 1129 KYCLDLLADSGTIDSKPVSTPSDSSIKLHQDSSPSYADIPSYRRLVGRLL 1178
KYC +L+ G +K +STP ++ L +D S DI YR +G ++
Sbjct: 371 KYCKELIKRFGMDSAKHMSTPMSTNCYLDKDESGQSIDIKQYRDAIGEVV 520
>BU548243
Length = 599
Score = 123 bits (308), Expect = 6e-28
Identities = 67/145 (46%), Positives = 89/145 (61%)
Frame = -1
Query: 1239 DADWAGCLDTRRSISGQCFFLGNSLISWRTKKQITVSRSSSEAEYRALASATCELQWILY 1298
DA WA +D RS G FLG +LISW ++KQ ++SS+EAEYR++A + EL WI
Sbjct: 587 DAGWASDVDDHRSTLGSAIFLGPNLISWWSRKQQVTAQSSTEAEYRSIAQTSAELTWIQA 408
Query: 1299 LLQDIHISCPKLPVLYCDNQSALHIAANPVFHERTKHLEIDCHIVREKVQAGILKLLPVS 1358
LL ++ I PV+ CDN+SA+ IA N VFH RTKH+EID V EKV + L++ +
Sbjct: 407 LLMELQIPFTP-PVILCDNKSAVAIAHNLVFHSRTKHMEIDVFFVHEKVLSKQLQIFHIP 231
Query: 1359 SQDQVADFFTKALLPKPFNILLSKM 1383
+ DQ A TK L F L SK+
Sbjct: 230 ALDQWAGILTKPLSSARFTFLKSKL 156
>BU764568
Length = 420
Score = 100 bits (248), Expect(2) = 6e-27
Identities = 46/84 (54%), Positives = 61/84 (71%)
Frame = +3
Query: 1253 SGQCFFLGNSLISWRTKKQITVSRSSSEAEYRALASATCELQWILYLLQDIHISCPKLPV 1312
SG C +G +LISW++KKQ V++SS+EAEYRA+A TCEL W+ LL ++
Sbjct: 168 SGYCVLIGGNLISWKSKKQSVVAKSSAEAEYRAMALVTCELIWLKQLL*ELKFEEDTQMT 347
Query: 1313 LYCDNQSALHIAANPVFHERTKHL 1336
L CDNQ+ALHIA+NP+FH RTKH+
Sbjct: 348 LICDNQAALHIASNPIFH*RTKHI 419
Score = 40.8 bits (94), Expect(2) = 6e-27
Identities = 20/50 (40%), Positives = 27/50 (54%)
Frame = +1
Query: 1195 SQFLSQPTQAHHTAALRVLRYLKGCPGRGLFFPRNSSINLQGFSDADWAG 1244
SQFL+ P Q H A +L+ K PG+GL + + G+SDAD G
Sbjct: 1 SQFLNSPCQDHWNAVS*ILK*TKSAPGKGLIYEDKGHSQIIGYSDAD*VG 150
>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
Length = 1213
Score = 102 bits (253), Expect(2) = 1e-26
Identities = 59/186 (31%), Positives = 94/186 (49%)
Frame = +3
Query: 1068 SFTILLVYVDDIIIAGDSLTEFTYIKSVLDASFKIKDLGQLKYFLGIEVAHSKLGISLCQ 1127
+F I+ +YVDDII S ++ F+ G+LK+ LG+++ GI + Q
Sbjct: 483 TFLIIHIYVDDIIFGATSKRMCKEFFELMKDGFETSMKGELKFLLGLQIIQKVYGIFIHQ 662
Query: 1128 RKYCLDLLADSGTIDSKPVSTPSDSSIKLHQDSSPSYADIPSYRRLVGRLLYLNTTRPDI 1187
KY L ++KP++TP S + +D ++ Y ++ L YL ++RPDI
Sbjct: 663 EKYTKSHLKRFRMDEAKPMATPMHRSTIIDKDEKGNHTS*KEYSGMIDSLSYLTSSRPDI 842
Query: 1188 TFITQQLSQFLSQPTQAHHTAALRVLRYLKGCPGRGLFFPRNSSINLQGFSDADWAGCLD 1247
F+ ++F S P +H TA R+LRYL G L+F + S +L G+ D +AG
Sbjct: 843 VFVVCLCARFQSYPKISHVTAVKRILRYLVGTTNHCLWFKKRSEFDLLGYCDVYFAGDKV 1022
Query: 1248 TRRSIS 1253
R+S S
Sbjct: 1023 ERKSTS 1040
Score = 38.1 bits (87), Expect(2) = 1e-26
Identities = 18/41 (43%), Positives = 26/41 (62%)
Frame = +2
Query: 1023 KLQKSLYGLKQASRKWYEKLTSVLSHHHYIQASSDHSLFVK 1063
K +YGLKQA R WYE+L+S L + + + +D +LF K
Sbjct: 347 KTLSCVYGLKQALRAWYERLSSFLVSNGFTRGITDPALFRK 469
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.320 0.133 0.403
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 75,241,980
Number of Sequences: 63676
Number of extensions: 1392486
Number of successful extensions: 25098
Number of sequences better than 10.0: 761
Number of HSP's better than 10.0 without gapping: 13138
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 18505
length of query: 1391
length of database: 12,639,632
effective HSP length: 109
effective length of query: 1282
effective length of database: 5,698,948
effective search space: 7306051336
effective search space used: 7306051336
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 65 (29.6 bits)
Medicago: description of AC149471.17