
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144765.7 - phase: 0
(482 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAU90262.1| integrase core domain containing protein [Oryza s... 225 3e-57
gb|AAA57005.1| copia-like retrotransposon Hopscotch polyprotein ... 221 3e-56
gb|AAB82754.1| retrofit [Oryza longistaminata] gi|7444451|pir||T... 214 5e-54
gb|AAT85031.1| putative polyprotein [Oryza sativa (japonica cult... 214 6e-54
gb|AAU43956.1| unknown protein [Oryza sativa (japonica cultivar-... 211 4e-53
ref|NP_915223.1| putative rice retrotransposon retrofit gag/pol ... 206 1e-51
gb|AAP54977.1| putative gag-pol polyprotein [Oryza sativa (japon... 204 5e-51
gb|AAU10682.1| putative polyprotein [Oryza sativa (japonica cult... 203 9e-51
ref|XP_476197.1| putative polyprotein [Oryza sativa (japonica cu... 201 6e-50
ref|NP_909900.1| putative copia-like retrotransposon Hopscotch p... 199 2e-49
ref|NP_915573.1| putative gag/pol polyprotein [Oryza sativa (jap... 198 4e-49
gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas... 186 1e-45
ref|XP_475911.1| putative polyprotein [Oryza sativa (japonica cu... 186 2e-45
gb|AAX96193.1| retrotransposon protein, putative, Ty1-copia sub-... 183 9e-45
gb|AAX95626.1| Zinc knuckle, putative [Oryza sativa (japonica cu... 182 2e-44
ref|XP_507315.1| PREDICTED P0623F08.21-2 gene product [Oryza sat... 181 5e-44
gb|AAF02855.1| Similar to retrotransposon proteins [Arabidopsis ... 175 3e-42
gb|AAP94600.1| putative copia-like retrotransposon Hopscotch pol... 174 6e-42
gb|AAL66754.1| putative copia-like retrotransposon Hopscotch pol... 174 6e-42
emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana] 174 7e-42
>gb|AAU90262.1| integrase core domain containing protein [Oryza sativa (japonica
cultivar-group)]
Length = 1021
Score = 225 bits (573), Expect = 3e-57
Identities = 127/397 (31%), Positives = 218/397 (53%), Gaps = 32/397 (8%)
Query: 15 SVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSS---KSNNPAFEE 71
+V+ KL R N+ LWK+ +LPV+RG +++GY+ G + P I + D K++ PAFE
Sbjct: 16 AVAEKLTRTNFLLWKAQILPVIRGARMEGYLTGATQAPLAVIDAKDGEATVKASKPAFEM 75
Query: 72 WQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIR 131
W DQ++LG++L++++ E+ TQ++ E++ Q+W + + +R++ + + +
Sbjct: 76 WITADQQVLGFLLSTLSKEILTQVISMESAAQVWKAITEMLSSQSRARALNTRLALATTL 135
Query: 132 KGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL-SDHTTLSW 190
KG++ + DY+ KMK LAD++ AG P+ + +LI L GLD +Y P+V L +S
Sbjct: 136 KGDLSVSDYISKMKVLADEMAFAGKPLDDEELISYVLAGLDDDYEPVVSSLVGKSEVVSL 195
Query: 191 VDLQAQLLTFESRIEQLNNLTN--LNLNATANVANKFD-----HRDNRFNSNNNWRGSNF 243
+ +QLL+F+SR + + + ++NA K +R R NNN R SN
Sbjct: 196 AECYSQLLSFKSRQKLRHAAAHQASSVNAARRGGGKGGGYTPFNRGGRSGGNNNGRRSNN 255
Query: 244 RGWRGGRG-----RGRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQGSHNA 298
G RGGRG RG C +CGKT H +C++R+D+N+ N A + G
Sbjct: 256 GGGRGGRGNNNGDRGGKPHPVCHLCGKTGHVVADCWYRYDENFVPENKIAAAASYG---- 311
Query: 299 FIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIV----ATC 354
D +WY D+GA++H+T + +K +++GK+ + +G ++I +
Sbjct: 312 --------VDTNWYVDTGATDHITGELDKLTTREKYNGKDQIYTASGAGMDIKHIGHSVI 363
Query: 355 SSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+ +++ L ++L+VP KNLL +LA DN+ FVE
Sbjct: 364 CTPTRNIYLKNILHVPKAKKNLLFAHRLALDNHAFVE 400
>gb|AAA57005.1| copia-like retrotransposon Hopscotch polyprotein [Zea mays]
gi|7444442|pir||T02087 gag/pol polyprotein - maize
retrotransposon Hopscotch
Length = 1439
Score = 221 bits (564), Expect = 3e-56
Identities = 136/431 (31%), Positives = 227/431 (52%), Gaps = 62/431 (14%)
Query: 1 MASAANNNKNDLPSS----VSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFI 56
MA ++ + + +P+S VS KL + NY LWK+ VLP +R +LD + G + CP +
Sbjct: 1 MAMQSSLSTSAIPTSFAIPVSEKLTKGNYLLWKAQVLPAIRAAQLDDILTGVEICPPK-- 58
Query: 57 TSSDSSKSN----NPAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLA 112
T SD+S NPA+ W A DQ +LG++L+S++ E+ + +++C TS +W +
Sbjct: 59 TISDASDRTVTVANPAYGRWIARDQAVLGYLLSSLSREVLSSVVNCSTSASVWTTLSEMY 118
Query: 113 GAHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLD 172
+H+R++ + + + +KG + +Y KM+ AD+L AG P+ + + + L GLD
Sbjct: 119 SSHSRARKVNTRIALATTKKGASSVAEYFAKMRGFADELGAAGKPLDDEEFVSFLLTGLD 178
Query: 173 SEYNPI---VVKLSDHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRD 229
++NP+ VV SD T DL QLL++E+R+ ++L + ++AN +
Sbjct: 179 EDFNPLVTAVVARSDPITPG--DLYTQLLSYENRMHLQTGSSSL-MQSSANARSP----- 230
Query: 230 NRFNSNNNWRGSNFRGWRGGRGRGR------------------------SSKAPCQVCGK 265
+W S RG+ GRGRGR SS+ CQVC +
Sbjct: 231 ---GRGMSWGRSGGRGFSRGRGRGRGPSRGGFQSFGRGNNYSGATDADTSSRPRCQVCSR 287
Query: 266 TNHTAINCFHRFDKNYSRSNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQT 325
HTA+NC++RFD+NY SA+S A+ + + WY D+GA++H+T
Sbjct: 288 VGHTALNCWYRFDENYVPDQRSANS----------AAHQNGSNVPWYTDTGATDHITGDL 337
Query: 326 NKFQDLTEHHGKNSLVVGNGDKLEIV----ATCSSKLKSLNLDDVLYVPNITKNLLSVSK 381
++ ++ G + ++ NG + I A + +SL+L VL+VP+ KNL+SV +
Sbjct: 338 DRLTMHDKYTGTDQIIAANGTGMTISNIGNAIVPTSSRSLHLRSVLHVPSTHKNLISVHR 397
Query: 382 LAADNNIFVEF 392
L DN++F+EF
Sbjct: 398 LTNDNDVFIEF 408
>gb|AAB82754.1| retrofit [Oryza longistaminata] gi|7444451|pir||T10728 probable
gag/pol polyprotein - long-staminate rice
retrotransposon retrofit
Length = 1445
Score = 214 bits (545), Expect = 5e-54
Identities = 132/403 (32%), Positives = 220/403 (53%), Gaps = 43/403 (10%)
Query: 15 SVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKSN---NPAFEE 71
SVS KL + N+ LWK+ V VRG +L GY+ G K P+ ++ + K+ NPAFE+
Sbjct: 20 SVSEKLGKANHALWKAQVSAAVRGARLLGYLNGDIKAPDAELSVTIDGKTTTKPNPAFED 79
Query: 72 WQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIR 131
W+ANDQ +LG++L+S++ ++ Q+ C+T+ + W ++L TR++ + + + +
Sbjct: 80 WEANDQLVLGYLLSSLSRDVLIQVATCKTAAEAWRSIEALYSTGTRARAVNTRLALTNTK 139
Query: 132 KGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL---SDHTTL 188
KG MK+ +Y+ KM+ L D++ G+P+ DL+ + GL+ +++PIV L SD T+
Sbjct: 140 KGTMKIAEYVAKMRALGDEMAAGGHPLDEEDLVQYIIAGLNEDFSPIVSNLCNKSDPITV 199
Query: 189 SWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNN----------- 237
+L +QL+ FE+ ++ + A VAN+ NNN
Sbjct: 200 G--ELYSQLVNFETLLDLYR---STGQGGAAFVANRGRGGGGGGRGNNNNSGGGGGRSAP 254
Query: 238 -WRGSNFRGWRGGRGR---GRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQ 293
RGS +G RGGRGR G+ + CQVC K HTA +C++RFD++Y
Sbjct: 255 GGRGSGSQG-RGGRGRGTGGQDRRPTCQVCFKRGHTAADCWYRFDEDY-----------V 302
Query: 294 GSHNAFIASQNSVE-DYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIV- 351
A+ NS D +WY D+GA++H+T + K +++G + +G ++I
Sbjct: 303 ADEKLVAAATNSYGIDTNWYIDTGATDHITGELEKLTTKEKYNGGEQIHTASGAGMDISH 362
Query: 352 ---ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+ ++++L++VLYVP KNL+S S+LAADN+ F+E
Sbjct: 363 IGHTIVHTPSRNIHLNNVLYVPQAKKNLISASQLAADNSAFLE 405
>gb|AAT85031.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1437
Score = 214 bits (544), Expect = 6e-54
Identities = 127/416 (30%), Positives = 219/416 (52%), Gaps = 38/416 (9%)
Query: 1 MASAANNNKND--LPSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITS 58
MAS++ NN + + VS KL ++N+ +WK+ +L +RG +L+G++ G + P +
Sbjct: 1 MASSSKNNTGNPLVGQPVSEKLGKSNHAVWKAQILATIRGARLEGHLTGDDQPPAPILRR 60
Query: 59 SDSSKS---NNPAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAH 115
+ K +NP +EEW A DQ++L ++L+SM ++ Q+ C T+ W Q + G+
Sbjct: 61 KEGEKEVVVSNPEYEEWVATDQQVLAYLLSSMTKDLLVQVATCRTAASAWSMIQGMFGSM 120
Query: 116 TRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEY 175
TR++ I + +++KG+M + Y+ KM+ LAD L G P+ + +LI GLD E+
Sbjct: 121 TRARTINTRLSLSTLQKGDMNITTYVGKMRALADDLMAVGKPVDDDELIGYIFAGLDDEF 180
Query: 176 NPI---VVKLSDHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRF 232
P+ +V D T+ + AQL++FE R+ + ++N+ + + +R
Sbjct: 181 EPVISTIVGRPDPVTIG--ETYAQLISFEQRLAHRRSGDQSSVNSASRSRGQPQRGGSRS 238
Query: 233 NSNNN-WRGSNFRGWRGGRGRGRSS------------KAPCQVCGKTNHTAINCFHRFDK 279
++N RG+ G GRGRG S + CQ+C K HT +C++R+D+
Sbjct: 239 GGDSNRGRGAPSNGANRGRGRGNPSGGRANVGGGTDNRPKCQLCYKRGHTVCDCWYRYDE 298
Query: 280 NYSRSNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNS 339
N+ A + + + D +WY D+GA++HVT + +K ++HG +
Sbjct: 299 NFVPDERFAGT-----------AVSYGVDTNWYLDTGATDHVTGELDKLTVRDKYHGNDQ 347
Query: 340 LVVGNGDKLEIVATCSSKLK----SLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+ +G +EI +S +K +L+L DVLYVP KNL+S KL +DN F+E
Sbjct: 348 VHTASGAGMEISHIGNSVVKTPSRNLHLKDVLYVPKANKNLVSAYKLTSDNLAFIE 403
>gb|AAU43956.1| unknown protein [Oryza sativa (japonica cultivar-group)]
gi|52353503|gb|AAU44069.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1447
Score = 211 bits (537), Expect = 4e-53
Identities = 116/400 (29%), Positives = 212/400 (53%), Gaps = 34/400 (8%)
Query: 14 SSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKSN------NP 67
+++S KL ++N+ LWK+ V+ VRG +L+G++ G K P IT++ K NP
Sbjct: 16 NAISEKLSKSNHALWKAQVMAAVRGARLEGHLTGATKTPNALITTTAGDKGEKEVTVRNP 75
Query: 68 AFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEF 127
F++W A DQ++LG++L+++A ++ Q+ C T+ W + + + TR++ I +
Sbjct: 76 EFDDWVATDQQVLGFLLSTLARDVLAQVATCGTAAAAWQMLEEMYSSVTRARFINTRIAL 135
Query: 128 HSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKLSDHTT 187
+ +KG + + +Y+ KMK LAD++ AG + + DLI + GLD Y P++ + T
Sbjct: 136 SNTKKGTLSINEYVSKMKALADEMTAAGKIVDDDDLISYIIAGLDDTYEPVISTIVGKDT 195
Query: 188 LSWVDLQAQLLTFESRIE-QLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRGSNFRGW 246
++ + +QLL+FE R+ + +++NL R + RG N G
Sbjct: 196 MTLGEAYSQLLSFEQRLALRHGGDSSVNLANRGRGGGGGQQRGGNTGNGGRGRGGNNNGA 255
Query: 247 RGGRGRGRS----------SKAPCQVCGKTNHTAINCFHRFDKNY-SRSNYSADSDKQGS 295
GRGRG + ++ CQ+C K HT INC++R+D+++ Y+ + G
Sbjct: 256 NRGRGRGNNGGARPPGGVDNRPKCQLCYKRGHTVINCWYRYDEDFVPDEKYAGSATSYGI 315
Query: 296 HNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIV---- 351
D +WY D+ A++HVT + +K + G++ + +G +EI
Sbjct: 316 ------------DTNWYVDTSATDHVTGELDKLTVRDRYKGQDQVHTASGAGMEISHIGH 363
Query: 352 ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+T + + ++L ++LYVPN KNL+S ++L +DN+ ++E
Sbjct: 364 STVRTPNRDIHLRNILYVPNANKNLVSANRLVSDNSAYME 403
>ref|NP_915223.1| putative rice retrotransposon retrofit gag/pol polyprotein [Oryza
sativa (japonica cultivar-group)]
gi|20161626|dbj|BAB90546.1| putative rice
retrotransposon retrofit gag/pol polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1448
Score = 206 bits (524), Expect = 1e-51
Identities = 128/404 (31%), Positives = 214/404 (52%), Gaps = 41/404 (10%)
Query: 15 SVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKSN---NPAFEE 71
SVS KL + N+ LWK+ V V G +L GY+ G K P I+ + K+ NPAFE+
Sbjct: 20 SVSEKLGKANHALWKAQVSAAVHGARLLGYLNGDIKAPNAEISVTIDGKTTTKPNPAFED 79
Query: 72 WQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIR 131
W+ANDQ +LG++L+S++ ++ Q+ C+T+ + W ++L TR++ + + + +
Sbjct: 80 WEANDQLVLGYLLSSLSRDVLIQVATCKTAAEAWRNIEALYSTGTRARAVNTRLALTNTK 139
Query: 132 KGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL---SDHTTL 188
KG MK+ +Y+ KM+ L D++ G P+ L+ + GL+ +++PIV L SD T+
Sbjct: 140 KGTMKIAEYVAKMRALCDEMAAGGRPLDEEGLVQYIIAGLNEDFSPIVSNLCNKSDPITV 199
Query: 189 SWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNN----------- 237
+L +QL+ FE+ ++ + AN R N+NN+
Sbjct: 200 G--ELYSQLVNFETLLDLYRSTGQGGAAFVANRGRGGGGGGGRGNNNNSDGGGGGGGRGA 257
Query: 238 --WRGSNFRGWRGGRGR---GRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDK 292
RG +G RGG GR G+ + CQVC K HTA +C++RFD++Y
Sbjct: 258 PRGRGGGGQG-RGGHGRGTGGQDRRPTCQVCFKRGHTAADCWYRFDEDY----------- 305
Query: 293 QGSHNAFIASQNSVE-DYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIV 351
A+ NS D +WY D+GA++H+T + K +++G + +G ++I
Sbjct: 306 VADEKLVAAATNSYGIDTNWYIDTGATDHITGELEKLTTKEKYNGGEQIHTASGAGMDIS 365
Query: 352 ----ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+ ++++L++VLYVP KNL+S S+LAADN+ F+E
Sbjct: 366 HIGHTIVHTPSRNIHLNNVLYVPQAKKNLISASQLAADNSAFLE 409
>gb|AAP54977.1| putative gag-pol polyprotein [Oryza sativa (japonica
cultivar-group)] gi|37536776|ref|NP_922690.1| putative
gag-pol polyprotein [Oryza sativa (japonica
cultivar-group)] gi|14165328|gb|AAK55460.1| putative
gag-pol polyprotein [Oryza sativa (japonica
cultivar-group)]
Length = 1031
Score = 204 bits (519), Expect = 5e-51
Identities = 129/417 (30%), Positives = 213/417 (50%), Gaps = 42/417 (10%)
Query: 1 MASAANNNKNDLPSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSD 60
MA+A +N L +S KL + N+PLW + +L +RG +L+ +++ T P I D
Sbjct: 6 MAAAISNPLFGL--QISEKLTKQNHPLWAAQILTTLRGAQLEEHIVSTTAAPAAEIEKED 63
Query: 61 SSKSN-------NPAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAG 113
K NP ++ W DQ++LG++ +S++ E+ Q+ T+ Q W+ +
Sbjct: 64 GDKDKKTKIVIPNPEYKTWFVQDQQVLGFIFSSLSREVLQQVAGARTAAQAWNMIDDMFS 123
Query: 114 AHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDS 173
+++ I + + +KG M + +Y+ KM++LAD++ G P+ +L+ +NGLDS
Sbjct: 124 CKSKAGTINVLLALTTTQKGPMSISEYIAKMRSLADEMAATGKPLDEEELVAYIINGLDS 183
Query: 174 EYNPIVVKLSDHTTLSWVDLQ---AQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDN 230
E++ V L ++ V + +QLL++E+RI + L +AN AN+ R
Sbjct: 184 EFDAAVEGLMATARIAPVSISHVYSQLLSYENRI----RIRQAYLTTSANAANRGGGRGG 239
Query: 231 RFNSNNN---WRGSNFRGWRG---------GRGRGRSSKAPCQVCGKTNHTAINCFHRFD 278
R +S N RG RG RG GRGRG ++ CQVC K H A +C+HR+D
Sbjct: 240 RGSSTGNRGGRRGGFGRGGRGRGAPSGASQGRGRGNDTRPVCQVCHKRGHVASDCWHRYD 299
Query: 279 KNYSRSNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKN 338
+Y +K G A+ D +WY D+GA++H+T Q +K + G +
Sbjct: 300 DSY------VPDEKLGG----AATYAYGVDTNWYVDTGATDHITGQLDKLTTKERYKGTD 349
Query: 339 SLVVGNGDKLEIV----ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+ +G+ + I A + + L+L +VL+VP KNL+SV KL ADN F+E
Sbjct: 350 QIHTASGEGMSIKHVGHAIVPTPSRPLHLKNVLHVPEAAKNLVSVHKLVADNYAFLE 406
>gb|AAU10682.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1210
Score = 203 bits (517), Expect = 9e-51
Identities = 128/418 (30%), Positives = 217/418 (51%), Gaps = 46/418 (11%)
Query: 1 MASAANNNKNDL-PSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSS 59
MAS++ N L ++S KL +NN+ LWK+ +LPV+ G +++GY+ G + P I
Sbjct: 1 MASSSGAAVNPLFGQAISEKLTKNNFSLWKTHILPVICGARMEGYLTGATQVPSAEIEVK 60
Query: 60 DSSKS------NNPAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAG 113
+ K +NPA+E W A DQ++LG++L+S++ E+ Q+ + +T+ W L
Sbjct: 61 EGEKGEITKKVSNPAYEAWIAADQQVLGFLLSSISKEILIQVANVDTAAHAWKMIVGLLS 120
Query: 114 AHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDS 173
+R++ + + + +KGE + DY+ KMK LAD++ AG P+ + + L GLDS
Sbjct: 121 TQSRARALNTRIALATTQKGESSVSDYISKMKTLADEMASAGKPLDDEEFTSYILAGLDS 180
Query: 174 EYNPIV---VKLSDHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDN 230
+Y +V V S+ T+S ++ +QLL+FE + TN + + ++V + R+N
Sbjct: 181 DYEQVVSSIVGRSEGVTIS--EVYSQLLSFEELWQ-----TNGSSGSYSSVNSANHGRNN 233
Query: 231 RFNSNNNWRGSNF--------RGWRGGRGRGRSS-----KAPCQVCGKTNHTAINCFHRF 277
N N G F RG RGG GRG + CQ+C K H +C+H +
Sbjct: 234 GGGGNFNNGGGYFSNRGRGGGRGDRGGCGRGNGGRNFKPRPTCQLCSKVGHVVADCWHCY 293
Query: 278 DKNYSRSNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGK 337
D ++ A + G D +WY D+GA++H+T++ K ++GK
Sbjct: 294 DDSFVPDARVAAAASYG------------VDSNWYVDTGATDHITNELEKLTTRDRYNGK 341
Query: 338 NSLVVGNGDKLEI----VATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+ +G ++I +T + ++L L ++L+VP KNL+S +LA DN+ FVE
Sbjct: 342 EQIHTASGSGMDIKHIGQSTIRTPTRNLYLRNILHVPRTKKNLISAHRLAVDNHAFVE 399
>ref|XP_476197.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|46981313|gb|AAT07631.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
gi|46981245|gb|AAT07563.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1431
Score = 201 bits (510), Expect = 6e-50
Identities = 132/417 (31%), Positives = 208/417 (49%), Gaps = 51/417 (12%)
Query: 1 MASAANNNKNDLPSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSD 60
MA+A +N L +S KL + N+PLW + +L +RG +L+ +++ T P I D
Sbjct: 6 MAAAISNPLFGL--QISDKLTKQNHPLWAAQILTTLRGAQLEEHIVSTTAAPAAEIEKED 63
Query: 61 SSKSN-------NPAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAG 113
K NP ++ W DQ++LG++ +S++ E+ Q+ T+ Q W+ +
Sbjct: 64 GDKDKKTKIVIPNPEYKTWFVQDQQVLGFIFSSLSREVLQQVAGARTAAQAWNMIDDMFS 123
Query: 114 AHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDS 173
KS+ +I KG M M +Y+ KM++LADK+ G P+ +L+ +NGLDS
Sbjct: 124 C---------KSKAGTINKGPMSMSEYIAKMRSLADKMAATGKPLDEEELVAYIINGLDS 174
Query: 174 EYNPIVVKLSDHTTLSWVDLQ---AQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDN 230
E++ V L ++ V + +QLL++E+RI + L +AN AN+ R
Sbjct: 175 EFDAAVEGLMATARIAPVSISHVYSQLLSYENRI----RIRQAYLTTSANAANRGGGRGG 230
Query: 231 RFNSNNN---WRGSNFRGWRG---------GRGRGRSSKAPCQVCGKTNHTAINCFHRFD 278
R +S N RG RG G GRGRG ++ CQVC K H A +C+HR+D
Sbjct: 231 RGSSTGNRGGGRGGFGRGGHGRGAPSGASQGRGRGNDTRPVCQVCHKRGHVASDCWHRYD 290
Query: 279 KNYSRSNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKN 338
+Y +K G A+ D +WY D+GA++H+T Q +K + G +
Sbjct: 291 DSY------VPDEKLGG----AATYAYGVDTNWYVDTGATDHITGQLDKLTTKERYKGTD 340
Query: 339 SLVVGNGDKLEIV----ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+ +G+ I A + L+L +VL+VP KNL+SV KL ADN F+E
Sbjct: 341 QIHTASGEGTSIKHVGHAIVPTPSHPLHLKNVLHVPEAAKNLVSVHKLVADNYAFLE 397
>ref|NP_909900.1| putative copia-like retrotransposon Hopscotch polyprotein [Oryza
sativa (japonica cultivar-group)]
gi|12957712|gb|AAK09230.1| putative copia-like
retrotransposon Hopscotch polyprotein [Oryza sativa
(japonica cultivar-group)] gi|14718304|gb|AAK72882.1|
putative gag-pol protein [Oryza sativa]
Length = 1219
Score = 199 bits (506), Expect = 2e-49
Identities = 118/397 (29%), Positives = 207/397 (51%), Gaps = 38/397 (9%)
Query: 16 VSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKSN------NPAF 69
VS KL + N+ LW + VL +RG +L+G++LGT PE + + K NPA+
Sbjct: 19 VSEKLTKQNHSLWSAQVLTALRGARLEGHVLGTSVPPEAELEQKEGEKGEKTVRVPNPAY 78
Query: 70 EEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHS 129
EW A DQ++LG++ +S+ E+ +Q+ T+ W ++ + + I ++ +
Sbjct: 79 GEWFATDQQVLGFLFSSLTREIRSQVAGAPTAAAAWKTIENTFSTRSHAGAINVRLALTT 138
Query: 130 IRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL-SDHTTL 188
+KG+ + +Y+ KM+ L D++ G PI + +L+ +NGLDSE++P+V L + + ++
Sbjct: 139 TQKGQSTVTEYVSKMRALGDEIAATGKPIDDEELVAYIINGLDSEFDPVVEALIAKNASV 198
Query: 189 SWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWR-------GS 241
+ ++ +QLL FE+R++ + A N+ R N R GS
Sbjct: 199 TVAEVYSQLLGFENRVK-------IRTACAATSGNRGSGNQGRGGGNPRGRGTGRGGGGS 251
Query: 242 NFRGWRGGRGRGR---SSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQGSHNA 298
RG GRG GR ++ CQVC K H +C+HR+D+NY +K G
Sbjct: 252 GGRGGGHGRGNGRGGTDNRPTCQVCHKKGHVVADCWHRYDENY------VPDEKLGG--- 302
Query: 299 FIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIV----ATC 354
A+ D +WY D+ A++H+T Q +K ++ G + + +G+ ++I +
Sbjct: 303 -AATHAYGVDTNWYVDTEATDHITGQLDKLTTREKYKGTDQIHTASGEGMDIQHIGHSYV 361
Query: 355 SSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+ + L+L ++L+VP +KNL+SV +L ADN F+E
Sbjct: 362 PTSSRPLHLKNILHVPKASKNLISVHRLVADNYAFLE 398
>ref|NP_915573.1| putative gag/pol polyprotein [Oryza sativa (japonica
cultivar-group)]
Length = 1449
Score = 198 bits (503), Expect = 4e-49
Identities = 125/410 (30%), Positives = 219/410 (52%), Gaps = 47/410 (11%)
Query: 14 SSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFIT------SSDSSKSNNP 67
+++S KL +NNY LWK+ VL VRG +L+G++ GT P I+ ++++ NP
Sbjct: 10 NTISEKLAKNNYALWKAQVLASVRGARLEGHLTGTTAAPAITISVPGEKEGDKATRAANP 69
Query: 68 AFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEF 127
A++EW A DQ++LG +L++++ ++ Q+ C T+ W + + + TR++ I +
Sbjct: 70 AYDEWVATDQQILGLLLSTLSKDVLAQVATCGTAAAAWSMLEEMYTSMTRARFINTRIAL 129
Query: 128 HSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL-SDHT 186
+ +KG++ + +Y+ KM+ L D + AG + + DLI + GLD Y P++ +
Sbjct: 130 SNTKKGDLSITEYVAKMRALGDDMTAAGKVVDDEDLISYIIAGLDDTYEPVISSIVGKSE 189
Query: 187 TLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANK---------------FDHRDNR 231
+S+ + +QLL+FE R NNL + ++AN+AN+ ++R
Sbjct: 190 PMSFGEAFSQLLSFEQR----NNLRH-GGESSANLANRGRGTTGGNGGQRGRGGNNRGRG 244
Query: 232 FNSNNNW--RGSNFR---GWRGGR-GRGRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSN 285
N NN RG R G+ GGR G G ++ CQ+C K HT INC++R+D+++
Sbjct: 245 GNGGNNSANRGKGGRGNGGFNGGRQGGGVDTRPKCQLCYKRGHTVINCWYRYDEDFVPDE 304
Query: 286 YSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNG 345
A S A+ + D +WY D+GA++HVT + K + G + + +G
Sbjct: 305 KYAGS----------AATSYGIDTNWYVDTGATDHVTGELEKLIVRDRYKGHDQVHTASG 354
Query: 346 DKLEIVATCSSKLKS----LNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+EI S +K+ ++L ++L+VP KNL+S +L +DN+ F+E
Sbjct: 355 AGMEISHIGHSIVKTPSRDIHLRNILHVPKANKNLVSAQRLVSDNSAFME 404
>gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from
Arabidopsis thaliana BAC gb|AF080119 and is a member of
the reverse transcriptase family PF|00078
gi|25301706|pir||C86438 hypothetical protein F28K20.17 -
Arabidopsis thaliana
Length = 1415
Score = 186 bits (473), Expect = 1e-45
Identities = 129/411 (31%), Positives = 202/411 (48%), Gaps = 44/411 (10%)
Query: 14 SSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEE---FITSSDSSKSNNPAFE 70
SSV++KL +NY LWK+ ++ KL G++ G P + + +S+ NP +E
Sbjct: 15 SSVTLKLTDSNYLLWKTQFESLLSSQKLIGFVNGAVNAPSQSRLVVNGEVTSEEPNPLYE 74
Query: 71 EWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQI---IYLKSEF 127
W DQ + W+ +++ E+ + + TS+Q+W SLA +S + L+
Sbjct: 75 SWFCTDQLVRSWLFGTLSEEVLGHVHNLSTSRQIW---VSLAENFNKSSVAREFSLRQNL 131
Query: 128 HSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVV----KLS 183
+ K E Y + K + D L G P+ S I LNGL +Y+PI LS
Sbjct: 132 QLLSKKEKPFSVYCREFKTICDALSSIGKPVDESMKIFGFLNGLGRDYDPITTVIQSSLS 191
Query: 184 DHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRG--- 240
T ++ D+ +++ F+S+++ ++ + N+ + + ++N N RG
Sbjct: 192 KLPTPTFNDVVSEVQGFDSKLQSYEEAASVTPHLAFNI-ERSESGSPQYNPNQKGRGRSG 250
Query: 241 -SNFRGWRGGRGRGRSS----------KAPCQVCGKTNHTAINCFHRFDKNYSRSNYSAD 289
+ RG RGRG S + CQ+CG+T HTA+ C++RFD NY
Sbjct: 251 QNKGRGGYSTRGRGFSQHQSSPQVSGPRPVCQICGRTGHTALKCYNRFDNNY-------- 302
Query: 290 SDKQGSHNAFIASQNSVED-YDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKL 348
Q AF + S + +W+ DS A+ HVT TN Q TE+ G ++++VG+G L
Sbjct: 303 ---QAEIQAFSTLRVSDDTGKEWHPDSAATAHVTSSTNGLQSATEYEGDDAVLVGDGTYL 359
Query: 349 EIVATCSSKLKSLN----LDDVLYVPNITKNLLSVSKLAADNNIFVEFDKN 395
I T S+ +KS N L++VL VPNI K+LLSVSKL D V FD N
Sbjct: 360 PITHTGSTTIKSSNGKIPLNEVLVVPNIQKSLLSVSKLCDDYPCGVYFDAN 410
>ref|XP_475911.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|52353546|gb|AAU44112.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
gi|50080247|gb|AAT69582.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1256
Score = 186 bits (471), Expect = 2e-45
Identities = 112/369 (30%), Positives = 194/369 (52%), Gaps = 45/369 (12%)
Query: 1 MASAANNNKNDLPSSVSV--KLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITS 58
MAS++ ++ + ++V KL + NY +WK VL V+RG +LD Y+ G K P I
Sbjct: 1 MASSSQSSASGSLGGITVTEKLSKGNYLIWKVQVLAVIRGARLDSYLTGATKKPSATIII 60
Query: 59 SDSSKS---NNPAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAH 115
+ K +NPA +EW ANDQ++LG++L +M+ ++ +Q+ C ++ LW + + +
Sbjct: 61 KKNEKEVEVSNPAVDEWIANDQQVLGYLLTTMSRDVLSQVATCSSAASLWSTIEGMFSSA 120
Query: 116 TRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEY 175
TR++ I K + +KG++ + +Y+ KM+ LAD+L +G P+ DLI + GLD ++
Sbjct: 121 TRARSINTKIALTNTKKGDLGIAEYVSKMRVLADELATSGKPVDEEDLISYIIAGLDEDF 180
Query: 176 NPIV---VKLSDHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRF 232
PI+ V S+H +L + +QLL+FE R++ + +AN+AN+ R N
Sbjct: 181 EPIISSLVSKSEHVSLG--EAYSQLLSFEQRMK-------MRQEHSANLANRGRGRGNPG 231
Query: 233 NSNNNWRGSNFRGWRGG--RGRGRSS-------------KAPCQVCGKTNHTAINCFHRF 277
NN + + GG RGRGR + + CQ+C K HT I+C++R+
Sbjct: 232 RGRNNKQPQQQQRGHGGNSRGRGRGNNSNQRQGGNGVDYRPKCQLCYKRGHTVIDCWYRY 291
Query: 278 DKNY-SRSNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHG 336
D+++ Y+ + G D +WY D+G ++HVT + K ++ G
Sbjct: 292 DEDFVPDEKYAGTTASYG------------VDSNWYVDTGTTDHVTGELEKLTIRDKYKG 339
Query: 337 KNSLVVGNG 345
++ + NG
Sbjct: 340 QDQVQTANG 348
>gb|AAX96193.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
sativa (japonica cultivar-group)]
Length = 1621
Score = 183 bits (465), Expect = 9e-45
Identities = 122/364 (33%), Positives = 188/364 (51%), Gaps = 36/364 (9%)
Query: 2 ASAANNNKNDLPSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLG-TKKCPEEFITSSD 60
++AA N P +S KL + N+ LWK V VRG +L G++ G TK+ P E + D
Sbjct: 92 STAATNPFQGHP--ISEKLGKANHALWKVQVSAAVRGARLQGHLTGATKRPPAEIAVTKD 149
Query: 61 SS--KSNNPAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRS 118
+ K NPA E+W+A DQ++LG++L+S+ E+ Q+ C+T+ ++W + + HTR+
Sbjct: 150 GATKKEPNPAHEDWEATDQQVLGYLLSSLTREVLMQVATCDTAAEVWSAIEQMYSTHTRA 209
Query: 119 QIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLA-GNPISNSDLIIQTLNGLDSEYNP 177
+ I + + +KG M +Y KMK+L D++ A G PI +LI + GL Y+
Sbjct: 210 RAINTRFALTNTKKGNMSTPEYFAKMKSLGDEMATAGGRPIDEEELIQYIITGLGEGYSE 269
Query: 178 IVVKLSDHT-TLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANK--FDHRDNRFNS 234
+V + +S DL +Q+L FE+R + T NVAN+ F H R NS
Sbjct: 270 VVSAVCARVEPISVSDLYSQVLNFEARQAIYRGAQEV----TVNVANRGGFSH-GGRGNS 324
Query: 235 NNNWRGSNFRG--------WRGGRGRGRS----SKAP-CQVCGKTNHTAINCFHRFDKNY 281
N GS G GGRGRGR+ K P CQVC K HTA +C++R+D++Y
Sbjct: 325 NGGQGGSRGGGGGGHGRGNGGGGRGRGRTPGGVDKRPICQVCFKRGHTAADCWYRYDEDY 384
Query: 282 SRSNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLV 341
H A A + D +WY D+GA++HVT + +K +++G +
Sbjct: 385 V---------PDAKHVAAAAVNSYGVDTNWYIDTGATDHVTGELDKLTMKEKYNGGEQIH 435
Query: 342 VGNG 345
+G
Sbjct: 436 TASG 439
>gb|AAX95626.1| Zinc knuckle, putative [Oryza sativa (japonica cultivar-group)]
gi|50901022|ref|XP_462944.1| putative gag-pol protein
[Oryza sativa (japonica cultivar-group)]
Length = 535
Score = 182 bits (462), Expect = 2e-44
Identities = 112/394 (28%), Positives = 205/394 (51%), Gaps = 35/394 (8%)
Query: 19 KLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKS-----NNPAFEEWQ 73
K + N+ LW + +L +RG +L+GY+ GT + P D K +NP + +W
Sbjct: 134 KFTKQNHSLWSAQILTTLRGAQLEGYITGTAEAPAAECEKEDGDKKVKTTISNPEYIKWF 193
Query: 74 ANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIRKG 133
DQ++LG++ +S++ E+ Q+ +T+ Q W + +++ I + + +KG
Sbjct: 194 TQDQQVLGFLFSSLSREVLQQVAGAKTAAQAWSMINDMFTCKSKAGAINVLLALTTTQKG 253
Query: 134 EMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL---SDHTTLSW 190
+ + +Y+ KM++L D++ AG P+ + +LI +NGL+S+++ V L + LS
Sbjct: 254 PISISEYIAKMRSLGDEMAGAGKPLDDEELIAYIINGLNSDFDATVEGLMATARIAPLSI 313
Query: 191 VDLQAQLLTFESRIEQLNNLTNLNLNATANVAN------KFDHRDNRFNSNNNWRG---S 241
+ +QLL++E+RI + L +AN AN + +R R ++ + RG
Sbjct: 314 SHVYSQLLSYENRI----RIRQAYLTTSANAANRGGRGGRGGNRGGRSSAPHGGRGGGRG 369
Query: 242 NFRGWRGGRGRGRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQGSHNAFIA 301
N G GRGRG ++ CQVC K H A +C+HR+D+NY +K G A
Sbjct: 370 NTGGANPGRGRGNDTRPVCQVCHKRGHVASDCWHRYDENY------GPDEKLGG----AA 419
Query: 302 SQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIVATCSSKLKS- 360
+ D +WY D+ A++H+T Q +K ++ G + + +G+ + + + + +
Sbjct: 420 TYAYGVDTNWYVDTRATDHITGQLDKLTTREKYKGTDLIHTVSGEGMNVKHIVHTIVPTP 479
Query: 361 ---LNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
L+L ++L+VP +KNL+ V +L ADN F++
Sbjct: 480 SCPLHLKNILHVPQASKNLVFVHRLVADNYAFLD 513
>ref|XP_507315.1| PREDICTED P0623F08.21-2 gene product [Oryza sativa (japonica
cultivar-group)]
Length = 354
Score = 181 bits (459), Expect = 5e-44
Identities = 108/344 (31%), Positives = 182/344 (52%), Gaps = 32/344 (9%)
Query: 16 VSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKSN----NPAFEE 71
VS KL + NY LW + VL +RG +LDG++ G P I + S K+ NPA++E
Sbjct: 19 VSEKLTKGNYALWSAQVLAAIRGARLDGHITGATAAPSMEIEKTASDKTTEKIVNPAYQE 78
Query: 72 WQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIR 131
W A+DQ++LG++L++++ ++ TQ+ T+ Q W + ++ A T+++ + ++ + +
Sbjct: 79 WFASDQQVLGFLLSTLSRDILTQVATASTAAQAWQQVCAMFTAQTKARSLNVRLTLTNTQ 138
Query: 132 KGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIV---VKLSDHTTL 188
KG M + +Y KMK LAD++ +G P+ DL+ LNGLD ++ P+V V ++ TT+
Sbjct: 139 KGNMSISEYCGKMKALADEIASSGKPLDEEDLVAYVLNGLDDDFEPVVSAIVARNESTTM 198
Query: 189 SWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDH-------RDNRFNSNNNWRGS 241
+ ++ +QLL FE+R Q + + NA A F R N RG
Sbjct: 199 A--EVYSQLLNFENR--QALRQAHASANAAARGRGGFQRGRGGGHGTCGRSNPAAPGRG- 253
Query: 242 NFRGWRGGRGRGRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQGSHNAFIA 301
RG G + RG + + CQVC K H A C+HR+D+ Y SA A
Sbjct: 254 --RGTGGNQTRGGNDRPICQVCLKRGHVAAECWHRYDETYVPDERSA-----------AA 300
Query: 302 SQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNG 345
+ D +WY D+GA++H+T++ +K ++ G + + +G
Sbjct: 301 AAAYGIDTNWYLDTGATDHITNELDKLDVREKYKGGDKIHTASG 344
>gb|AAF02855.1| Similar to retrotransposon proteins [Arabidopsis thaliana]
gi|25301689|pir||C96578 hypothetical protein T18A20.5
[imported] - Arabidopsis thaliana
Length = 1522
Score = 175 bits (443), Expect = 3e-42
Identities = 121/413 (29%), Positives = 200/413 (48%), Gaps = 40/413 (9%)
Query: 11 DLPSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEF--ITSSD-SSKSNNP 67
++ + V+V L++ NY LWKS + G L G++ G+ P + +T ++ +S+ NP
Sbjct: 10 NISNCVTVTLNQQNYILWKSQFESFLSGQGLLGFVTGSISAPAQTRSVTHNNVTSEEPNP 69
Query: 68 AFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEF 127
F W DQ + W+L S A ++ + +++C TS Q+W + + S++ L+
Sbjct: 70 EFYTWHQTDQVVKSWLLGSFAEDILSVVVNCFTSHQVWLTLANHFNRVSSSRLFELQRRL 129
Query: 128 HSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKLSD--- 184
++ K + ME +L +K++ D+L G+P+ I LNGL EY PI + +
Sbjct: 130 QTLEKKDNTMEVFLKDLKHICDQLASVGSPVPEKMKIFSALNGLGREYEPIKTTIENSVD 189
Query: 185 -HTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRGSNF 243
+ +LS ++ ++L ++ R++ ++ + NV H D+ + NNN RG
Sbjct: 190 SNPSLSLDEVASKLRGYDDRLQSYVTEPTISPHVAFNVT----HSDSGYYHNNN-RGKGR 244
Query: 244 RGWRGG------RGRG-------------RSSKAPCQVCGKTNHTAINCFHRFDKNYSRS 284
G RGRG +S CQ+CGK H A+ C+HRFD +Y
Sbjct: 245 SNSGSGKSSFSTRGRGFHQQISPTSGSQAGNSGLVCQICGKAGHHALKCWHRFDNSYQHE 304
Query: 285 NYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGN 344
+ I ++W DS AS HVT+ + Q +HG +S++V +
Sbjct: 305 DL-----PMALATMRITDVTDHHGHEWIPDSAASAHVTNNRHVLQQSQPYHGSDSIMVAD 359
Query: 345 GDKLEIVATCSSKLKS----LNLDDVLYVPNITKNLLSVSKLAADNNIFVEFD 393
G+ L I T S + S + L +VL P+I K+LLSVSKL +D VEFD
Sbjct: 360 GNFLPITHTGSGSIASSSGKIPLKEVLVCPDIVKSLLSVSKLTSDYPCSVEFD 412
>gb|AAP94600.1| putative copia-like retrotransposon Hopscotch polyprotein [Zea
mays]
Length = 969
Score = 174 bits (441), Expect = 6e-42
Identities = 101/346 (29%), Positives = 171/346 (49%), Gaps = 44/346 (12%)
Query: 26 PLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKSNNPAFEEWQANDQRLLGWMLN 85
P K+ V +RG +L+GY+ G K P+E + KS NPAFEEW+A DQ++L ++L+
Sbjct: 2 PCGKAQVRAAMRGARLEGYLTGATKMPDEETVDNKGKKSPNPAFEEWEAKDQQILSYLLS 61
Query: 86 SMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMK 145
S++ E+ Q+ +T+ + W +++ + TR++ + L+ + +KG M + +Y KMK
Sbjct: 62 SISREVQIQVTSAKTAAEAWHSIEAMFASQTRARAVNLRLALSTTKKGSMTVAEYYTKMK 121
Query: 146 NLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKLSDHT-TLSWVDLQAQLLTFESRI 204
D++ AG P+ + +++ L GL+ E+ P+V L +S DL +QLL FE+++
Sbjct: 122 GYGDEMAAAGRPLQDEEMVEYILTGLEEEFLPMVSALVTRVDPISLEDLYSQLLNFETKL 181
Query: 205 EQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRGSNFRGWRGGRGRGRSSKAP----- 259
+ + + +AN+A R N GS RG RGGRG R +A
Sbjct: 182 DLMRGGGEQH-QGSANMA-------GRGGRGNQRGGSGGRGQRGGRGSSRGGRASWSGGR 233
Query: 260 --------------------CQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQGSHNAF 299
CQVC K HTA C+HRF++++ A + H
Sbjct: 234 QSNQGGYIRRSNNSSDERPVCQVCFKKGHTAARCWHRFEEDFVPDEKLAGAATNSYH--- 290
Query: 300 IASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNG 345
D +WY D+GA++H+T + K ++ G + + +G
Sbjct: 291 -------VDTNWYTDTGATDHITGELEKLSIREKYAGGDQIHTASG 329
>gb|AAL66754.1| putative copia-like retrotransposon Hopscotch polyprotein [Zea
mays]
Length = 1313
Score = 174 bits (441), Expect = 6e-42
Identities = 101/346 (29%), Positives = 171/346 (49%), Gaps = 44/346 (12%)
Query: 26 PLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKSNNPAFEEWQANDQRLLGWMLN 85
P K+ V +RG +L+GY+ G K P+E + KS NPAFEEW+A DQ++L ++L+
Sbjct: 2 PCGKAQVRAAMRGARLEGYLTGATKMPDEETVDNKGKKSPNPAFEEWEAKDQQILSYLLS 61
Query: 86 SMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMK 145
S++ E+ Q+ +T+ + W +++ + TR++ + L+ + +KG M + +Y KMK
Sbjct: 62 SISREVQIQVTSAKTAAEAWHSIEAMFASQTRARAVNLRLALSTTKKGSMTVAEYYTKMK 121
Query: 146 NLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKLSDHT-TLSWVDLQAQLLTFESRI 204
D++ AG P+ + +++ L GL+ E+ P+V L +S DL +QLL FE+++
Sbjct: 122 GYGDEMAAAGRPLQDEEMVEYILTGLEEEFLPMVSALVTRVDPISLEDLYSQLLNFETKL 181
Query: 205 EQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRGSNFRGWRGGRGRGRSSKAP----- 259
+ + + +AN+A R N GS RG RGGRG R +A
Sbjct: 182 DLMRGGGEQH-QGSANMA-------GRGGRGNQRGGSGGRGQRGGRGSSRGGRASWSGGR 233
Query: 260 --------------------CQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQGSHNAF 299
CQVC K HTA C+HRF++++ A + H
Sbjct: 234 QSNQGGYIRRSNNSSDERPVCQVCFKKGHTAARCWHRFEEDFVPDEKLAGAATNSYH--- 290
Query: 300 IASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNG 345
D +WY D+GA++H+T + K ++ G + + +G
Sbjct: 291 -------VDTNWYTDTGATDHITGELEKLSIREKYAGGDQIHTASG 329
>emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]
Length = 1466
Score = 174 bits (440), Expect = 7e-42
Identities = 128/411 (31%), Positives = 205/411 (49%), Gaps = 42/411 (10%)
Query: 14 SSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEE---FITSSDSSKSNNPAFE 70
SSV++KL+ +NY LWK+ ++ KL G++ G P + + +S+ NP +E
Sbjct: 15 SSVTLKLNDSNYLLWKTQFESLLSSQKLIGFVNGVVTPPAQTRLVVNDDVTSEVPNPQYE 74
Query: 71 EWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQI---IYLKSEF 127
+W DQ + W+ +++ E+ + + TS+Q+W SLA +S I L+
Sbjct: 75 DWFCTDQLVRSWLFGTLSEEVLGHVHNLTTSRQIWI---SLAENFNKSSIAREFSLRRNL 131
Query: 128 HSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVV----KLS 183
+ K + + Y K + D L G P+ S I LNGL EY+PI LS
Sbjct: 132 QLLTKKDKSLSVYCRDFKIICDSLSSIGKPVEESMKIFGFLNGLGREYDPITTVIQSSLS 191
Query: 184 DHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRG--- 240
++ D+ +++ F+S+++ ++ ++N + N + + ++NSN+ RG
Sbjct: 192 KLPAPTFNDVISEVQGFDSKLQSYDDTVSVNPHLAFNT-ERSNSGAPQYNSNSRGRGRSG 250
Query: 241 -SNFRGWRGGRGRGRS---SKAP-------CQVCGKTNHTAINCFHRFDKNYSRSNYSAD 289
+ RG RGRG S S +P CQ+CG+ HTAI C++RFD NY
Sbjct: 251 QNRGRGGYSTRGRGFSQHQSASPSSGQRPVCQICGRIGHTAIKCYNRFDNNY-------- 302
Query: 290 SDKQGSHNAFIASQNSVE-DYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKL 348
+ AF A + S E +WY DS A+ H+T T+ Q+ T + G ++++VG+G L
Sbjct: 303 -QSEVPTQAFSALRVSDETGKEWYPDSAATAHITASTSGLQNATTYEGNDAVLVGDGTYL 361
Query: 349 EIV----ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVEFDKN 395
I T SS ++ L++VL P I K+LLSVSKL D V FD N
Sbjct: 362 PITHVGSTTISSSKGTIPLNEVLVCPAIQKSLLSVSKLCDDYPCGVYFDAN 412
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.316 0.130 0.391
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 755,542,067
Number of Sequences: 2540612
Number of extensions: 29795938
Number of successful extensions: 122490
Number of sequences better than 10.0: 908
Number of HSP's better than 10.0 without gapping: 197
Number of HSP's successfully gapped in prelim test: 738
Number of HSP's that attempted gapping in prelim test: 116759
Number of HSP's gapped (non-prelim): 3389
length of query: 482
length of database: 863,360,394
effective HSP length: 132
effective length of query: 350
effective length of database: 527,999,610
effective search space: 184799863500
effective search space used: 184799863500
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)
Medicago: description of AC144765.7