
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144765.7 - phase: 0
(482 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q60DN3 Integrase core domain containing protein [Oryza... 225 3e-57
UniRef100_O24438 Retrofit [Oryza longistaminata] 214 4e-54
UniRef100_Q6ATL7 Putative polyprotein [Oryza sativa] 214 6e-54
UniRef100_Q65X82 Putative polyprotein [Oryza sativa] 211 4e-53
UniRef100_Q8RZ67 Putative rice retrotransposon retrofit gag/pol ... 206 1e-51
UniRef100_Q94LQ7 Putative gag-pol polyprotein [Oryza sativa] 204 5e-51
UniRef100_Q688S3 Putative polyprotein [Oryza sativa] 203 8e-51
UniRef100_Q75G45 Putative polyprotein [Oryza sativa] 201 5e-50
UniRef100_Q7G7H3 Putative gag-pol protein [Oryza sativa] 199 1e-49
UniRef100_Q94DD5 Putative gag/pol polyprotein [Oryza sativa] 198 3e-49
UniRef100_Q7XKV9 OSJNBa0073E02.10 protein [Oryza sativa] 187 8e-46
UniRef100_Q9SA17 F28K20.17 protein [Arabidopsis thaliana] 186 1e-45
UniRef100_Q6F356 Putative polyprotein [Oryza sativa] 186 2e-45
UniRef100_Q94H72 Putative gag-pol protein [Oryza sativa] 182 2e-44
UniRef100_Q9SSB1 T18A20.5 protein [Arabidopsis thaliana] 175 3e-42
UniRef100_Q8W0X9 Putative copia-like retrotransposon Hopscotch p... 174 5e-42
UniRef100_Q94IU9 Copia-like polyprotein [Arabidopsis thaliana] 174 7e-42
UniRef100_Q8H7X8 Putative gag-pol polyprotein [Oryza sativa] 172 1e-41
UniRef100_Q9SLL4 Putative retroelement pol polyprotein [Arabidop... 172 3e-41
UniRef100_Q9SV56 Hypothetical protein AT4g28900 [Arabidopsis tha... 171 6e-41
>UniRef100_Q60DN3 Integrase core domain containing protein [Oryza sativa]
Length = 1021
Score = 225 bits (573), Expect = 3e-57
Identities = 127/397 (31%), Positives = 218/397 (53%), Gaps = 32/397 (8%)
Query: 15 SVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSS---KSNNPAFEE 71
+V+ KL R N+ LWK+ +LPV+RG +++GY+ G + P I + D K++ PAFE
Sbjct: 16 AVAEKLTRTNFLLWKAQILPVIRGARMEGYLTGATQAPLAVIDAKDGEATVKASKPAFEM 75
Query: 72 WQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIR 131
W DQ++LG++L++++ E+ TQ++ E++ Q+W + + +R++ + + +
Sbjct: 76 WITADQQVLGFLLSTLSKEILTQVISMESAAQVWKAITEMLSSQSRARALNTRLALATTL 135
Query: 132 KGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL-SDHTTLSW 190
KG++ + DY+ KMK LAD++ AG P+ + +LI L GLD +Y P+V L +S
Sbjct: 136 KGDLSVSDYISKMKVLADEMAFAGKPLDDEELISYVLAGLDDDYEPVVSSLVGKSEVVSL 195
Query: 191 VDLQAQLLTFESRIEQLNNLTN--LNLNATANVANKFD-----HRDNRFNSNNNWRGSNF 243
+ +QLL+F+SR + + + ++NA K +R R NNN R SN
Sbjct: 196 AECYSQLLSFKSRQKLRHAAAHQASSVNAARRGGGKGGGYTPFNRGGRSGGNNNGRRSNN 255
Query: 244 RGWRGGRG-----RGRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQGSHNA 298
G RGGRG RG C +CGKT H +C++R+D+N+ N A + G
Sbjct: 256 GGGRGGRGNNNGDRGGKPHPVCHLCGKTGHVVADCWYRYDENFVPENKIAAAASYG---- 311
Query: 299 FIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIV----ATC 354
D +WY D+GA++H+T + +K +++GK+ + +G ++I +
Sbjct: 312 --------VDTNWYVDTGATDHITGELDKLTTREKYNGKDQIYTASGAGMDIKHIGHSVI 363
Query: 355 SSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+ +++ L ++L+VP KNLL +LA DN+ FVE
Sbjct: 364 CTPTRNIYLKNILHVPKAKKNLLFAHRLALDNHAFVE 400
>UniRef100_O24438 Retrofit [Oryza longistaminata]
Length = 1445
Score = 214 bits (545), Expect = 4e-54
Identities = 132/403 (32%), Positives = 220/403 (53%), Gaps = 43/403 (10%)
Query: 15 SVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKSN---NPAFEE 71
SVS KL + N+ LWK+ V VRG +L GY+ G K P+ ++ + K+ NPAFE+
Sbjct: 20 SVSEKLGKANHALWKAQVSAAVRGARLLGYLNGDIKAPDAELSVTIDGKTTTKPNPAFED 79
Query: 72 WQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIR 131
W+ANDQ +LG++L+S++ ++ Q+ C+T+ + W ++L TR++ + + + +
Sbjct: 80 WEANDQLVLGYLLSSLSRDVLIQVATCKTAAEAWRSIEALYSTGTRARAVNTRLALTNTK 139
Query: 132 KGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL---SDHTTL 188
KG MK+ +Y+ KM+ L D++ G+P+ DL+ + GL+ +++PIV L SD T+
Sbjct: 140 KGTMKIAEYVAKMRALGDEMAAGGHPLDEEDLVQYIIAGLNEDFSPIVSNLCNKSDPITV 199
Query: 189 SWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNN----------- 237
+L +QL+ FE+ ++ + A VAN+ NNN
Sbjct: 200 G--ELYSQLVNFETLLDLYR---STGQGGAAFVANRGRGGGGGGRGNNNNSGGGGGRSAP 254
Query: 238 -WRGSNFRGWRGGRGR---GRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQ 293
RGS +G RGGRGR G+ + CQVC K HTA +C++RFD++Y
Sbjct: 255 GGRGSGSQG-RGGRGRGTGGQDRRPTCQVCFKRGHTAADCWYRFDEDY-----------V 302
Query: 294 GSHNAFIASQNSVE-DYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIV- 351
A+ NS D +WY D+GA++H+T + K +++G + +G ++I
Sbjct: 303 ADEKLVAAATNSYGIDTNWYIDTGATDHITGELEKLTTKEKYNGGEQIHTASGAGMDISH 362
Query: 352 ---ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+ ++++L++VLYVP KNL+S S+LAADN+ F+E
Sbjct: 363 IGHTIVHTPSRNIHLNNVLYVPQAKKNLISASQLAADNSAFLE 405
>UniRef100_Q6ATL7 Putative polyprotein [Oryza sativa]
Length = 1437
Score = 214 bits (544), Expect = 6e-54
Identities = 127/416 (30%), Positives = 219/416 (52%), Gaps = 38/416 (9%)
Query: 1 MASAANNNKND--LPSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITS 58
MAS++ NN + + VS KL ++N+ +WK+ +L +RG +L+G++ G + P +
Sbjct: 1 MASSSKNNTGNPLVGQPVSEKLGKSNHAVWKAQILATIRGARLEGHLTGDDQPPAPILRR 60
Query: 59 SDSSKS---NNPAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAH 115
+ K +NP +EEW A DQ++L ++L+SM ++ Q+ C T+ W Q + G+
Sbjct: 61 KEGEKEVVVSNPEYEEWVATDQQVLAYLLSSMTKDLLVQVATCRTAASAWSMIQGMFGSM 120
Query: 116 TRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEY 175
TR++ I + +++KG+M + Y+ KM+ LAD L G P+ + +LI GLD E+
Sbjct: 121 TRARTINTRLSLSTLQKGDMNITTYVGKMRALADDLMAVGKPVDDDELIGYIFAGLDDEF 180
Query: 176 NPI---VVKLSDHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRF 232
P+ +V D T+ + AQL++FE R+ + ++N+ + + +R
Sbjct: 181 EPVISTIVGRPDPVTIG--ETYAQLISFEQRLAHRRSGDQSSVNSASRSRGQPQRGGSRS 238
Query: 233 NSNNN-WRGSNFRGWRGGRGRGRSS------------KAPCQVCGKTNHTAINCFHRFDK 279
++N RG+ G GRGRG S + CQ+C K HT +C++R+D+
Sbjct: 239 GGDSNRGRGAPSNGANRGRGRGNPSGGRANVGGGTDNRPKCQLCYKRGHTVCDCWYRYDE 298
Query: 280 NYSRSNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNS 339
N+ A + + + D +WY D+GA++HVT + +K ++HG +
Sbjct: 299 NFVPDERFAGT-----------AVSYGVDTNWYLDTGATDHVTGELDKLTVRDKYHGNDQ 347
Query: 340 LVVGNGDKLEIVATCSSKLK----SLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+ +G +EI +S +K +L+L DVLYVP KNL+S KL +DN F+E
Sbjct: 348 VHTASGAGMEISHIGNSVVKTPSRNLHLKDVLYVPKANKNLVSAYKLTSDNLAFIE 403
>UniRef100_Q65X82 Putative polyprotein [Oryza sativa]
Length = 1447
Score = 211 bits (537), Expect = 4e-53
Identities = 116/400 (29%), Positives = 212/400 (53%), Gaps = 34/400 (8%)
Query: 14 SSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKSN------NP 67
+++S KL ++N+ LWK+ V+ VRG +L+G++ G K P IT++ K NP
Sbjct: 16 NAISEKLSKSNHALWKAQVMAAVRGARLEGHLTGATKTPNALITTTAGDKGEKEVTVRNP 75
Query: 68 AFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEF 127
F++W A DQ++LG++L+++A ++ Q+ C T+ W + + + TR++ I +
Sbjct: 76 EFDDWVATDQQVLGFLLSTLARDVLAQVATCGTAAAAWQMLEEMYSSVTRARFINTRIAL 135
Query: 128 HSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKLSDHTT 187
+ +KG + + +Y+ KMK LAD++ AG + + DLI + GLD Y P++ + T
Sbjct: 136 SNTKKGTLSINEYVSKMKALADEMTAAGKIVDDDDLISYIIAGLDDTYEPVISTIVGKDT 195
Query: 188 LSWVDLQAQLLTFESRIE-QLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRGSNFRGW 246
++ + +QLL+FE R+ + +++NL R + RG N G
Sbjct: 196 MTLGEAYSQLLSFEQRLALRHGGDSSVNLANRGRGGGGGQQRGGNTGNGGRGRGGNNNGA 255
Query: 247 RGGRGRGRS----------SKAPCQVCGKTNHTAINCFHRFDKNY-SRSNYSADSDKQGS 295
GRGRG + ++ CQ+C K HT INC++R+D+++ Y+ + G
Sbjct: 256 NRGRGRGNNGGARPPGGVDNRPKCQLCYKRGHTVINCWYRYDEDFVPDEKYAGSATSYGI 315
Query: 296 HNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIV---- 351
D +WY D+ A++HVT + +K + G++ + +G +EI
Sbjct: 316 ------------DTNWYVDTSATDHVTGELDKLTVRDRYKGQDQVHTASGAGMEISHIGH 363
Query: 352 ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+T + + ++L ++LYVPN KNL+S ++L +DN+ ++E
Sbjct: 364 STVRTPNRDIHLRNILYVPNANKNLVSANRLVSDNSAYME 403
>UniRef100_Q8RZ67 Putative rice retrotransposon retrofit gag/pol polyprotein [Oryza
sativa]
Length = 1448
Score = 206 bits (524), Expect = 1e-51
Identities = 128/404 (31%), Positives = 214/404 (52%), Gaps = 41/404 (10%)
Query: 15 SVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKSN---NPAFEE 71
SVS KL + N+ LWK+ V V G +L GY+ G K P I+ + K+ NPAFE+
Sbjct: 20 SVSEKLGKANHALWKAQVSAAVHGARLLGYLNGDIKAPNAEISVTIDGKTTTKPNPAFED 79
Query: 72 WQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIR 131
W+ANDQ +LG++L+S++ ++ Q+ C+T+ + W ++L TR++ + + + +
Sbjct: 80 WEANDQLVLGYLLSSLSRDVLIQVATCKTAAEAWRNIEALYSTGTRARAVNTRLALTNTK 139
Query: 132 KGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL---SDHTTL 188
KG MK+ +Y+ KM+ L D++ G P+ L+ + GL+ +++PIV L SD T+
Sbjct: 140 KGTMKIAEYVAKMRALCDEMAAGGRPLDEEGLVQYIIAGLNEDFSPIVSNLCNKSDPITV 199
Query: 189 SWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNN----------- 237
+L +QL+ FE+ ++ + AN R N+NN+
Sbjct: 200 G--ELYSQLVNFETLLDLYRSTGQGGAAFVANRGRGGGGGGGRGNNNNSDGGGGGGGRGA 257
Query: 238 --WRGSNFRGWRGGRGR---GRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDK 292
RG +G RGG GR G+ + CQVC K HTA +C++RFD++Y
Sbjct: 258 PRGRGGGGQG-RGGHGRGTGGQDRRPTCQVCFKRGHTAADCWYRFDEDY----------- 305
Query: 293 QGSHNAFIASQNSVE-DYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIV 351
A+ NS D +WY D+GA++H+T + K +++G + +G ++I
Sbjct: 306 VADEKLVAAATNSYGIDTNWYIDTGATDHITGELEKLTTKEKYNGGEQIHTASGAGMDIS 365
Query: 352 ----ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+ ++++L++VLYVP KNL+S S+LAADN+ F+E
Sbjct: 366 HIGHTIVHTPSRNIHLNNVLYVPQAKKNLISASQLAADNSAFLE 409
>UniRef100_Q94LQ7 Putative gag-pol polyprotein [Oryza sativa]
Length = 1031
Score = 204 bits (519), Expect = 5e-51
Identities = 129/417 (30%), Positives = 213/417 (50%), Gaps = 42/417 (10%)
Query: 1 MASAANNNKNDLPSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSD 60
MA+A +N L +S KL + N+PLW + +L +RG +L+ +++ T P I D
Sbjct: 6 MAAAISNPLFGL--QISEKLTKQNHPLWAAQILTTLRGAQLEEHIVSTTAAPAAEIEKED 63
Query: 61 SSKSN-------NPAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAG 113
K NP ++ W DQ++LG++ +S++ E+ Q+ T+ Q W+ +
Sbjct: 64 GDKDKKTKIVIPNPEYKTWFVQDQQVLGFIFSSLSREVLQQVAGARTAAQAWNMIDDMFS 123
Query: 114 AHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDS 173
+++ I + + +KG M + +Y+ KM++LAD++ G P+ +L+ +NGLDS
Sbjct: 124 CKSKAGTINVLLALTTTQKGPMSISEYIAKMRSLADEMAATGKPLDEEELVAYIINGLDS 183
Query: 174 EYNPIVVKLSDHTTLSWVDLQ---AQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDN 230
E++ V L ++ V + +QLL++E+RI + L +AN AN+ R
Sbjct: 184 EFDAAVEGLMATARIAPVSISHVYSQLLSYENRI----RIRQAYLTTSANAANRGGGRGG 239
Query: 231 RFNSNNN---WRGSNFRGWRG---------GRGRGRSSKAPCQVCGKTNHTAINCFHRFD 278
R +S N RG RG RG GRGRG ++ CQVC K H A +C+HR+D
Sbjct: 240 RGSSTGNRGGRRGGFGRGGRGRGAPSGASQGRGRGNDTRPVCQVCHKRGHVASDCWHRYD 299
Query: 279 KNYSRSNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKN 338
+Y +K G A+ D +WY D+GA++H+T Q +K + G +
Sbjct: 300 DSY------VPDEKLGG----AATYAYGVDTNWYVDTGATDHITGQLDKLTTKERYKGTD 349
Query: 339 SLVVGNGDKLEIV----ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+ +G+ + I A + + L+L +VL+VP KNL+SV KL ADN F+E
Sbjct: 350 QIHTASGEGMSIKHVGHAIVPTPSRPLHLKNVLHVPEAAKNLVSVHKLVADNYAFLE 406
>UniRef100_Q688S3 Putative polyprotein [Oryza sativa]
Length = 1210
Score = 203 bits (517), Expect = 8e-51
Identities = 128/418 (30%), Positives = 217/418 (51%), Gaps = 46/418 (11%)
Query: 1 MASAANNNKNDL-PSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSS 59
MAS++ N L ++S KL +NN+ LWK+ +LPV+ G +++GY+ G + P I
Sbjct: 1 MASSSGAAVNPLFGQAISEKLTKNNFSLWKTHILPVICGARMEGYLTGATQVPSAEIEVK 60
Query: 60 DSSKS------NNPAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAG 113
+ K +NPA+E W A DQ++LG++L+S++ E+ Q+ + +T+ W L
Sbjct: 61 EGEKGEITKKVSNPAYEAWIAADQQVLGFLLSSISKEILIQVANVDTAAHAWKMIVGLLS 120
Query: 114 AHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDS 173
+R++ + + + +KGE + DY+ KMK LAD++ AG P+ + + L GLDS
Sbjct: 121 TQSRARALNTRIALATTQKGESSVSDYISKMKTLADEMASAGKPLDDEEFTSYILAGLDS 180
Query: 174 EYNPIV---VKLSDHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDN 230
+Y +V V S+ T+S ++ +QLL+FE + TN + + ++V + R+N
Sbjct: 181 DYEQVVSSIVGRSEGVTIS--EVYSQLLSFEELWQ-----TNGSSGSYSSVNSANHGRNN 233
Query: 231 RFNSNNNWRGSNF--------RGWRGGRGRGRSS-----KAPCQVCGKTNHTAINCFHRF 277
N N G F RG RGG GRG + CQ+C K H +C+H +
Sbjct: 234 GGGGNFNNGGGYFSNRGRGGGRGDRGGCGRGNGGRNFKPRPTCQLCSKVGHVVADCWHCY 293
Query: 278 DKNYSRSNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGK 337
D ++ A + G D +WY D+GA++H+T++ K ++GK
Sbjct: 294 DDSFVPDARVAAAASYG------------VDSNWYVDTGATDHITNELEKLTTRDRYNGK 341
Query: 338 NSLVVGNGDKLEI----VATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+ +G ++I +T + ++L L ++L+VP KNL+S +LA DN+ FVE
Sbjct: 342 EQIHTASGSGMDIKHIGQSTIRTPTRNLYLRNILHVPRTKKNLISAHRLAVDNHAFVE 399
>UniRef100_Q75G45 Putative polyprotein [Oryza sativa]
Length = 1431
Score = 201 bits (510), Expect = 5e-50
Identities = 132/417 (31%), Positives = 208/417 (49%), Gaps = 51/417 (12%)
Query: 1 MASAANNNKNDLPSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSD 60
MA+A +N L +S KL + N+PLW + +L +RG +L+ +++ T P I D
Sbjct: 6 MAAAISNPLFGL--QISDKLTKQNHPLWAAQILTTLRGAQLEEHIVSTTAAPAAEIEKED 63
Query: 61 SSKSN-------NPAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAG 113
K NP ++ W DQ++LG++ +S++ E+ Q+ T+ Q W+ +
Sbjct: 64 GDKDKKTKIVIPNPEYKTWFVQDQQVLGFIFSSLSREVLQQVAGARTAAQAWNMIDDMFS 123
Query: 114 AHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDS 173
KS+ +I KG M M +Y+ KM++LADK+ G P+ +L+ +NGLDS
Sbjct: 124 C---------KSKAGTINKGPMSMSEYIAKMRSLADKMAATGKPLDEEELVAYIINGLDS 174
Query: 174 EYNPIVVKLSDHTTLSWVDLQ---AQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDN 230
E++ V L ++ V + +QLL++E+RI + L +AN AN+ R
Sbjct: 175 EFDAAVEGLMATARIAPVSISHVYSQLLSYENRI----RIRQAYLTTSANAANRGGGRGG 230
Query: 231 RFNSNNN---WRGSNFRGWRG---------GRGRGRSSKAPCQVCGKTNHTAINCFHRFD 278
R +S N RG RG G GRGRG ++ CQVC K H A +C+HR+D
Sbjct: 231 RGSSTGNRGGGRGGFGRGGHGRGAPSGASQGRGRGNDTRPVCQVCHKRGHVASDCWHRYD 290
Query: 279 KNYSRSNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKN 338
+Y +K G A+ D +WY D+GA++H+T Q +K + G +
Sbjct: 291 DSY------VPDEKLGG----AATYAYGVDTNWYVDTGATDHITGQLDKLTTKERYKGTD 340
Query: 339 SLVVGNGDKLEIV----ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+ +G+ I A + L+L +VL+VP KNL+SV KL ADN F+E
Sbjct: 341 QIHTASGEGTSIKHVGHAIVPTPSHPLHLKNVLHVPEAAKNLVSVHKLVADNYAFLE 397
>UniRef100_Q7G7H3 Putative gag-pol protein [Oryza sativa]
Length = 1219
Score = 199 bits (506), Expect = 1e-49
Identities = 118/397 (29%), Positives = 207/397 (51%), Gaps = 38/397 (9%)
Query: 16 VSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKSN------NPAF 69
VS KL + N+ LW + VL +RG +L+G++LGT PE + + K NPA+
Sbjct: 19 VSEKLTKQNHSLWSAQVLTALRGARLEGHVLGTSVPPEAELEQKEGEKGEKTVRVPNPAY 78
Query: 70 EEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHS 129
EW A DQ++LG++ +S+ E+ +Q+ T+ W ++ + + I ++ +
Sbjct: 79 GEWFATDQQVLGFLFSSLTREIRSQVAGAPTAAAAWKTIENTFSTRSHAGAINVRLALTT 138
Query: 130 IRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL-SDHTTL 188
+KG+ + +Y+ KM+ L D++ G PI + +L+ +NGLDSE++P+V L + + ++
Sbjct: 139 TQKGQSTVTEYVSKMRALGDEIAATGKPIDDEELVAYIINGLDSEFDPVVEALIAKNASV 198
Query: 189 SWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWR-------GS 241
+ ++ +QLL FE+R++ + A N+ R N R GS
Sbjct: 199 TVAEVYSQLLGFENRVK-------IRTACAATSGNRGSGNQGRGGGNPRGRGTGRGGGGS 251
Query: 242 NFRGWRGGRGRGR---SSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQGSHNA 298
RG GRG GR ++ CQVC K H +C+HR+D+NY +K G
Sbjct: 252 GGRGGGHGRGNGRGGTDNRPTCQVCHKKGHVVADCWHRYDENY------VPDEKLGG--- 302
Query: 299 FIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIV----ATC 354
A+ D +WY D+ A++H+T Q +K ++ G + + +G+ ++I +
Sbjct: 303 -AATHAYGVDTNWYVDTEATDHITGQLDKLTTREKYKGTDQIHTASGEGMDIQHIGHSYV 361
Query: 355 SSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+ + L+L ++L+VP +KNL+SV +L ADN F+E
Sbjct: 362 PTSSRPLHLKNILHVPKASKNLISVHRLVADNYAFLE 398
>UniRef100_Q94DD5 Putative gag/pol polyprotein [Oryza sativa]
Length = 1449
Score = 198 bits (503), Expect = 3e-49
Identities = 125/410 (30%), Positives = 219/410 (52%), Gaps = 47/410 (11%)
Query: 14 SSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFIT------SSDSSKSNNP 67
+++S KL +NNY LWK+ VL VRG +L+G++ GT P I+ ++++ NP
Sbjct: 10 NTISEKLAKNNYALWKAQVLASVRGARLEGHLTGTTAAPAITISVPGEKEGDKATRAANP 69
Query: 68 AFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEF 127
A++EW A DQ++LG +L++++ ++ Q+ C T+ W + + + TR++ I +
Sbjct: 70 AYDEWVATDQQILGLLLSTLSKDVLAQVATCGTAAAAWSMLEEMYTSMTRARFINTRIAL 129
Query: 128 HSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL-SDHT 186
+ +KG++ + +Y+ KM+ L D + AG + + DLI + GLD Y P++ +
Sbjct: 130 SNTKKGDLSITEYVAKMRALGDDMTAAGKVVDDEDLISYIIAGLDDTYEPVISSIVGKSE 189
Query: 187 TLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANK---------------FDHRDNR 231
+S+ + +QLL+FE R NNL + ++AN+AN+ ++R
Sbjct: 190 PMSFGEAFSQLLSFEQR----NNLRH-GGESSANLANRGRGTTGGNGGQRGRGGNNRGRG 244
Query: 232 FNSNNNW--RGSNFR---GWRGGR-GRGRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSN 285
N NN RG R G+ GGR G G ++ CQ+C K HT INC++R+D+++
Sbjct: 245 GNGGNNSANRGKGGRGNGGFNGGRQGGGVDTRPKCQLCYKRGHTVINCWYRYDEDFVPDE 304
Query: 286 YSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNG 345
A S A+ + D +WY D+GA++HVT + K + G + + +G
Sbjct: 305 KYAGS----------AATSYGIDTNWYVDTGATDHVTGELEKLIVRDRYKGHDQVHTASG 354
Query: 346 DKLEIVATCSSKLKS----LNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+EI S +K+ ++L ++L+VP KNL+S +L +DN+ F+E
Sbjct: 355 AGMEISHIGHSIVKTPSRDIHLRNILHVPKANKNLVSAQRLVSDNSAFME 404
>UniRef100_Q7XKV9 OSJNBa0073E02.10 protein [Oryza sativa]
Length = 1131
Score = 187 bits (474), Expect = 8e-46
Identities = 128/409 (31%), Positives = 205/409 (49%), Gaps = 33/409 (8%)
Query: 1 MASAANNNKND---LPSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFIT 57
MAS++++ + L VS KL R+N+ +W++ VLP VRG +L GY+ GTK+ P IT
Sbjct: 1 MASSSSSTLSSSAVLGHPVSEKLSRDNFLVWRAQVLPAVRGAQLTGYLDGTKEVPSPEIT 60
Query: 58 SSDSSKSNN----PAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAG 113
+ + + R W + +L L + +
Sbjct: 61 VEKKPIHSGLRMINKYSGIYSRRFREKFWC--------KSHILRVPDKSGL--QFNEMFS 110
Query: 114 AHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDS 173
+ ++++II ++++ KG+ Y KMK LAD++ AG + + D++ L GLD+
Sbjct: 111 SQSKARIIQIRAQLARELKGDSSAAAYFTKMKGLADEMAAAGKKLDDDDIVSYILGGLDA 170
Query: 174 EYNPIVVKLSDHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDH-RDNRF 232
+YNP+V +S +S DL AQLL+FE+ LNN + +++AN A++ R
Sbjct: 171 DYNPLVASVSSKDYISLSDLYAQLLSFEA---HLNNQSEGGYHSSANSASRGGRGRGQGR 227
Query: 233 NSNNNWRGSNF------RGWRGGRGRGRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSNY 286
GSNF RG RGGRGRG S+ CQ+CGK HT C+ RFD+++S ++
Sbjct: 228 GRGRGGFGSNFGSGFGGRG-RGGRGRGDGSRPSCQLCGKEGHTVHTCWKRFDRSFSGNDV 286
Query: 287 SADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGD 346
Q + +A S V D +WY D+ A++H+T + K +HG + N
Sbjct: 287 IFQQHHQQAKSASAVSSYGV-DTNWYLDTAATDHITGELKKLTTKERYHGNEQVHAANSA 345
Query: 347 KLEIV----ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
+ I + ++L L++VLY+P KNL+S +LA DN+ FVE
Sbjct: 346 GMSISHIGRTIFHTPNRNLALNNVLYIPKAKKNLVSAHRLAYDNHAFVE 394
>UniRef100_Q9SA17 F28K20.17 protein [Arabidopsis thaliana]
Length = 1415
Score = 186 bits (473), Expect = 1e-45
Identities = 129/411 (31%), Positives = 202/411 (48%), Gaps = 44/411 (10%)
Query: 14 SSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEE---FITSSDSSKSNNPAFE 70
SSV++KL +NY LWK+ ++ KL G++ G P + + +S+ NP +E
Sbjct: 15 SSVTLKLTDSNYLLWKTQFESLLSSQKLIGFVNGAVNAPSQSRLVVNGEVTSEEPNPLYE 74
Query: 71 EWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQI---IYLKSEF 127
W DQ + W+ +++ E+ + + TS+Q+W SLA +S + L+
Sbjct: 75 SWFCTDQLVRSWLFGTLSEEVLGHVHNLSTSRQIW---VSLAENFNKSSVAREFSLRQNL 131
Query: 128 HSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVV----KLS 183
+ K E Y + K + D L G P+ S I LNGL +Y+PI LS
Sbjct: 132 QLLSKKEKPFSVYCREFKTICDALSSIGKPVDESMKIFGFLNGLGRDYDPITTVIQSSLS 191
Query: 184 DHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRG--- 240
T ++ D+ +++ F+S+++ ++ + N+ + + ++N N RG
Sbjct: 192 KLPTPTFNDVVSEVQGFDSKLQSYEEAASVTPHLAFNI-ERSESGSPQYNPNQKGRGRSG 250
Query: 241 -SNFRGWRGGRGRGRSS----------KAPCQVCGKTNHTAINCFHRFDKNYSRSNYSAD 289
+ RG RGRG S + CQ+CG+T HTA+ C++RFD NY
Sbjct: 251 QNKGRGGYSTRGRGFSQHQSSPQVSGPRPVCQICGRTGHTALKCYNRFDNNY-------- 302
Query: 290 SDKQGSHNAFIASQNSVED-YDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKL 348
Q AF + S + +W+ DS A+ HVT TN Q TE+ G ++++VG+G L
Sbjct: 303 ---QAEIQAFSTLRVSDDTGKEWHPDSAATAHVTSSTNGLQSATEYEGDDAVLVGDGTYL 359
Query: 349 EIVATCSSKLKSLN----LDDVLYVPNITKNLLSVSKLAADNNIFVEFDKN 395
I T S+ +KS N L++VL VPNI K+LLSVSKL D V FD N
Sbjct: 360 PITHTGSTTIKSSNGKIPLNEVLVVPNIQKSLLSVSKLCDDYPCGVYFDAN 410
>UniRef100_Q6F356 Putative polyprotein [Oryza sativa]
Length = 1256
Score = 186 bits (471), Expect = 2e-45
Identities = 112/369 (30%), Positives = 194/369 (52%), Gaps = 45/369 (12%)
Query: 1 MASAANNNKNDLPSSVSV--KLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITS 58
MAS++ ++ + ++V KL + NY +WK VL V+RG +LD Y+ G K P I
Sbjct: 1 MASSSQSSASGSLGGITVTEKLSKGNYLIWKVQVLAVIRGARLDSYLTGATKKPSATIII 60
Query: 59 SDSSKS---NNPAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAH 115
+ K +NPA +EW ANDQ++LG++L +M+ ++ +Q+ C ++ LW + + +
Sbjct: 61 KKNEKEVEVSNPAVDEWIANDQQVLGYLLTTMSRDVLSQVATCSSAASLWSTIEGMFSSA 120
Query: 116 TRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEY 175
TR++ I K + +KG++ + +Y+ KM+ LAD+L +G P+ DLI + GLD ++
Sbjct: 121 TRARSINTKIALTNTKKGDLGIAEYVSKMRVLADELATSGKPVDEEDLISYIIAGLDEDF 180
Query: 176 NPIV---VKLSDHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRF 232
PI+ V S+H +L + +QLL+FE R++ + +AN+AN+ R N
Sbjct: 181 EPIISSLVSKSEHVSLG--EAYSQLLSFEQRMK-------MRQEHSANLANRGRGRGNPG 231
Query: 233 NSNNNWRGSNFRGWRGG--RGRGRSS-------------KAPCQVCGKTNHTAINCFHRF 277
NN + + GG RGRGR + + CQ+C K HT I+C++R+
Sbjct: 232 RGRNNKQPQQQQRGHGGNSRGRGRGNNSNQRQGGNGVDYRPKCQLCYKRGHTVIDCWYRY 291
Query: 278 DKNY-SRSNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHG 336
D+++ Y+ + G D +WY D+G ++HVT + K ++ G
Sbjct: 292 DEDFVPDEKYAGTTASYG------------VDSNWYVDTGTTDHVTGELEKLTIRDKYKG 339
Query: 337 KNSLVVGNG 345
++ + NG
Sbjct: 340 QDQVQTANG 348
>UniRef100_Q94H72 Putative gag-pol protein [Oryza sativa]
Length = 535
Score = 182 bits (462), Expect = 2e-44
Identities = 112/394 (28%), Positives = 205/394 (51%), Gaps = 35/394 (8%)
Query: 19 KLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKS-----NNPAFEEWQ 73
K + N+ LW + +L +RG +L+GY+ GT + P D K +NP + +W
Sbjct: 134 KFTKQNHSLWSAQILTTLRGAQLEGYITGTAEAPAAECEKEDGDKKVKTTISNPEYIKWF 193
Query: 74 ANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIRKG 133
DQ++LG++ +S++ E+ Q+ +T+ Q W + +++ I + + +KG
Sbjct: 194 TQDQQVLGFLFSSLSREVLQQVAGAKTAAQAWSMINDMFTCKSKAGAINVLLALTTTQKG 253
Query: 134 EMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL---SDHTTLSW 190
+ + +Y+ KM++L D++ AG P+ + +LI +NGL+S+++ V L + LS
Sbjct: 254 PISISEYIAKMRSLGDEMAGAGKPLDDEELIAYIINGLNSDFDATVEGLMATARIAPLSI 313
Query: 191 VDLQAQLLTFESRIEQLNNLTNLNLNATANVAN------KFDHRDNRFNSNNNWRG---S 241
+ +QLL++E+RI + L +AN AN + +R R ++ + RG
Sbjct: 314 SHVYSQLLSYENRI----RIRQAYLTTSANAANRGGRGGRGGNRGGRSSAPHGGRGGGRG 369
Query: 242 NFRGWRGGRGRGRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQGSHNAFIA 301
N G GRGRG ++ CQVC K H A +C+HR+D+NY +K G A
Sbjct: 370 NTGGANPGRGRGNDTRPVCQVCHKRGHVASDCWHRYDENY------GPDEKLGG----AA 419
Query: 302 SQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIVATCSSKLKS- 360
+ D +WY D+ A++H+T Q +K ++ G + + +G+ + + + + +
Sbjct: 420 TYAYGVDTNWYVDTRATDHITGQLDKLTTREKYKGTDLIHTVSGEGMNVKHIVHTIVPTP 479
Query: 361 ---LNLDDVLYVPNITKNLLSVSKLAADNNIFVE 391
L+L ++L+VP +KNL+ V +L ADN F++
Sbjct: 480 SCPLHLKNILHVPQASKNLVFVHRLVADNYAFLD 513
>UniRef100_Q9SSB1 T18A20.5 protein [Arabidopsis thaliana]
Length = 1522
Score = 175 bits (443), Expect = 3e-42
Identities = 121/413 (29%), Positives = 200/413 (48%), Gaps = 40/413 (9%)
Query: 11 DLPSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEF--ITSSD-SSKSNNP 67
++ + V+V L++ NY LWKS + G L G++ G+ P + +T ++ +S+ NP
Sbjct: 10 NISNCVTVTLNQQNYILWKSQFESFLSGQGLLGFVTGSISAPAQTRSVTHNNVTSEEPNP 69
Query: 68 AFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEF 127
F W DQ + W+L S A ++ + +++C TS Q+W + + S++ L+
Sbjct: 70 EFYTWHQTDQVVKSWLLGSFAEDILSVVVNCFTSHQVWLTLANHFNRVSSSRLFELQRRL 129
Query: 128 HSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKLSD--- 184
++ K + ME +L +K++ D+L G+P+ I LNGL EY PI + +
Sbjct: 130 QTLEKKDNTMEVFLKDLKHICDQLASVGSPVPEKMKIFSALNGLGREYEPIKTTIENSVD 189
Query: 185 -HTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRGSNF 243
+ +LS ++ ++L ++ R++ ++ + NV H D+ + NNN RG
Sbjct: 190 SNPSLSLDEVASKLRGYDDRLQSYVTEPTISPHVAFNVT----HSDSGYYHNNN-RGKGR 244
Query: 244 RGWRGG------RGRG-------------RSSKAPCQVCGKTNHTAINCFHRFDKNYSRS 284
G RGRG +S CQ+CGK H A+ C+HRFD +Y
Sbjct: 245 SNSGSGKSSFSTRGRGFHQQISPTSGSQAGNSGLVCQICGKAGHHALKCWHRFDNSYQHE 304
Query: 285 NYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGN 344
+ I ++W DS AS HVT+ + Q +HG +S++V +
Sbjct: 305 DL-----PMALATMRITDVTDHHGHEWIPDSAASAHVTNNRHVLQQSQPYHGSDSIMVAD 359
Query: 345 GDKLEIVATCSSKLKS----LNLDDVLYVPNITKNLLSVSKLAADNNIFVEFD 393
G+ L I T S + S + L +VL P+I K+LLSVSKL +D VEFD
Sbjct: 360 GNFLPITHTGSGSIASSSGKIPLKEVLVCPDIVKSLLSVSKLTSDYPCSVEFD 412
>UniRef100_Q8W0X9 Putative copia-like retrotransposon Hopscotch polyprotein [Zea
mays]
Length = 1313
Score = 174 bits (441), Expect = 5e-42
Identities = 101/346 (29%), Positives = 171/346 (49%), Gaps = 44/346 (12%)
Query: 26 PLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSDSSKSNNPAFEEWQANDQRLLGWMLN 85
P K+ V +RG +L+GY+ G K P+E + KS NPAFEEW+A DQ++L ++L+
Sbjct: 2 PCGKAQVRAAMRGARLEGYLTGATKMPDEETVDNKGKKSPNPAFEEWEAKDQQILSYLLS 61
Query: 86 SMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMK 145
S++ E+ Q+ +T+ + W +++ + TR++ + L+ + +KG M + +Y KMK
Sbjct: 62 SISREVQIQVTSAKTAAEAWHSIEAMFASQTRARAVNLRLALSTTKKGSMTVAEYYTKMK 121
Query: 146 NLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKLSDHT-TLSWVDLQAQLLTFESRI 204
D++ AG P+ + +++ L GL+ E+ P+V L +S DL +QLL FE+++
Sbjct: 122 GYGDEMAAAGRPLQDEEMVEYILTGLEEEFLPMVSALVTRVDPISLEDLYSQLLNFETKL 181
Query: 205 EQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRGSNFRGWRGGRGRGRSSKAP----- 259
+ + + +AN+A R N GS RG RGGRG R +A
Sbjct: 182 DLMRGGGEQH-QGSANMA-------GRGGRGNQRGGSGGRGQRGGRGSSRGGRASWSGGR 233
Query: 260 --------------------CQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQGSHNAF 299
CQVC K HTA C+HRF++++ A + H
Sbjct: 234 QSNQGGYIRRSNNSSDERPVCQVCFKKGHTAARCWHRFEEDFVPDEKLAGAATNSYH--- 290
Query: 300 IASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNG 345
D +WY D+GA++H+T + K ++ G + + +G
Sbjct: 291 -------VDTNWYTDTGATDHITGELEKLSIREKYAGGDQIHTASG 329
>UniRef100_Q94IU9 Copia-like polyprotein [Arabidopsis thaliana]
Length = 1466
Score = 174 bits (440), Expect = 7e-42
Identities = 128/411 (31%), Positives = 205/411 (49%), Gaps = 42/411 (10%)
Query: 14 SSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEE---FITSSDSSKSNNPAFE 70
SSV++KL+ +NY LWK+ ++ KL G++ G P + + +S+ NP +E
Sbjct: 15 SSVTLKLNDSNYLLWKTQFESLLSSQKLIGFVNGVVTPPAQTRLVVNDDVTSEVPNPQYE 74
Query: 71 EWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQI---IYLKSEF 127
+W DQ + W+ +++ E+ + + TS+Q+W SLA +S I L+
Sbjct: 75 DWFCTDQLVRSWLFGTLSEEVLGHVHNLTTSRQIWI---SLAENFNKSSIAREFSLRRNL 131
Query: 128 HSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVV----KLS 183
+ K + + Y K + D L G P+ S I LNGL EY+PI LS
Sbjct: 132 QLLTKKDKSLSVYCRDFKIICDSLSSIGKPVEESMKIFGFLNGLGREYDPITTVIQSSLS 191
Query: 184 DHTTLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRG--- 240
++ D+ +++ F+S+++ ++ ++N + N + + ++NSN+ RG
Sbjct: 192 KLPAPTFNDVISEVQGFDSKLQSYDDTVSVNPHLAFNT-ERSNSGAPQYNSNSRGRGRSG 250
Query: 241 -SNFRGWRGGRGRGRS---SKAP-------CQVCGKTNHTAINCFHRFDKNYSRSNYSAD 289
+ RG RGRG S S +P CQ+CG+ HTAI C++RFD NY
Sbjct: 251 QNRGRGGYSTRGRGFSQHQSASPSSGQRPVCQICGRIGHTAIKCYNRFDNNY-------- 302
Query: 290 SDKQGSHNAFIASQNSVE-DYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKL 348
+ AF A + S E +WY DS A+ H+T T+ Q+ T + G ++++VG+G L
Sbjct: 303 -QSEVPTQAFSALRVSDETGKEWYPDSAATAHITASTSGLQNATTYEGNDAVLVGDGTYL 361
Query: 349 EIV----ATCSSKLKSLNLDDVLYVPNITKNLLSVSKLAADNNIFVEFDKN 395
I T SS ++ L++VL P I K+LLSVSKL D V FD N
Sbjct: 362 PITHVGSTTISSSKGTIPLNEVLVCPAIQKSLLSVSKLCDDYPCGVYFDAN 412
>UniRef100_Q8H7X8 Putative gag-pol polyprotein [Oryza sativa]
Length = 1247
Score = 172 bits (437), Expect = 1e-41
Identities = 109/360 (30%), Positives = 188/360 (51%), Gaps = 43/360 (11%)
Query: 14 SSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFIT------SSDSSKSNNP 67
+++S KL +NNY LWK+ VL VRG +L+G++ GT P I+ +++ NP
Sbjct: 10 NTISEKLAKNNYALWKAQVLASVRGARLEGHLTGTTAAPAITISVPGEKEGDKATRVANP 69
Query: 68 AFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEF 127
A++EW A DQ++LG +L++++ ++ Q+ C T+ W + + + TR++ I +
Sbjct: 70 AYDEWVATDQQILGLLLSTLSRDVLAQVATCGTAATAWSMLEEMYTSMTRARFINTRIAL 129
Query: 128 HSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKL-SDHT 186
+ +KG++ + +Y+ KM+ L D + AG + N DLI + GLD Y P++ +
Sbjct: 130 SNTKKGDLSITEYVAKMRALGDDMTAAGKVVDNEDLISYIIAGLDDTYEPVISSIVGKSE 189
Query: 187 TLSWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANK---------------FDHRDNR 231
+S+ + +QLL+F EQ NNL + ++AN+AN+ ++R
Sbjct: 190 PMSFGEAFSQLLSF----EQCNNLRH-GGESSANLANRGCGTTGGNGGQRGRGGNNRGRG 244
Query: 232 FNSNNNW--RGSNFR---GWRGGR-GRGRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSN 285
N NN RG R G+ GGR G G ++ CQ+C K HT INC++R+D+++
Sbjct: 245 GNGGNNSANRGKGGRGNGGFNGGRQGGGVDTRPKCQLCYKRGHTVINCWYRYDEDFVPDE 304
Query: 286 YSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNG 345
A S A+ + D +WY D+GA++HVT + K + G + + +G
Sbjct: 305 KYAGS----------AATSYGIDTNWYVDTGATDHVTGELEKLIVRDHYKGHDQVHTASG 354
>UniRef100_Q9SLL4 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1402
Score = 172 bits (435), Expect = 3e-41
Identities = 116/416 (27%), Positives = 194/416 (45%), Gaps = 36/416 (8%)
Query: 11 DLPSSVSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPEEFITSSD----SSKSNN 66
++ + V+V L NY LWKS + G L G++ G+ P + SD +S S N
Sbjct: 10 NISNCVTVTLTAKNYILWKSQFESFLDGQGLLGFVTGSIPAPSQTSVVSDIDGSTSASPN 69
Query: 67 PAFEEWQANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSE 126
P + W D+ + W+L S ++ + +++C TS ++W + + S++ L+
Sbjct: 70 PEYYTWFKTDRVVKSWLLGSFLEDILSVVVNCNTSHEVWISVANHFNRVSSSRLFELQRR 129
Query: 127 FHSIRKGEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKLSDHT 186
++ K + M++YL +K + D+L G+P++ I LNGL EY PI + +
Sbjct: 130 LQNVSKRDKSMDEYLKDLKTICDQLASVGSPVTEKMKIFAALNGLGREYEPIKTTIENSM 189
Query: 187 TL----SWVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFD-HRDNRFNSNNNWRGS 241
S D+ +L ++ R++ T ++ + N+ D + FN+ N +G
Sbjct: 190 DALPGPSLEDVIPKLTGYDDRLQGYLEETAVSPHVAFNITTSDDSNASGYFNAYNRGKGK 249
Query: 242 NFRGWRGGRGRGR------------------SSKAPCQVCGKTNHTAINCFHRFDKNYSR 283
+ RG RGR + CQ+CGK H A+ C+HRF+
Sbjct: 250 SNRGRNSFSTRGRGFHQQISSTNSSSGSQSGGTSVVCQICGKMGHPALKCWHRFN----- 304
Query: 284 SNYSADSDKQGSHNAFIASQNSVEDYDWYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVG 343
++Y + + I +W DS A+ HVT+ Q +HG ++++V
Sbjct: 305 NSYQYEELPRALAAMRITDITDQHGNEWLPDSAATAHVTNSPRSLQQSQPYHGSDAVMVA 364
Query: 344 NGDKLEIVATCSSKLKS----LNLDDVLYVPNITKNLLSVSKLAADNNIFVEFDKN 395
+G+ L I T S+ L S + L DVL P+ITK+LLSVSKL D VEFD +
Sbjct: 365 DGNFLPITHTGSTNLASSSGNVPLTDVLVCPSITKSLLSVSKLTQDYPCTVEFDSD 420
>UniRef100_Q9SV56 Hypothetical protein AT4g28900 [Arabidopsis thaliana]
Length = 1415
Score = 171 bits (432), Expect = 6e-41
Identities = 114/392 (29%), Positives = 190/392 (48%), Gaps = 39/392 (9%)
Query: 16 VSVKLDRNNYPLWKSLVLPVVRGCKLDGYMLGTKKCPE--EFITSSDS-SKSNNPAFEEW 72
V++KL NY LWK + +L G++ G CP I + D +++ NP F W
Sbjct: 17 VTLKLSTANYLLWKIQFETWLNNQRLLGFVTGANPCPNATRSIRNGDQVTEATNPDFLTW 76
Query: 73 QANDQRLLGWMLNSMATEMATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIRK 132
NDQ+++GW+L S++ + + TS+++W + S+ L+ + + K
Sbjct: 77 VQNDQKIMGWLLGSLSEDALRSVYGLHTSREVWFSLAKKYNRVSASRKSDLQRRLNPVSK 136
Query: 133 GEMKMEDYLIKMKNLADKLKLAGNPISNSDLIIQTLNGLDSEYNPIVVKLS---DHTTLS 189
E M +YL +K + D+L G P+ ++ I LNGL EY + + D +S
Sbjct: 137 NEKSMLEYLNCVKQICDQLDSIGCPVPENEKIFGVLNGLGQEYMLVSTMIKGSMDTYPMS 196
Query: 190 WVDLQAQLLTFESRIEQLNNLTNLNLNATANVANKFDHRDNRFNSNNNWRGSNF-RGWRG 248
+ D+ +L+ F+ +++ + NR +N +G F +
Sbjct: 197 FEDVVFKLINFDDKLQNGQS------------------GGNRGRNNYTTKGRGFPQQISS 238
Query: 249 GRGRGRSSKAPCQVCGKTNHTAINCFHRFDKNYSRSNYSADSDKQGSHNAFIASQNSVED 308
G ++ CQ+C K H+A C+ RFD + ++S AF A + S +
Sbjct: 239 GSPSDSGTRPTCQICNKYGHSAYKCWKRFDHAFQSEDFS---------KAFAAMRVSDQK 289
Query: 309 YD-WYFDSGASNHVTHQTNKFQDLTEHHGKNSLVVGNGDKLEIV----ATCSSKLKSLNL 363
+ W DSGA++H+T+ T++ Q + G++S++VGN D L I A +S +L L
Sbjct: 290 SNPWVTDSGATSHITNSTSQLQSAQPYSGEDSVIVGNSDFLPITHIGSAVLTSNQGNLPL 349
Query: 364 DDVLYVPNITKNLLSVSKLAADNNIFVEFDKN 395
DVL PNITK+LLSVSKL +D +EFD +
Sbjct: 350 RDVLVCPNITKSLLSVSKLTSDYPCVIEFDSD 381
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.316 0.130 0.391
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 756,671,546
Number of Sequences: 2790947
Number of extensions: 30119972
Number of successful extensions: 110599
Number of sequences better than 10.0: 684
Number of HSP's better than 10.0 without gapping: 168
Number of HSP's successfully gapped in prelim test: 532
Number of HSP's that attempted gapping in prelim test: 107714
Number of HSP's gapped (non-prelim): 1935
length of query: 482
length of database: 848,049,833
effective HSP length: 131
effective length of query: 351
effective length of database: 482,435,776
effective search space: 169334957376
effective search space used: 169334957376
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)
Medicago: description of AC144765.7