
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147000.15 - phase: 0 /pseudo
(485 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_O81903 Putative transposable element [Arabidopsis thal... 90 1e-16
UniRef100_Q8LNW7 Putative polyprotein [Oryza sativa] 87 1e-15
UniRef100_Q6L4V3 Putative polyprotein [Oryza sativa] 83 2e-14
UniRef100_Q9SHR5 F28L22.3 protein [Arabidopsis thaliana] 79 2e-13
UniRef100_Q9AUZ1 Polyprotein, putative [Arabidopsis thaliana] 78 7e-13
UniRef100_Q9ZQE4 Copia-like retroelement pol polyprotein [Arabid... 75 3e-12
UniRef100_Q9LJS7 Copia-like retroelement pol polyprotein-like [A... 64 1e-08
UniRef100_Q6L4X8 Putative polyprotein [Oryza sativa] 63 2e-08
UniRef100_O81449 T27D20.5 protein [Arabidopsis thaliana] 62 5e-08
UniRef100_Q02902 Orf 3 [Arabidopsis thaliana] 61 8e-08
UniRef100_Q7XP09 OSJNBb0013J13.11 protein [Oryza sativa] 60 2e-07
UniRef100_Q75HA9 Putative polyprotein [Oryza sativa] 57 1e-06
UniRef100_Q94LG0 Putative retroelement pol polyprotein [Oryza sa... 56 2e-06
UniRef100_Q7XQV8 OSJNBb0050N09.11 protein [Oryza sativa] 55 3e-06
UniRef100_Q8S757 Putative retroelement [Oryza sativa] 55 3e-06
UniRef100_Q8S6N1 Putative gag-pol polyprotein [Oryza sativa] 55 6e-06
UniRef100_Q75KL7 Putative polyprotein [Oryza sativa] 55 6e-06
UniRef100_Q8S0V1 Putative gag-pol polyprotein [Oryza sativa] 54 8e-06
UniRef100_Q7X8V4 OSJNBa0064M23.6 protein [Oryza sativa] 54 8e-06
UniRef100_Q9ZPU5 Putative retroelement pol polyprotein [Arabidop... 54 8e-06
>UniRef100_O81903 Putative transposable element [Arabidopsis thaliana]
Length = 1308
Score = 90.1 bits (222), Expect = 1e-16
Identities = 96/353 (27%), Positives = 165/353 (46%), Gaps = 38/353 (10%)
Query: 1 FNQICKDSGIVRHKQIRGTPRENCLDH*RMDRKIMERFICLMSNANNSTLFWDETV**FF 60
F++ CK +GI RH+ TP++N + RM+R +ME+ CL++ + +FW E
Sbjct: 567 FDEFCKQNGIERHRTCTYTPQQNGVAK-RMNRTLMEKVRCLLNESGLEEVFWAEAAATAA 625
Query: 61 LIS*IDSLTLNLV*RH*KRCGWSCSKP*-KYESFWLCPLYAHIKRTILN-QGLKLYVFEV 118
+ ++ + V + W KP K+ + C Y H+ + L + LK
Sbjct: 626 YL--VNRSPASAVDHNVPEELWLDKKPGYKHLRRFGCIAYVHLDQGKLKPRALKGVFLGY 683
Query: 119 SLRCKRVQIWCMEKGET*VLINKDVVFKESEMFYSKETSS*H--QP*LDCEERVQFKVE- 175
K ++W +++ + +I++++VF E++++ SS + D E +F+V
Sbjct: 684 PQGTKGYKVWLLDEEKC--VISRNIVFNENQVYKDIRESSEQSVKDISDLEGYNEFQVSV 741
Query: 176 ------SLHGSTNLDAKVKDKD*NDDTTPLFFLMENVAIRF*ISFSSSFRFKLRTSTS-- 227
S G ++ ++ D + T E + +S S R + R + +
Sbjct: 742 KEHGECSKTGGVTIEEIDQESDSENSVT-----QEPLIASIDLSNYQSARDRERRAPNPP 796
Query: 228 LSLSSNTTLEN*L**KK*VVVSGEIDGDEPVN*DGFVNASKSREWI---ASMKEELSSLH 284
L+ T L V++ EI+ +EP + +A K + WI MKEE+ SL
Sbjct: 797 QKLADYTHFALAL------VMAEEIESEEP---QCYHDAKKDKHWIKWNGGMKEEIDSLL 847
Query: 285 KNGTWLLVNKPKRMKNMSCI*LFKLKEGVNYMD---*NSRFKARGFT*REGID 334
KNGTW +V PK K +SC LFKLK G+ ++ +R ARGFT ++GID
Sbjct: 848 KNGTWDIVEWPKEQKVISCRWLFKLKPGIPGVEAQRYKARLVARGFTQQKGID 900
Score = 52.0 bits (123), Expect = 4e-05
Identities = 28/65 (43%), Positives = 40/65 (61%)
Query: 405 YLNETMHKYVDYMILKIDDMLIACNIKEEFDKVKVML*IEFQMKDMRAAS*ILAKKILRD 464
Y+ E Y++L +DDML+A K E K+K L ++F+MKDM AAS IL I+R+
Sbjct: 1003 YVKEVNPGEFVYLLLYVDDMLLAAKSKSEISKLKEALSLKFEMKDMGAASRILGIDIIRN 1062
Query: 465 ISKGT 469
+GT
Sbjct: 1063 RKEGT 1067
>UniRef100_Q8LNW7 Putative polyprotein [Oryza sativa]
Length = 1280
Score = 86.7 bits (213), Expect = 1e-15
Identities = 98/361 (27%), Positives = 156/361 (43%), Gaps = 62/361 (17%)
Query: 1 FNQICKDSGIVRHKQIRGTPRENCLDH*RMDRKIMERFICLMSNANNSTLFWDETV**FF 60
F CK GIV H + TP++N + RM+ I+ + C++SNA+ FW E V
Sbjct: 606 FKSYCKSEGIVHHYTVPHTPQQNGVAE-RMNMAIISKARCMLSNADLPKQFWAEAV---- 660
Query: 61 LIS*IDSLTLNLV*RH*KRCG--------WSCSKP*KYESFWL--CPLYAHIKRTILN-Q 109
S T L+ R WS S P Y + C YAH+ L +
Sbjct: 661 ------STTCYLINRSPSYATDKKTPIEVWSGS-PANYSDLRVFGCTAYAHVDNGKLEPR 713
Query: 110 GLKLYVFEVSLRCKRVQIWCMEKGET*VLINKDVVFKESEMFYSKETSS*HQP*LDCEER 169
+K K ++WC E + V+I+++VVF ES + + K +++ ++ +E+
Sbjct: 714 AIKCIFLGYPSGVKGYKLWCPETKK--VVISRNVVFHESVILHDKPSTN---VPVESQEK 768
Query: 170 VQFKVESLHGSTNLDAKVKDKD*NDDTTPLFFLMENVAIRF*IS-FSSSFRFKLRTSTSL 228
+VE L S + K ENVAI S ++ S+
Sbjct: 769 ASVQVEHLISSGHAPEK-----------------ENVAINQDAPVIEDSDSSIVQQSSKR 811
Query: 229 SLSSNTTLEN*L**KK*V----------VVSGEIDGD-EPVN*DGFVNASKSREWIASMK 277
S++ + N ++ + V+ EI+G+ EP + + WI +M
Sbjct: 812 SIAKDKPKRNIKPPRRYIEEANIVAYALSVAEEIEGNAEPSTYSEAIVSDDCNRWITAMH 871
Query: 278 EELSSLHKNGTWLLVNKPKRMKNMSCI*LFKLKEGVNYMD*NSRFKAR----GFT*REGI 333
+E+ SL KN TW V PK K + C +FK KEG++ D +R+KAR G++ GI
Sbjct: 872 DEMESLKKNHTWEFVKLPKEKKPIRCKWIFKRKEGMSPSD-EARYKARLVAKGYSQIPGI 930
Query: 334 D 334
D
Sbjct: 931 D 931
Score = 46.2 bits (108), Expect = 0.002
Identities = 25/53 (47%), Positives = 32/53 (60%)
Query: 416 YMILKIDDMLIACNIKEEFDKVKVML*IEFQMKDMRAAS*ILAKKILRDISKG 468
Y++L +DDMLIA K E K+K L EF MKD+ AA IL +I R+ G
Sbjct: 1044 YLLLYVDDMLIAAKDKSEIAKLKAQLSSEFGMKDLGAAKKILGMEITRERHSG 1096
>UniRef100_Q6L4V3 Putative polyprotein [Oryza sativa]
Length = 1243
Score = 83.2 bits (204), Expect = 2e-14
Identities = 99/358 (27%), Positives = 158/358 (43%), Gaps = 56/358 (15%)
Query: 1 FNQICKDSGIVRHKQIRGTPRENCLDH*RMDRKIMERFICLMSNANNSTLFWDETV**F- 59
F CK GIV H TP++N + RM+R I+ + C++SNA FW E V
Sbjct: 543 FKSYCKSEGIVCHYTAPHTPQQNDVAE-RMNRTIISKARCMLSNAGLPKQFWAEAVSTAC 601
Query: 60 FLIS*-----IDSLTLNLV*RH*KRCGWSCSKP*KYESFWL--CPLYAHIKRTILN-QGL 111
+LI+ ID T V WS S P Y + C YAH+ L + +
Sbjct: 602 YLINRSPGYAIDKKTPIEV--------WSGS-PTNYSDLRVFGCTAYAHVDNGKLEPRAI 652
Query: 112 KLYVFEVSLRCKRVQIWCMEKGET*VLINKDVVFKESEMFYSKETSS*HQP*LDCEERVQ 171
K + K ++WC E + V+I+++VVF ES + + K +++ ++ +E+
Sbjct: 653 KCIFLGYASGVKGYKLWCPETKK--VVISRNVVFHESVILHDKPSTNVP---VESQEKAS 707
Query: 172 FKVESLHGSTNLDAKVKDKD*NDDTTPLFFLMENVAIRF*ISFSSSFRFKLRTSTSLSLS 231
+VE L S + K +D N D + S + S S++
Sbjct: 708 VQVEHLISSGHAPEK-EDVAINQDAPVI---------------EDSDSSIVHQSPKRSIA 751
Query: 232 SNTTLEN*L**KK*VV----------VSGEIDGD-EPVN*DGFVNASKSREWIASMKEEL 280
+ N ++ + V+ +I+G+ EP + + WI +M +E+
Sbjct: 752 KDKPKRNIKPPRRYIEEAKIVAYALSVAEKIEGNAEPSTYSEAIVSDDCNRWITAMHDEM 811
Query: 281 SSLHKNGTWLLVNKPKRMKNMSCI*LFKLKEGVNYMD*NSRFKAR----GFT*REGID 334
SL KN TW LV PK K + C +FK KEG++ D +R+KAR G++ GID
Sbjct: 812 ESLEKNHTWELVKLPKEKKPIRCKWIFKRKEGMSPSD-EARYKARLVAKGYSQIPGID 868
Score = 50.1 bits (118), Expect = 1e-04
Identities = 27/65 (41%), Positives = 38/65 (57%)
Query: 404 CYLNETMHKYVDYMILKIDDMLIACNIKEEFDKVKVML*IEFQMKDMRAAS*ILAKKILR 463
C + + V Y++L +DDMLIA K E +K+K L EF+MKD+ AA IL +I R
Sbjct: 969 CVYLKVVDGSVIYLLLYVDDMLIAAKDKSEIEKLKAQLSSEFEMKDLGAAKKILGMEITR 1028
Query: 464 DISKG 468
+ G
Sbjct: 1029 ERHSG 1033
>UniRef100_Q9SHR5 F28L22.3 protein [Arabidopsis thaliana]
Length = 1356
Score = 79.3 bits (194), Expect = 2e-13
Identities = 92/370 (24%), Positives = 156/370 (41%), Gaps = 62/370 (16%)
Query: 1 FNQICKDSGIVRHKQIRGTPRENCLDH*RMDRKIMERFICLMSNANNSTLFWDETV**FF 60
F+ CK+ GI RH+ TP++N + RM+R IME+ CL++ + +FW E
Sbjct: 576 FDSYCKEHGIERHRTCTYTPQQNGVAE-RMNRTIMEKVRCLLNKSGVEEVFWAEAA---- 630
Query: 61 LIS*IDSLTLNLV*RH*KRCGWSCSKP*KYESFWLC--PLYAHIKR------------TI 106
+ L+ R S E WL P Y H+++ +
Sbjct: 631 ------ATAAYLI----NRSPASAINHNVPEEMWLNRKPGYKHLRKFGSIAYVHQDQGKL 680
Query: 107 LNQGLKLYVFEVSLRCKRVQIWCMEKGET*VLINKDVVFKESEMFYS-----KETSS*HQ 161
+ LK + K ++W +E+ + +I+++VVF+ES ++ +T + +Q
Sbjct: 681 KPRALKGFFLGYPAGTKGYKVWLLEEEKC--VISRNVVFQESVVYRDLKVKEDDTDNLNQ 738
Query: 162 P*LDCEERVQFKVESLHGSTNLDAKVKDKD*---------NDDTTPLFFLMENVAIRF*I 212
E Q K GS + D + +++ + R +
Sbjct: 739 KETTSSEVEQNKFAEASGSGGVIQLQSDSEPITEGEQSSDSEEEVEYSEKTQETPKRTGL 798
Query: 213 SFSSSFRFKLRTS----TSLSLSSNTTLEN*L**KK*VVVSGEIDGDEPVN*DGFVNASK 268
+ R ++R + T + S+ T +VV EP + + +
Sbjct: 799 TTYKLARDRVRRNINPPTRFTEESSVTFA--------LVVVENCIVQEPQSYQEAMESQD 850
Query: 269 SREWIASMKEELSSLHKNGTWLLVNKPKRMKNMSCI*LFKLKEGVNYMD*NSRFKAR--- 325
+W + +E+ SL KNGTW LV+KPK K + C LFKLK G+ ++ +RFKAR
Sbjct: 851 CEKWDMATHDEMDSLMKNGTWDLVDKPKDRKIIGCRWLFKLKSGIPGVE-PTRFKARLVA 909
Query: 326 -GFT*REGID 334
G+T REG+D
Sbjct: 910 KGYTQREGVD 919
Score = 47.8 bits (112), Expect = 7e-04
Identities = 28/58 (48%), Positives = 36/58 (61%), Gaps = 1/58 (1%)
Query: 411 HKYVDYMILKIDDMLIACNIKEEFDKVKVML*IEFQMKDMRAAS*ILAKKILRDISKG 468
H ++ Y++L +DDMLIA K E ++VK L EF+MKDM AS IL I RD G
Sbjct: 1029 HDFI-YLLLYVDDMLIAGASKAEINRVKEQLSTEFEMKDMGGASRILGIDIYRDRKGG 1085
>UniRef100_Q9AUZ1 Polyprotein, putative [Arabidopsis thaliana]
Length = 855
Score = 77.8 bits (190), Expect = 7e-13
Identities = 96/368 (26%), Positives = 164/368 (44%), Gaps = 63/368 (17%)
Query: 1 FNQICKDSGIVRHKQIRGTPRENCLDH*RMDRKIMERFICLMSNANNSTLFWDETV**FF 60
F+ CK GI RHK TP++N + RM+R +ME+ CL++ + FW E
Sbjct: 259 FDSYCKKYGIERHKTCTYTPQQNGVAE-RMNRTVMEKVRCLLNESGLEEEFWAEV----- 312
Query: 61 LIS*IDSLTLNLV*RH*KRCGWSCSKP*KYESFWLC--PLYAHIKR------TILNQG-- 110
+ T + + P E WL P Y H++R ++QG
Sbjct: 313 ------ATTAVYIINRSPSAAIDHNVP---EELWLNRKPGYKHLRRLRAVAYVHVDQGKL 363
Query: 111 ----LKLYVFEVSLRCKRVQIWCMEKGET*VLINKDVVFKESEMFYS---KETSS*HQP* 163
+K K ++W +E+ + +I+++V+F+E ++ KET +
Sbjct: 364 KPRAIKGIFIGYPSGTKGYKVWLLEEQKC--VISRNVIFQEEVVYKDLNDKETVVKKE-- 419
Query: 164 LDCEERVQF----------KVESLHGSTNLDAKVKDKD*NDDTTPLFFLMENVAIRF*IS 213
+ R Q +V G T+++ + ++ D ND+ P + + S
Sbjct: 420 ---DIRTQTDNHLVISKTKEVSDQGGVTHIE-ECEESDENDEQEPETVNETDPTVESEGS 475
Query: 214 FSSSFRFKLRTSTSLSLSSNTTLEN*L**KK*VVVSGEIDGDEPVN*DGFVNASKSREWI 273
++ K R ++ + T E+ + +VV + +EP + + A++ +EW+
Sbjct: 476 LANYQLAKDRVRRQINPPARFTEESGV--AFALVVVESLSLEEP---ESYQEATQDKEWL 530
Query: 274 A---SMKEELSSLHKNGTWLLVNKPKRMKNMSCI*LFKLKEGVNYMD*NSRFKAR----G 326
+ EE+ SL KNGTW LV+KP K + C LFKLK G+ ++ RFKAR G
Sbjct: 531 KWKNATHEEMDSLIKNGTWDLVDKPTNRKIIGCRWLFKLKSGIPGVE-PVRFKARLVAKG 589
Query: 327 FT*REGID 334
+T REG+D
Sbjct: 590 YTQREGVD 597
Score = 53.5 bits (127), Expect = 1e-05
Identities = 30/64 (46%), Positives = 40/64 (61%), Gaps = 1/64 (1%)
Query: 405 YLNETMHKYVDYMILKIDDMLIACNIKEEFDKVKVML*IEFQMKDMRAAS*ILAKKILRD 464
Y+ + +V Y++L +DDMLIA K E + +K L +EF+MKDM AS IL K I RD
Sbjct: 700 YVKQVEQGFV-YLLLYVDDMLIAGKSKAEVNMIKEQLSVEFEMKDMGPASRILGKDITRD 758
Query: 465 ISKG 468
KG
Sbjct: 759 RKKG 762
>UniRef100_Q9ZQE4 Copia-like retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1166
Score = 75.5 bits (184), Expect = 3e-12
Identities = 91/387 (23%), Positives = 168/387 (42%), Gaps = 70/387 (18%)
Query: 1 FNQICKDSGIVRHKQIRGTPRENCLDH*RMDRKIMERFICLMSNANNSTLFWDETV**FF 60
F+ +CK+ GI RH+ TP++N + RM++ IM+ +++ S FW E
Sbjct: 460 FDNLCKEDGIKRHRTCTYTPQQNGVSE-RMNKTIMDIVRSMLAETGMSQEFWAEAT--ST 516
Query: 61 LIS*IDSLTLNLV*RH*KRCGWSCSKP*-KYESFWLCPLYAHIKRTILNQGLKLYVFEVS 119
+ I+ + + W+ +KP + + C Y H+ + + VF +
Sbjct: 517 AVYLINRTPNSFIGFKLPEEVWTGTKPDLSHLRRFGCSAYVHVTQDKTSPRAVKGVF-MG 575
Query: 120 LRC--KRVQIWCMEKGET*VLINKDVVFKESEMFYS---------KETSS*HQP*LDCEE 168
C K ++W ++G+ +++VVF E+E++ +E ++ +
Sbjct: 576 YPCGIKGYRVWLPKEGKC--TTSRNVVFNETELYKDTLSSADERKEEAEKEYKKLKKARK 633
Query: 169 RVQFKVESLHG-------------------------STNLDAKVKDKD*----NDDTTPL 199
RV F + L G S NL+ +++ N+ +
Sbjct: 634 RVSFSHDLLRGPSTSCCDLDDSSSQGGETSSSSSESSENLEESEMNEEVVGSENEQSLDD 693
Query: 200 FFLMENVAIRF*ISFSSSFRFKLRTSTSLSLSSNTTLEN*L**KK*VVVSGEIDGDEPVN 259
+ L ++ R I S RF+ + +L++ LE +EP +
Sbjct: 694 YLLARDMKRRSNIRPPS--RFEDEDFVAYALATAEDLEE----------------EEPKS 735
Query: 260 *DGFVNASKSREWIASMKEELSSLHKNGTWLLVNKPKRMKNMSCI*LFKLKEGVNYMD*N 319
+ + +SK ++W +MKEE+ S K+ TW L+ KP++ K + C +FKLK G+ ++
Sbjct: 736 YEEALKSSKRKQWENAMKEEMDSHKKSHTWDLIEKPEKQKLIGCKWIFKLKPGIPGVE-K 794
Query: 320 SRFKAR----GFT*REGID*CENYFLV 342
R+KAR GF+ +EGID E + LV
Sbjct: 795 QRYKARLVAKGFSQQEGIDYNEVFSLV 821
>UniRef100_Q9LJS7 Copia-like retroelement pol polyprotein-like [Arabidopsis thaliana]
Length = 348
Score = 63.5 bits (153), Expect = 1e-08
Identities = 35/96 (36%), Positives = 61/96 (63%), Gaps = 5/96 (5%)
Query: 251 EIDGDEPVN*DGFVNASKSREWIASMKEELSSLHKNGTWLLVNKPKRMKNMSCI*LFKLK 310
+++ +EP + + + SK ++W ++MKEE+ S K+ TW L+ KP++ K + C +FKLK
Sbjct: 199 DLEIEEPKSYEEAMKRSKRKQWESAMKEEMDSHQKSHTWDLIEKPEKQKLIGCKWIFKLK 258
Query: 311 EGVNYMD*NSRFK----ARGFT*REGID*CENYFLV 342
G+ ++ R+K A+GF+ +EGID E + LV
Sbjct: 259 PGIPGLE-KQRYKVRLVAKGFSQQEGIDYNEVFSLV 293
>UniRef100_Q6L4X8 Putative polyprotein [Oryza sativa]
Length = 1211
Score = 63.2 bits (152), Expect = 2e-08
Identities = 68/256 (26%), Positives = 115/256 (44%), Gaps = 40/256 (15%)
Query: 96 CPLYAHIKRTILN-QGLKLYVFEVSLRCKRVQIWCMEKGET*VLINKDVVFKESEMFYSK 154
C YAH+ L + +K K ++WC + + V+I+++VVF ES M + K
Sbjct: 543 CTAYAHVDNGKLEPRAIKCIFLGYPSGVKDYKLWCPKTKK--VVISRNVVFHESVMLHDK 600
Query: 155 ETSS*HQP*LDCEERVQFKVESLHGSTNLDAKVKDKD*NDDTTPLFFLMENVAIRF*IS- 213
+++ ++ +E+ +VE L S + K E+VAI S
Sbjct: 601 PSTNVP---VESQEKASVQVEHLISSGHAPEK-----------------EDVAINQDASV 640
Query: 214 FSSSFRFKLRTSTSLSLSSNTTLEN*L**KK*VV----------VSGEIDGD-EPVN*DG 262
S ++ S S++ + N ++ + V+ EI+G+ EP
Sbjct: 641 IEDSDSSAVQQSPKRSIAKDKPKRNIKPPRRYIEEANIVAYALSVAEEIEGNAEPSTYSE 700
Query: 263 FVNASKSREWIASMKEELSSLHKNGTWLLVNKPKRMKNMSCI*LFKLKEGVNYMD*NSRF 322
+ + WI +M +E+ SL KN TW LV PK K + C +FK K+G++ D +R+
Sbjct: 701 AIVSDGCNRWITAMHDEMESLEKNHTWELVKLPKEKKPIRCKWIFKRKKGMSPSD-EARY 759
Query: 323 KAR----GFT*REGID 334
KAR G++ GID
Sbjct: 760 KARLVAKGYSQIPGID 775
Score = 49.3 bits (116), Expect = 2e-04
Identities = 25/53 (47%), Positives = 34/53 (63%)
Query: 416 YMILKIDDMLIACNIKEEFDKVKVML*IEFQMKDMRAAS*ILAKKILRDISKG 468
Y++L +DDMLIA K E +K+K L EF+MKD+ AA IL +I R+ G
Sbjct: 888 YLLLYVDDMLIAAKDKSEIEKLKAQLSSEFEMKDLGAAKKILGMEITRERHSG 940
>UniRef100_O81449 T27D20.5 protein [Arabidopsis thaliana]
Length = 1104
Score = 61.6 bits (148), Expect = 5e-08
Identities = 37/92 (40%), Positives = 51/92 (55%), Gaps = 3/92 (3%)
Query: 246 VVVSGEIDGDEPVN*DGFVNASKSREWIASMKEELSSLHKNGTWLLVNKPKRMKNMSCI* 305
+V++ E++ +EPV +W M EE+ SL KN TW +V+KPK K +SC
Sbjct: 664 LVMAEEVESEEPVCFHDVKEDKDWEKWHGGMIEEMDSLLKNATWDIVDKPKNQKVISCHW 723
Query: 306 LFKLK---EGVNYMD*NSRFKARGFT*REGID 334
L+K K GV +R ARGF+ REGID
Sbjct: 724 LYKKKLGIPGVELPRYKARLVARGFSHREGID 755
>UniRef100_Q02902 Orf 3 [Arabidopsis thaliana]
Length = 494
Score = 60.8 bits (146), Expect = 8e-08
Identities = 32/83 (38%), Positives = 51/83 (60%), Gaps = 3/83 (3%)
Query: 255 DEPVN*DGFVNASKSREWIASMKEELSSLHKNGTWLLVNKPKRMKNMSCI*LFKLKEGVN 314
+EP + + + ++W + EE+ SL KNGTW+LV+KPK K + C LFK+K G+
Sbjct: 69 EEPQSYQEATSNKEWKKWKLATHEEMDSLIKNGTWVLVDKPKDRKIIGCRWLFKMKSGIP 128
Query: 315 YMD---*NSRFKARGFT*REGID 334
++ +R A+G+T REG+D
Sbjct: 129 GVEPVRYKARLVAKGYTQREGVD 151
Score = 52.0 bits (123), Expect = 4e-05
Identities = 28/53 (52%), Positives = 35/53 (65%)
Query: 416 YMILKIDDMLIACNIKEEFDKVKVML*IEFQMKDMRAAS*ILAKKILRDISKG 468
Y++L +DDMLIA K E +KVK L +EF+MKDM AS IL I+RD G
Sbjct: 265 YLLLYVDDMLIAGKSKSEINKVKEQLSMEFEMKDMGPASRILGIDIIRDRKNG 317
>UniRef100_Q7XP09 OSJNBb0013J13.11 protein [Oryza sativa]
Length = 576
Score = 59.7 bits (143), Expect = 2e-07
Identities = 35/83 (42%), Positives = 49/83 (58%), Gaps = 5/83 (6%)
Query: 256 EPVN*DGFVNASKSREWIASMKEELSSLHKNGTWLLVNKPKRMKNMSCI*LFKLKEGVNY 315
EP V + WI++M+EE+ SL KNGTW LV+ PK+ K + C +FK KEG++
Sbjct: 138 EPATYTEAVVSGDRENWISAMQEEMQSLEKNGTWKLVHLPKQKKPVHCKWIFKRKEGLSP 197
Query: 316 MD*NSRFKAR----GFT*REGID 334
+ RFKAR GF+ G+D
Sbjct: 198 SE-PPRFKARLVAKGFSQIAGVD 219
Score = 49.3 bits (116), Expect = 2e-04
Identities = 26/53 (49%), Positives = 34/53 (64%)
Query: 416 YMILKIDDMLIACNIKEEFDKVKVML*IEFQMKDMRAAS*ILAKKILRDISKG 468
Y++L +DDMLIA KE+ +K L EF MKD+ AA IL +I RDI+ G
Sbjct: 332 YLLLYVDDMLIAAKSKEQITTLKKQLSSEFDMKDLGAAKKILGMEITRDINFG 384
>UniRef100_Q75HA9 Putative polyprotein [Oryza sativa]
Length = 1322
Score = 57.0 bits (136), Expect = 1e-06
Identities = 33/83 (39%), Positives = 50/83 (59%), Gaps = 5/83 (6%)
Query: 256 EPVN*DGFVNASKSREWIASMKEELSSLHKNGTWLLVNKPKRMKNMSCI*LFKLKEGVNY 315
EP V + +WI++++EE+ SL KNGTW LV+ PK+ K + C +FK KEG++
Sbjct: 805 EPATYTEAVVSGDREKWISAIQEEMQSLEKNGTWELVHLPKQKKPVRCKWIFKRKEGLSP 864
Query: 316 MD*NSRFK----ARGFT*REGID 334
+ RFK A+GF+ G+D
Sbjct: 865 SE-PPRFKVRLVAKGFSQIAGVD 886
Score = 51.6 bits (122), Expect = 5e-05
Identities = 47/158 (29%), Positives = 74/158 (46%), Gaps = 17/158 (10%)
Query: 1 FNQICKDSGIVRHKQIRGTPRENCLDH*RMDRKIMERFICLMSNANNSTLFWDETV-**F 59
F+ C+ GIVRH I TP++N + RM+R I+ + C++SNA + FW E
Sbjct: 564 FDDYCRKEGIVRHHTIPYTPQQNGVAE-RMNRTIISKARCMLSNARMNKRFWAEAANTAC 622
Query: 60 FLIS*IDSLTLNLV*RH*KRCG---WSCSKP*KYESFWL--CPLYAHIKRTILN-QGLKL 113
+LI+ S+ LN K+ WS P Y + C YAH+ L + +K
Sbjct: 623 YLINRSPSIPLN------KKTPIEIWS-GMPADYSQLRVFGCTAYAHVDNGKLEPRAIKC 675
Query: 114 YVFEVSLRCKRVQIWCMEKGET*VLINKDVVFKESEMF 151
K ++W E +T ++++V+F E MF
Sbjct: 676 LFLGYGSGVKGYKLWNPETNKT--FMSRNVIFNEFVMF 711
Score = 47.8 bits (112), Expect = 7e-04
Identities = 25/53 (47%), Positives = 33/53 (62%)
Query: 416 YMILKIDDMLIACNIKEEFDKVKVML*IEFQMKDMRAAS*ILAKKILRDISKG 468
Y++L +DDMLIA KE+ +K L EF MKD+ AA IL +I RD + G
Sbjct: 999 YLLLYVDDMLIAAKSKEQITTLKKQLSSEFDMKDLGAAKKILGMEITRDRNSG 1051
>UniRef100_Q94LG0 Putative retroelement pol polyprotein [Oryza sativa]
Length = 1326
Score = 56.2 bits (134), Expect = 2e-06
Identities = 30/69 (43%), Positives = 43/69 (61%), Gaps = 1/69 (1%)
Query: 256 EPVN*DGFVNASKSREWIASMKEELSSLHKNGTWLLVNKPKRMKNMSCI*LFKLKEGVNY 315
EP V + +WI++M+EE+ SL KNGTW LV+ PK+ K + C +FK KEG++
Sbjct: 773 EPATYTEAVVSGDREKWISAMQEEMQSLEKNGTWELVHLPKQKKPVRCKWIFKRKEGLSS 832
Query: 316 MD*NSRFKA 324
+ RFKA
Sbjct: 833 SE-PPRFKA 840
Score = 54.7 bits (130), Expect = 6e-06
Identities = 49/158 (31%), Positives = 74/158 (46%), Gaps = 17/158 (10%)
Query: 1 FNQICKDSGIVRHKQIRGTPRENCLDH*RMDRKIMERFICLMSNANNSTLFWDETV-**F 59
F+ C+ GIVRH I TP++N + RM+R I+ + C++SNA + FW E
Sbjct: 564 FDDYCRKEGIVRHHTIPYTPQQNGVAE-RMNRTIISKARCMLSNARMNKRFWAEAANTAC 622
Query: 60 FLIS*IDSLTLNLV*RH*KRCG---WSCSKP*KYESFWL--CPLYAHIKRTILN-QGLKL 113
+LI+ S+ LN K+ WS P Y + C YAH+ L + +K
Sbjct: 623 YLINRSPSIPLN------KKTPIEVWS-GMPADYSQLRVFGCTAYAHVDNGKLEPRAIKC 675
Query: 114 YVFEVSLRCKRVQIWCMEKGET*VLINKDVVFKESEMF 151
KR ++W E +T + + VVF +S MF
Sbjct: 676 LFLGYGSGVKRYKLWNPETNKT--FMRRSVVFNKSVMF 711
Score = 49.3 bits (116), Expect = 2e-04
Identities = 26/53 (49%), Positives = 33/53 (62%)
Query: 416 YMILKIDDMLIACNIKEEFDKVKVML*IEFQMKDMRAAS*ILAKKILRDISKG 468
Y++L +DDMLIA KE+ +K L EF MKD+ AA IL KI RD + G
Sbjct: 910 YLLLYVDDMLIAAKSKEQITTLKKQLSSEFDMKDLGAAKKILGMKITRDRNSG 962
>UniRef100_Q7XQV8 OSJNBb0050N09.11 protein [Oryza sativa]
Length = 1405
Score = 55.5 bits (132), Expect = 3e-06
Identities = 31/87 (35%), Positives = 48/87 (54%), Gaps = 2/87 (2%)
Query: 255 DEPVN*DGFVNASKSREWIASMKEELSSLHKNGTWLLVNKPKRMKNMSCI*LFKLK--EG 312
D+P + + + +++S EW+ SMK+E+ S+ N W L PK K + C ++K K
Sbjct: 884 DDPTSYEEAMRSARSSEWLESMKDEMESMKLNDVWDLEEIPKGAKTVGCKWVYKTKYDSR 943
Query: 313 VNYMD*NSRFKARGFT*REGID*CENY 339
N +R A+GFT REGID E +
Sbjct: 944 GNIEKFKARLVAKGFTQREGIDYNETF 970
>UniRef100_Q8S757 Putative retroelement [Oryza sativa]
Length = 521
Score = 55.5 bits (132), Expect = 3e-06
Identities = 31/90 (34%), Positives = 52/90 (57%), Gaps = 2/90 (2%)
Query: 255 DEPVN*DGFVNASKSREWIASMKEELSSLHKNGTWLLVNKPKRMKNMSCI*LFKLK-EGV 313
D+P + + + +++S EW+ +MK+E+ + N W L PK K + C ++K K +
Sbjct: 403 DDPTSYEEAMRSARSSEWLEAMKDEIKLMKLNNVWDLEEIPKEAKTVGCKWVYKTKYDSR 462
Query: 314 NYMD-*NSRFKARGFT*REGID*CENYFLV 342
+M+ +R A+GFT REGID E + LV
Sbjct: 463 GHMEKFKARLVAKGFTQREGIDYNETFSLV 492
>UniRef100_Q8S6N1 Putative gag-pol polyprotein [Oryza sativa]
Length = 1787
Score = 54.7 bits (130), Expect = 6e-06
Identities = 30/87 (34%), Positives = 48/87 (54%), Gaps = 2/87 (2%)
Query: 255 DEPVN*DGFVNASKSREWIASMKEELSSLHKNGTWLLVNKPKRMKNMSCI*LFKLK--EG 312
D+P + + + +++S EW+ +MK+E+ S+ N W L PK K + C ++K K
Sbjct: 759 DDPTSYEEAMRSARSSEWLEAMKDEMKSMKLNNVWDLEEIPKGAKTVGCKWVYKTKYDSR 818
Query: 313 VNYMD*NSRFKARGFT*REGID*CENY 339
N +R A+GFT REGID E +
Sbjct: 819 GNIEKFKARLVAKGFTQREGIDYNETF 845
>UniRef100_Q75KL7 Putative polyprotein [Oryza sativa]
Length = 1362
Score = 54.7 bits (130), Expect = 6e-06
Identities = 30/87 (34%), Positives = 48/87 (54%), Gaps = 2/87 (2%)
Query: 255 DEPVN*DGFVNASKSREWIASMKEELSSLHKNGTWLLVNKPKRMKNMSCI*LFKLK--EG 312
D+P + + + +++S EW+ +MK+E+ S+ N W L PK K + C ++K K
Sbjct: 841 DDPTSYEEAMRSARSSEWLEAMKDEMKSMKLNNVWDLEEIPKGAKTVGCKWVYKTKYDSR 900
Query: 313 VNYMD*NSRFKARGFT*REGID*CENY 339
N +R A+GFT REGID E +
Sbjct: 901 GNIEKFKARLVAKGFTQREGIDYNETF 927
>UniRef100_Q8S0V1 Putative gag-pol polyprotein [Oryza sativa]
Length = 1397
Score = 54.3 bits (129), Expect = 8e-06
Identities = 30/87 (34%), Positives = 48/87 (54%), Gaps = 2/87 (2%)
Query: 255 DEPVN*DGFVNASKSREWIASMKEELSSLHKNGTWLLVNKPKRMKNMSCI*LFKLK--EG 312
D+P + + + +++S EW+ +MK+E+ S+ N W L PK K + C ++K K
Sbjct: 876 DDPTSYEEAMRSARSSEWLEAMKDEMKSMKLNDVWDLEEIPKGAKTVGCKWVYKTKYDSR 935
Query: 313 VNYMD*NSRFKARGFT*REGID*CENY 339
N +R A+GFT REGID E +
Sbjct: 936 GNIEKFKARLVAKGFTQREGIDYNETF 962
>UniRef100_Q7X8V4 OSJNBa0064M23.6 protein [Oryza sativa]
Length = 1148
Score = 54.3 bits (129), Expect = 8e-06
Identities = 30/87 (34%), Positives = 48/87 (54%), Gaps = 2/87 (2%)
Query: 255 DEPVN*DGFVNASKSREWIASMKEELSSLHKNGTWLLVNKPKRMKNMSCI*LFKLK--EG 312
D+P + + + +++S EW+ +MK+E+ S+ N W L PK K + C ++K K
Sbjct: 764 DDPTSYEEAMRSARSSEWLEAMKDEMKSMKLNDVWDLEEIPKGAKTVGCKWVYKTKYDSR 823
Query: 313 VNYMD*NSRFKARGFT*REGID*CENY 339
N +R A+GFT REGID E +
Sbjct: 824 GNIEKFKARLVAKGFTQREGIDYNETF 850
>UniRef100_Q9ZPU5 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1335
Score = 54.3 bits (129), Expect = 8e-06
Identities = 28/67 (41%), Positives = 44/67 (64%), Gaps = 3/67 (4%)
Query: 271 EWIASMKEELSSLHKNGTWLLVNKPKRMKNMSCI*LFKLKEGVNYMD*N---SRFKARGF 327
+W A+MKEE+ S+ KN TW LV KP+++K + C +F K G+ ++ +R A+GF
Sbjct: 830 KWNAAMKEEMVSMSKNHTWDLVTKPEKVKLIGCRWVFTRKAGIPGVEAPRFIARLVAKGF 889
Query: 328 T*REGID 334
T +EG+D
Sbjct: 890 TQKEGVD 896
Score = 48.1 bits (113), Expect = 6e-04
Identities = 44/155 (28%), Positives = 74/155 (47%), Gaps = 11/155 (7%)
Query: 1 FNQICKDSGIVRHKQIRGTPRENCLDH*RMDRKIMERFICLMSNANNSTLFWDETV-**F 59
F + CK+ GIVRHK TP++N + R++R IM++ ++S + FW E
Sbjct: 553 FEKFCKEEGIVRHKTCAYTPQQNGIAE-RLNRTIMDKVRSMLSRSGMEKKFWAEAASTAV 611
Query: 60 FLIS*IDSLTLNLV*RH*KRCGWSCSKP*KYESF--WLCPLYAHIKRTILNQGLKLYVF- 116
+LI+ S +N K W+ + P S + C Y H + LN K +F
Sbjct: 612 YLINRSPSTAINFDLPEEK---WTGALP-DLSSLRKFGCLAYIHADQGKLNPRSKKGIFT 667
Query: 117 EVSLRCKRVQIWCMEKGET*VLINKDVVFKESEMF 151
K ++W +E + +I+++V+F+E MF
Sbjct: 668 SYPEGVKGYKVWVLEDKK--CVISRNVIFREQVMF 700
Score = 44.3 bits (103), Expect = 0.008
Identities = 24/53 (45%), Positives = 34/53 (63%)
Query: 416 YMILKIDDMLIACNIKEEFDKVKVML*IEFQMKDMRAAS*ILAKKILRDISKG 468
Y++L +DDMLIA K E +++K +L EF+MKD+ A IL +I RD G
Sbjct: 1010 YLLLYVDDMLIASANKSEVNELKQLLSREFEMKDLGDAKKILGMEISRDRDAG 1062
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.361 0.161 0.572
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 625,390,308
Number of Sequences: 2790947
Number of extensions: 20971563
Number of successful extensions: 76217
Number of sequences better than 10.0: 383
Number of HSP's better than 10.0 without gapping: 195
Number of HSP's successfully gapped in prelim test: 194
Number of HSP's that attempted gapping in prelim test: 75390
Number of HSP's gapped (non-prelim): 787
length of query: 485
length of database: 848,049,833
effective HSP length: 131
effective length of query: 354
effective length of database: 482,435,776
effective search space: 170782264704
effective search space used: 170782264704
T: 11
A: 40
X1: 14 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.9 bits)
S2: 77 (34.3 bits)
Medicago: description of AC147000.15