Miyakogusa Predicted Gene
- Lj1g3v3964490.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v3964490.1 Non Chatacterized Hit- tr|C1E834|C1E834_MICSR
Putative uncharacterized protein OS=Micromonas sp.
(st,41.9,5e-18,seg,NULL; Sas10_Utp3_C,Sas10 C-terminal domain;
Sas10_Utp3,Sas10/Utp3/C1D; SUBFAMILY NOT NAMED,NULL;,CUFF.31604.1
(661 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G43650.1 | Symbols: EMB2777 | Sas10/U3 ribonucleoprotein (Utp... 379 e-105
AT3G28230.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 106 4e-23
AT3G28230.2 | Symbols: | FUNCTIONS IN: molecular_function unkno... 106 4e-23
AT1G07840.1 | Symbols: | Sas10/Utp3/C1D family | chr1:2424603-2... 78 2e-14
AT1G07840.2 | Symbols: | Sas10/Utp3/C1D family | chr1:2424603-2... 78 2e-14
AT1G07840.3 | Symbols: | Sas10/Utp3/C1D family | chr1:2424603-2... 77 3e-14
>AT2G43650.1 | Symbols: EMB2777 | Sas10/U3 ribonucleoprotein (Utp)
family protein | chr2:18099430-18103147 FORWARD
LENGTH=654
Length = 654
Score = 379 bits (973), Expect = e-105, Method: Compositional matrix adjust.
Identities = 272/657 (41%), Positives = 366/657 (55%), Gaps = 60/657 (9%)
Query: 34 DDEIDAFHKHRDIVPLDVNDDFGESDEDDELPIFXXXXXXXXXXXXXXXXXXXXXXXXXX 93
DDEIDAFHK RDIVPLDVNDD ESDEDD P+F
Sbjct: 27 DDEIDAFHKQRDIVPLDVNDDTDESDEDDVQPVFDLQGVDDESEEDEDTEDEEEAENGL- 85
Query: 94 XXXXVAKIIRQRNFLRAKFXXXXXXXXXXXXXXXXXXKHILGGRRS-AHGADIHNIELLS 152
AK+IRQ+ +LRAKF + GGR H D + ++LS
Sbjct: 86 ----TAKMIRQKKYLRAKFGDGDDEMADDDKDKEEDKRSTWGGRSGLYHSGDNVDFDILS 141
Query: 153 SDDEAPKEEEEIAMQIQREKARSLTMEDYDLDISEDKVKDKS-TLKDASDKGNREMKS-- 209
SDDE K EEE ++++ E+ S+T D LD ++ D+ T+++ SDKG + KS
Sbjct: 142 SDDEDIKAEEEEVIRLRAEQLGSITAADAGLDDDSEEDSDRELTMEEISDKGKQATKSIT 201
Query: 210 -----PDRDITFK--AEDLNALSKEEQMNLVYRCAPELIDWLSELNEAHKQLECKINPFL 262
D+D + +D+N+LSKEEQM++VY APE++ LSELN+A ++LE KINP +
Sbjct: 202 DKKEKGDKDTHVEEIKKDINSLSKEEQMDVVYSSAPEIVGLLSELNDAVEELESKINPVM 261
Query: 263 SKVQKGEIVMEGGVRYFELKQLLLLSYCQAITFYLLLKSEGHPVHDHPVVARLAEIRELL 322
+K+++GEI + G RY E+KQLLLL+YCQ+ITFYLLLKSEG P+ DHPV+ARL EI+ LL
Sbjct: 262 NKLKEGEISLNGLARYLEVKQLLLLTYCQSITFYLLLKSEGQPIRDHPVLARLVEIKSLL 321
Query: 323 DQIKQLDAKLPVGLEDILKES--NG-LETVVNSDNENAPI--TIDSITRNQELPLVSAES 377
D+IK+LD +LP G E+ L S NG ++ VV D +P+ ++D IT++
Sbjct: 322 DKIKELDEELPPGFEESLARSIANGAVQKVVKEDQLTSPVSDSVDRITQDT--------- 372
Query: 378 LEEAMPIKVDEIKKLDSSKDGVPKARKVKHQKDHIGMRSLEMLTVRASLAEKWXXXXXXX 437
A P+K+D ++ ++ K K KHQ D + ++S EML +RA+L K
Sbjct: 373 ---AKPMKID-----NAREEKKKKGEKRKHQNDLVDVQSEEMLKLRAALEGKLRTNGVLG 424
Query: 438 XXXXXXXXGLKRSRPDNSQHGTSVFD---DDA------VGPARLSQGLRKSNLKKPKVIS 488
KR + N + T FD DDA V +L++ + S +KPK IS
Sbjct: 425 STVSKSDKAQKRQKLANRKLET--FDDYVDDADNSTHNVTADKLTKLV--STKRKPKTIS 480
Query: 489 GDDDLPEKDDIGERRRKHELRVLASAXXXXXXXXXXXXXXXXXXXXXAKQAHVEAE---- 544
GDDDLP++DDIGERRRK ELRVLA A +
Sbjct: 481 GDDDLPQRDDIGERRRKFELRVLAGAGVKSSEGDGRNKNGAFASDDEDDNDGDNNDMVDN 540
Query: 545 --DSDNEFYEQVKQXXXXXXXXXXETYSRKSAASSLSETFSETIEGKRHITSQMEKNRGL 602
+S++EFY+QVKQ E YSRK L + E ++GKRHI++QM NRGL
Sbjct: 541 DGESEDEFYKQVKQKQQAKRAAKAEIYSRK---PHLMPSSPEHVDGKRHISNQMVSNRGL 597
Query: 603 TRIRNKDKKNPRKNYKMKHQKAVKNRKGQVQAIRKPTAPYGGEATGINASISRSVRF 659
TR RN+D KNPRK Y+ ++K V RKGQV+ IRK T PY GEA GIN + SRS+R
Sbjct: 598 TRQRNRDLKNPRKKYRKNYEKKVTRRKGQVRDIRKQTGPYAGEARGINPNTSRSIRM 654
>AT3G28230.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: gene silencing; LOCATED IN: nucleus;
EXPRESSED IN: 15 plant structures; EXPRESSED DURING: 6
growth stages; CONTAINS InterPro DOMAIN/s: Something
about silencing protein 10 (Sas10), C-terminal
(InterPro:IPR018972); BEST Arabidopsis thaliana protein
match is: Sas10/U3 ribonucleoprotein (Utp) family
protein (TAIR:AT2G43650.1); Has 374 Blast hits to 360
proteins in 175 species: Archae - 0; Bacteria - 4;
Metazoa - 107; Fungi - 115; Plants - 56; Viruses - 0;
Other Eukaryotes - 92 (source: NCBI BLink). |
chr3:10529314-10530199 FORWARD LENGTH=173
Length = 173
Score = 106 bits (265), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 61/124 (49%), Positives = 78/124 (62%), Gaps = 3/124 (2%)
Query: 538 QAHVEAEDSDNEFYEQVKQXXXXXXXXXXETYSRKSAASSLSETFSETIEGKRHITSQME 597
Q ++EDS++EFY QVKQ E YSRK L + + ++G+R I++QM
Sbjct: 53 QKRQKSEDSEDEFYRQVKQKQEAKKAAKAEIYSRKPY---LIPSSPDLVDGRRLISNQMA 109
Query: 598 KNRGLTRIRNKDKKNPRKNYKMKHQKAVKNRKGQVQAIRKPTAPYGGEATGINASISRSV 657
NRGLTR RNKD KNPRK Y+ +H+K V NRKGQV+ IR PY GE GIN SRS+
Sbjct: 110 SNRGLTRKRNKDHKNPRKKYRDQHKKIVINRKGQVRDIRTQVGPYAGETRGINPYTSRSI 169
Query: 658 RFKS 661
R K+
Sbjct: 170 RIKN 173
>AT3G28230.2 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: gene silencing; LOCATED IN: nucleus;
EXPRESSED IN: 15 plant structures; EXPRESSED DURING: 6
growth stages; CONTAINS InterPro DOMAIN/s: Something
about silencing protein 10 (Sas10), C-terminal
(InterPro:IPR018972); BEST Arabidopsis thaliana protein
match is: Sas10/U3 ribonucleoprotein (Utp) family
protein (TAIR:AT2G43650.1); Has 30201 Blast hits to
17322 proteins in 780 species: Archae - 12; Bacteria -
1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr3:10529314-10530199 FORWARD LENGTH=174
Length = 174
Score = 106 bits (265), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 61/124 (49%), Positives = 78/124 (62%), Gaps = 3/124 (2%)
Query: 538 QAHVEAEDSDNEFYEQVKQXXXXXXXXXXETYSRKSAASSLSETFSETIEGKRHITSQME 597
Q ++EDS++EFY QVKQ E YSRK L + + ++G+R I++QM
Sbjct: 54 QKRQKSEDSEDEFYRQVKQKQEAKKAAKAEIYSRKPY---LIPSSPDLVDGRRLISNQMA 110
Query: 598 KNRGLTRIRNKDKKNPRKNYKMKHQKAVKNRKGQVQAIRKPTAPYGGEATGINASISRSV 657
NRGLTR RNKD KNPRK Y+ +H+K V NRKGQV+ IR PY GE GIN SRS+
Sbjct: 111 SNRGLTRKRNKDHKNPRKKYRDQHKKIVINRKGQVRDIRTQVGPYAGETRGINPYTSRSI 170
Query: 658 RFKS 661
R K+
Sbjct: 171 RIKN 174
>AT1G07840.1 | Symbols: | Sas10/Utp3/C1D family |
chr1:2424603-2426425 FORWARD LENGTH=312
Length = 312
Score = 78.2 bits (191), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 72/137 (52%)
Query: 221 LNALSKEEQMNLVYRCAPELIDWLSELNEAHKQLECKINPFLSKVQKGEIVMEGGVRYFE 280
+ ++K + +V + AP+L L E+ + K+ + V+ GG+ Y E
Sbjct: 1 MEEITKPVLVGIVKKEAPQLASVLREMKNVLDVVRSKVEALTALVKANSFPTAGGISYLE 60
Query: 281 LKQLLLLSYCQAITFYLLLKSEGHPVHDHPVVARLAEIRELLDQIKQLDAKLPVGLEDIL 340
K LLLLSYCQ + +Y+L K++G + HP+V L EIR L++I+ +D KL ++ +
Sbjct: 61 AKHLLLLSYCQDLVYYILRKAKGLSIDGHPLVRSLVEIRMFLEKIRPIDKKLQYQIQKLT 120
Query: 341 KESNGLETVVNSDNENA 357
+ + +S+ + +
Sbjct: 121 TAGGPVTELAHSEGKGS 137
>AT1G07840.2 | Symbols: | Sas10/Utp3/C1D family |
chr1:2424603-2426425 FORWARD LENGTH=312
Length = 312
Score = 78.2 bits (191), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 72/137 (52%)
Query: 221 LNALSKEEQMNLVYRCAPELIDWLSELNEAHKQLECKINPFLSKVQKGEIVMEGGVRYFE 280
+ ++K + +V + AP+L L E+ + K+ + V+ GG+ Y E
Sbjct: 1 MEEITKPVLVGIVKKEAPQLASVLREMKNVLDVVRSKVEALTALVKANSFPTAGGISYLE 60
Query: 281 LKQLLLLSYCQAITFYLLLKSEGHPVHDHPVVARLAEIRELLDQIKQLDAKLPVGLEDIL 340
K LLLLSYCQ + +Y+L K++G + HP+V L EIR L++I+ +D KL ++ +
Sbjct: 61 AKHLLLLSYCQDLVYYILRKAKGLSIDGHPLVRSLVEIRMFLEKIRPIDKKLQYQIQKLT 120
Query: 341 KESNGLETVVNSDNENA 357
+ + +S+ + +
Sbjct: 121 TAGGPVTELAHSEGKGS 137
>AT1G07840.3 | Symbols: | Sas10/Utp3/C1D family |
chr1:2424603-2426131 FORWARD LENGTH=279
Length = 279
Score = 77.4 bits (189), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 72/137 (52%)
Query: 221 LNALSKEEQMNLVYRCAPELIDWLSELNEAHKQLECKINPFLSKVQKGEIVMEGGVRYFE 280
+ ++K + +V + AP+L L E+ + K+ + V+ GG+ Y E
Sbjct: 1 MEEITKPVLVGIVKKEAPQLASVLREMKNVLDVVRSKVEALTALVKANSFPTAGGISYLE 60
Query: 281 LKQLLLLSYCQAITFYLLLKSEGHPVHDHPVVARLAEIRELLDQIKQLDAKLPVGLEDIL 340
K LLLLSYCQ + +Y+L K++G + HP+V L EIR L++I+ +D KL ++ +
Sbjct: 61 AKHLLLLSYCQDLVYYILRKAKGLSIDGHPLVRSLVEIRMFLEKIRPIDKKLQYQIQKLT 120
Query: 341 KESNGLETVVNSDNENA 357
+ + +S+ + +
Sbjct: 121 TAGGPVTELAHSEGKGS 137