Miyakogusa Predicted Gene

Lj5g3v2029570.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v2029570.1 Non Chatacterized Hit- tr|F4JSL7|F4JSL7_ARATH
Uncharacterized protein OS=Arabidopsis thaliana
GN=At4,36.1,6e-18,DUF4378,Domain of unknown function DUF4378;
VARLMGL,NULL; seg,NULL,CUFF.56443.1
         (674 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G51850.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   134   2e-31
AT5G62170.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   113   4e-25
AT4G25430.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    89   1e-17

>AT5G51850.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G62170.1); Has 384 Blast hits to 375 proteins
           in 79 species: Archae - 0; Bacteria - 14; Metazoa - 135;
           Fungi - 31; Plants - 92; Viruses - 0; Other Eukaryotes -
           112 (source: NCBI BLink). | chr5:21079419-21081478
           FORWARD LENGTH=590
          Length = 590

 Score =  134 bits (338), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 157/501 (31%), Positives = 221/501 (44%), Gaps = 124/501 (24%)

Query: 122 PGTKTPTLVARLMGLDLLPDXXXXXXXXXXCLSTPNPHYHQNRTRQHIQIIKHRNSTDNV 181
           PG+KTP LVARLMGLDLLPD           L T + H+          I  HR S    
Sbjct: 115 PGSKTPNLVARLMGLDLLPDKTDLNHSLSD-LHTMSSHH----------ITSHRLSKKG- 162

Query: 182 STRSLPETPRISSARRSDVVEHRLSLQINKENMGLGEDLEGPRFSFSKRKYDENSSRSPS 241
            TRSLP +PRISSAR+SD   HRLSLQ+N+E            F  S+ K D+  S SP 
Sbjct: 163 -TRSLPVSPRISSARKSDFDIHRLSLQLNREK----------EFGRSRLKEDQEESHSPR 211

Query: 242 HYARQIVKQVKESV--SRKVGLDITNTVKNREQGREDVVNQSKFKKPTKISVKPLDETSP 299
            YARQIVKQ+KE V   R VG+DITN+VKNRE          + ++ T +S  P    S 
Sbjct: 212 DYARQIVKQIKERVVTRRVVGMDITNSVKNRE-----ARPSHELRRDTTVSCSPRTRFSE 266

Query: 300 GKHSNQSYSPRPSRFMDTKHKPN---TTKPSPIAPNYQNTKPSPSPPPMVNIEAELSRVL 356
            K + QS          T HKPN   +++P PI       KP P+P            +L
Sbjct: 267 -KENKQS----------TSHKPNSSSSSRPEPII-----QKPKPTPV-----------IL 299

Query: 357 TKPKPQALLQKELNNPKSVQKHKKPTQIRN--KP-PQTSIRNKQEESFITRSPSNTRAND 413
            + + Q  +++    P ++ K K  T+ R   KP P + IRN++ E+F++ S       D
Sbjct: 300 GEKQSQNRVKQRQLKPINLCK-KAETETRRPIKPSPTSDIRNRKRETFLSDS------RD 352

Query: 414 IKTKSKRTHXXXXXXXXXXXXXXXXXXXXKSNPSPPTIKIPQIQVKTQTQESDDIQEAKS 473
           +K K    H                    +    PP  +I + + +  + E+  I+    
Sbjct: 353 VKAKP--LHKIKKFKKIPKSNDLENISATR----PPHQQINERE-RLISNEAASIR---- 401

Query: 474 STQLFSSLRQSTLCTRGRTNDEDKANGVYTATGAGDEGPEYQYITTLLSRTGVHKATSLP 533
           S+ +  + + S    R    D+           A +   E  YI  +++  G+       
Sbjct: 402 SSSMHKTEKNSPQVARNHKFDD----------AATEINSEQDYIIRIMNLAGIKS----- 446

Query: 534 HHHFQWFSSTHPLDPLLFHRLEQH--YPLSNSFASSIESYRDCKFRQKNHLGPRCNRRLM 591
                   S   LD  +F +LE    YP S + A                LG  CNRRL+
Sbjct: 447 -------DSQAMLDLSIFRKLEHFGDYP-SGTLA----------------LG--CNRRLL 480

Query: 592 FDLVDELLSEILVRPKGKGQG 612
           FDLV+E+L E + + +G  QG
Sbjct: 481 FDLVNEILIETVAKRRGNYQG 501


>AT5G62170.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G51850.1); Has 381 Blast hits to 359 proteins
           in 81 species: Archae - 0; Bacteria - 16; Metazoa - 101;
           Fungi - 21; Plants - 99; Viruses - 3; Other Eukaryotes -
           141 (source: NCBI BLink). | chr5:24973115-24975475
           REVERSE LENGTH=703
          Length = 703

 Score =  113 bits (283), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 133/372 (35%), Positives = 179/372 (48%), Gaps = 59/372 (15%)

Query: 61  IPNHHNTVPKGAEAPRNSLES--EDGTVTSISKEENFKIP---KIRTSGSTRGXXXXXXX 115
           I +HH  +PKG +APRNSLES  E+ + +   K+ N  I    KI+T    R        
Sbjct: 61  INHHHLHLPKGVDAPRNSLESTEEETSFSPTRKDGNLNISMGIKIKTKPQARSSSASLTP 120

Query: 116 XXXXXXPGTKTPTLVARLMGLDLLPDXXXXXXXXXXC-------LSTPNPHYHQNRTRQH 168
                 P  KTPTLVARLMGLDL+PD                  L TP    H    ++H
Sbjct: 121 TETYS-PSIKTPTLVARLMGLDLVPDNYRSSPTPSSSSSSTLIDLKTPTRSSH---AKKH 176

Query: 169 IQIIKHRNSTDNVSTRSLPETPRISSARRS---DVVEH-RLSLQINKENMGLGEDLEGP- 223
                 RNS D   TRSLPETPRIS  RRS   +  EH R SL +   N+ +  + E   
Sbjct: 177 RHYSLQRNSVDG-GTRSLPETPRISLGRRSVDVNCYEHQRSSLHLRDNNINVFPERESGI 235

Query: 224 ---RFSFSKRKYDENSSRSPSHYARQIVKQVKESVS--RKVGLDITNTVKNREQGREDVV 278
              R +  K  +++  +RSP  YARQIV Q+KE+VS  R++G DITN      Q RE  V
Sbjct: 236 NNVRLTRVKEIHEDKENRSPREYARQIVMQLKENVSRRRRMGTDITN---KETQPRE--V 290

Query: 279 NQSKFKKPTKISVKPLDETSPGKHSNQSYSPRPSRFMDTKHKPNTTKPSPIAPNYQNTKP 338
           ++SK K  +K ++   D +S         SPR    +     P  TKP+ +  N   +K 
Sbjct: 291 HESK-KASSKTTIITHDVSS---------SPR----LGLTEVPK-TKPTSLQTNNVASKI 335

Query: 339 SPSPPPMVNIEAELSRVLTKPKPQALLQKELNNPKSVQKHKKPTQIRN---KPPQTSIRN 395
             +    V  +  L  V  +P+     +KE    KS +K KKP   ++   KPPQ+    
Sbjct: 336 LETTAMKVQDKTRLPTVHEEPQGT---EKE-KQRKSTKKCKKPENFKSRLVKPPQS---- 387

Query: 396 KQEESFITRSPS 407
            QEE F+ RSP+
Sbjct: 388 MQEEPFV-RSPA 398



 Score = 90.1 bits (222), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 57/148 (38%), Positives = 86/148 (58%), Gaps = 29/148 (19%)

Query: 511 GPEYQYITTLLSRTGVHKATSLPHHHFQWFSSTHPLDPLLFHRLEQHYPLSNSFASSIES 570
           G E +YIT  L RTG+ + T  P  + +WFS +HPLDP +F+ LE H+ ++++       
Sbjct: 486 GGELEYITRTLRRTGIDRDT--PISYAKWFSPSHPLDPSIFYFLE-HFAVTST------- 535

Query: 571 YRDCKFRQKN---HLGPRCNRRLMFDLVDELLSEIL-----VRP-KGKGQGKSHRGL--- 618
                 R +N   +L  RCNR+L+F LVDE+L++IL     ++P       +S R L   
Sbjct: 536 ------RPRNSPENLSLRCNRKLLFHLVDEILADILKPHINLKPWVCHYPIRSQRNLKGS 589

Query: 619 -LLETVWKRVRSFPRAKCEVLEDIDGLI 645
            L++ + +R+  FP AKC VLEDID L+
Sbjct: 590 ELIDELSRRIERFPLAKCLVLEDIDALV 617


>AT4G25430.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G51850.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr4:12998571-13000211 FORWARD LENGTH=459
          Length = 459

 Score = 88.6 bits (218), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 82/212 (38%), Positives = 108/212 (50%), Gaps = 39/212 (18%)

Query: 70  KGAEAPRNSLE-SEDGTVTSISKEENFKIPK----IRTSG--STRGXXXXXXXXXXXXXP 122
           KG  APRNSL+ SE+  +++     N+K+ +    I   G  ST               P
Sbjct: 62  KGLVAPRNSLDLSEESPLST-----NYKLEREGLNISVGGKKSTLRGLLVDTPSHNCNLP 116

Query: 123 GTKTPTLVARLMGLDLLPDXXXXXXXXXXCLSTPNPHYHQNRTRQHIQIIKHRNSTDNVS 182
            TKTP +VARLMGLDLLPD             T +P   +N  R       HR S +   
Sbjct: 117 RTKTPNVVARLMGLDLLPDNLEL---------TRSP---RNGVR------GHRLSGNGSG 158

Query: 183 TRSLPETPRISSARRSDVVEHRLSLQINKENMGLGEDLEGPRFSFSKRKYDENSSRSPSH 242
           TRSLP +PRIS    SD   HRLSL++N+EN    +  E  R    + K DE S  SP +
Sbjct: 159 TRSLPASPRIS----SDSENHRLSLELNREN---NKHEEFVRTRLKELKQDEQSP-SPRY 210

Query: 243 YARQIVKQVKESV-SRKVGLDITNTVKNREQG 273
             RQIVKQ K+ V +RK G+D+TN ++ +  G
Sbjct: 211 SGRQIVKQTKKRVTTRKFGMDVTNLLEKKRAG 242