Miyakogusa Predicted Gene

Lj5g3v0528950.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v0528950.1 Non Chatacterized Hit- tr|I3T0N3|I3T0N3_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2 SV=1,98.89,0,SUBFAMILY
NOT NAMED,NULL; FAMILY NOT NAMED,NULL,CUFF.53214.1
         (360 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G25265.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   612   e-175
AT2G25260.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   545   e-155
AT5G13500.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   495   e-140
AT5G13500.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   495   e-140
AT5G13500.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   495   e-140
AT3G01720.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    72   7e-13

>AT5G25265.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane,
           membrane; EXPRESSED IN: cultured cell, leaf; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT2G25260.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:8754794-8756855 REVERSE LENGTH=366
          Length = 366

 Score =  612 bits (1577), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 299/366 (81%), Positives = 323/366 (88%), Gaps = 6/366 (1%)

Query: 1   MGCSGNLFFTILITFSVALITYNIIISGNAPLRQDFPGPSRRPTITIDPIIEMPL----R 56
           MGC G LF+ +LIT SVALITYNIIIS NAPL+Q FPG S    I+IDP+IE+P     R
Sbjct: 1   MGCGGTLFYPLLITLSVALITYNIIISANAPLKQGFPGRSSSSDISIDPVIELPRGGGSR 60

Query: 57  RHSSSSKRLFHTAVTASDSVYNTWQCRVMYHWFKKFQAD--PDSSMGGFTRILHSGKPDA 114
            +     RLFHTAVTASDSVYNTWQCRVMY+WFKK QA   P S MGGFTRILHSGKPD 
Sbjct: 61  NNDGKRIRLFHTAVTASDSVYNTWQCRVMYYWFKKIQASAGPGSEMGGFTRILHSGKPDQ 120

Query: 115 FMDEIPTFVAQPLPSGMDQGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNL 174
           +MDEIPTFVAQPLPSGMDQGY+VLNRPWAFVQWLQQ DIKEDYILMSEPDHIIVKPIPNL
Sbjct: 121 YMDEIPTFVAQPLPSGMDQGYVVLNRPWAFVQWLQQTDIKEDYILMSEPDHIIVKPIPNL 180

Query: 175 AKDGMGAAFPFFYIEPKKYETVLRKYFPEENGPVTNIDPIGNSPVIVGKESLKKIAPTWM 234
           AKDG+GAAFPFFYIEPKKYE VLRKY+PE  GPVTNIDPIGNSPVIVGK++LKKIAPTWM
Sbjct: 181 AKDGLGAAFPFFYIEPKKYEKVLRKYYPEVRGPVTNIDPIGNSPVIVGKDALKKIAPTWM 240

Query: 235 NVSLAMKKDPETDKAFGWVLEMYAYAVASALHGVRNILYKDFMIQPPWDKEIGKSYIIHY 294
           NVSLAMKKDPE DKAFGWVLEMYAYAV+SALHGV NIL+KDFMIQPPWD E+G  YIIHY
Sbjct: 241 NVSLAMKKDPEADKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQPPWDIEVGDKYIIHY 300

Query: 295 TYGCDYNMKGELTYGKIGEWRFDKRSYDHVAPPKNLTLPPPGVPESVVTLVKMVNEATAS 354
           TYGCDY+MKG+LTYGKIGEWRFDKRSYD   PP+NLT+PPPGV +SVVTLVKM+NEATA+
Sbjct: 301 TYGCDYDMKGKLTYGKIGEWRFDKRSYDSKPPPRNLTMPPPGVSQSVVTLVKMINEATAN 360

Query: 355 IPNWYS 360
           IPNW S
Sbjct: 361 IPNWGS 366


>AT2G25260.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT5G25265.1); Has 35333 Blast hits
           to 34131 proteins in 2444 species: Archae - 798;
           Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
           531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
           BLink). | chr2:10755617-10757766 REVERSE LENGTH=358
          Length = 358

 Score =  545 bits (1405), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 261/361 (72%), Positives = 304/361 (84%), Gaps = 4/361 (1%)

Query: 1   MGCSGNLFFTILITFSVALIT-YNIIISGNAPLRQDFPGPSRRPTITIDPIIEMPLRRHS 59
           MG  G  FF IL+T S+ LI  YN I+S + PLRQ+ PG  RR   + D I    ++  S
Sbjct: 1   MGFRGKYFFPILMTLSLFLIIRYNYIVSDDPPLRQELPG--RRSASSGDDITYT-VKTPS 57

Query: 60  SSSKRLFHTAVTASDSVYNTWQCRVMYHWFKKFQADPDSSMGGFTRILHSGKPDAFMDEI 119
             +KRLFHTAVTA+DSVY+TWQCRVMY+W+ +F+ +P S MGG+TRILHSG+PD  MDEI
Sbjct: 58  KKTKRLFHTAVTATDSVYSTWQCRVMYYWYNRFRDEPGSDMGGYTRILHSGRPDGLMDEI 117

Query: 120 PTFVAQPLPSGMDQGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLAKDGM 179
           PTFVA PLPSG+D+GY+VLNRPWAFVQWLQQA I+EDYILM+EPDHIIVKPIPNLA+  +
Sbjct: 118 PTFVADPLPSGVDKGYVVLNRPWAFVQWLQQAHIEEDYILMAEPDHIIVKPIPNLARGNL 177

Query: 180 GAAFPFFYIEPKKYETVLRKYFPEENGPVTNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 239
            AAFPFFYIEPKKYE+VLRK+FP+ENGP++ IDPIGNSPVIV K +L KIAPTWMNVSLA
Sbjct: 178 AAAFPFFYIEPKKYESVLRKFFPKENGPISRIDPIGNSPVIVTKNALMKIAPTWMNVSLA 237

Query: 240 MKKDPETDKAFGWVLEMYAYAVASALHGVRNILYKDFMIQPPWDKEIGKSYIIHYTYGCD 299
           MK DP+TDKAFGWVLEMYAYAV+SALHGV NIL+KDFMIQPPWD E  K++IIHYTYGCD
Sbjct: 238 MKNDPQTDKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQPPWDTETKKTFIIHYTYGCD 297

Query: 300 YNMKGELTYGKIGEWRFDKRSYDHVAPPKNLTLPPPGVPESVVTLVKMVNEATASIPNWY 359
           ++MKG++  GKIGEWRFDKRSY    PP+NLTLPP GVPESVVTLV M+NEATA+IPNW 
Sbjct: 298 FDMKGKMMVGKIGEWRFDKRSYGDKPPPRNLTLPPRGVPESVVTLVTMINEATANIPNWE 357

Query: 360 S 360
           S
Sbjct: 358 S 358


>AT5G13500.3 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 22 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G25265.1);
           Has 35333 Blast hits to 34131 proteins in 2444 species:
           Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
           991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
           (source: NCBI BLink). | chr5:4338676-4340827 FORWARD
           LENGTH=358
          Length = 358

 Score =  495 bits (1274), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 228/313 (72%), Positives = 266/313 (84%), Gaps = 1/313 (0%)

Query: 47  IDPIIEMPLR-RHSSSSKRLFHTAVTASDSVYNTWQCRVMYHWFKKFQADPDSSMGGFTR 105
           +DP+++MPL  R + SS   FH A+TA+D+ YN WQCR+MY+W+K+ +A P S MGGFTR
Sbjct: 43  LDPVVQMPLNIRKAKSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTR 102

Query: 106 ILHSGKPDAFMDEIPTFVAQPLPSGMDQGYIVLNRPWAFVQWLQQADIKEDYILMSEPDH 165
           ILHSG  D  MDEIPTFV  PLP G+D+GY+VLNRPWAFVQWL++A IKEDY+LM+EPDH
Sbjct: 103 ILHSGNSDNLMDEIPTFVVDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDH 162

Query: 166 IIVKPIPNLAKDGMGAAFPFFYIEPKKYETVLRKYFPEENGPVTNIDPIGNSPVIVGKES 225
           + V P+PNLA  G  AAFPFFYI P+KYE ++RKY+P E GPVTNIDPIGNSPVI+ KES
Sbjct: 163 VFVNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKES 222

Query: 226 LKKIAPTWMNVSLAMKKDPETDKAFGWVLEMYAYAVASALHGVRNILYKDFMIQPPWDKE 285
           L+KIAPTWMNVSL MK DPETDKAFGWVLEMY YA+ASA+HGVR+IL KDFM+QPPWD  
Sbjct: 223 LEKIAPTWMNVSLTMKNDPETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLS 282

Query: 286 IGKSYIIHYTYGCDYNMKGELTYGKIGEWRFDKRSYDHVAPPKNLTLPPPGVPESVVTLV 345
               +IIHYTYGCDYNMKGELTYGKIGEWRFDKRS+    PP+N++LPPPGVPESVVTLV
Sbjct: 283 TKGKFIIHYTYGCDYNMKGELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLV 342

Query: 346 KMVNEATASIPNW 358
           KMVNEATA+IPNW
Sbjct: 343 KMVNEATATIPNW 355


>AT5G13500.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 22 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G25265.1);
           Has 228 Blast hits to 200 proteins in 21 species: Archae
           - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 213;
           Viruses - 0; Other Eukaryotes - 15 (source: NCBI BLink).
           | chr5:4338676-4340827 FORWARD LENGTH=358
          Length = 358

 Score =  495 bits (1274), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 228/313 (72%), Positives = 266/313 (84%), Gaps = 1/313 (0%)

Query: 47  IDPIIEMPLR-RHSSSSKRLFHTAVTASDSVYNTWQCRVMYHWFKKFQADPDSSMGGFTR 105
           +DP+++MPL  R + SS   FH A+TA+D+ YN WQCR+MY+W+K+ +A P S MGGFTR
Sbjct: 43  LDPVVQMPLNIRKAKSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTR 102

Query: 106 ILHSGKPDAFMDEIPTFVAQPLPSGMDQGYIVLNRPWAFVQWLQQADIKEDYILMSEPDH 165
           ILHSG  D  MDEIPTFV  PLP G+D+GY+VLNRPWAFVQWL++A IKEDY+LM+EPDH
Sbjct: 103 ILHSGNSDNLMDEIPTFVVDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDH 162

Query: 166 IIVKPIPNLAKDGMGAAFPFFYIEPKKYETVLRKYFPEENGPVTNIDPIGNSPVIVGKES 225
           + V P+PNLA  G  AAFPFFYI P+KYE ++RKY+P E GPVTNIDPIGNSPVI+ KES
Sbjct: 163 VFVNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKES 222

Query: 226 LKKIAPTWMNVSLAMKKDPETDKAFGWVLEMYAYAVASALHGVRNILYKDFMIQPPWDKE 285
           L+KIAPTWMNVSL MK DPETDKAFGWVLEMY YA+ASA+HGVR+IL KDFM+QPPWD  
Sbjct: 223 LEKIAPTWMNVSLTMKNDPETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLS 282

Query: 286 IGKSYIIHYTYGCDYNMKGELTYGKIGEWRFDKRSYDHVAPPKNLTLPPPGVPESVVTLV 345
               +IIHYTYGCDYNMKGELTYGKIGEWRFDKRS+    PP+N++LPPPGVPESVVTLV
Sbjct: 283 TKGKFIIHYTYGCDYNMKGELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLV 342

Query: 346 KMVNEATASIPNW 358
           KMVNEATA+IPNW
Sbjct: 343 KMVNEATATIPNW 355


>AT5G13500.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 22 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G25265.1);
           Has 1807 Blast hits to 1807 proteins in 277 species:
           Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347;
           Plants - 385; Viruses - 0; Other Eukaryotes - 339
           (source: NCBI BLink). | chr5:4338676-4340827 FORWARD
           LENGTH=358
          Length = 358

 Score =  495 bits (1274), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 228/313 (72%), Positives = 266/313 (84%), Gaps = 1/313 (0%)

Query: 47  IDPIIEMPLR-RHSSSSKRLFHTAVTASDSVYNTWQCRVMYHWFKKFQADPDSSMGGFTR 105
           +DP+++MPL  R + SS   FH A+TA+D+ YN WQCR+MY+W+K+ +A P S MGGFTR
Sbjct: 43  LDPVVQMPLNIRKAKSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTR 102

Query: 106 ILHSGKPDAFMDEIPTFVAQPLPSGMDQGYIVLNRPWAFVQWLQQADIKEDYILMSEPDH 165
           ILHSG  D  MDEIPTFV  PLP G+D+GY+VLNRPWAFVQWL++A IKEDY+LM+EPDH
Sbjct: 103 ILHSGNSDNLMDEIPTFVVDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDH 162

Query: 166 IIVKPIPNLAKDGMGAAFPFFYIEPKKYETVLRKYFPEENGPVTNIDPIGNSPVIVGKES 225
           + V P+PNLA  G  AAFPFFYI P+KYE ++RKY+P E GPVTNIDPIGNSPVI+ KES
Sbjct: 163 VFVNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKES 222

Query: 226 LKKIAPTWMNVSLAMKKDPETDKAFGWVLEMYAYAVASALHGVRNILYKDFMIQPPWDKE 285
           L+KIAPTWMNVSL MK DPETDKAFGWVLEMY YA+ASA+HGVR+IL KDFM+QPPWD  
Sbjct: 223 LEKIAPTWMNVSLTMKNDPETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLS 282

Query: 286 IGKSYIIHYTYGCDYNMKGELTYGKIGEWRFDKRSYDHVAPPKNLTLPPPGVPESVVTLV 345
               +IIHYTYGCDYNMKGELTYGKIGEWRFDKRS+    PP+N++LPPPGVPESVVTLV
Sbjct: 283 TKGKFIIHYTYGCDYNMKGELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLV 342

Query: 346 KMVNEATASIPNW 358
           KMVNEATA+IPNW
Sbjct: 343 KMVNEATATIPNW 355


>AT3G01720.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 24 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G25265.1);
           Has 374 Blast hits to 211 proteins in 23 species: Archae
           - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 316;
           Viruses - 0; Other Eukaryotes - 58 (source: NCBI BLink).
           | chr3:262412-265608 REVERSE LENGTH=802
          Length = 802

 Score = 71.6 bits (174), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 69/277 (24%), Positives = 114/277 (41%), Gaps = 46/277 (16%)

Query: 66  FHTAVTASDSVYNTWQCRVMYHWFKKFQADPDSSMGGFTRIL---------HSGKPDAFM 116
            HT  +   + Y  WQ     H F++         G  TR+L         + G   A  
Sbjct: 394 IHTLFSTECTTYFDWQTVGFMHSFRQ-----SGQPGNITRLLSCTDEALKNYKGHDLAPT 448

Query: 117 DEIPTFVAQPLPSGMDQGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNL-- 174
             +P+    PL       Y  +N+P A V WL   +I  +Y+++ + D I+  PI     
Sbjct: 449 HYVPSMSRHPLTGDW---YPAINKPAAVVHWLHHTNIDAEYVVILDADMILRGPITPWEF 505

Query: 175 -AKDGMGAAFPFFYIEPKKYETV-LRKYFPEENGPVTNIDPIGNSPVIVGKESLKKIAPT 232
            A  G   + P+ Y+     +   L    PE    V  +       +I+  E L+K A  
Sbjct: 506 KAARGRPVSTPYDYLIGCDNDLARLHTRNPEACDKVGGV-------IIMHIEDLRKFAMY 558

Query: 233 WMNVSLAMKKDPE------TDKAF--GWVLEMYAYAVASALHGVRNILYKDFMIQPPWDK 284
           W+  +  ++ D E      T   +  GW+ EMY Y+  +A   +R+ + K+ MI P +  
Sbjct: 559 WLLKTQEVRADKEHYGKELTGDIYESGWISEMYGYSFGAAELNLRHSINKEIMIYPGYVP 618

Query: 285 EIGKSYIIHYTYGCDYNMKGELTYGKIGEWRFDKRSY 321
           E G  Y + + YG ++         K+G W FDK ++
Sbjct: 619 EPGADYRV-FHYGLEF---------KVGNWSFDKANW 645