Miyakogusa Predicted Gene

Lj0g3v0066399.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0066399.1 Non Chatacterized Hit- tr|C6TI18|C6TI18_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.55406
PE,82.69,0,seg,NULL; DUF4057,Domain of unknown function DUF4057;
SUBFAMILY NOT NAMED,NULL; FAMILY NOT NAMED,NUL,CUFF.3127.1
         (283 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G78150.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   342   2e-94
AT1G78150.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   342   2e-94
AT1G35780.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   328   2e-90
AT1G78150.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   327   7e-90
AT4G39860.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   297   6e-81
AT4G39860.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   290   6e-79
AT2G22270.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   183   1e-46

>AT1G78150.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G35780.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr1:29404996-29406341 FORWARD LENGTH=274
          Length = 274

 Score =  342 bits (877), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 187/286 (65%), Positives = 216/286 (75%), Gaps = 15/286 (5%)

Query: 1   MERSTPVRKPHTSTADLLTWSETXXXXXXXXXXXXXXXXXX-XDGIRKVVFGGQVTDEEV 59
           MERSTPVRKPHTSTADLLTWSE                     DGI KVVFGGQVTDEEV
Sbjct: 1   MERSTPVRKPHTSTADLLTWSEVPPPDSPSSASRSAVRSHQPSDGISKVVFGGQVTDEEV 60

Query: 60  ESLNKRKPCSESKMKEITGSGIFVANGEDDTAEDGSANPQNKTGIRMYQQAVAGISHISF 119
           ESLN+RKPCSE KMKEITGSGIF  N +DD +E           + +YQQAV GIS ISF
Sbjct: 61  ESLNRRKPCSEHKMKEITGSGIFSRNEKDDASEP----------LPVYQQAVNGISQISF 110

Query: 120 GEEGSLLPKKPSSLPEVAKQRELSGTLESED-HKLNKQLSDAKCKELSGHDIFAPPPEIK 178
           GEE +L PKKP+++PEVAKQRELSGT+E+E  +KL KQLSDAK KE+SG +IFAPPPEIK
Sbjct: 111 GEEENLSPKKPATVPEVAKQRELSGTMENESANKLQKQLSDAKYKEISGQNIFAPPPEIK 170

Query: 179 PRPITARVLELKGSIDIGELAPSQ-ADGDGTTKTAKKIYKEKFADLSGNDIFRGEVPPSS 237
           PR  T R L LK + ++G  A SQ A+ D + KTAKKIY +KFA+LSGNDIF+G+   S+
Sbjct: 171 PRSGTNRALALKDNFNLG--AESQTAEEDSSVKTAKKIYDKKFAELSGNDIFKGDAASSN 228

Query: 238 TEKSLSGAKLREISGSNIFADGKVESRDFLGGVRKPPGGESSIALV 283
            EK LS AKL+EI G+NIFADGKVE+RD+LGGVRKPPGGE+SIALV
Sbjct: 229 VEKHLSQAKLKEIGGNNIFADGKVEARDYLGGVRKPPGGETSIALV 274


>AT1G78150.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G35780.1); Has 152 Blast hits to 146 proteins
           in 18 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 2; Plants - 149; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr1:29404996-29406341 FORWARD
           LENGTH=274
          Length = 274

 Score =  342 bits (877), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 187/286 (65%), Positives = 216/286 (75%), Gaps = 15/286 (5%)

Query: 1   MERSTPVRKPHTSTADLLTWSETXXXXXXXXXXXXXXXXXX-XDGIRKVVFGGQVTDEEV 59
           MERSTPVRKPHTSTADLLTWSE                     DGI KVVFGGQVTDEEV
Sbjct: 1   MERSTPVRKPHTSTADLLTWSEVPPPDSPSSASRSAVRSHQPSDGISKVVFGGQVTDEEV 60

Query: 60  ESLNKRKPCSESKMKEITGSGIFVANGEDDTAEDGSANPQNKTGIRMYQQAVAGISHISF 119
           ESLN+RKPCSE KMKEITGSGIF  N +DD +E           + +YQQAV GIS ISF
Sbjct: 61  ESLNRRKPCSEHKMKEITGSGIFSRNEKDDASEP----------LPVYQQAVNGISQISF 110

Query: 120 GEEGSLLPKKPSSLPEVAKQRELSGTLESED-HKLNKQLSDAKCKELSGHDIFAPPPEIK 178
           GEE +L PKKP+++PEVAKQRELSGT+E+E  +KL KQLSDAK KE+SG +IFAPPPEIK
Sbjct: 111 GEEENLSPKKPATVPEVAKQRELSGTMENESANKLQKQLSDAKYKEISGQNIFAPPPEIK 170

Query: 179 PRPITARVLELKGSIDIGELAPSQ-ADGDGTTKTAKKIYKEKFADLSGNDIFRGEVPPSS 237
           PR  T R L LK + ++G  A SQ A+ D + KTAKKIY +KFA+LSGNDIF+G+   S+
Sbjct: 171 PRSGTNRALALKDNFNLG--AESQTAEEDSSVKTAKKIYDKKFAELSGNDIFKGDAASSN 228

Query: 238 TEKSLSGAKLREISGSNIFADGKVESRDFLGGVRKPPGGESSIALV 283
            EK LS AKL+EI G+NIFADGKVE+RD+LGGVRKPPGGE+SIALV
Sbjct: 229 VEKHLSQAKLKEIGGNNIFADGKVEARDYLGGVRKPPGGETSIALV 274


>AT1G35780.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G78150.2); Has 145 Blast hits to 144 proteins
           in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 145; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr1:13277778-13280113 REVERSE
           LENGTH=286
          Length = 286

 Score =  328 bits (842), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 182/290 (62%), Positives = 206/290 (71%), Gaps = 11/290 (3%)

Query: 1   MERSTPVRKPHTSTADLLTWSETX--XXXXXXXXXXXXXXXXXXDGIRKVVFGGQVTDEE 58
           ME++TPVRKPH STADLLTW E                      DGI KVVFGGQVTDEE
Sbjct: 1   MEKNTPVRKPHMSTADLLTWPENQPFESPAAVSSRSAARSHQPSDGISKVVFGGQVTDEE 60

Query: 59  VESLNKRKPCSESKMKEITGSGIFVANGEDDTAEDGSANPQNKTGIRMYQQAVAGI-SHI 117
           VESLNKRKPCS  KMKEITGSGIF    E+D +E  SAN       R +QQ  A I SHI
Sbjct: 61  VESLNKRKPCSNYKMKEITGSGIFSVYEENDDSELASANSATNGKSRTFQQPPAAIMSHI 120

Query: 118 SFGEEGSLLPKKPSSLPEVAKQRELSGTLESE-DHKLNKQLSDAKCKELSGHDIFAPPPE 176
           SFGEE  + PKKP+++PEVAKQRELSGTLE + D KLNKQ SDAKCKELSGH+IFAPPPE
Sbjct: 121 SFGEEEIVTPKKPATVPEVAKQRELSGTLEYQSDAKLNKQFSDAKCKELSGHNIFAPPPE 180

Query: 177 IKPRPITARVLELKGSIDIGELAPSQADGDGTTKTAKKIYKEKFADLSGNDIFRGEV--P 234
           IK RP T R L  K + D+GE   S    DG  KTAKKI   KF DLSGN++F+ +V  P
Sbjct: 181 IKLRP-TVRALAYKDNFDLGE---SDTKPDGELKTAKKIADRKFTDLSGNNVFKSDVSSP 236

Query: 235 PSST-EKSLSGAKLREISGSNIFADGKVESRDFLGGVRKPPGGESSIALV 283
            S+T E+ LS AKL+EISG++IFAD K +SRD+ GGVRKPPGGESSIALV
Sbjct: 237 SSATAERLLSTAKLKEISGNDIFADAKAQSRDYFGGVRKPPGGESSIALV 286


>AT1G78150.3 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT1G35780.1). | chr1:29404996-29406341 FORWARD
           LENGTH=303
          Length = 303

 Score =  327 bits (837), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 187/315 (59%), Positives = 216/315 (68%), Gaps = 44/315 (13%)

Query: 1   MERSTPVRKPHTSTADLLTWSETXXXXXXXXXXXXXXXXXX-XDGIRKVVFGGQVTDEEV 59
           MERSTPVRKPHTSTADLLTWSE                     DGI KVVFGGQVTDEEV
Sbjct: 1   MERSTPVRKPHTSTADLLTWSEVPPPDSPSSASRSAVRSHQPSDGISKVVFGGQVTDEEV 60

Query: 60  ESLNKR-----------------------------KPCSESKMKEITGSGIFVANGEDDT 90
           ESLN+R                             KPCSE KMKEITGSGIF  N +DD 
Sbjct: 61  ESLNRRILDDAFDSFMRLVIYTNVKTCENVYDVIRKPCSEHKMKEITGSGIFSRNEKDDA 120

Query: 91  AEDGSANPQNKTGIRMYQQAVAGISHISFGEEGSLLPKKPSSLPEVAKQRELSGTLESED 150
           +E           + +YQQAV GIS ISFGEE +L PKKP+++PEVAKQRELSGT+E+E 
Sbjct: 121 SEP----------LPVYQQAVNGISQISFGEEENLSPKKPATVPEVAKQRELSGTMENES 170

Query: 151 -HKLNKQLSDAKCKELSGHDIFAPPPEIKPRPITARVLELKGSIDIGELAPSQ-ADGDGT 208
            +KL KQLSDAK KE+SG +IFAPPPEIKPR  T R L LK + ++G  A SQ A+ D +
Sbjct: 171 ANKLQKQLSDAKYKEISGQNIFAPPPEIKPRSGTNRALALKDNFNLG--AESQTAEEDSS 228

Query: 209 TKTAKKIYKEKFADLSGNDIFRGEVPPSSTEKSLSGAKLREISGSNIFADGKVESRDFLG 268
            KTAKKIY +KFA+LSGNDIF+G+   S+ EK LS AKL+EI G+NIFADGKVE+RD+LG
Sbjct: 229 VKTAKKIYDKKFAELSGNDIFKGDAASSNVEKHLSQAKLKEIGGNNIFADGKVEARDYLG 288

Query: 269 GVRKPPGGESSIALV 283
           GVRKPPGGE+SIALV
Sbjct: 289 GVRKPPGGETSIALV 303


>AT4G39860.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G22270.1); Has 152 Blast hits to 146 proteins
           in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 2; Plants - 146; Viruses - 0; Other Eukaryotes -
           4 (source: NCBI BLink). | chr4:18499909-18501472 FORWARD
           LENGTH=299
          Length = 299

 Score =  297 bits (760), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 163/302 (53%), Positives = 204/302 (67%), Gaps = 22/302 (7%)

Query: 1   MERSTPVRKPHTSTADLLTWSETXXXXXXXXXXXXXXXXXXXDGIRKVVFGGQVTDEEVE 60
           MER+TPVR PHTSTADLL+WSET                   DGI K++ GGQ+TDEE +
Sbjct: 1   MERNTPVRNPHTSTADLLSWSETPPPPHHSTPSAARSHQPS-DGISKILGGGQITDEEAQ 59

Query: 61  SLNKRKPCSESKMKEITGSGIFVANGEDDTAEDGSANPQNKTGIRMYQQAVAGISHISFG 120
           SLNK K CS  K+KE+TGSGIF   G+  +  D + +P  KTG+R YQQ + G+S ISF 
Sbjct: 60  SLNKLKNCSGYKLKEMTGSGIFTDKGKVGSESDATTDP--KTGLRYYQQTLNGMSQISFS 117

Query: 121 EEGSLLPKKPSSLPEVAKQRELSGTLESE-DHKLNKQLSDAKCKELSGHDIFAPPPEIKP 179
            +G++ PKKP++L EVAKQRELSG L +E D K NKQ+S AK +E+SGHDIFAPP EI+P
Sbjct: 118 ADGNVSPKKPTTLTEVAKQRELSGNLLTEADLKSNKQISSAKIEEISGHDIFAPPSEIQP 177

Query: 180 RPITARVLELKGSIDIGELAPSQ----------ADGDGT--------TKTAKKIYKEKFA 221
           R + A   E +G+ D+GE AP            A G            KT+KKI+ +KF 
Sbjct: 178 RSLVAAQQEARGNRDMGEPAPRNLRTSVKVSNPAGGQSNILFSEEPVVKTSKKIHNQKFQ 237

Query: 222 DLSGNDIFRGEVPPSSTEKSLSGAKLREISGSNIFADGKVESRDFLGGVRKPPGGESSIA 281
           +L+GN IF+G+  P S +K LS AKLRE+SG+NIFADGK ESRD+ GGVRKPPGGESSI+
Sbjct: 238 ELTGNGIFKGDESPGSADKQLSSAKLREMSGNNIFADGKSESRDYFGGVRKPPGGESSIS 297

Query: 282 LV 283
           LV
Sbjct: 298 LV 299


>AT4G39860.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G22270.1); Has 148 Blast hits to 144 proteins
           in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 144; Viruses - 0; Other Eukaryotes -
           4 (source: NCBI BLink). | chr4:18499909-18501472 FORWARD
           LENGTH=298
          Length = 298

 Score =  290 bits (743), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 162/302 (53%), Positives = 203/302 (67%), Gaps = 23/302 (7%)

Query: 1   MERSTPVRKPHTSTADLLTWSETXXXXXXXXXXXXXXXXXXXDGIRKVVFGGQVTDEEVE 60
           MER+TPVR PHTSTADLL+WSET                   DGI K++ GGQ+TDEE +
Sbjct: 1   MERNTPVRNPHTSTADLLSWSETPPPPHHSTPSAARSHQPS-DGISKILGGGQITDEEAQ 59

Query: 61  SLNKRKPCSESKMKEITGSGIFVANGEDDTAEDGSANPQNKTGIRMYQQAVAGISHISFG 120
           SLNK K CS  K+KE+TGSGIF   G+  +  D + +P  KTG+R YQ  + G+S ISF 
Sbjct: 60  SLNKLKNCSGYKLKEMTGSGIFTDKGKVGSESDATTDP--KTGLRYYQ-TLNGMSQISFS 116

Query: 121 EEGSLLPKKPSSLPEVAKQRELSGTLESE-DHKLNKQLSDAKCKELSGHDIFAPPPEIKP 179
            +G++ PKKP++L EVAKQRELSG L +E D K NKQ+S AK +E+SGHDIFAPP EI+P
Sbjct: 117 ADGNVSPKKPTTLTEVAKQRELSGNLLTEADLKSNKQISSAKIEEISGHDIFAPPSEIQP 176

Query: 180 RPITARVLELKGSIDIGELAPSQ----------ADGDGT--------TKTAKKIYKEKFA 221
           R + A   E +G+ D+GE AP            A G            KT+KKI+ +KF 
Sbjct: 177 RSLVAAQQEARGNRDMGEPAPRNLRTSVKVSNPAGGQSNILFSEEPVVKTSKKIHNQKFQ 236

Query: 222 DLSGNDIFRGEVPPSSTEKSLSGAKLREISGSNIFADGKVESRDFLGGVRKPPGGESSIA 281
           +L+GN IF+G+  P S +K LS AKLRE+SG+NIFADGK ESRD+ GGVRKPPGGESSI+
Sbjct: 237 ELTGNGIFKGDESPGSADKQLSSAKLREMSGNNIFADGKSESRDYFGGVRKPPGGESSIS 296

Query: 282 LV 283
           LV
Sbjct: 297 LV 298


>AT2G22270.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G39860.2); Has 247 Blast hits to 231 proteins
           in 42 species: Archae - 0; Bacteria - 17; Metazoa - 14;
           Fungi - 5; Plants - 145; Viruses - 0; Other Eukaryotes -
           66 (source: NCBI BLink). | chr2:9463765-9465282 FORWARD
           LENGTH=328
          Length = 328

 Score =  183 bits (465), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 131/328 (39%), Positives = 176/328 (53%), Gaps = 61/328 (18%)

Query: 10  PHTSTADLLTWSETXXXXXXXXXXXXXXXXXXXDGIRKVVFGG-QVTDEEVESLN----K 64
           PH STADLL+WSE                    DG+  V+ GG Q+T+ E +SLN     
Sbjct: 8   PHHSTADLLSWSEIRRPDYSTAANRSNQPS---DGMNDVLGGGGQITNAETKSLNTNVSH 64

Query: 65  RKPCSESKMKEITGSGIFVANGEDD--------------------------TAEDGSANP 98
           RK CS  K+KE+TGS IF  +G+ D                          T  +G  +P
Sbjct: 65  RKNCSGHKLKEMTGSDIFSDDGKYDPNHQTRIHYHQDQLSQISFSGEENATTPMNGKDDP 124

Query: 99  QNKTGIRMYQQAVAGISHISFGEEGSLLPKKPSSLPEVAKQRELSGTLESE-DHKLNK-Q 156
            ++T I  +Q      S ISF  E ++ PKKP++L E AKQ+ELS T+E++ D K  K Q
Sbjct: 125 NHQTRIHYHQDQR---SQISFSGEENVTPKKPTTLNEAAKQKELSRTVETQADSKCKKKQ 181

Query: 157 LSDAKCKELSGHDIFAPPPEIKPRPIT--ARVLELKGSIDIGELAPSQADGDGTT----- 209
           +S+ K K +SGHDIFA P E +PR +   A   E+KG+ +  E AP  +     T     
Sbjct: 182 ISNTKNKAMSGHDIFASP-ESQPRRLFGGATQSEVKGNKNTEESAPRSSRASVKTSNGQS 240

Query: 210 -----------KTAKKIYKEK--FADLSGNDIFRGE-VPPSSTEKSLSGAKLREISGSNI 255
                      K++KKI+ +K  F  L+ N IF+ + +PP  +EK  S AK RE+SG NI
Sbjct: 241 SNRLFSEEHVVKSSKKIHNQKSQFQGLTSNGIFKSDKIPPGYSEKMQSSAKKREMSGHNI 300

Query: 256 FADGKVESRDFLGGVRKPPGGESSIALV 283
           FADGK E RD+ GG R+PPGGESSI+LV
Sbjct: 301 FADGKSEYRDYYGGARRPPGGESSISLV 328