Miyakogusa Predicted Gene

Lj1g3v4913230.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v4913230.1 Non Chatacterized Hit- tr|I1JQJ5|I1JQJ5_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,68.05,0,SANT,SANT
domain; seg,NULL; FAMILY NOT NAMED,NULL;
Homeodomain-like,Homeodomain-like,CUFF.33565.1
         (842 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G47820.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   377   e-104
AT2G47820.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   377   e-104
AT1G09040.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   322   9e-88
AT1G09050.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   315   7e-86
AT1G55050.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   273   5e-73
AT1G55050.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   273   5e-73

>AT2G47820.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 17 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT1G09040.1). | chr2:19588122-19590629 FORWARD
           LENGTH=805
          Length = 805

 Score =  377 bits (967), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 221/499 (44%), Positives = 287/499 (57%), Gaps = 65/499 (13%)

Query: 22  LHGVDDKFGDPEVLPRVGDEYQVEIPSLTAAPYLSQLAKKTKDSEIRVNVPEP-----FS 76
           L+G+    GDP+VLPRVGD+YQ ++P L        L +  +   I     EP      +
Sbjct: 26  LNGI---HGDPDVLPRVGDQYQADLPVL--------LTESDRLKLITCFHSEPPLQKLLT 74

Query: 77  LGLPIPIMWAHC-SFGCENSESVTSGEGKVSSEH----ECTKVKGGNLGGFSNSQNSSKS 131
            GLPIP+MW     F       +      V  +      C K +   L          K 
Sbjct: 75  FGLPIPLMWTRSEKFRGFREADIDKASPPVDDQSLQNAACMKPRSIVLALPCQKNAKFKF 134

Query: 132 DETDIDSCKGLKTELNQPRGKYLLPGLLDDQPWTDIEYDSFLLGLYVFGKNLNLLKRFVG 191
           D  D              +  Y  PG L  QPW D E + FLLGLY  GKNL L++RFVG
Sbjct: 135 DWLD--------------KTLYPFPGTLG-QPWEDAEQERFLLGLYCLGKNLVLVQRFVG 179

Query: 192 SKKMGDILSFYYGKFFRSKGHSRWSECRKSRTRRCAHGQKIFMGWRQQESLSRLFSHVPG 251
           SK MGD+LS+YYG F+RS  + RW + RKSR+RR   GQK+  GWRQQE LSR+ SHV  
Sbjct: 180 SKHMGDMLSYYYGSFYRSTEYRRWVDGRKSRSRRSVQGQKLLSGWRQQELLSRISSHVSE 239

Query: 252 ECQTLLVEISRNFVEKKILFEEYIFALKDAVGIELLIAAVGIGKGKHDLTGTAVEPSKTY 311
           EC+  L+++S+ F E KI  E+Y+F LK+ VGI++L   +GIGKGK DLT  A+EP+K  
Sbjct: 240 ECKITLLKVSKAFREDKIALEDYVFTLKNTVGIDMLTQVIGIGKGKRDLTNCALEPTKLN 299

Query: 312 H-------VSVRPELPIGKACSSLTSADIIKFLTGNFRLSKARSSDLFWEAVWPRLLANG 364
           H       V +R +LPI         ADI+KFLTG +R+SK RSSDLFWEAVWPRLLA G
Sbjct: 300 HGASGNSQVRIRNDLPI---------ADIVKFLTGEYRMSKTRSSDLFWEAVWPRLLARG 350

Query: 365 WHSEQPMDQVVSGSKQSLVFLIPGVKKFSKRKLVKGNHYFDSISDMLNKVASDPGLLET- 423
           WHSEQP D    G K SLVFL+P   KFS+RK+ KGNHYFDS++D+LNKVA DP LLE  
Sbjct: 351 WHSEQPKD----GPKNSLVFLVPEANKFSRRKMSKGNHYFDSLTDVLNKVALDPTLLELD 406

Query: 424 EIQATEGSSDGGKRQD-----KRDVDGLPNG-QQCHYLQSRSKCN--EDLAKLTIIDTSM 475
           E    +GS +   + D     +   D  PN  ++  YLQ RSK    +++   TIIDTS 
Sbjct: 407 EDLERKGSKEEVIKNDPPTNLEEFDDSSPNSKKKKKYLQPRSKTRKIQEVMLFTIIDTSE 466

Query: 476 VHNMDQRKVRQMRSLSFQT 494
            ++++   ++++RSL   T
Sbjct: 467 TNSIEGCTLKELRSLPVGT 485



 Score = 54.7 bits (130), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 42/122 (34%), Positives = 64/122 (52%), Gaps = 9/122 (7%)

Query: 676 TSSQAKDSVTENHMVGEVSAENSETRMLIDLNFPQVSPEF---GIDLEIPS--SQNDNQC 730
           TSS A+DS    ++  E+S E SE+R   DLN  Q+S E    G D  +      +++ C
Sbjct: 635 TSSFARDSSCRRNIDREISPERSESREDFDLNVSQISLEREADGTDTVMADVVQNSESSC 694

Query: 731 ANTLSSQSEITQLNAMHDFPDDNKEQQSTIVNLRQSTRNRPLTTKALEALQYRFLNSTKR 790
           A   S Q ++ +       P + +     +   RQSTR RPLTTKALEA  + +L ++ +
Sbjct: 695 AEQSSVQVDVEKQCK----PQELQVTADLLPERRQSTRTRPLTTKALEAFAFGYLGNSNK 750

Query: 791 KR 792
           +R
Sbjct: 751 ER 752


>AT2G47820.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G09040.1); Has 628 Blast hits to 543 proteins
           in 149 species: Archae - 0; Bacteria - 106; Metazoa -
           145; Fungi - 69; Plants - 97; Viruses - 10; Other
           Eukaryotes - 201 (source: NCBI BLink). |
           chr2:19588122-19590629 FORWARD LENGTH=805
          Length = 805

 Score =  377 bits (967), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 221/499 (44%), Positives = 287/499 (57%), Gaps = 65/499 (13%)

Query: 22  LHGVDDKFGDPEVLPRVGDEYQVEIPSLTAAPYLSQLAKKTKDSEIRVNVPEP-----FS 76
           L+G+    GDP+VLPRVGD+YQ ++P L        L +  +   I     EP      +
Sbjct: 26  LNGI---HGDPDVLPRVGDQYQADLPVL--------LTESDRLKLITCFHSEPPLQKLLT 74

Query: 77  LGLPIPIMWAHC-SFGCENSESVTSGEGKVSSEH----ECTKVKGGNLGGFSNSQNSSKS 131
            GLPIP+MW     F       +      V  +      C K +   L          K 
Sbjct: 75  FGLPIPLMWTRSEKFRGFREADIDKASPPVDDQSLQNAACMKPRSIVLALPCQKNAKFKF 134

Query: 132 DETDIDSCKGLKTELNQPRGKYLLPGLLDDQPWTDIEYDSFLLGLYVFGKNLNLLKRFVG 191
           D  D              +  Y  PG L  QPW D E + FLLGLY  GKNL L++RFVG
Sbjct: 135 DWLD--------------KTLYPFPGTLG-QPWEDAEQERFLLGLYCLGKNLVLVQRFVG 179

Query: 192 SKKMGDILSFYYGKFFRSKGHSRWSECRKSRTRRCAHGQKIFMGWRQQESLSRLFSHVPG 251
           SK MGD+LS+YYG F+RS  + RW + RKSR+RR   GQK+  GWRQQE LSR+ SHV  
Sbjct: 180 SKHMGDMLSYYYGSFYRSTEYRRWVDGRKSRSRRSVQGQKLLSGWRQQELLSRISSHVSE 239

Query: 252 ECQTLLVEISRNFVEKKILFEEYIFALKDAVGIELLIAAVGIGKGKHDLTGTAVEPSKTY 311
           EC+  L+++S+ F E KI  E+Y+F LK+ VGI++L   +GIGKGK DLT  A+EP+K  
Sbjct: 240 ECKITLLKVSKAFREDKIALEDYVFTLKNTVGIDMLTQVIGIGKGKRDLTNCALEPTKLN 299

Query: 312 H-------VSVRPELPIGKACSSLTSADIIKFLTGNFRLSKARSSDLFWEAVWPRLLANG 364
           H       V +R +LPI         ADI+KFLTG +R+SK RSSDLFWEAVWPRLLA G
Sbjct: 300 HGASGNSQVRIRNDLPI---------ADIVKFLTGEYRMSKTRSSDLFWEAVWPRLLARG 350

Query: 365 WHSEQPMDQVVSGSKQSLVFLIPGVKKFSKRKLVKGNHYFDSISDMLNKVASDPGLLET- 423
           WHSEQP D    G K SLVFL+P   KFS+RK+ KGNHYFDS++D+LNKVA DP LLE  
Sbjct: 351 WHSEQPKD----GPKNSLVFLVPEANKFSRRKMSKGNHYFDSLTDVLNKVALDPTLLELD 406

Query: 424 EIQATEGSSDGGKRQD-----KRDVDGLPNG-QQCHYLQSRSKCN--EDLAKLTIIDTSM 475
           E    +GS +   + D     +   D  PN  ++  YLQ RSK    +++   TIIDTS 
Sbjct: 407 EDLERKGSKEEVIKNDPPTNLEEFDDSSPNSKKKKKYLQPRSKTRKIQEVMLFTIIDTSE 466

Query: 476 VHNMDQRKVRQMRSLSFQT 494
            ++++   ++++RSL   T
Sbjct: 467 TNSIEGCTLKELRSLPVGT 485



 Score = 54.7 bits (130), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 42/122 (34%), Positives = 64/122 (52%), Gaps = 9/122 (7%)

Query: 676 TSSQAKDSVTENHMVGEVSAENSETRMLIDLNFPQVSPEF---GIDLEIPS--SQNDNQC 730
           TSS A+DS    ++  E+S E SE+R   DLN  Q+S E    G D  +      +++ C
Sbjct: 635 TSSFARDSSCRRNIDREISPERSESREDFDLNVSQISLEREADGTDTVMADVVQNSESSC 694

Query: 731 ANTLSSQSEITQLNAMHDFPDDNKEQQSTIVNLRQSTRNRPLTTKALEALQYRFLNSTKR 790
           A   S Q ++ +       P + +     +   RQSTR RPLTTKALEA  + +L ++ +
Sbjct: 695 AEQSSVQVDVEKQCK----PQELQVTADLLPERRQSTRTRPLTTKALEAFAFGYLGNSNK 750

Query: 791 KR 792
           +R
Sbjct: 751 ER 752


>AT1G09040.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: membrane;
           EXPRESSED IN: leaf; BEST Arabidopsis thaliana protein
           match is: unknown protein (TAIR:AT1G09050.1); Has 614
           Blast hits to 567 proteins in 104 species: Archae - 2;
           Bacteria - 12; Metazoa - 344; Fungi - 31; Plants - 81;
           Viruses - 0; Other Eukaryotes - 144 (source: NCBI
           BLink). | chr1:2912362-2915174 FORWARD LENGTH=911
          Length = 911

 Score =  322 bits (824), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 195/479 (40%), Positives = 276/479 (57%), Gaps = 56/479 (11%)

Query: 18  VEEDLHGVDDKF--GDPEVLPRVGDEYQVEIPSLTAAP----YLSQLAKKTKDSEIRVNV 71
            EED +  DD+F  GDP+V PRVGDE+QV+IP + +A     +LS        S      
Sbjct: 19  TEEDSY--DDEFPCGDPQVEPRVGDEFQVDIPPMMSATKRAVFLSTPVALDDSSY----- 71

Query: 72  PEPFSLGLPIPIMWAHC-----SFGCENSESVTSGEGKVSSEHECTKVKGGNLGGFSNSQ 126
              F +GLP+ +MW          G +N +   S +   + +  C+      + G S+  
Sbjct: 72  --SFLIGLPVQVMWIDKHRRGQGNGDDNVDMNQSLKSLRAKKSRCS----AKIRGKSDKN 125

Query: 127 NSSKSDETDIDSCKGLKTELNQPRGKYLLPGLLDDQPWTDIEYDSFLLGLYVFGKNLNLL 186
           + +K   +++++                   ++    W D+E  SF+LGLY FGKN   +
Sbjct: 126 SETKKQRSNLEAVP-----------------VIPSSSWEDLEVASFVLGLYTFGKNFTQV 168

Query: 187 KRFVGSKKMGDILSFYYGKFFRSKGHSRWSECRKSRTRRCAHGQKIFMGWRQQESLSRLF 246
           K F+ +K +G+I+ FYYGKF+ S  +  WSE RK R R+C  G+ ++ GWRQQ+ L+RL 
Sbjct: 169 KNFMENKGIGEIMLFYYGKFYNSAKYHSWSESRKKRNRKCVFGRTLYSGWRQQQLLTRLM 228

Query: 247 SHVPGECQT-LLVEISRNFVEKKILFEEYIFALKDAVGIELLIAAVGIGKGKHDLTGTAV 305
             +P E Q  +LV++S++F E  I  E+Y+ A+K+ VG+ LL+ AV IGK K DLT    
Sbjct: 229 PSIPDEPQKQILVDVSKSFAEGTITLEKYVSAVKNLVGLRLLVDAVAIGKEKEDLTVPTS 288

Query: 306 EPSKT---YHVSVRPELPIGKA-CSSLTSADIIKFLTGNFRLSKARSSDLFWEAVWPRLL 361
            P KT   + VS +  L  G+   +SLTSA II  LTG  RLSKAR +D+FW AVWPRLL
Sbjct: 289 TPMKTKPWFTVSSKSSLVPGEGDYNSLTSAGIINQLTGCSRLSKARCNDIFWGAVWPRLL 348

Query: 362 ANGWHSEQPMDQVVSGSKQSLVFLIPGVKKFSKRKLVKGNHYFDSISDMLNKVASDPGLL 421
           A GWHS+QP D+    SK  +VF++PGVKKFS+++LVKG+HYFDS+SD+L KV S+P LL
Sbjct: 349 ARGWHSQQPEDRGYFKSKDYIVFIVPGVKKFSRQELVKGDHYFDSVSDILTKVVSEPELL 408

Query: 422 ETEIQ--ATEGSSDGGKRQDKRDVDGLPNGQQCH-YLQSRSKCNEDLA-KLTIIDTSMV 476
           E E    A E SS      DK D + +P+    H YL+S       L  K T++DTS+ 
Sbjct: 409 ENETGGVAAELSS------DKSDEESVPSDSLRHRYLRSPCSNRGTLGMKFTVVDTSLA 461


>AT1G09050.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G09040.1); Has 552 Blast hits to 499 proteins
           in 115 species: Archae - 0; Bacteria - 86; Metazoa -
           259; Fungi - 14; Plants - 77; Viruses - 0; Other
           Eukaryotes - 116 (source: NCBI BLink). |
           chr1:2918031-2920858 FORWARD LENGTH=916
          Length = 916

 Score =  315 bits (808), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 184/426 (43%), Positives = 255/426 (59%), Gaps = 32/426 (7%)

Query: 18  VEEDLHGVDDKF--GDPEVLPRVGDEYQVEIPSLTAAPYLSQLAKKTKDSEIRVNVPEPF 75
           +EED +  DD+F  GDP+V PRVGDE+QV+IP + +A   S+ A    +     +    F
Sbjct: 19  IEEDSY--DDEFPCGDPQVEPRVGDEFQVDIPLMMSA---SKRAVFLSNPVALDDSTCSF 73

Query: 76  SLGLPIPIMWA-HCSFGCENSESVTSGEGKVSSEHECTKVKGGNLGGFSNSQNSSKSDET 134
            +GLP+ +MW      G  N      G+G V        ++     G  +++   KSD+ 
Sbjct: 74  LVGLPVQVMWIDKVGIGQGN------GDGNVDMNQSLKSLRAKK--GRCSAKIRGKSDKN 125

Query: 135 DIDSCKGLKTELNQPRGKYLLPGLLDDQPWTDIEYDSFLLGLYVFGKNLNLLKRFVGSKK 194
                + L  E         +P +     W D+E  SF+LGLY FGKN   +  F+ +K 
Sbjct: 126 SETKKQRLNLEA--------VPAIPSSS-WDDLEVASFVLGLYTFGKNFTQMNNFMENKG 176

Query: 195 MGDILSFYYGKFFRSKGHSRWSECRKSRTRRCAHGQKIFMGWRQQESLSRLFSHVPGECQ 254
           +G+I+ FYYGKF+ S  +  WSE RK R R+C +G+K++ GWRQQ+ L+RL   +P E Q
Sbjct: 177 IGEIMLFYYGKFYNSAKYHTWSESRKKRNRKCVYGRKLYSGWRQQQLLTRLMPSIPDEPQ 236

Query: 255 T-LLVEISRNFVEKKILFEEYIFALKDAVGIELLIAAVGIGKGKHDLTGTAVEPSKT--- 310
             +LV++S++F E  I  E+Y+ A+K+ VG+ LL+ AV IGK K DLT     P KT   
Sbjct: 237 KQMLVDVSKSFAEGTITLEKYVSAVKNLVGLRLLVDAVAIGKEKEDLTVPTSTPMKTKPW 296

Query: 311 YHVSVRPELPIGKA-CSSLTSADIIKFLTGNFRLSKARSSDLFWEAVWPRLLANGWHSEQ 369
           + VS +  L  G+   +SLTSA II  LTG  RLSKAR +D+FW AVWPRLLA GW S+Q
Sbjct: 297 FTVSSKSSLVPGEGDYNSLTSAGIINQLTGCSRLSKARCNDIFWGAVWPRLLARGWRSQQ 356

Query: 370 PMDQVVSGSKQSLVFLIPGVKKFSKRKLVKGNHYFDSISDMLNKVASDPGLLETEIQ--A 427
           P D+    SK  +VF++PGVKKFS+++LVKG+HYFDS+SD+L KV S+P LLE E    A
Sbjct: 357 PEDRGYFKSKDYIVFIVPGVKKFSRQELVKGDHYFDSVSDILTKVVSEPELLENETGGVA 416

Query: 428 TEGSSD 433
            E  SD
Sbjct: 417 AENPSD 422


>AT1G55050.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: cultured cell;
           BEST Arabidopsis thaliana protein match is: unknown
           protein (TAIR:AT1G09040.1); Has 30201 Blast hits to
           17322 proteins in 780 species: Archae - 12; Bacteria -
           1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
           Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
           BLink). | chr1:20542779-20545612 FORWARD LENGTH=915
          Length = 915

 Score =  273 bits (697), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 166/407 (40%), Positives = 226/407 (55%), Gaps = 52/407 (12%)

Query: 26  DDKF--GDPEVLPRVGDEYQVEIPSLTAAPYLSQLAKKTKDSEIRVNVPE-----PFSLG 78
           D++F  GDP+V  RVGDEYQVEIP     P +S+    ++ +E+ +N  E      F++G
Sbjct: 14  DEEFVCGDPKVDIRVGDEYQVEIP-----PMMSE----SQRAELLLNPLEFDSSCSFAVG 64

Query: 79  LPIPIMWAHCSFGCENSESVTSGEGKVSSEHECTKVKGGNLGGFSNSQNSSKSDETDIDS 138
           LP+ +MW      C + + + S    ++   +  K K    GG               D 
Sbjct: 65  LPVEVMWIETK--CRDGDGLGSDNIDMNESLKSLKRKRSRRGG--------------SDG 108

Query: 139 CKGLKTELNQPRGKYLLPGLLDDQP------WTDIEYDSFLLGLYVFGKNLNLLKRFVGS 192
             G K  +N           L+  P      W D+E D F+LGLY FGKN   +++ + S
Sbjct: 109 NSGSKRRMN-----------LEAVPEKSSSSWEDLEVDGFVLGLYTFGKNFAQVQKLLES 157

Query: 193 KKMGDILSFYYGKFFRSKGHSRWSECRKSRTRRCAHGQKIFMGWRQQESLSRLFSHVPGE 252
           K  G+IL FYYGKF+ S  +  WS   K R+ RC  G+K++  WR Q  LSRL   +  E
Sbjct: 158 KATGEILLFYYGKFYGSAKYKTWSNYLKKRSTRCIQGKKLYSDWRLQLLLSRLIRSITDE 217

Query: 253 C-QTLLVEISRNFVEKKILFEEYIFALKDAVGIELLIAAVGIGKGKHDLTGTAVEPSKTY 311
             +  LV++S++F E K   EEYI A+K  VG+  L+ AV IGK K DLT    +P    
Sbjct: 218 SKEQKLVDVSKSFAEGKKSLEEYINAVKKLVGLRCLVEAVAIGKDKEDLTVLTTKPVDVE 277

Query: 312 H-VSVRPELPIGKA-CSSLTSADIIKFLTGNFRLSKARSSDLFWEAVWPRLLANGWHSEQ 369
               V   +P G    +SLT   II+ L+G  R+SKAR +D+FW+AVWPRLL  GW SE 
Sbjct: 278 QWFRVSSAVPAGLGEYNSLTVEGIIEKLSGGSRVSKARCNDIFWDAVWPRLLHRGWRSEL 337

Query: 370 PMDQVVSGSKQSLVFLIPGVKKFSKRKLVKGNHYFDSISDMLNKVAS 416
           P DQ    SK+ +VFL+PGVKKFS++KLVK +HYFDSISD+L KV S
Sbjct: 338 PKDQGYIKSKEHIVFLVPGVKKFSRKKLVKRDHYFDSISDILKKVVS 384


>AT1G55050.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: cultured cell;
           BEST Arabidopsis thaliana protein match is: unknown
           protein (TAIR:AT1G09040.1); Has 2440 Blast hits to 1999
           proteins in 271 species: Archae - 0; Bacteria - 138;
           Metazoa - 960; Fungi - 166; Plants - 162; Viruses - 14;
           Other Eukaryotes - 1000 (source: NCBI BLink). |
           chr1:20542779-20545612 FORWARD LENGTH=915
          Length = 915

 Score =  273 bits (697), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 166/407 (40%), Positives = 226/407 (55%), Gaps = 52/407 (12%)

Query: 26  DDKF--GDPEVLPRVGDEYQVEIPSLTAAPYLSQLAKKTKDSEIRVNVPE-----PFSLG 78
           D++F  GDP+V  RVGDEYQVEIP     P +S+    ++ +E+ +N  E      F++G
Sbjct: 14  DEEFVCGDPKVDIRVGDEYQVEIP-----PMMSE----SQRAELLLNPLEFDSSCSFAVG 64

Query: 79  LPIPIMWAHCSFGCENSESVTSGEGKVSSEHECTKVKGGNLGGFSNSQNSSKSDETDIDS 138
           LP+ +MW      C + + + S    ++   +  K K    GG               D 
Sbjct: 65  LPVEVMWIETK--CRDGDGLGSDNIDMNESLKSLKRKRSRRGG--------------SDG 108

Query: 139 CKGLKTELNQPRGKYLLPGLLDDQP------WTDIEYDSFLLGLYVFGKNLNLLKRFVGS 192
             G K  +N           L+  P      W D+E D F+LGLY FGKN   +++ + S
Sbjct: 109 NSGSKRRMN-----------LEAVPEKSSSSWEDLEVDGFVLGLYTFGKNFAQVQKLLES 157

Query: 193 KKMGDILSFYYGKFFRSKGHSRWSECRKSRTRRCAHGQKIFMGWRQQESLSRLFSHVPGE 252
           K  G+IL FYYGKF+ S  +  WS   K R+ RC  G+K++  WR Q  LSRL   +  E
Sbjct: 158 KATGEILLFYYGKFYGSAKYKTWSNYLKKRSTRCIQGKKLYSDWRLQLLLSRLIRSITDE 217

Query: 253 C-QTLLVEISRNFVEKKILFEEYIFALKDAVGIELLIAAVGIGKGKHDLTGTAVEPSKTY 311
             +  LV++S++F E K   EEYI A+K  VG+  L+ AV IGK K DLT    +P    
Sbjct: 218 SKEQKLVDVSKSFAEGKKSLEEYINAVKKLVGLRCLVEAVAIGKDKEDLTVLTTKPVDVE 277

Query: 312 H-VSVRPELPIGKA-CSSLTSADIIKFLTGNFRLSKARSSDLFWEAVWPRLLANGWHSEQ 369
               V   +P G    +SLT   II+ L+G  R+SKAR +D+FW+AVWPRLL  GW SE 
Sbjct: 278 QWFRVSSAVPAGLGEYNSLTVEGIIEKLSGGSRVSKARCNDIFWDAVWPRLLHRGWRSEL 337

Query: 370 PMDQVVSGSKQSLVFLIPGVKKFSKRKLVKGNHYFDSISDMLNKVAS 416
           P DQ    SK+ +VFL+PGVKKFS++KLVK +HYFDSISD+L KV S
Sbjct: 338 PKDQGYIKSKEHIVFLVPGVKKFSRKKLVKRDHYFDSISDILKKVVS 384