Miyakogusa Predicted Gene

Lj0g3v0262339.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0262339.1 Non Chatacterized Hit- tr|D7TUW0|D7TUW0_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,56.63,3e-17,coiled-coil,NULL; FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.17272.1
         (531 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G13260.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   531   e-151
AT3G48860.2 | Symbols:  | unknown protein; INVOLVED IN: biologic...   466   e-131
AT4G25070.2 | Symbols:  | unknown protein; EXPRESSED IN: culture...   411   e-115
AT4G25070.1 | Symbols:  | unknown protein; EXPRESSED IN: culture...   411   e-115
AT5G23700.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   406   e-113
AT3G48860.1 | Symbols:  | unknown protein; INVOLVED IN: biologic...   356   2e-98
AT4G08630.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   332   5e-91

>AT5G13260.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G48860.2); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:4243164-4246677 FORWARD LENGTH=537
          Length = 537

 Score =  531 bits (1367), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 307/537 (57%), Positives = 351/537 (65%), Gaps = 21/537 (3%)

Query: 1   MERKGRESPVSIRQWSSDSGNVIXXXXXXXXXXXXXXXXXXXXXXXXXXXXKRTQNXXXX 60
           MER   ESP   RQWS DSG                               KR QN    
Sbjct: 1   MERARTESPSYFRQWSGDSGTT--NAAAVAPSSPARHHHARSSSVTNMSNVKRAQNVAAK 58

Query: 61  XXXXXXXXXXXSQTADNEEDDDD--------LGFRYTAPPPLSIXXXXXXXXXXXXPA-- 110
                      SQT ++++DDDD        LGFRY APPPLS             P   
Sbjct: 59  AAAQRLAKVMASQTNNDDDDDDDDDEVGGDDLGFRYGAPPPLSFTRNPSSTIAKPKPVAS 118

Query: 111 --------LARNHLVDESMYLXXXXXXXXXXXXXLSQRTXXXXXXXXXXHNNKRFPFDTG 162
                   ++R+     S  +             L  +T             KR   D G
Sbjct: 119 SAVVPPPKISRSSSPANSPAVSVRASQPPVPPSKLRNQTTNPLPVATP-KTEKRVLADIG 177

Query: 163 LVQPKDSGDQRQASALRDEVDMLQEENESILDKLRLEEESCKEAEARVRVLEKQVASLGE 222
               KDS DQ +ASALRDE+DMLQEEN+SIL+KLRLE+E CKEAEARVR LEKQV SLGE
Sbjct: 178 HFNGKDSKDQHEASALRDELDMLQEENDSILEKLRLEDEKCKEAEARVRELEKQVTSLGE 237

Query: 223 GVSLEAKLLSRKEAALRQREAALKNSRDCRDGVDTEITSLQAEVENAKIETEAAVRQLNG 282
           GVSLEAKLLSRKEAALRQREAALK++R  RDG + E T+L+++VE AK+ET A V QL G
Sbjct: 238 GVSLEAKLLSRKEAALRQREAALKDARQNRDGTNRETTALRSQVETAKLETAAIVAQLQG 297

Query: 283 AESEVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAAKYGICADVAVSKYELWSSLA 342
           AESEV  LR+MT RMILTQKEMEEVVLKRCWLARYWGLA++YGIC+D+A SKYE WSSLA
Sbjct: 298 AESEVNGLRTMTHRMILTQKEMEEVVLKRCWLARYWGLASRYGICSDIATSKYEYWSSLA 357

Query: 343 PLPFEVVVSAGQKAEEECWEKGEDAMEKRSKLVPDLNDLIGEGNIESMLSVEMGLKELAS 402
           PLPFE+V+SAGQKA+EE WEK  +  EKRS+LV D+NDL GEGNIESMLSVEMGLKEL S
Sbjct: 358 PLPFEIVLSAGQKAKEESWEKESEENEKRSQLVQDINDLTGEGNIESMLSVEMGLKELTS 417

Query: 403 LKVEDAIVQALAQQRRPNSARQLVSDIKSPGDPKFMEAFELSPEESEDVLFKEAWLTYFW 462
           LKVE AI   LAQ R  N+ R    ++KSPG PK  E  ELS EESEDVLFKEAWLTYFW
Sbjct: 418 LKVEVAITITLAQLRLANTTRLSDIELKSPGGPKITETLELSQEESEDVLFKEAWLTYFW 477

Query: 463 RRAKAHSIEEDIAKDRLHFWIGRSGHSPTSHDAVDVEQGLSELRKLGIEHRLWEASR 519
           RRA++  IE D A++RL FWI RS HSP+SHDA++VEQGL+ELRKL IE RLWEASR
Sbjct: 478 RRAQSLGIEVDTARERLRFWISRSAHSPSSHDAMEVEQGLTELRKLRIERRLWEASR 534


>AT3G48860.2 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G23700.1); Has 12429 Blast
           hits to 9751 proteins in 897 species: Archae - 180;
           Bacteria - 1190; Metazoa - 6552; Fungi - 1361; Plants -
           886; Viruses - 50; Other Eukaryotes - 2210 (source: NCBI
           BLink). | chr3:18117619-18121853 FORWARD LENGTH=577
          Length = 577

 Score =  466 bits (1198), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 245/373 (65%), Positives = 291/373 (78%), Gaps = 5/373 (1%)

Query: 154 NKRFPFDTGLVQPKDSGDQRQASALRDEVDMLQEENESILDKLRLEEESCKEAEARVRVL 213
           +KRF  D   V  K+ GDQR+ASALRDE+DMLQEENE++L+KLR  EE   EAEAR + L
Sbjct: 193 DKRFFADVPSVNSKEKGDQREASALRDELDMLQEENENVLEKLRRAEEKRVEAEARAKEL 252

Query: 214 EKQVASLGEGVSLEAKLLSRKEAALRQREAALKNSRDCRDGVDTEITSLQAEVENAKIET 273
           EKQVASLGEGVSLEAKLLSRKEAALRQREAAL  ++  + G D EI SL++E+EN K E 
Sbjct: 253 EKQVASLGEGVSLEAKLLSRKEAALRQREAALNVAKQKKSGKDEEIVSLRSELENLKDEA 312

Query: 274 EAAVRQLNGAESEVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAAKYGICADVAVS 333
             A  +L  AESE K+LR+MTQRMILTQ EMEEVVLKRCWLARYWGLA ++GICAD+A S
Sbjct: 313 TTAAERLQEAESEAKSLRTMTQRMILTQDEMEEVVLKRCWLARYWGLAVQHGICADIAPS 372

Query: 334 KYELWSSLAPLPFEVVVSAGQKAEEECWEKGEDAMEKRSKLVPDLNDLIGEGNIESMLSV 393
           + E WS LAPLPFE+V SA QKA+E  W+KG +    RSK   DL+DL GEGNIESMLSV
Sbjct: 373 RQEHWSKLAPLPFELVTSAAQKAKELSWDKGGN---DRSKAARDLSDLTGEGNIESMLSV 429

Query: 394 EMGLKELASLKVEDAIVQALAQQRRPNSARQLVSDIKSPGDPKFMEAFELSPEESEDVLF 453
           EMGL+ELASLKVEDA+V   AQQR+ +  R  VSD K  G+ +F++A+EL   E EDV F
Sbjct: 430 EMGLRELASLKVEDAVVLIFAQQRKLSLVRHTVSDSKGHGESRFIDAYELGEAEQEDVAF 489

Query: 454 KEAWLTYFWRRAKAHSIEEDIAKDRLHFWIGR-SGHS-PTSHDAVDVEQGLSELRKLGIE 511
           K+AWL YFW RAK H +E+DIA++R+  WI R SG S  TSHDA+DVE+GL ELRKLGIE
Sbjct: 490 KQAWLMYFWGRAKLHGVEDDIAEERVQLWISRSSGKSQTTSHDALDVERGLIELRKLGIE 549

Query: 512 HRLWEASRKEVDQ 524
            +LWEASR+E+DQ
Sbjct: 550 QQLWEASRREIDQ 562


>AT4G25070.2 | Symbols:  | unknown protein; EXPRESSED IN: cultured
           cell; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT3G48860.2); Has 30201 Blast hits
           to 17322 proteins in 780 species: Archae - 12; Bacteria
           - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
           Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
           BLink). | chr4:12872482-12876468 FORWARD LENGTH=767
          Length = 767

 Score =  411 bits (1057), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 212/364 (58%), Positives = 278/364 (76%), Gaps = 5/364 (1%)

Query: 162 GLVQPKDSG---DQRQASALRDEVDMLQEENESILDKLRLEEESCKEAEARVRVLEKQVA 218
            ++ P +S    D R+ASALRDE+DMLQEEN++I+DKL+  EE  + AEAR + LEKQVA
Sbjct: 389 NILAPNNSNQQEDDREASALRDELDMLQEENDNIMDKLQRAEERREAAEARAKELEKQVA 448

Query: 219 SLGEGVSLEAKLLSRKEAALRQREAALKNSRDCRDGVDTEITSLQAEVENAKIETEAAVR 278
           SLGEG + + KLL RKEAALRQREAAL+ +   RDG + E  +L +E ++ K E E +  
Sbjct: 449 SLGEGANFDVKLLKRKEAALRQREAALRAAEQKRDGRNRETNALSSEFQSLKDEAEKSTE 508

Query: 279 QLNGAESEVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAAKYGICADVAVSKYELW 338
           QL   E+E+K+LR+M  R IL+Q+EMEEVVLKRCWLARYW LA ++GIC D++ S+YE W
Sbjct: 509 QLQEVEAEIKSLRTMIHRTILSQEEMEEVVLKRCWLARYWELAVQHGICEDISTSRYEHW 568

Query: 339 SSLAPLPFEVVVSAGQKAEEECWEKGEDAMEKRSKLVPDLNDLIGEGNIESMLSVEMGLK 398
           S+LAPLP EVV+SA QK+ E+ W+ G  +    SK++ + +DL GEGNIESML+VE GL+
Sbjct: 569 SALAPLPSEVVLSAAQKS-EDSWQTG-GSDRTWSKVISNFSDLNGEGNIESMLAVETGLR 626

Query: 399 ELASLKVEDAIVQALAQQRRPNSARQLVSDIKSPGDPKFMEAFELSPEESEDVLFKEAWL 458
           E+ASLKVEDA++ AL++ R+ N ARQ V+D +  G+PKF E FELS +E +D+LFKEAWL
Sbjct: 627 EIASLKVEDAVMLALSRYRQTNVARQAVTDPRVQGEPKFSETFELSHDEQQDILFKEAWL 686

Query: 459 TYFWRRAKAHSIEEDIAKDRLHFWIGRSGHSPTSHDAVDVEQGLSELRKLGIEHRLWEAS 518
            YFW+RAK H +E DIA++RL FWI R G   +SHDA+DVE+G+ ELRKLGIE +LWE S
Sbjct: 687 LYFWKRAKIHGVESDIAEERLQFWINRLGQHSSSHDAIDVERGMRELRKLGIEQQLWETS 746

Query: 519 RKEV 522
           RKE+
Sbjct: 747 RKEL 750


>AT4G25070.1 | Symbols:  | unknown protein; EXPRESSED IN: cultured
           cell; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT3G48860.2); Has 14837 Blast hits
           to 10961 proteins in 1163 species: Archae - 189;
           Bacteria - 1924; Metazoa - 7665; Fungi - 1127; Plants -
           653; Viruses - 80; Other Eukaryotes - 3199 (source: NCBI
           BLink). | chr4:12872482-12876468 FORWARD LENGTH=765
          Length = 765

 Score =  411 bits (1056), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 214/371 (57%), Positives = 281/371 (75%), Gaps = 6/371 (1%)

Query: 155 KRFPFDTGLVQPKDSG---DQRQASALRDEVDMLQEENESILDKLRLEEESCKEAEARVR 211
           KR+     ++ P +S    D R+ASALRDE+DMLQEEN++I+DKL+  EE  + AEAR +
Sbjct: 381 KRY-HPANILAPNNSNQQEDDREASALRDELDMLQEENDNIMDKLQRAEERREAAEARAK 439

Query: 212 VLEKQVASLGEGVSLEAKLLSRKEAALRQREAALKNSRDCRDGVDTEITSLQAEVENAKI 271
            LEKQVASLGEG + + KLL RKEAALRQREAAL+ +   RDG + E  +L +E ++ K 
Sbjct: 440 ELEKQVASLGEGANFDVKLLKRKEAALRQREAALRAAEQKRDGRNRETNALSSEFQSLKD 499

Query: 272 ETEAAVRQLNGAESEVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAAKYGICADVA 331
           E E +  QL   E+E+K+LR+M  R IL+Q+EMEEVVLKRCWLARYW LA ++GIC D++
Sbjct: 500 EAEKSTEQLQEVEAEIKSLRTMIHRTILSQEEMEEVVLKRCWLARYWELAVQHGICEDIS 559

Query: 332 VSKYELWSSLAPLPFEVVVSAGQKAEEECWEKGEDAMEKRSKLVPDLNDLIGEGNIESML 391
            S+YE WS+LAPLP EVV+SA QK+ E+ W+ G  +    SK++ + +DL GEGNIESML
Sbjct: 560 TSRYEHWSALAPLPSEVVLSAAQKS-EDSWQTG-GSDRTWSKVISNFSDLNGEGNIESML 617

Query: 392 SVEMGLKELASLKVEDAIVQALAQQRRPNSARQLVSDIKSPGDPKFMEAFELSPEESEDV 451
           +VE GL+E+ASLKVEDA++ AL++ R+ N ARQ V+D +  G+PKF E FELS +E +D+
Sbjct: 618 AVETGLREIASLKVEDAVMLALSRYRQTNVARQAVTDPRVQGEPKFSETFELSHDEQQDI 677

Query: 452 LFKEAWLTYFWRRAKAHSIEEDIAKDRLHFWIGRSGHSPTSHDAVDVEQGLSELRKLGIE 511
           LFKEAWL YFW+RAK H +E DIA++RL FWI R G   +SHDA+DVE+G+ ELRKLGIE
Sbjct: 678 LFKEAWLLYFWKRAKIHGVESDIAEERLQFWINRLGQHSSSHDAIDVERGMRELRKLGIE 737

Query: 512 HRLWEASRKEV 522
            +LWE SRKE+
Sbjct: 738 QQLWETSRKEL 748


>AT5G23700.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G48860.2); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:7992851-7996420 FORWARD LENGTH=573
          Length = 573

 Score =  406 bits (1044), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 232/394 (58%), Positives = 275/394 (69%), Gaps = 46/394 (11%)

Query: 163 LVQPKDSGDQRQASALRDEVDMLQEENESILDKLRLEEESCKEAEARVRVLEKQVASLGE 222
           LV  +D G QR+ASALRDEVDMLQEENE +L+KL   EE  + AEAR R LEKQVASLGE
Sbjct: 176 LVNSRDKGYQREASALRDEVDMLQEENEIVLEKLHRAEEMREAAEARARELEKQVASLGE 235

Query: 223 GVSLEAKLLSRKEAALRQREAALKNSRDCRDGVDTEITSLQAEVENAKIETEAAVRQLNG 282
           GVSLEAKLLSRKEAALRQREAALK + + +DG   E+ SL++E++  K E E A   L  
Sbjct: 236 GVSLEAKLLSRKEAALRQREAALKAANEKKDGKKEEVVSLRSEIQILKDEAETAAECLQE 295

Query: 283 AESEVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAAKYGICADVAVSKYELWSSLA 342
           AESE KALR MTQRM+LTQ EMEEV LKRCWLARYWGLA ++GICAD+A S++E WS+LA
Sbjct: 296 AESEAKALRIMTQRMVLTQDEMEEVALKRCWLARYWGLAVQHGICADIAPSRHEKWSALA 355

Query: 343 PLPFEVVVSAGQKAEEECWEKGEDAMEKRSKLVPDLNDLIGEGNIESMLSVEMGLKELAS 402
           PLPFE+V+SA QK +++           +SK    L+DL GEGNIESMLSVEMGL+ELAS
Sbjct: 356 PLPFELVISAAQKTKDD-----------QSKTARFLSDLPGEGNIESMLSVEMGLRELAS 404

Query: 403 LKVEDAIVQALAQQRRPNSARQLVSDIKSPGDPKFMEAF--------------------- 441
           LKVEDA++ A AQ+R P+  RQ   D K  G+  F+E++                     
Sbjct: 405 LKVEDAVMLAFAQKRTPSLVRQ---DSKGHGELSFVESYGKRRESKHAQYIISAVKLDEI 461

Query: 442 ----------ELSPEESEDVLFKEAWLTYFWRRAKAHSIEEDIAKDRLHFWIGRS-GHSP 490
                     E+   E EDV FK+AWL YFW RAK HS+EEDIA +R  FW  RS G SP
Sbjct: 462 LTMLSHFSNAEIKEGEQEDVAFKQAWLMYFWGRAKLHSVEEDIADERFQFWTSRSEGKSP 521

Query: 491 TSHDAVDVEQGLSELRKLGIEHRLWEASRKEVDQ 524
           TS DAVDVE+GL ELRKLG+E +LWEA RKE DQ
Sbjct: 522 TSQDAVDVERGLLELRKLGVEQQLWEACRKETDQ 555


>AT3G48860.1 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G23700.1); Has 12232 Blast
           hits to 9546 proteins in 892 species: Archae - 172;
           Bacteria - 1174; Metazoa - 6487; Fungi - 1343; Plants -
           856; Viruses - 50; Other Eukaryotes - 2150 (source: NCBI
           BLink). | chr3:18117619-18120865 FORWARD LENGTH=494
          Length = 494

 Score =  356 bits (913), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 189/288 (65%), Positives = 224/288 (77%), Gaps = 3/288 (1%)

Query: 154 NKRFPFDTGLVQPKDSGDQRQASALRDEVDMLQEENESILDKLRLEEESCKEAEARVRVL 213
           +KRF  D   V  K+ GDQR+ASALRDE+DMLQEENE++L+KLR  EE   EAEAR + L
Sbjct: 193 DKRFFADVPSVNSKEKGDQREASALRDELDMLQEENENVLEKLRRAEEKRVEAEARAKEL 252

Query: 214 EKQVASLGEGVSLEAKLLSRKEAALRQREAALKNSRDCRDGVDTEITSLQAEVENAKIET 273
           EKQVASLGEGVSLEAKLLSRKEAALRQREAAL  ++  + G D EI SL++E+EN K E 
Sbjct: 253 EKQVASLGEGVSLEAKLLSRKEAALRQREAALNVAKQKKSGKDEEIVSLRSELENLKDEA 312

Query: 274 EAAVRQLNGAESEVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAAKYGICADVAVS 333
             A  +L  AESE K+LR+MTQRMILTQ EMEEVVLKRCWLARYWGLA ++GICAD+A S
Sbjct: 313 TTAAERLQEAESEAKSLRTMTQRMILTQDEMEEVVLKRCWLARYWGLAVQHGICADIAPS 372

Query: 334 KYELWSSLAPLPFEVVVSAGQKAEEECWEKGEDAMEKRSKLVPDLNDLIGEGNIESMLSV 393
           + E WS LAPLPFE+V SA QKA+E  W+KG +    RSK   DL+DL GEGNIESMLSV
Sbjct: 373 RQEHWSKLAPLPFELVTSAAQKAKELSWDKGGN---DRSKAARDLSDLTGEGNIESMLSV 429

Query: 394 EMGLKELASLKVEDAIVQALAQQRRPNSARQLVSDIKSPGDPKFMEAF 441
           EMGL+ELASLKVEDA+V   AQQR+ +  R  VSD K  G+ +F++A+
Sbjct: 430 EMGLRELASLKVEDAVVLIFAQQRKLSLVRHTVSDSKGHGESRFIDAY 477


>AT4G08630.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G48860.2); Has 1487 Blast hits to 747 proteins
           in 184 species: Archae - 0; Bacteria - 56; Metazoa -
           305; Fungi - 197; Plants - 180; Viruses - 3; Other
           Eukaryotes - 746 (source: NCBI BLink). |
           chr4:5506998-5511959 REVERSE LENGTH=845
          Length = 845

 Score =  332 bits (850), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 193/431 (44%), Positives = 267/431 (61%), Gaps = 64/431 (14%)

Query: 154 NKRFPFDTGLV-QPKDSGDQRQASALRDEVDMLQEENESILDKLRLEEESCKEAEARVRV 212
           +KRF  D G     ++ G QR  SAL+DEVDMLQEENES+L+KLRL E+ C+EA+AR + 
Sbjct: 418 DKRFSMDLGSSGNLRELGSQRSTSALQDEVDMLQEENESLLEKLRLAEDKCEEADARAKQ 477

Query: 213 LEKQVASLGEGVSLEAKLLSRKEAALRQ--REAALKNSRDCRDGVDTEITSLQAEVENAK 270
           LEKQV  LGEGV+++A+LLSR+ + L    +  +    R C +   T+  S + E  ++ 
Sbjct: 478 LEKQVEILGEGVTMDARLLSRQASVLFNFWKRGSSTTERGCFENCITK--SWRKEGRSSS 535

Query: 271 IETEAAVRQLNGAESEVKALRSMTQRMILTQKEMEEVVLKRCWLARYWGLAAKYGICADV 330
           ++      QL+  E E+ +L+++T+R+ILTQ+EMEEVVLKRCWL+RYWGL  ++GI  D+
Sbjct: 536 LD------QLHEVELELNSLKTVTKRLILTQEEMEEVVLKRCWLSRYWGLCVRHGIQPDI 589

Query: 331 AVSKYELWSSLAPLPFEVVVSAGQKAEE---EC-------------------------W- 361
           A  K+E WSS APLP E+V+SAGQ+A +   +C                         W 
Sbjct: 590 AGGKHEYWSSFAPLPLEIVLSAGQRARDGVSQCNIFHLAAEISLELFGIVLTSLVLTLWS 649

Query: 362 --EKGEDAMEKRSKLVPDLNDLIGEGNIESMLSVEMGLKELASLKVEDAIVQ-------- 411
             +   +   +R K + +L +  GEGN+E+M+ VE GL+ELASLK + +++Q        
Sbjct: 650 PHQAANNTYGEREKSLQNLQETSGEGNLENMIWVEKGLRELASLKNQSSVIQETDLKYDS 709

Query: 412 ------------ALAQQRRPNSARQLVSD-IKSPGDPKFMEAFELSPEESEDVLFKEAWL 458
                        +AQ RR  S++  VSD +K P D +F EAFELS EE EDV FK+AWL
Sbjct: 710 LRCLKVQEAVAFVMAQNRRNTSSKFFVSDEVKMPMDGQF-EAFELSDEEVEDVNFKQAWL 768

Query: 459 TYFWRRAKAHSIEEDIAKDRLHFWIGRSGHSPTSHDAVDVEQGLSELRKLGIEHRLWEAS 518
           +YFWRRAK H IE D+  +RL +WI +   S TS DAVDVE+GL ELRKL IE +LW+ S
Sbjct: 769 SYFWRRAKNHEIESDLVDERLQYWINQGTRSATSQDAVDVERGLMELRKLNIESQLWQKS 828

Query: 519 RKEVDQDHTIS 529
           RK +D +   S
Sbjct: 829 RKGLDHESNPS 839