Miyakogusa Predicted Gene
- Lj0g3v0281249.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0281249.1 tr|Q9SI42|Q9SI42_ARATH At2g13350 OS=Arabidopsis
thaliana GN=At2g13350 PE=2 SV=1,42.96,4e-18,seg,NULL,44481_g.1
(149 letters)
Database: trembl
41,451,118 sequences; 13,208,986,710 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
G7I2P8_MEDTR (tr|G7I2P8) RNA-binding protein 12B OS=Medicago tru... 160 1e-37
K7N4L6_SOYBN (tr|K7N4L6) Uncharacterized protein OS=Glycine max ... 155 4e-36
K7LJ48_SOYBN (tr|K7LJ48) Uncharacterized protein OS=Glycine max ... 145 4e-33
I1NEL6_SOYBN (tr|I1NEL6) Uncharacterized protein OS=Glycine max ... 126 2e-27
G7I6R0_MEDTR (tr|G7I6R0) Putative uncharacterized protein OS=Med... 122 5e-26
M5XQ83_PRUPE (tr|M5XQ83) Uncharacterized protein OS=Prunus persi... 120 2e-25
B9SQS8_RICCO (tr|B9SQS8) Putative uncharacterized protein OS=Ric... 117 2e-24
B9HKV3_POPTR (tr|B9HKV3) Predicted protein OS=Populus trichocarp... 106 3e-21
B9HSU6_POPTR (tr|B9HSU6) Predicted protein OS=Populus trichocarp... 105 5e-21
A5BDH1_VITVI (tr|A5BDH1) Putative uncharacterized protein OS=Vit... 104 1e-20
F6H8S6_VITVI (tr|F6H8S6) Putative uncharacterized protein OS=Vit... 104 1e-20
R0HRX6_9BRAS (tr|R0HRX6) Uncharacterized protein OS=Capsella rub... 98 1e-18
D7LFS6_ARALL (tr|D7LFS6) C2 domain-containing protein OS=Arabido... 98 1e-18
O22783_ARATH (tr|O22783) Calcium-dependent lipid-binding domain-... 97 2e-18
D7KDF9_ARALL (tr|D7KDF9) Putative uncharacterized protein OS=Ara... 96 4e-18
F4I5P7_ARATH (tr|F4I5P7) Calcium-dependent lipid-binding domain-... 96 4e-18
R0IEI3_9BRAS (tr|R0IEI3) Uncharacterized protein OS=Capsella rub... 95 7e-18
M1CV62_SOLTU (tr|M1CV62) Uncharacterized protein OS=Solanum tube... 95 9e-18
M4DZ96_BRARP (tr|M4DZ96) Uncharacterized protein OS=Brassica rap... 94 1e-17
K4DHH0_SOLLC (tr|K4DHH0) Uncharacterized protein OS=Solanum lyco... 94 2e-17
M4DFQ2_BRARP (tr|M4DFQ2) Uncharacterized protein OS=Brassica rap... 88 1e-15
K4BDZ6_SOLLC (tr|K4BDZ6) Uncharacterized protein OS=Solanum lyco... 86 4e-15
M4CMR9_BRARP (tr|M4CMR9) Uncharacterized protein OS=Brassica rap... 83 3e-14
M4CMR8_BRARP (tr|M4CMR8) Uncharacterized protein OS=Brassica rap... 83 3e-14
K4CWB8_SOLLC (tr|K4CWB8) Uncharacterized protein OS=Solanum lyco... 80 3e-13
M0ZMB3_SOLTU (tr|M0ZMB3) Uncharacterized protein OS=Solanum tube... 78 1e-12
M4EUM2_BRARP (tr|M4EUM2) Uncharacterized protein OS=Brassica rap... 78 1e-12
M4D9I7_BRARP (tr|M4D9I7) Uncharacterized protein OS=Brassica rap... 77 3e-12
M1B1G6_SOLTU (tr|M1B1G6) Uncharacterized protein OS=Solanum tube... 76 3e-12
M4F7Z5_BRARP (tr|M4F7Z5) Uncharacterized protein OS=Brassica rap... 76 4e-12
O23030_ARATH (tr|O23030) T1G11.21 protein OS=Arabidopsis thalian... 75 6e-12
Q9SI42_ARATH (tr|Q9SI42) At2g13350 OS=Arabidopsis thaliana GN=AT... 75 1e-11
R0HT42_9BRAS (tr|R0HT42) Uncharacterized protein OS=Capsella rub... 74 1e-11
D7L022_ARALL (tr|D7L022) Putative uncharacterized protein OS=Ara... 74 1e-11
B9DHP7_ARATH (tr|B9DHP7) AT2G13350 protein (Fragment) OS=Arabido... 74 1e-11
B9S7M7_RICCO (tr|B9S7M7) Putative uncharacterized protein OS=Ric... 55 9e-06
>G7I2P8_MEDTR (tr|G7I2P8) RNA-binding protein 12B OS=Medicago truncatula
GN=MTR_1g073210 PE=4 SV=1
Length = 682
Score = 160 bits (406), Expect = 1e-37, Method: Composition-based stats.
Identities = 85/127 (66%), Positives = 91/127 (71%), Gaps = 8/127 (6%)
Query: 1 MRSNLGNIRPVVMTESELGPSPSEVAAVMARRPRIEEGEISTVGGWSLDESVEGLQSKLE 60
MRSNL N+RPV+MTESELGPSPSEVAA MAR+P I+E E STVGGWSLDESVEGLQSKLE
Sbjct: 537 MRSNLANMRPVIMTESELGPSPSEVAAAMARKPIIDE-ENSTVGGWSLDESVEGLQSKLE 595
Query: 61 RWRTELPPVIDRGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVEC 120
RWRTELPPVID GE SS+P LFSCFS ICGVEC
Sbjct: 596 RWRTELPPVIDHGELSSFP-------TTSSKTSRHSRRHTEGGSGNGLFSCFSNICGVEC 648
Query: 121 SIVCGGD 127
S+VCGGD
Sbjct: 649 SVVCGGD 655
>K7N4L6_SOYBN (tr|K7N4L6) Uncharacterized protein OS=Glycine max PE=4 SV=1
Length = 642
Score = 155 bits (392), Expect = 4e-36, Method: Composition-based stats.
Identities = 83/130 (63%), Positives = 91/130 (70%), Gaps = 6/130 (4%)
Query: 2 RSNLGNIRPVVMTESELGPSPSEVAAVMARRPRIEEGEISTVGGWSLD-ESVEGLQSKLE 60
RSNL N+ VV+TESELGPSPSEVAA +AR+P I+EGE STVGGWSLD ESVEGLQSKLE
Sbjct: 498 RSNLANMGHVVITESELGPSPSEVAAAIARKPVIDEGENSTVGGWSLDAESVEGLQSKLE 557
Query: 61 RWRTELPPVIDRGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVEC 120
RWRT+LPPV+DRGE SSYP LFSCFS ICGVEC
Sbjct: 558 RWRTDLPPVVDRGEVSSYP-----TTSTTKTSRHSRRHTDGGSTGSGLFSCFSNICGVEC 612
Query: 121 SIVCGGDPKG 130
+ CGGDPKG
Sbjct: 613 YVGCGGDPKG 622
>K7LJ48_SOYBN (tr|K7LJ48) Uncharacterized protein OS=Glycine max PE=4 SV=1
Length = 657
Score = 145 bits (367), Expect = 4e-33, Method: Composition-based stats.
Identities = 78/130 (60%), Positives = 89/130 (68%), Gaps = 2/130 (1%)
Query: 2 RSNLGNIRPVVMTESELGPSPSEVAAVMARRPRIEEGEISTVGGWSLD-ESVEGLQSKLE 60
RSNL N+ VV+TESELGPS SEVAAV+A++P I+E E STVGGWSLD ES+EGL+SKLE
Sbjct: 508 RSNLANMGHVVITESELGPSASEVAAVIAQKPVIDEAENSTVGGWSLDAESMEGLESKLE 567
Query: 61 RWRTELPPVIDRGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVEC 120
RW+T+LPPVID GE SSYP LFSCFS ICGVEC
Sbjct: 568 RWQTKLPPVIDHGELSSYP-TTSTTKTSRHSRRHTDGGSIGSGSGSGLFSCFSNICGVEC 626
Query: 121 SIVCGGDPKG 130
+ CGGDPKG
Sbjct: 627 YVGCGGDPKG 636
>I1NEL6_SOYBN (tr|I1NEL6) Uncharacterized protein OS=Glycine max PE=4 SV=2
Length = 449
Score = 126 bits (317), Expect = 2e-27, Method: Composition-based stats.
Identities = 75/125 (60%), Positives = 81/125 (64%), Gaps = 13/125 (10%)
Query: 2 RSNLGNIRPVVMTESELGPSPSEVAAVMARRPRIEEGEISTVGGWSLDESVEGLQSKLER 61
RSNLG RP MT+SELGPS SEVAAV+AR P IEEGE STVGGWSLD+SVEGLQ K+ER
Sbjct: 317 RSNLGR-RPF-MTDSELGPSASEVAAVVARLP-IEEGENSTVGGWSLDDSVEGLQPKVER 373
Query: 62 WRTELPPVIDRGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVECS 121
W+TELPPV D E S+ LFSCFS ICGVECS
Sbjct: 374 WQTELPPVYDGSERSN----------MTTSSKKGKHSRRQTDGGNGLFSCFSVICGVECS 423
Query: 122 IVCGG 126
IVCGG
Sbjct: 424 IVCGG 428
>G7I6R0_MEDTR (tr|G7I6R0) Putative uncharacterized protein OS=Medicago truncatula
GN=MTR_1g045780 PE=4 SV=1
Length = 459
Score = 122 bits (305), Expect = 5e-26, Method: Composition-based stats.
Identities = 66/128 (51%), Positives = 80/128 (62%), Gaps = 3/128 (2%)
Query: 2 RSNLGNIRPVVMTESELGPSPSEVAAVMARRPRIEEGEISTVGGWSLDESVEGLQSKLER 61
R+N G+ RP++ T+SELGPS SEVA +AR+P ++EGE S V GWSL+ESVE LQ K+ER
Sbjct: 318 RTNKGHHRPII-TDSELGPSASEVAEAVARQPVMDEGESSIVTGWSLNESVEDLQPKIER 376
Query: 62 WRTELPPVIDRGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVECS 121
W+T+L PV D E SS P LFSCFS ICG+ECS
Sbjct: 377 WQTDLAPVHDGREMSSKP--TTSSKKKDKHSRRRTNGGGGDGGDNGLFSCFSVICGLECS 434
Query: 122 IVCGGDPK 129
IVCGGD K
Sbjct: 435 IVCGGDKK 442
>M5XQ83_PRUPE (tr|M5XQ83) Uncharacterized protein OS=Prunus persica
GN=PRUPE_ppa026634mg PE=4 SV=1
Length = 492
Score = 120 bits (301), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 69/124 (55%), Positives = 79/124 (63%), Gaps = 15/124 (12%)
Query: 7 NIRPV-VMTESELGPSPSEVAAVMARRPRIEEGEISTV-GGWSLDESVEGLQSKLERWRT 64
NI PV +TESELGPSPSEVAA +A+ ++ E S V G W+ ++SVEGLQSKLERWRT
Sbjct: 352 NIMPVPFITESELGPSPSEVAAAIAKERLDQDAESSVVVGAWNEEDSVEGLQSKLERWRT 411
Query: 65 ELPPVIDRGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVECSIVC 124
ELPPV DRGEFSS+P LFSCFS ICG+ECSIVC
Sbjct: 412 ELPPVYDRGEFSSFP------------SSDERHERRHSDGGSGLFSCFSNICGIECSIVC 459
Query: 125 G-GD 127
G GD
Sbjct: 460 GSGD 463
>B9SQS8_RICCO (tr|B9SQS8) Putative uncharacterized protein OS=Ricinus communis
GN=RCOM_1217340 PE=4 SV=1
Length = 472
Score = 117 bits (292), Expect = 2e-24, Method: Composition-based stats.
Identities = 62/124 (50%), Positives = 71/124 (57%), Gaps = 11/124 (8%)
Query: 6 GNIRPVVMTESELGPSPSEVAAVMARRP---RIEEGEISTVGGWSLDESVEGLQSKLERW 62
G P+ +TESELGPS SEVAA+M ++EE E +G WSL+ S+EGLQSKLERW
Sbjct: 334 GRTAPLHITESELGPSASEVAAIMVNNKNQYKVEEAESEIMGSWSLESSMEGLQSKLERW 393
Query: 63 RTELPPVIDRGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVECSI 122
R ELPPV DR E SSYP FSCF T CG+ECSI
Sbjct: 394 RAELPPVYDRSELSSYPISSVAAGGNRHNRRRSDGDGA--------FSCFGTFCGMECSI 445
Query: 123 VCGG 126
VCGG
Sbjct: 446 VCGG 449
>B9HKV3_POPTR (tr|B9HKV3) Predicted protein OS=Populus trichocarpa
GN=POPTRDRAFT_766502 PE=2 SV=1
Length = 492
Score = 106 bits (264), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 60/123 (48%), Positives = 70/123 (56%), Gaps = 7/123 (5%)
Query: 9 RPVVMTESELGPSPSEVAAVMARRP--RIEEGEISTVGGWSLDESVEGLQSKLERWRTEL 66
RP + T+SELGPSPSEVAAVM R+ R E E +G SLD S+EGLQSKLERWR EL
Sbjct: 355 RPAI-TDSELGPSPSEVAAVMTRKKNRRFVEIESEIMGVMSLDGSMEGLQSKLERWRAEL 413
Query: 67 PPVIDRGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVECSIVCGG 126
PPV D + SS+P F+CF CG+ECSIVCGG
Sbjct: 414 PPVYDASDISSFPASSTSKESKIVKQHSRRHSADDDGT----FTCFGRFCGLECSIVCGG 469
Query: 127 DPK 129
P+
Sbjct: 470 PPR 472
>B9HSU6_POPTR (tr|B9HSU6) Predicted protein OS=Populus trichocarpa
GN=POPTRDRAFT_565827 PE=4 SV=1
Length = 460
Score = 105 bits (262), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 59/120 (49%), Positives = 68/120 (56%), Gaps = 7/120 (5%)
Query: 12 VMTESELGPSPSEVAAVMARRP--RIEEGEISTVGGWSLDESVEGLQSKLERWRTELPPV 69
V+TES+LGPS SEVAAV+AR R+EE E +G SLD S+E LQSKLERWRTELPP
Sbjct: 326 VITESQLGPSASEVAAVIARNKHRRVEETESEIIGEMSLDGSMEALQSKLERWRTELPPA 385
Query: 70 IDRGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVECSIVCGGDPK 129
D SS+P FSCF CG+ECSIVCGG P+
Sbjct: 386 YDASNISSFPTSGTSKGGKVVKRHNHKHTDDDGT-----FSCFGRFCGLECSIVCGGPPR 440
>A5BDH1_VITVI (tr|A5BDH1) Putative uncharacterized protein OS=Vitis vinifera
GN=VITISV_026570 PE=4 SV=1
Length = 494
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 57/120 (47%), Positives = 71/120 (59%), Gaps = 11/120 (9%)
Query: 12 VMTESELGPSPSEVAAVMA--RRPRIEEGEISTVGGWSLDESVEGLQSKLERWRTELPPV 69
+++ESE+GPSPSEVAA +A R + E+G S + GWSL+ S EGL+SKL+RWRTELPP+
Sbjct: 379 MLSESEVGPSPSEVAAAIAHDRCRQAEDGNNSALDGWSLNSSEEGLRSKLQRWRTELPPL 438
Query: 70 IDRGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVECSIVCGGDPK 129
DRG + Y LFSCF ICG ECSIVCGG+P
Sbjct: 439 YDRGAYGIY---------RTPGGHVRRHTEGDEPDGSGLFSCFGNICGYECSIVCGGNPN 489
>F6H8S6_VITVI (tr|F6H8S6) Putative uncharacterized protein OS=Vitis vinifera
GN=VIT_05s0049g01960 PE=4 SV=1
Length = 494
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 57/120 (47%), Positives = 71/120 (59%), Gaps = 11/120 (9%)
Query: 12 VMTESELGPSPSEVAAVMA--RRPRIEEGEISTVGGWSLDESVEGLQSKLERWRTELPPV 69
+++ESE+GPSPSEVAA +A R + E+G S + GWSL+ S EGL+SKL+RWRTELPP+
Sbjct: 379 MLSESEVGPSPSEVAAAIAHDRCRQAEDGNNSALDGWSLNSSEEGLRSKLQRWRTELPPL 438
Query: 70 IDRGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVECSIVCGGDPK 129
DRG + Y LFSCF ICG ECSIVCGG+P
Sbjct: 439 YDRGAYGIY---------RTPGGHVRRHTEGDEPDGSGLFSCFGNICGYECSIVCGGNPN 489
>R0HRX6_9BRAS (tr|R0HRX6) Uncharacterized protein OS=Capsella rubella
GN=CARUB_v10022860mg PE=4 SV=1
Length = 597
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 67/148 (45%), Positives = 79/148 (53%), Gaps = 22/148 (14%)
Query: 1 MRSNLGNIRPVVMTESELGPSPSEVAAVMAR-RPRIEEGEISTVGGWSLDE--SVEGLQS 57
MRSNL RP+ +TESELGPSPSEVA MA+ R E E S + WSLD+ ++EGL+S
Sbjct: 432 MRSNLAG-RPI-LTESELGPSPSEVAQKMAKERSLANETESSILSEWSLDDDSNMEGLRS 489
Query: 58 KLERWRTELPPVIDRGEFSSYPXXXX-------------XXXXXXXXXXXXXXXXXXXXX 104
KLERWRTELPP+ D G SS+
Sbjct: 490 KLERWRTELPPLYDLG--SSHQSSDVGSGAIVVANAGGGKSSRKKTPAVKKKHNRRHTEG 547
Query: 105 XXXLFSCFSTICGVECSIVCGG--DPKG 130
LFSCFS +CGVEC+ VCGG DP G
Sbjct: 548 GNGLFSCFSNLCGVECTFVCGGGSDPDG 575
>D7LFS6_ARALL (tr|D7LFS6) C2 domain-containing protein OS=Arabidopsis lyrata
subsp. lyrata GN=ARALYDRAFT_902404 PE=4 SV=1
Length = 595
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/142 (45%), Positives = 77/142 (54%), Gaps = 20/142 (14%)
Query: 1 MRSNLGNIRPVVMTESELGPSPSEVAAVMAR-RPRIEEGEISTVGGWSLDE--SVEGLQS 57
MRSNL RP+ +TESELGPSPSEVA MA+ R + E E S + WSLD+ ++EGL+S
Sbjct: 430 MRSNLAG-RPI-LTESELGPSPSEVAQKMAKERSQANETESSILSEWSLDDDSNIEGLRS 487
Query: 58 KLERWRTELPPVIDRGEFSSYPXXXX-------------XXXXXXXXXXXXXXXXXXXXX 104
KLERWRTELPP+ D G SS+
Sbjct: 488 KLERWRTELPPLYDLG--SSHQSSDVGSGAIVVANVGGGKSSRKKTPVVKKKHNRRHTEG 545
Query: 105 XXXLFSCFSTICGVECSIVCGG 126
LFSCFS +CGVEC+ VCGG
Sbjct: 546 GNGLFSCFSNLCGVECTFVCGG 567
>O22783_ARATH (tr|O22783) Calcium-dependent lipid-binding domain-containing
protein OS=Arabidopsis thaliana GN=AT2G33320 PE=4 SV=1
Length = 602
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 77/143 (53%), Gaps = 21/143 (14%)
Query: 1 MRSNLGNIRPVVMTESELGPSPSEVAAVMAR-RPRIEEGEISTVGGWSLDE--SVEGLQS 57
MRSNL RPV +TESELGPSPSEVA MA+ R + E E S + WSLD+ ++EGL+S
Sbjct: 436 MRSNLAG-RPV-LTESELGPSPSEVAQKMAKERSQAYETESSILSEWSLDDDSNIEGLRS 493
Query: 58 KLERWRTELPPVIDRGEFSSYPXXX--------------XXXXXXXXXXXXXXXXXXXXX 103
KLERWRTELPP+ D G SS+
Sbjct: 494 KLERWRTELPPLYDLG--SSHQSSDVGSGAMVVANVGGGKSSRKKTPAVKKKHNRRHTEG 551
Query: 104 XXXXLFSCFSTICGVECSIVCGG 126
LFSCFS +CGVEC+ VCGG
Sbjct: 552 GGNGLFSCFSNLCGVECTFVCGG 574
>D7KDF9_ARALL (tr|D7KDF9) Putative uncharacterized protein OS=Arabidopsis lyrata
subsp. lyrata GN=ARALYDRAFT_470434 PE=4 SV=1
Length = 577
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/127 (46%), Positives = 70/127 (55%), Gaps = 13/127 (10%)
Query: 12 VMTESELGPSPSEVAAVMARRPRIEEGEISTVGGWSLDES-VEGLQSKLERWRTELPPVI 70
++TESELGPSPSEVA +A+ R E E S + WS+DES +EGL+SKLERWRTELPP+
Sbjct: 421 ILTESELGPSPSEVADKLAK-DRSHETESSILSEWSIDESSIEGLRSKLERWRTELPPLY 479
Query: 71 DRG---------EFSSYPXXXX--XXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVE 119
D G + +S P LFSCFS ICGVE
Sbjct: 480 DIGSSHISSSNYDGASVPAATAGGGMSSRRKTPTAKKHNRRHTDGGNGLFSCFSKICGVE 539
Query: 120 CSIVCGG 126
CS VCGG
Sbjct: 540 CSFVCGG 546
>F4I5P7_ARATH (tr|F4I5P7) Calcium-dependent lipid-binding domain-containing
protein OS=Arabidopsis thaliana GN=AT1G04540 PE=2 SV=1
Length = 601
Score = 95.9 bits (237), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/127 (46%), Positives = 70/127 (55%), Gaps = 13/127 (10%)
Query: 12 VMTESELGPSPSEVAAVMARRPRIEEGEISTVGGWSLDES-VEGLQSKLERWRTELPPVI 70
++TESELGPSPSEVA +A+ R E E S + WS+DES +EGL+SKLERWRTELPP+
Sbjct: 445 ILTESELGPSPSEVADKLAK-DRSHETESSILSEWSIDESSIEGLRSKLERWRTELPPLY 503
Query: 71 DRG---------EFSSYPXXXX--XXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVE 119
D G + +S P LFSCFS ICGVE
Sbjct: 504 DIGSSHISSTDYDGASVPAATAGGGMSSRRKTPTTKKHNRRHTDGGNGLFSCFSKICGVE 563
Query: 120 CSIVCGG 126
CS VCGG
Sbjct: 564 CSFVCGG 570
>R0IEI3_9BRAS (tr|R0IEI3) Uncharacterized protein OS=Capsella rubella
GN=CARUB_v10011922mg PE=4 SV=1
Length = 600
Score = 95.1 bits (235), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 59/130 (45%), Positives = 70/130 (53%), Gaps = 13/130 (10%)
Query: 9 RPVVMTESELGPSPSEVAAVMARRPRIEEGEISTVGGWSLDES-VEGLQSKLERWRTELP 67
R ++TESELGPSPSEVA +A+ + E E S + WS+DES +EGL+SKLERWRTELP
Sbjct: 442 RRRILTESELGPSPSEVAEKLAK-DKSHETESSILSEWSIDESSIEGLRSKLERWRTELP 500
Query: 68 PVIDRG----EFSSYPXXXXXXXXX-------XXXXXXXXXXXXXXXXXXXLFSCFSTIC 116
P+ D G S+Y LFSCFS IC
Sbjct: 501 PLYDIGSSHISSSNYDGASVHVATAGGGMSSRRKTPATKKHNRRHTDGGNGLFSCFSKIC 560
Query: 117 GVECSIVCGG 126
GVECS VCGG
Sbjct: 561 GVECSFVCGG 570
>M1CV62_SOLTU (tr|M1CV62) Uncharacterized protein OS=Solanum tuberosum
GN=PGSC0003DMG400029340 PE=4 SV=1
Length = 524
Score = 95.1 bits (235), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 54/115 (46%), Positives = 65/115 (56%), Gaps = 13/115 (11%)
Query: 12 VMTESELGPSPSEVAAVMARRPR-IEEGEISTVGGWSLDESVEGLQSKLERWRTELPPVI 70
V ++SE+GPSPSEVAA +A + +EE + S + GWS+DESVEGL+SKLERWRTELPP+
Sbjct: 400 VWSDSEVGPSPSEVAAAIAEKKYPLEEEKSSVLDGWSIDESVEGLRSKLERWRTELPPLY 459
Query: 71 DRGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVECSIVCG 125
DRG SS LFSCF G EC VCG
Sbjct: 460 DRGMASS------------SYHSTGRHTRRHTDGGSGLFSCFGNFYGYECQCVCG 502
>M4DZ96_BRARP (tr|M4DZ96) Uncharacterized protein OS=Brassica rapa subsp.
pekinensis GN=Bra021843 PE=4 SV=1
Length = 611
Score = 94.4 bits (233), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/143 (42%), Positives = 74/143 (51%), Gaps = 19/143 (13%)
Query: 1 MRSNLGNIRPVVMTESELGPSPSEVAAVMAR-RPRIEEGEISTVGGWSLDE--SVEGLQS 57
MRSNL +++TESELGPSPSEVA MA+ R + + E S + WSLD+ ++EGL+S
Sbjct: 443 MRSNLAGR--LILTESELGPSPSEVANKMAKERSQANDTESSILSEWSLDDDSNIEGLRS 500
Query: 58 KLERWRTELPPVIDRGE--------------FSSYPXXXXXXXXXXXXXXXXXXXXXXXX 103
KLERWRTELPP+ D G +S
Sbjct: 501 KLERWRTELPPLYDLGSSHQSSDVGSAIVPASASAGGGKISRRKTPTVKKKKKHQRRHTE 560
Query: 104 XXXXLFSCFSTICGVECSIVCGG 126
LFSCFS ICG ECS VCGG
Sbjct: 561 GGNGLFSCFSNICGAECSFVCGG 583
>K4DHH0_SOLLC (tr|K4DHH0) Uncharacterized protein OS=Solanum lycopersicum
GN=Solyc12g096970.1 PE=4 SV=1
Length = 534
Score = 93.6 bits (231), Expect = 2e-17, Method: Composition-based stats.
Identities = 54/115 (46%), Positives = 65/115 (56%), Gaps = 13/115 (11%)
Query: 12 VMTESELGPSPSEVAAVMARRPR-IEEGEISTVGGWSLDESVEGLQSKLERWRTELPPVI 70
V ++SE+GPSPSEVAA +A + +EE + S + GWS+DESVEGL+SKLERWRTELPP+
Sbjct: 410 VWSDSEVGPSPSEVAAAIAEKKYPLEEEKSSVLDGWSIDESVEGLRSKLERWRTELPPLY 469
Query: 71 DRGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVECSIVCG 125
DRG SS LFSCF G EC VCG
Sbjct: 470 DRGMASS------------SYHSTGRHTRRHTDGGSGLFSCFGNFYGYECQCVCG 512
>M4DFQ2_BRARP (tr|M4DFQ2) Uncharacterized protein OS=Brassica rapa subsp.
pekinensis GN=Bra015325 PE=4 SV=1
Length = 596
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 58/132 (43%), Positives = 70/132 (53%), Gaps = 12/132 (9%)
Query: 3 SNLGNIRPVVMTESELGPSPSEVAAVMAR-RPRIEEGEISTVGGWSLDE-SVEGLQSKLE 60
SNL R ++T+SELGPSPSE+A +A+ R + E S + WS+DE SVEGL+SKLE
Sbjct: 434 SNLAGRR--ILTDSELGPSPSEIAEQLAKNRSHANDTESSILSEWSIDETSVEGLRSKLE 491
Query: 61 RWRTELPPVIDRG--EFSSYPXXXXX------XXXXXXXXXXXXXXXXXXXXXXXLFSCF 112
RWRTELPP+ D G + SS LFSCF
Sbjct: 492 RWRTELPPLYDIGSSQVSSTEYDGSTIVPAGGRSSRRKTPAVKKHSRRHTEGGNGLFSCF 551
Query: 113 STICGVECSIVC 124
S ICGVECS C
Sbjct: 552 SKICGVECSFAC 563
>K4BDZ6_SOLLC (tr|K4BDZ6) Uncharacterized protein OS=Solanum lycopersicum
GN=Solyc03g005720.2 PE=4 SV=1
Length = 485
Score = 85.9 bits (211), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 51/118 (43%), Positives = 63/118 (53%), Gaps = 13/118 (11%)
Query: 13 MTESELGPSPSEVAAVMARRPRIEEGEISTV-GGWSLDESVEGLQSKLERWRTELPPVID 71
+TESE+GPS S VAA + R + + + S+V GWS+DES EGL+SKLERWR E+PPV +
Sbjct: 361 LTESEIGPSASIVAAALVERGYLLDDKRSSVLDGWSIDESTEGLRSKLERWRNEIPPVQN 420
Query: 72 RGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVECSIVCGGDPK 129
RG SS LFSCF ICG EC +CG K
Sbjct: 421 RGTGSS------------SYRSTGRHPRRRSSGGSSLFSCFGNICGYECQCMCGKPKK 466
>M4CMR9_BRARP (tr|M4CMR9) Uncharacterized protein OS=Brassica rapa subsp.
pekinensis GN=Bra005507 PE=4 SV=1
Length = 271
Score = 83.2 bits (204), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/145 (38%), Positives = 72/145 (49%), Gaps = 23/145 (15%)
Query: 1 MRSNLGNIRPVVMTESELGPSPSEVAAVMAR-RPRIEEGEISTVGGWSLDE--SVEGLQS 57
MRSNL +++TESELGPS S V ++ R + + E S + WSLD+ ++EGL+S
Sbjct: 102 MRSNLAGR--LILTESELGPSSSGVTNQKSKERSQANDTESSILSEWSLDDDSNIEGLRS 159
Query: 58 KLERWRTELPPVIDRGEFSSYPXX----------------XXXXXXXXXXXXXXXXXXXX 101
KLERWRTELPP+ D G SS+
Sbjct: 160 KLERWRTELPPLYDLG--SSHQSSDVGREIVPVSANGGGGKSSRRKTPTAKKKKKHNRRH 217
Query: 102 XXXXXXLFSCFSTICGVECSIVCGG 126
LFSCFS +CGVEC+ VCGG
Sbjct: 218 TEGGNGLFSCFSNLCGVECTFVCGG 242
>M4CMR8_BRARP (tr|M4CMR8) Uncharacterized protein OS=Brassica rapa subsp.
pekinensis GN=Bra005506 PE=4 SV=1
Length = 561
Score = 83.2 bits (204), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/145 (38%), Positives = 72/145 (49%), Gaps = 23/145 (15%)
Query: 1 MRSNLGNIRPVVMTESELGPSPSEVAAVMAR-RPRIEEGEISTVGGWSLDE--SVEGLQS 57
MRSNL +++TESELGPS S V ++ R + + E S + WSLD+ ++EGL+S
Sbjct: 392 MRSNLAGR--LILTESELGPSSSGVTNQKSKERSQANDTESSILSEWSLDDDSNIEGLRS 449
Query: 58 KLERWRTELPPVIDRGEFSSYPXX----------------XXXXXXXXXXXXXXXXXXXX 101
KLERWRTELPP+ D G SS+
Sbjct: 450 KLERWRTELPPLYDLG--SSHQSSDVGREIVPVSANGGGGKSSRRKTPTAKKKKKHNRRH 507
Query: 102 XXXXXXLFSCFSTICGVECSIVCGG 126
LFSCFS +CGVEC+ VCGG
Sbjct: 508 TEGGNGLFSCFSNLCGVECTFVCGG 532
>K4CWB8_SOLLC (tr|K4CWB8) Uncharacterized protein OS=Solanum lycopersicum
GN=Solyc09g090920.2 PE=4 SV=1
Length = 521
Score = 79.7 bits (195), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 52/113 (46%), Positives = 65/113 (57%), Gaps = 13/113 (11%)
Query: 14 TESELGPSPSEVAAVMARRPR-IEEGEISTVGGWSLDESVEGLQSKLERWRTELPPVIDR 72
T+SE+GPS SEVAA +A + +++ + S + GWSLDESVEGL+SKLERWRTE+PPV DR
Sbjct: 400 TDSEIGPSASEVAAAVAEKKYPLDDQKSSMLDGWSLDESVEGLRSKLERWRTEVPPVYDR 459
Query: 73 GEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVECSIVCG 125
G+ SS LFSCF I G EC +CG
Sbjct: 460 GQASS------------SYRSTGRHARRHARGSSGLFSCFGNIMGFECQCICG 500
>M0ZMB3_SOLTU (tr|M0ZMB3) Uncharacterized protein OS=Solanum tuberosum
GN=PGSC0003DMG400001497 PE=4 SV=1
Length = 521
Score = 78.2 bits (191), Expect = 1e-12, Method: Composition-based stats.
Identities = 53/117 (45%), Positives = 66/117 (56%), Gaps = 13/117 (11%)
Query: 14 TESELGPSPSEVAAVMA-RRPRIEEGEISTVGGWSLDESVEGLQSKLERWRTELPPVIDR 72
T+SE+GPS SEVAA +A ++ +++ + S + GWSLDESVEGL+SKLERWRTE+PPV DR
Sbjct: 400 TDSEIGPSASEVAAAVAEKKYPLDDQKSSVLDGWSLDESVEGLRSKLERWRTEVPPVYDR 459
Query: 73 GEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVECSIVCGGDPK 129
G SS LFSCF I G EC +CG K
Sbjct: 460 GHASS------------SYHSTGRHGRRHARGSSGLFSCFGNIMGYECQCICGKPQK 504
>M4EUM2_BRARP (tr|M4EUM2) Uncharacterized protein OS=Brassica rapa subsp.
pekinensis GN=Bra032504 PE=4 SV=1
Length = 593
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 53/132 (40%), Positives = 65/132 (49%), Gaps = 17/132 (12%)
Query: 3 SNLGNIRPVVMTESELGPSPSEVAAVMARRPRIEEGEISTVGGWSLDES-VEGLQSKLER 61
SNL R ++T+SELGPS SEVA + + E E S + WS+D+S +EG +SKLE
Sbjct: 437 SNLAGRR--ILTDSELGPSSSEVA-----KNKSHETESSILSDWSVDDSSIEGARSKLEM 489
Query: 62 WRTELPPVIDRG---------EFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCF 112
WRTELPP+ D G + S LFSCF
Sbjct: 490 WRTELPPLYDIGSSQVSSTDYDGSVVYAANGGRSSRRKTPAAKNANRRHSSEGNGLFSCF 549
Query: 113 STICGVECSIVC 124
S ICGVECS VC
Sbjct: 550 SKICGVECSFVC 561
>M4D9I7_BRARP (tr|M4D9I7) Uncharacterized protein OS=Brassica rapa subsp.
pekinensis GN=Bra013147 PE=4 SV=1
Length = 378
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 55/142 (38%), Positives = 65/142 (45%), Gaps = 30/142 (21%)
Query: 9 RPVVMTESELGPSPSEVAAVMARRP----RIEEGEISTVGGWSLDESVEGLQSKLERWRT 64
RPVV+TES+LGPS S VAA +A+ R E + +VG SVEGL+SKLERW+
Sbjct: 231 RPVVITESDLGPSASVVAAQIAKEKALTGRDAESTVISVGA----RSVEGLRSKLERWQA 286
Query: 65 ELPPVIDRGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXX----------------- 107
LP V+D G SSY
Sbjct: 287 NLPVVLDVG--SSYQPSSDYKTSSNFNPKSSYKPNEAVPRNQQMIVAPPQKQGGTKKKGG 344
Query: 108 ---LFSCFSTICGVECSIVCGG 126
LFSCF ICG+ECSIVCGG
Sbjct: 345 DNGLFSCFGNICGIECSIVCGG 366
>M1B1G6_SOLTU (tr|M1B1G6) Uncharacterized protein OS=Solanum tuberosum
GN=PGSC0003DMG400013409 PE=4 SV=1
Length = 482
Score = 76.3 bits (186), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 62/114 (54%), Gaps = 13/114 (11%)
Query: 13 MTESELGPSPSEVAAVMARRPR-IEEGEISTVGGWSLDESVEGLQSKLERWRTELPPVID 71
+TESE+GPS S VAA +A R +++ S + GWS+DES EGL+SKLERWR E+PP+ +
Sbjct: 358 LTESEIGPSASVVAAALAERGYPLDDKRSSVLEGWSIDESTEGLKSKLERWRNEIPPIQN 417
Query: 72 RGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFSCFSTICGVECSIVCG 125
RG SS LFSCF ICG EC +CG
Sbjct: 418 RGTGSS------------SYHSTGRHTRRHSSGGSSLFSCFGNICGYECQCMCG 459
>M4F7Z5_BRARP (tr|M4F7Z5) Uncharacterized protein OS=Brassica rapa subsp.
pekinensis GN=Bra037206 PE=4 SV=1
Length = 407
Score = 76.3 bits (186), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 55/142 (38%), Positives = 66/142 (46%), Gaps = 30/142 (21%)
Query: 9 RPVVMTESELGPSPSEVAAVMARRP----RIEEGEISTVGGWSLDESVEGLQSKLERWRT 64
RPVV+TES+LGPS S VAA +A+ R E + +VG + SVEGL+SKLERW+
Sbjct: 258 RPVVITESDLGPSASVVAAQIAKEKALTGRDAESTVISVG----ERSVEGLRSKLERWQA 313
Query: 65 ELPPVIDRGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXX----------------- 107
LP V+D G SSY
Sbjct: 314 NLPVVLDVG--SSYQPSSDYKTSSNFKPKSSYKPNETVPRNQQMIVAPLPKQGGRKKKGG 371
Query: 108 ---LFSCFSTICGVECSIVCGG 126
LFSCF ICG+ECSIVCGG
Sbjct: 372 DNGLFSCFGNICGIECSIVCGG 393
>O23030_ARATH (tr|O23030) T1G11.21 protein OS=Arabidopsis thaliana GN=T1G11.21
PE=2 SV=1
Length = 578
Score = 75.5 bits (184), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 40/63 (63%), Positives = 49/63 (77%), Gaps = 2/63 (3%)
Query: 12 VMTESELGPSPSEVAAVMARRPRIEEGEISTVGGWSLDES-VEGLQSKLERWRTELPPVI 70
++TESELGPSPSEVA +A+ R E E S + WS+DES +EGL+SKLERWRTELPP+
Sbjct: 442 ILTESELGPSPSEVADKLAK-DRSHETESSILSEWSIDESSIEGLRSKLERWRTELPPLY 500
Query: 71 DRG 73
D G
Sbjct: 501 DIG 503
>Q9SI42_ARATH (tr|Q9SI42) At2g13350 OS=Arabidopsis thaliana GN=AT2G13350 PE=2
SV=1
Length = 401
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 66/142 (46%), Gaps = 30/142 (21%)
Query: 9 RPVVMTESELGPSPSEVAAVMARRPRI----EEGEISTVGGWSLDESVEGLQSKLERWRT 64
RP+V+TES+LGPS S VAA +A+ + E + +VG + SVEGL+SKLERW+
Sbjct: 252 RPIVITESDLGPSASVVAAQIAKEKALTGKDAESTVISVG----ERSVEGLRSKLERWQA 307
Query: 65 ELPPVIDRGEFSSYPXXXXXXXXXXXXXXXXXXXX--------------------XXXXX 104
LP V+D G SSY
Sbjct: 308 NLPVVLDVG--SSYQPSSDYKTNSNFNPKSSYKPNEIVPRNPQMIGAPIQKPSGRNKKSG 365
Query: 105 XXXLFSCFSTICGVECSIVCGG 126
LFSCF ICG+ECSIVCGG
Sbjct: 366 DNGLFSCFGNICGIECSIVCGG 387
>R0HT42_9BRAS (tr|R0HT42) Uncharacterized protein OS=Capsella rubella
GN=CARUB_v10016294mg PE=4 SV=1
Length = 404
Score = 74.3 bits (181), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 66/143 (46%), Gaps = 31/143 (21%)
Query: 9 RPVVMTESELGPSPSEVAAVMARRPRI-----EEGEISTVGGWSLDESVEGLQSKLERWR 63
RP+V+TES+LGPS S VAA +A+ + E + +VG + SVEGL+SKLERW+
Sbjct: 253 RPIVITESDLGPSASVVAAQIAKEKALTGKLDAESTVISVG----ERSVEGLRSKLERWQ 308
Query: 64 TELPPVIDRGEFSSYPXXXXXXXXXXXXXXXXXXXXXXXXXXXX---------------- 107
LP V+D G SSY
Sbjct: 309 ANLPVVVDVG--SSYQPSSDYKTNSFNVPKSSYKPNEIVPRNTQMIVPLPKQQGGRNKKG 366
Query: 108 ----LFSCFSTICGVECSIVCGG 126
LFSCF ICG+ECSIVCGG
Sbjct: 367 GDNGLFSCFGNICGIECSIVCGG 389
>D7L022_ARALL (tr|D7L022) Putative uncharacterized protein OS=Arabidopsis lyrata
subsp. lyrata GN=ARALYDRAFT_480303 PE=4 SV=1
Length = 402
Score = 74.3 bits (181), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 66/142 (46%), Gaps = 30/142 (21%)
Query: 9 RPVVMTESELGPSPSEVAAVMARRPRI----EEGEISTVGGWSLDESVEGLQSKLERWRT 64
RP+V+TES+LGPS S VAA +A+ + E + +VG + SVEGL+SKLERW+
Sbjct: 253 RPIVITESDLGPSASVVAAQIAKEKALTGKDAESTVISVG----ERSVEGLRSKLERWQA 308
Query: 65 ELPPVIDRGEFSSYPXXXXXXXXXXXXXX--------------------XXXXXXXXXXX 104
LP V+D G SSY
Sbjct: 309 NLPVVLDVG--SSYQPSSDYKTNSNFNPKSSYKPNEIVPRNPQMIGAPIQKQSGRNKKGG 366
Query: 105 XXXLFSCFSTICGVECSIVCGG 126
LFSCF ICG+ECSIVCGG
Sbjct: 367 DNGLFSCFGNICGIECSIVCGG 388
>B9DHP7_ARATH (tr|B9DHP7) AT2G13350 protein (Fragment) OS=Arabidopsis thaliana
GN=AT2G13350 PE=2 SV=1
Length = 224
Score = 74.3 bits (181), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 66/142 (46%), Gaps = 30/142 (21%)
Query: 9 RPVVMTESELGPSPSEVAAVMARRPRI----EEGEISTVGGWSLDESVEGLQSKLERWRT 64
RP+V+TES+LGPS S VAA +A+ + E + +VG + SVEGL+SKLERW+
Sbjct: 75 RPIVITESDLGPSASVVAAQIAKEKALTGKDAESTVISVG----ERSVEGLRSKLERWQA 130
Query: 65 ELPPVIDRGEFSSYPXXXXXXXXXXXXXXXXXXXX--------------------XXXXX 104
LP V+D G SSY
Sbjct: 131 NLPVVLDVG--SSYQPSSDYKTNSNFNPKSSYKPNEIVPRNPQVIGAPIQKPSGRNKKSG 188
Query: 105 XXXLFSCFSTICGVECSIVCGG 126
LFSCF ICG+ECSIVCGG
Sbjct: 189 DNGLFSCFGNICGIECSIVCGG 210
>B9S7M7_RICCO (tr|B9S7M7) Putative uncharacterized protein OS=Ricinus communis
GN=RCOM_0609800 PE=4 SV=1
Length = 380
Score = 55.1 bits (131), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 39/89 (43%), Gaps = 12/89 (13%)
Query: 38 GEISTVGGWSLDESVEGLQSKLERWRTELPPVIDRGEFSSYPXXXXXXXXXXXXXXXXXX 97
G S + W+ ++SVEGL++KLERWRTELPP+ D
Sbjct: 281 GGSSIIDDWTENDSVEGLRTKLERWRTELPPIYDSNAKKM------------KSKSRRKQ 328
Query: 98 XXXXXXXXXXLFSCFSTICGVECSIVCGG 126
LF+CF G E SI CGG
Sbjct: 329 HHRRRSDNPGLFTCFGNAFGCEISITCGG 357