
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC140773.5 - phase: 0 /pseudo
(194 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAW41048.1| hypothetical protein CNA04750 [Cryptococcus neofo... 39 0.10
gb|EAL23342.1| hypothetical protein CNBA4580 [Cryptococcus neofo... 39 0.10
gb|AAB94189.2| Hypothetical protein W03F8.2 [Caenorhabditis eleg... 36 0.67
gb|AAK82166.1| 305L [Chilo iridescent virus] gi|15079017|ref|NP_... 36 0.67
ref|NP_701782.1| hypothetical protein PFL2110c [Plasmodium falci... 35 1.1
ref|NP_070947.1| hypothetical protein AF2122 [Archaeoglobus fulg... 35 1.1
gb|AAG34740.1| similar to j-domain of DnaJ [Mycoplasma pneumonia... 35 1.5
gb|AAH53260.1| Hypothetical protein LOC393349 [Danio rerio] gi|4... 34 2.0
ref|NP_757638.1| hypothetical protein MYPE2520 [Mycoplasma penet... 34 2.6
emb|CAH15389.1| hypothetical protein [Legionella pneumophila str... 33 3.3
ref|ZP_00106036.1| COG1205: Distinct helicase family with a uniq... 33 3.3
gb|AAC60011.1| phospholipase C beta [Meleagris gallopavo] gi|213... 33 3.3
ref|NP_228481.1| hypothetical protein TM0672 [Thermotoga maritim... 33 4.4
emb|CAF29918.1| Conserved Hypothetical Protein [Methanococcus ma... 33 4.4
ref|NP_500739.1| predicted CDS, protein kinase and SH2 motif fam... 33 4.4
ref|NP_197352.1| hypothetical protein [Arabidopsis thaliana] 33 5.7
dbj|BAB88834.1| axonemal p83.9 [Ciona intestinalis] 33 5.7
ref|ZP_00472243.1| hypothetical protein CsalDRAFT_2418 [Chromoha... 32 7.5
gb|AAH76809.1| MGC83764 protein [Xenopus laevis] 32 7.5
gb|AAO52105.1| similar to Plasmodium falciparum. Phosphatidylino... 32 7.5
>gb|AAW41048.1| hypothetical protein CNA04750 [Cryptococcus neoformans var.
neoformans JEC21] gi|58258909|ref|XP_566867.1|
hypothetical protein CNA04750 [Cryptococcus neoformans
var. neoformans JEC21]
Length = 405
Score = 38.5 bits (88), Expect = 0.10
Identities = 28/104 (26%), Positives = 44/104 (41%), Gaps = 9/104 (8%)
Query: 33 SPSPSSTLRFQNIVHNSTKN---FEIFMARVEGSSSQPTKQHEPNVNLPVNTQFMEEPD- 88
SP P S RF N+ N +N F + S++ P K E + EP
Sbjct: 11 SPLPKSEHRFGNVNINDNENANAFSNYSLPKSPSANGPEKVQEEQTEEKTKVETETEPSR 70
Query: 89 -----VKEAEDKIWKSQVIISCPNSSHEYLGPLSDESDDQKKLD 127
E++D+ +QV S SH++L P + SD ++K +
Sbjct: 71 EEINGASESDDEAQVAQVQSSPAQESHQWLSPQVEASDQEEKAE 114
>gb|EAL23342.1| hypothetical protein CNBA4580 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 485
Score = 38.5 bits (88), Expect = 0.10
Identities = 28/104 (26%), Positives = 44/104 (41%), Gaps = 9/104 (8%)
Query: 33 SPSPSSTLRFQNIVHNSTKN---FEIFMARVEGSSSQPTKQHEPNVNLPVNTQFMEEPD- 88
SP P S RF N+ N +N F + S++ P K E + EP
Sbjct: 11 SPLPKSEHRFGNVNINDNENANAFSNYSLPKSPSANGPEKVQEEQTEEKTKVETETEPSR 70
Query: 89 -----VKEAEDKIWKSQVIISCPNSSHEYLGPLSDESDDQKKLD 127
E++D+ +QV S SH++L P + SD ++K +
Sbjct: 71 EEINGASESDDEAQVAQVQSSPAQESHQWLSPQVEASDQEEKAE 114
>gb|AAB94189.2| Hypothetical protein W03F8.2 [Caenorhabditis elegans]
Length = 448
Score = 35.8 bits (81), Expect = 0.67
Identities = 36/149 (24%), Positives = 65/149 (43%), Gaps = 10/149 (6%)
Query: 11 IYMTLSHIFTL-TSFLLSLLLLFSPSPSSTLRFQNIVHNSTKNFEIFMARVEGSSSQPTK 69
++ + H+ L T FLL L + + LRF I+ NF+++ ++ SS+PTK
Sbjct: 35 VFQKIVHLVHLQTPFLLKALRILTFLSHRGLRFSEII-----NFDLWFFFIKYGSSEPTK 89
Query: 70 QHEPNVNLPVNTQFMEEP--DVKEAEDKIWKSQVIISCPNSSHEYLGPLSDESDDQKKLD 127
Q + + V + E DVK E + K + + NS + +KK +
Sbjct: 90 QEKEDDKKEVEEKKENEKPVDVKVEEKVMVKVEATQNTDNSQELKAPEEKPKKKKKKKSN 149
Query: 128 AVFPWNEAQLPLIFSKGPYDLSFLKSKFR 156
+ + ++ ++ K D S L+ K R
Sbjct: 150 IIATASVEKISIVPEK--EDFSILEKKLR 176
>gb|AAK82166.1| 305L [Chilo iridescent virus] gi|15079017|ref|NP_149768.1| 305L
[Invertebrate iridescent virus 6]
Length = 208
Score = 35.8 bits (81), Expect = 0.67
Identities = 27/88 (30%), Positives = 44/88 (49%), Gaps = 11/88 (12%)
Query: 4 ICPISKAIYMTLSHIFTLTSFLLSLLLLFSP------SPSSTLRFQNIVHNSTKNFEIFM 57
IC +++ S + FL++LL++FSP SP S+L F +V + NF F
Sbjct: 109 ICKLARFPKSFCSSVNFPVRFLVTLLVVFSPFVTMSFSPFSSLTFLILVFGTFFNFVSFR 168
Query: 58 ARVEGSSSQPTKQHEPNVNLPVNTQFME 85
+V SS++ T + LP + F+E
Sbjct: 169 LKVFDSSTRST-----STKLPFTSSFLE 191
>ref|NP_701782.1| hypothetical protein PFL2110c [Plasmodium falciparum 3D7]
gi|23496955|gb|AAN36506.1| hypothetical protein PFL2110c
[Plasmodium falciparum 3D7]
Length = 1846
Score = 35.0 bits (79), Expect = 1.1
Identities = 24/91 (26%), Positives = 48/91 (52%), Gaps = 16/91 (17%)
Query: 41 RFQNIVHNSTKNFEIFMARVEGSSSQPTKQHEPNVNLPVNTQFMEEPDVKEAEDKIWKSQ 100
+F+NIV+ S N E + EG +++ + ++PN TQ EP + EA++K + +
Sbjct: 5 KFKNIVNLSNSNTE----KEEGKNAEINENNDPN------TQLAHEPSIVEADEKTFFDE 54
Query: 101 VIISCPNSSHEYLGPLSD------ESDDQKK 125
++ + + E+L L + +++D KK
Sbjct: 55 MLGAPQYTYDEFLQVLKNPIEERGDTEDMKK 85
>ref|NP_070947.1| hypothetical protein AF2122 [Archaeoglobus fulgidus DSM 4304]
gi|2648426|gb|AAB89149.1| A. fulgidus predicted coding
region AF2122 [Archaeoglobus fulgidus DSM 4304]
gi|7483766|pir||B69515 hypothetical protein AF2122 -
Archaeoglobus fulgidus gi|14424277|sp|O28158|YL22_ARCFU
Hypothetical protein AF2122
Length = 142
Score = 35.0 bits (79), Expect = 1.1
Identities = 23/82 (28%), Positives = 38/82 (46%)
Query: 11 IYMTLSHIFTLTSFLLSLLLLFSPSPSSTLRFQNIVHNSTKNFEIFMARVEGSSSQPTKQ 70
IY LS + T FL ++L + + + ++ F +++H ++N F VEG S
Sbjct: 44 IYGQLSLLLWGTKFLATILGVTANNATAMANFTDVLHTLSENSYHFFGTVEGESGMAYIA 103
Query: 71 HEPNVNLPVNTQFMEEPDVKEA 92
+ L N Q E+ VK A
Sbjct: 104 KHSYIELSQNQQLSEDMAVKFA 125
>gb|AAG34740.1| similar to j-domain of DnaJ [Mycoplasma pneumoniae M129]
gi|2494157|sp|Q50312|DNJL_MYCPN DnaJ-like protein MG002
homolog gi|1209516|gb|AAC43644.1| DnaJ protein homolog;
similar to Xdj1 protein from yeast
gi|13507741|ref|NP_109690.1| similar to j-domain of DnaJ
[Mycoplasma pneumoniae M129]
Length = 309
Score = 34.7 bits (78), Expect = 1.5
Identities = 28/93 (30%), Positives = 45/93 (48%), Gaps = 7/93 (7%)
Query: 87 PDV-KEAED---KIWKSQVIISCPNSSHEYLGPL--SDESDDQKKLDAVFPWNEAQLPLI 140
PD+ K+ D KI + ++S EY L S+ D K+LD W+E + +
Sbjct: 31 PDINKQGADTFVKINNAYAVLSDTTQKAEYDAMLRFSEFEDRVKRLDFSIKWHEQFMEEL 90
Query: 141 FSKGPYDLSFLKSKFRTEPKIENNEKYLDWLDK 173
+D F++++ T+P NN KY +LDK
Sbjct: 91 QFHHNWDFDFIRNREYTQPTPTNN-KYSSFLDK 122
>gb|AAH53260.1| Hypothetical protein LOC393349 [Danio rerio]
gi|41055205|ref|NP_956672.1| hypothetical protein
LOC393349 [Danio rerio]
Length = 215
Score = 34.3 bits (77), Expect = 2.0
Identities = 33/132 (25%), Positives = 51/132 (38%), Gaps = 11/132 (8%)
Query: 14 TLSHIFTLTSFLLSLLLLFSPSPSSTLRF------QNIVHNSTKNFEIFMARVEGSSSQP 67
+L+HI + L P + LR Q + + FEI R++G
Sbjct: 39 SLNHILVAVESVFVKRLCPGRDPPAELRLTARESTQKLRQFLQERFEIMFQRMKGMLIDR 98
Query: 68 TKQHEPNVNLPVNTQFMEEPDVKEAEDKIWKS-----QVIISCPNSSHEYLGPLSDESDD 122
NV LP + + P+ KE K+ +S Q + S L L ++ +
Sbjct: 99 MLSIPQNVLLPEDQMHQKYPEGKEELTKLQESIAKLLQAYEAEVCSKQALLAELEEQKET 158
Query: 123 QKKLDAVFPWNE 134
QK+LD V W E
Sbjct: 159 QKQLDEVLRWIE 170
>ref|NP_757638.1| hypothetical protein MYPE2520 [Mycoplasma penetrans HF-2]
gi|26453711|dbj|BAC44042.1| conserved hypothetical
protein [Mycoplasma penetrans HF-2]
Length = 682
Score = 33.9 bits (76), Expect = 2.6
Identities = 19/59 (32%), Positives = 29/59 (48%), Gaps = 2/59 (3%)
Query: 36 PSSTLRFQNIVHNSTKNFEIFMARVEGSSSQPTKQHEPNVNLPVNTQFMEEPDVKEAED 94
P S I+ N K+FEI +V +S KQH+PN +N + + D+K+ D
Sbjct: 208 PLSLDNLLTIISNGNKDFEILSKKVNKNSKHNVKQHKPNKK--INYEEITIQDIKDNWD 264
>emb|CAH15389.1| hypothetical protein [Legionella pneumophila str. Lens]
gi|54294086|ref|YP_126501.1| hypothetical protein
lpl1150 [Legionella pneumophila str. Lens]
Length = 157
Score = 33.5 bits (75), Expect = 3.3
Identities = 30/110 (27%), Positives = 44/110 (39%)
Query: 16 SHIFTLTSFLLSLLLLFSPSPSSTLRFQNIVHNSTKNFEIFMARVEGSSSQPTKQHEPNV 75
S I TL F + L++ P L +I + T F E S + NV
Sbjct: 36 SSILTLFGFTPNALVMPCPHGHGHLHIMDIQEDDTLVFATMNGSPEIMSLLALQFLMQNV 95
Query: 76 NLPVNTQFMEEPDVKEAEDKIWKSQVIISCPNSSHEYLGPLSDESDDQKK 125
NL ++ E D KEAE+K + N E S ES++++K
Sbjct: 96 NLNLDDMLAAETDAKEAEEKEEQELSSSESVNEEKEESEFSSSESENEEK 145
>ref|ZP_00106036.1| COG1205: Distinct helicase family with a unique C-terminal domain
including a metal-binding cysteine cluster [Nostoc
punctiforme PCC 73102]
Length = 759
Score = 33.5 bits (75), Expect = 3.3
Identities = 46/205 (22%), Positives = 82/205 (39%), Gaps = 42/205 (20%)
Query: 7 ISKAIYMTLSHIFTLTSFLLSLLLLFS-PSPSSTLRFQNIVHNSTKNFEIFMARVEGSSS 65
+ Y+ SHIF ++LL + P P+ L QN+ K +A ++G S
Sbjct: 197 VGMLFYLAYSHIFRFFENGRKVVLLTATPEPACELALQNLKQAGVK-----IAIIDGESG 251
Query: 66 QPT---KQHEPNVNL----------------PVNTQFMEEPD-----VKEAEDKIWKSQV 101
Q N+ L V +F+E+PD + ++ D I +
Sbjct: 252 NTNLLPSQTAVNLELRPKPDSKEEWLAELAAEVIQRFLEKPDENGAVILDSLDNINRLSD 311
Query: 102 IISCPNSSHEYLGPLSDESDDQKKLDAVFPWNEAQLPLIFSKGPYDLSFLKSKFRTEPKI 161
++ S+ Y+G ++ + Q + A+ Q +I + D+ F + EPK
Sbjct: 312 LLRQQGLSN-YIGRITGPAPKQDRQRAM------QCQIILATSTVDVGFNFER-HPEPKR 363
Query: 162 ENNEKYLDWLDKVEKIKGPFWKEMG 186
+N LDWL + + FW+ +G
Sbjct: 364 QN----LDWLIFSARDRAAFWQRIG 384
>gb|AAC60011.1| phospholipase C beta [Meleagris gallopavo] gi|2134446|pir||S68251
phospholipase C, inositol-lipid specific (EC 3.1.4.-)
isoform beta - turkey
Length = 1211
Score = 33.5 bits (75), Expect = 3.3
Identities = 25/94 (26%), Positives = 40/94 (41%), Gaps = 12/94 (12%)
Query: 34 PSPSSTLRFQNIVHNSTKNFEIFMARVEGSSSQPTKQHEPNVNLPVNTQFMEEPDVKEAE 93
PSP L + ++ N K F ++ SS + K EP + +E+P +AE
Sbjct: 449 PSPKDLLG-KILIKNKKKQF---VSEKRQSSMKKGKNVEPEI--------IEQPAFTDAE 496
Query: 94 DKIWKSQVIISCPNSSHEYLGPLSDESDDQKKLD 127
D +W V P + LG L +E + + D
Sbjct: 497 DTVWAGDVAEEEPEEEDDQLGNLDEEEIKKMQSD 530
>ref|NP_228481.1| hypothetical protein TM0672 [Thermotoga maritima MSB8]
gi|4981196|gb|AAD35756.1| hypothetical protein TM0672
[Thermotoga maritima MSB8] gi|7462692|pir||G72346
hypothetical protein - Thermotoga maritima (strain MSB8)
Length = 647
Score = 33.1 bits (74), Expect = 4.4
Identities = 26/104 (25%), Positives = 44/104 (42%), Gaps = 5/104 (4%)
Query: 89 VKEAEDKIWKSQVIISCPNSSHEYLGPLSDESDDQKKLDAVF-----PWNEAQLPLIFSK 143
VK++E+K SQ+ +SS YL + S+ KKL + F P +++ +F +
Sbjct: 199 VKDSENKESASQISFEKKHSSIPYLESETSRSESNKKLSSSFRTPAEPSLKSESTKVFKE 258
Query: 144 GPYDLSFLKSKFRTEPKIENNEKYLDWLDKVEKIKGPFWKEMGV 187
P ++K I N + L K+ F K G+
Sbjct: 259 QPISAQDKETKHSARQPIFNETRSLPQNSDAAKLSRDFEKTTGI 302
>emb|CAF29918.1| Conserved Hypothetical Protein [Methanococcus maripaludis S2]
gi|45357925|ref|NP_987482.1| hypothetical protein
MMP0362 [Methanococcus maripaludis S2]
Length = 433
Score = 33.1 bits (74), Expect = 4.4
Identities = 31/106 (29%), Positives = 44/106 (41%), Gaps = 15/106 (14%)
Query: 15 LSHIFTLTSFLLSLLLLFSPSPSSTLRFQNIVHNSTKNFEIFMARVEGSSSQPTKQHEPN 74
L +F +T L LLL FS P R QN FE+ VE S + + N
Sbjct: 25 LPTLFGITILLFILLLKFSFDPVFETRLQN------SEFEV----VENESFEIKLNIKNN 74
Query: 75 VNLPVNTQFMEEPDVKEAEDKIWKSQVIISCPNSSHEYLGPLSDES 120
P+N + E D + W+S ++ P S E L L+ +S
Sbjct: 75 SKCPLNFKIKENND-----NFTWESNNVLILPKKSAEILVSLTPKS 115
>ref|NP_500739.1| predicted CDS, protein kinase and SH2 motif family member (4F957)
[Caenorhabditis elegans] gi|7508905|pir||T15104
hypothetical protein W03F8.2 - Caenorhabditis elegans
Length = 617
Score = 33.1 bits (74), Expect = 4.4
Identities = 36/126 (28%), Positives = 58/126 (45%), Gaps = 17/126 (13%)
Query: 11 IYMTLSHIFTL-TSFLLSLLLLFSPSPSSTLRFQNIVHNSTKNFEIFMARVEGSSSQPTK 69
++ + H+ L T FLL L + + LRF I+ NF+++ ++ SS+PTK
Sbjct: 35 VFQKIVHLVHLQTPFLLKALRILTFLSHRGLRFSEII-----NFDLWFFFIKYGSSEPTK 89
Query: 70 QHEPNVNLPVNTQFMEEP--DVKEAEDKIWKSQVIISCPNSSH--EYLGP---LSDESDD 122
Q + + V + E DVK E K V + SSH YL P + +D+
Sbjct: 90 QEKEDDKKEVEEKKENEKPVDVKVEE----KVMVKVEVRASSHFTVYLCPDLDATQNTDN 145
Query: 123 QKKLDA 128
++L A
Sbjct: 146 SQELKA 151
>ref|NP_197352.1| hypothetical protein [Arabidopsis thaliana]
Length = 702
Score = 32.7 bits (73), Expect = 5.7
Identities = 11/28 (39%), Positives = 19/28 (67%)
Query: 167 YLDWLDKVEKIKGPFWKEMGVFDLIQLS 194
+L WL K++ + P WK+ G+F+ I+ S
Sbjct: 60 FLSWLGKMQALYEPIWKKAGIFEAIKAS 87
>dbj|BAB88834.1| axonemal p83.9 [Ciona intestinalis]
Length = 737
Score = 32.7 bits (73), Expect = 5.7
Identities = 21/51 (41%), Positives = 28/51 (54%), Gaps = 7/51 (13%)
Query: 59 RVEGSSSQ-PTKQHEPNVNL----PVNTQFMEEPDVK--EAEDKIWKSQVI 102
R EGS S+ P +Q +P + L P N F EEP V ++ DK WK+ I
Sbjct: 436 RDEGSPSKTPLEQQQPPIGLTFALPSNVMFFEEPQVASWDSSDKHWKTSGI 486
>ref|ZP_00472243.1| hypothetical protein CsalDRAFT_2418 [Chromohalobacter salexigens DSM
3043] gi|67520608|gb|EAM24551.1| hypothetical protein
CsalDRAFT_2418 [Chromohalobacter salexigens DSM 3043]
Length = 1269
Score = 32.3 bits (72), Expect = 7.5
Identities = 19/64 (29%), Positives = 35/64 (54%), Gaps = 6/64 (9%)
Query: 119 ESDDQKKLDAVFPWNEAQL-----PLIFSKGPYDLSFLKSKFR-TEPKIENNEKYLDWLD 172
E D+K + A+FP L P + ++ PY+L + + TEP++ ++ +YL +D
Sbjct: 1205 EISDEKPITALFPETTLDLMNRVTPQVLTRPPYELPKVLALIEETEPELISDPRYLRLID 1264
Query: 173 KVEK 176
VE+
Sbjct: 1265 LVER 1268
>gb|AAH76809.1| MGC83764 protein [Xenopus laevis]
Length = 562
Score = 32.3 bits (72), Expect = 7.5
Identities = 28/131 (21%), Positives = 56/131 (42%), Gaps = 3/131 (2%)
Query: 30 LLFSPSPSSTLRFQNIVHNSTKNFEIFMARVEGSSSQPTKQHEPNVNLPVNTQFMEEPDV 89
+ SPS L + S+ ++ +V GS+S P+ Q + ++ + P
Sbjct: 95 ITLSPSQQKLLGVPYSLAQSSPQRDLIPNKVPGSTSPPSMQGQNVLSYSPSRSPSSSPKF 154
Query: 90 KEAEDKIWKSQVIISCPNSSHEYLGPLSDESDDQKKLDAVFPWNEA-QLPLIFSKGPYDL 148
+ + Q+ PNS + +S S+ K+ + P + + Q P + GP +
Sbjct: 155 SPSCISGYSPQIQAMLPNSGSPFTSVVSYSSNSFPKIISYSPSSSSPQYP--SNLGPVES 212
Query: 149 SFLKSKFRTEP 159
L+S++R+ P
Sbjct: 213 GGLRSRYRSSP 223
>gb|AAO52105.1| similar to Plasmodium falciparum. Phosphatidylinositol 4-kinase,
putative (EC 2.7.1.67) [Dictyostelium discoideum]
Length = 514
Score = 32.3 bits (72), Expect = 7.5
Identities = 28/117 (23%), Positives = 53/117 (44%), Gaps = 13/117 (11%)
Query: 15 LSHIFTLTSFLLS-LLLLFSPSPSSTLRFQNIVHNSTKNFEIFMARVEGSSSQPTKQHEP 73
+++ T+ F + +LL+F+ SS +S E F ++ SSS T +
Sbjct: 378 INYFITVKQFFSNQILLIFTNCKSSK--------DSVDFIEFFKKKLLSSSSNNTNNNNN 429
Query: 74 NVNLPVNTQFMEEPDVKEAEDKIWKSQVIISCP----NSSHEYLGPLSDESDDQKKL 126
N+N+ + FM E V + ++ I K + + P N S+ + G D+ + +L
Sbjct: 430 NINVEIPPIFMYEHHVNQIKEYILKFKNFPASPFKKNNLSNMFNGLTVDQQNQMDRL 486
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.317 0.134 0.399
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 341,838,373
Number of Sequences: 2540612
Number of extensions: 14263291
Number of successful extensions: 37887
Number of sequences better than 10.0: 29
Number of HSP's better than 10.0 without gapping: 3
Number of HSP's successfully gapped in prelim test: 26
Number of HSP's that attempted gapping in prelim test: 37884
Number of HSP's gapped (non-prelim): 29
length of query: 194
length of database: 863,360,394
effective HSP length: 121
effective length of query: 73
effective length of database: 555,946,342
effective search space: 40584082966
effective search space used: 40584082966
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 71 (32.0 bits)
Medicago: description of AC140773.5