
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147178.12 + phase: 0
(163 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q8LCA0 Hypothetical protein [Arabidopsis thaliana] 214 5e-55
UniRef100_Q941E5 AT3g50680/T3A5_60 [Arabidopsis thaliana] 212 2e-54
UniRef100_Q67YF1 Hypothetical protein At3g50685 [Arabidopsis tha... 208 3e-53
UniRef100_Q9SCQ8 Hypothetical protein T3A5.60 [Arabidopsis thali... 190 1e-47
UniRef100_Q8S5E8 Hypothetical protein OSJNBb0047B19.16 [Oryza sa... 180 1e-44
UniRef100_UPI00002EA0BB UPI00002EA0BB UniRef100 entry 122 3e-27
UniRef100_UPI0000289DAC UPI0000289DAC UniRef100 entry 43 0.003
UniRef100_Q43480 Gamma-TIP-like protein [Hordeum vulgare] 37 0.18
UniRef100_Q9ZR95 Gamma-type tonoplast intrinsic protein [Triticu... 36 0.31
UniRef100_Q7NG85 Gll3288 protein [Gloeobacter violaceus] 36 0.40
UniRef100_Q64PU8 Hypothetical protein [Bacteroides fragilis] 35 0.68
UniRef100_UPI000042D24C UPI000042D24C UniRef100 entry 35 0.89
UniRef100_UPI00003C232C UPI00003C232C UniRef100 entry 34 1.5
UniRef100_Q96898 Polyprotein [Hepatitis GB virus A] 33 2.6
UniRef100_Q7WK38 Serine protease [Bordetella bronchiseptica] 33 2.6
UniRef100_Q7W8S8 Serine protease [Bordetella parapertussis] 33 2.6
UniRef100_Q9JIX6 RNA binding protein NAPOR-3 [Rattus norvegicus] 33 3.4
UniRef100_Q7VZ29 Serine protease [Bordetella pertussis] 33 3.4
UniRef100_UPI000023603E UPI000023603E UniRef100 entry 32 4.4
UniRef100_Q8YQ95 All3939 protein [Anabaena sp.] 32 4.4
>UniRef100_Q8LCA0 Hypothetical protein [Arabidopsis thaliana]
Length = 158
Score = 214 bits (546), Expect = 5e-55
Identities = 108/163 (66%), Positives = 128/163 (78%), Gaps = 5/163 (3%)
Query: 1 MLLFSPISTPKTFTNHLNHPLRKPKSHHVRLKLKITNMAKEGSDTNSSPTEIAAIAGGLI 60
MLL SPIS + H + +R+ ++ ++ AK+ +DT E AAIAGGL+
Sbjct: 1 MLLLSPISASLPPSFHRGNLIRRS----IKPLGRVVAKAKDNTDTGGF-LETAAIAGGLV 55
Query: 61 STPVIGWSLYTLKTTGCGLPPGPGGSIGALEGVSYLVVVGIVAWSIYTKTKTGSGLPNGP 120
STPVIGWSLYTLKTTGCGLPPGP G IGALEGVSYLVVVGIV WS+YTKTKTGSGLPNGP
Sbjct: 56 STPVIGWSLYTLKTTGCGLPPGPAGLIGALEGVSYLVVVGIVGWSLYTKTKTGSGLPNGP 115
Query: 121 FGLLGAVEGLSYLALVAIVVVFGLQYFQQGYIPGPLPADQCFG 163
FGLLGAVEGLSYL+++AI+VVFG+Q+ G +PGPLP+DQCFG
Sbjct: 116 FGLLGAVEGLSYLSVLAILVVFGIQFLDNGSVPGPLPSDQCFG 158
>UniRef100_Q941E5 AT3g50680/T3A5_60 [Arabidopsis thaliana]
Length = 158
Score = 212 bits (540), Expect = 2e-54
Identities = 107/163 (65%), Positives = 127/163 (77%), Gaps = 5/163 (3%)
Query: 1 MLLFSPISTPKTFTNHLNHPLRKPKSHHVRLKLKITNMAKEGSDTNSSPTEIAAIAGGLI 60
MLL SPIS + H + +R+ ++ ++ AK+ +DT E AAIAG L+
Sbjct: 1 MLLLSPISASLPPSFHRGNLIRRS----IKPLGRVVAKAKDNTDTGGF-LETAAIAGSLV 55
Query: 61 STPVIGWSLYTLKTTGCGLPPGPGGSIGALEGVSYLVVVGIVAWSIYTKTKTGSGLPNGP 120
STPVIGWSLYTLKTTGCGLPPGP G IGALEGVSYLVVVGIV WS+YTKTKTGSGLPNGP
Sbjct: 56 STPVIGWSLYTLKTTGCGLPPGPAGLIGALEGVSYLVVVGIVGWSLYTKTKTGSGLPNGP 115
Query: 121 FGLLGAVEGLSYLALVAIVVVFGLQYFQQGYIPGPLPADQCFG 163
FGLLGAVEGLSYL+++AI+VVFG+Q+ G +PGPLP+DQCFG
Sbjct: 116 FGLLGAVEGLSYLSVLAILVVFGIQFLDNGSVPGPLPSDQCFG 158
>UniRef100_Q67YF1 Hypothetical protein At3g50685 [Arabidopsis thaliana]
Length = 158
Score = 208 bits (530), Expect = 3e-53
Identities = 106/163 (65%), Positives = 126/163 (77%), Gaps = 5/163 (3%)
Query: 1 MLLFSPISTPKTFTNHLNHPLRKPKSHHVRLKLKITNMAKEGSDTNSSPTEIAAIAGGLI 60
MLL SPIS + H + +R+ ++ ++ AK+ +DT E AAIAG L+
Sbjct: 1 MLLLSPISASLPPSFHRGNLIRRS----IKPLGRVVAKAKDNTDTGGF-LETAAIAGSLV 55
Query: 61 STPVIGWSLYTLKTTGCGLPPGPGGSIGALEGVSYLVVVGIVAWSIYTKTKTGSGLPNGP 120
STPVIGWSLYTLKTTGCGLPPGP G IGALEGVSYLVVVGIV WS+YTKTKTGSGLPNGP
Sbjct: 56 STPVIGWSLYTLKTTGCGLPPGPAGLIGALEGVSYLVVVGIVGWSLYTKTKTGSGLPNGP 115
Query: 121 FGLLGAVEGLSYLALVAIVVVFGLQYFQQGYIPGPLPADQCFG 163
FGLLGAVEGLSYL+++AI+VVFG+Q+ G +PGPLP+DQ FG
Sbjct: 116 FGLLGAVEGLSYLSVLAILVVFGIQFLDNGSVPGPLPSDQSFG 158
>UniRef100_Q9SCQ8 Hypothetical protein T3A5.60 [Arabidopsis thaliana]
Length = 319
Score = 190 bits (483), Expect = 1e-47
Identities = 98/153 (64%), Positives = 117/153 (76%), Gaps = 5/153 (3%)
Query: 1 MLLFSPISTPKTFTNHLNHPLRKPKSHHVRLKLKITNMAKEGSDTNSSPTEIAAIAGGLI 60
MLL SPIS + H + +R+ ++ ++ AK+ +DT E AAIAG L+
Sbjct: 1 MLLLSPISASLPPSFHRGNLIRRS----IKPLGRVVAKAKDNTDTGGF-LETAAIAGSLV 55
Query: 61 STPVIGWSLYTLKTTGCGLPPGPGGSIGALEGVSYLVVVGIVAWSIYTKTKTGSGLPNGP 120
STPVIGWSLYTLKTTGCGLPPGP G IGALEGVSYLVVVGIV WS+YTKTKTGSGLPNGP
Sbjct: 56 STPVIGWSLYTLKTTGCGLPPGPAGLIGALEGVSYLVVVGIVGWSLYTKTKTGSGLPNGP 115
Query: 121 FGLLGAVEGLSYLALVAIVVVFGLQYFQQGYIP 153
FGLLGAVEGLSYL+++AI+VVFG+Q+ G +P
Sbjct: 116 FGLLGAVEGLSYLSVLAILVVFGIQFLDNGSVP 148
>UniRef100_Q8S5E8 Hypothetical protein OSJNBb0047B19.16 [Oryza sativa]
Length = 163
Score = 180 bits (456), Expect = 1e-44
Identities = 83/130 (63%), Positives = 100/130 (76%)
Query: 34 KITNMAKEGSDTNSSPTEIAAIAGGLISTPVIGWSLYTLKTTGCGLPPGPGGSIGALEGV 93
++T K +S E+AA A GL S+ V+ WSLYTLKTTGCGLPPGPGG++GA EGV
Sbjct: 34 RLTTTCKAEPSGGNSTLELAAGAAGLASSSVVAWSLYTLKTTGCGLPPGPGGALGAAEGV 93
Query: 94 SYLVVVGIVAWSIYTKTKTGSGLPNGPFGLLGAVEGLSYLALVAIVVVFGLQYFQQGYIP 153
SYLVV ++ WS+ TK +TGSGLP GPFGLLGA EG+SYLA AI VVFG Q+F+ G +P
Sbjct: 94 SYLVVAALIGWSLTTKVRTGSGLPAGPFGLLGAAEGVSYLAAAAIAVVFGFQFFEVGSLP 153
Query: 154 GPLPADQCFG 163
GPLP+DQCFG
Sbjct: 154 GPLPSDQCFG 163
>UniRef100_UPI00002EA0BB UPI00002EA0BB UniRef100 entry
Length = 162
Score = 122 bits (306), Expect = 3e-27
Identities = 63/145 (43%), Positives = 88/145 (60%), Gaps = 1/145 (0%)
Query: 16 HLNHPLRK-PKSHHVRLKLKITNMAKEGSDTNSSPTEIAAIAGGLISTPVIGWSLYTLKT 74
H+ P ++ +S R+ LK ++ + +SS E+ GG + PV+ +S +TLKT
Sbjct: 11 HIRPPAKRFNRSLASRVALKASSFEEGPEGGDSSTAELIYGLGGTLLAPVVLYSEFTLKT 70
Query: 75 TGCGLPPGPGGSIGALEGVSYLVVVGIVAWSIYTKTKTGSGLPNGPFGLLGAVEGLSYLA 134
TGCGLP GP G++G EGV YL V+ WS TK +TGSGLP GPFGLLG EGLS+LA
Sbjct: 71 TGCGLPAGPFGALGLAEGVGYLSVIAFCFWSFGTKLRTGSGLPAGPFGLLGVAEGLSWLA 130
Query: 135 LVAIVVVFGLQYFQQGYIPGPLPAD 159
+ + V G Q G++P +P +
Sbjct: 131 GIGGLAVLGFQLADYGFVPEAIPTE 155
>UniRef100_UPI0000289DAC UPI0000289DAC UniRef100 entry
Length = 106
Score = 42.7 bits (99), Expect = 0.003
Identities = 28/76 (36%), Positives = 42/76 (54%), Gaps = 5/76 (6%)
Query: 49 PTEIAAIAGG--LISTPVIGWSLYTLKTTGCGLPPGP-GGSIGALEGVSYLVVVGIVAWS 105
P + A + GG ++++ V+ WS YTLKTTGCGLP G GS+ L+ L ++ +
Sbjct: 2 PAQEATVIGGAGVLASCVMLWSEYTLKTTGCGLPAGTLSGSL--LQVTCTLPSCSVLTAA 59
Query: 106 IYTKTKTGSGLPNGPF 121
T T G +GP+
Sbjct: 60 CCTATAQDLGACSGPW 75
>UniRef100_Q43480 Gamma-TIP-like protein [Hordeum vulgare]
Length = 250
Score = 37.0 bits (84), Expect = 0.18
Identities = 33/112 (29%), Positives = 49/112 (43%), Gaps = 16/112 (14%)
Query: 48 SPTEIAAIAGGLISTPVIGWSLYTLKTTGCGLPPGP-----------GGSIGALEGVSYL 96
SP +A AG + + ++L+ + G + G GG+I G+ Y
Sbjct: 48 SPDGVATPAGLISAAIAHAFALFVAVSVGANISGGHVNPAVTFGAFVGGNITLFRGLLYW 107
Query: 97 V--VVGIVAWSIYTKTKTGSGLPNGPFGLLGAVEGLSYLALVAIVVVFGLQY 146
V ++G A + TG GLP G FGL G G ++ IV+ FGL Y
Sbjct: 108 VAQLLGSTAACFLLRFSTG-GLPTGTFGLTGI--GAWEAVVLEIVMTFGLVY 156
>UniRef100_Q9ZR95 Gamma-type tonoplast intrinsic protein [Triticum aestivum]
Length = 250
Score = 36.2 bits (82), Expect = 0.31
Identities = 33/108 (30%), Positives = 48/108 (43%), Gaps = 17/108 (15%)
Query: 53 AAIAGGLISTPVI-GWSLYTLKTTGCGLPPGP-----------GGSIGALEGVSYLV--V 98
AA GLIS + ++L+ + G + G GG+I G+ Y + +
Sbjct: 52 AATPAGLISAAIAHAFALFVAVSVGANISGGHVNPAVTFGAFVGGNITLFRGLLYWIAQL 111
Query: 99 VGIVAWSIYTKTKTGSGLPNGPFGLLGAVEGLSYLALVAIVVVFGLQY 146
+G A + TG GLP G FGL G G ++ IV+ FGL Y
Sbjct: 112 LGSTAACFLLRFSTG-GLPTGTFGLSGI--GAWEAVVLEIVMTFGLVY 156
>UniRef100_Q7NG85 Gll3288 protein [Gloeobacter violaceus]
Length = 319
Score = 35.8 bits (81), Expect = 0.40
Identities = 30/132 (22%), Positives = 56/132 (41%), Gaps = 21/132 (15%)
Query: 40 KEGSDTNSSPTEIAAIAGGLISTPVIG----WSLYTLKTTGCGLPPGPGGSIGALEGVSY 95
K+ T+ + +AGGL ++ +G W L L T P LEG+
Sbjct: 32 KKAGHTHLNRWVWGGVAGGLAASTAVGLLFGWLLGNLSTANQKYAPAVEP---LLEGIFG 88
Query: 96 LVVVGIVAWSIYTKTKTGSGLPNGPFGLLGAVEG--------LSYLALVAIV------VV 141
++ +G+++W + T+ L +G G +++L L ++ V+
Sbjct: 89 VLAIGMLSWMLIWMTRQARTLKGQVEASVGQALGQERAAGWAIAWLILFTVLREGFETVL 148
Query: 142 FGLQYFQQGYIP 153
F YF+QG++P
Sbjct: 149 FVTSYFRQGFVP 160
>UniRef100_Q64PU8 Hypothetical protein [Bacteroides fragilis]
Length = 156
Score = 35.0 bits (79), Expect = 0.68
Identities = 29/93 (31%), Positives = 44/93 (47%), Gaps = 7/93 (7%)
Query: 58 GLISTPVIGWSLYTLKTTGCGLPPGPGGSIGALEGVSYLVVVGIVAWSI---YTKTKTG- 113
GL V+ + + G L P G+I +L G+ LV+V +VAWS+ T T G
Sbjct: 56 GLFGIAVVATVVAAIFQFGSALKDNPKGAIRSLLGLILLVLVLVVAWSMGSGETLTIQGY 115
Query: 114 SGLPNGPFGLLGA---VEGLSYLALVAIVVVFG 143
G N PF L + + +L LV ++ + G
Sbjct: 116 EGTDNVPFWLKLTDMFLYSIYFLMLVTVLAIIG 148
>UniRef100_UPI000042D24C UPI000042D24C UniRef100 entry
Length = 477
Score = 34.7 bits (78), Expect = 0.89
Identities = 31/103 (30%), Positives = 42/103 (40%), Gaps = 12/103 (11%)
Query: 43 SDTNSSPTEIAAIAGGLISTPVIGWSLYTLKTTGCGLPPGPGGSIGALEG--VSYLVVVG 100
S S +EI I G + +GW +K+ G L G G S+G EG V Y V VG
Sbjct: 179 SAAGSGISEIKCIVSGFVMDGFLGWPTLFIKSLGLPLAIGSGLSVGK-EGPSVHYAVCVG 237
Query: 101 IVAWSIYTKTKTGSGLPNGPFGLLGAVEGLSYLALVAIVVVFG 143
+ TK + + A E L+ A + V FG
Sbjct: 238 NSIAKLITKYRKSAS---------RAREFLTATAAAGVAVAFG 271
>UniRef100_UPI00003C232C UPI00003C232C UniRef100 entry
Length = 591
Score = 33.9 bits (76), Expect = 1.5
Identities = 27/87 (31%), Positives = 36/87 (41%), Gaps = 12/87 (13%)
Query: 47 SSPTEIAAIAGGLISTPVIGWSLYT------------LKTTGCGLPPGPGGSIGALEGVS 94
SSP ++IAG + I + YT L+ G GLPPG GG G G S
Sbjct: 484 SSPAPTSSIAGSIGIGAEIPFLAYTTPNQDMVNNVSWLRAGGGGLPPGVGGFGGGGGGGS 543
Query: 95 YLVVVGIVAWSIYTKTKTGSGLPNGPF 121
+ W+ +T TG G + F
Sbjct: 544 SSHKSVVQEWNEFTAGSTGGGSSSSGF 570
>UniRef100_Q96898 Polyprotein [Hepatitis GB virus A]
Length = 2954
Score = 33.1 bits (74), Expect = 2.6
Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 9/54 (16%)
Query: 49 PTEIAAIAGGLISTPVIGWSLYTLKTTGCGLPPGPGGSIGALEGVSYLVVVGIV 102
P AA+A L+ TPV+GW+ TGC +G + V+YL V+G V
Sbjct: 652 PVVEAALAPELVCTPVVGWAAQEWWFTGC---------LGVMCVVAYLNVLGSV 696
>UniRef100_Q7WK38 Serine protease [Bordetella bronchiseptica]
Length = 1076
Score = 33.1 bits (74), Expect = 2.6
Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 11/89 (12%)
Query: 37 NMAKEGSDTNSSPTEIAAIAGGL--ISTPVIGWSLY---TLKTTGCGLPPGPG------G 85
++A S SPT + GG +S ++G +L L+ TG L PG G
Sbjct: 529 SLAGGSSLGGGSPTTPIVVTGGADAVSGAILGGNLDIGGALRMTGASLAPGNSIGTVTVG 588
Query: 86 SIGALEGVSYLVVVGIVAWSIYTKTKTGS 114
S+GAL G +YL V S ++G+
Sbjct: 589 SVGALSGSTYLAEVNAKGQSDLLVVRSGN 617
>UniRef100_Q7W8S8 Serine protease [Bordetella parapertussis]
Length = 1076
Score = 33.1 bits (74), Expect = 2.6
Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 11/89 (12%)
Query: 37 NMAKEGSDTNSSPTEIAAIAGGL--ISTPVIGWSLY---TLKTTGCGLPPGPG------G 85
++A S SPT + GG +S ++G +L L+ TG L PG G
Sbjct: 529 SLAGGSSLGGGSPTTPIVVTGGADAVSGAILGGNLDIGGALRMTGASLAPGNSIGTVTVG 588
Query: 86 SIGALEGVSYLVVVGIVAWSIYTKTKTGS 114
S+GAL G +YL V S ++G+
Sbjct: 589 SVGALSGSTYLAEVNAKGQSDLLVVRSGN 617
>UniRef100_Q9JIX6 RNA binding protein NAPOR-3 [Rattus norvegicus]
Length = 226
Score = 32.7 bits (73), Expect = 3.4
Identities = 31/123 (25%), Positives = 53/123 (42%), Gaps = 15/123 (12%)
Query: 40 KEGSDTNSSPTEIAAIAGGLISTPV--------IGWSLYTLKTTGCGLPPGPGGSIGALE 91
KEG TN++P + A G +++PV G ++ +L T G G G+ L
Sbjct: 7 KEGMGTNANPLSSTSSALGALTSPVAASTPNSTAGAAMNSL--TSLGTLQGLAGATVGLN 64
Query: 92 GVSYLVVVGIVAWSIYTKTKTG-SGLPNGPFGLLGAV----EGLSYLALVAIVVVFGLQY 146
++ L V +++ G +GL NG G + A+ G+ A A+ ++
Sbjct: 65 NINALAVAQMLSGMAALNGGLGATGLTNGTAGTMDALTQAYSGIQQYAAAALPTLYSQSL 124
Query: 147 FQQ 149
QQ
Sbjct: 125 LQQ 127
>UniRef100_Q7VZ29 Serine protease [Bordetella pertussis]
Length = 1076
Score = 32.7 bits (73), Expect = 3.4
Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 11/89 (12%)
Query: 37 NMAKEGSDTNSSPTEIAAIAGGL--ISTPVIGWSLY---TLKTTGCGLPPGPG------G 85
++A S SPT + GG +S ++G +L L+ TG L PG G
Sbjct: 529 SLAGGSSLGGGSPTTPIVVIGGADAVSGAILGGNLDIGGALRMTGASLAPGNSIGTVTVG 588
Query: 86 SIGALEGVSYLVVVGIVAWSIYTKTKTGS 114
S+GAL G +YL V S ++G+
Sbjct: 589 SVGALSGSTYLAEVNAKGQSDLLVVRSGN 617
>UniRef100_UPI000023603E UPI000023603E UniRef100 entry
Length = 460
Score = 32.3 bits (72), Expect = 4.4
Identities = 19/63 (30%), Positives = 28/63 (44%)
Query: 90 LEGVSYLVVVGIVAWSIYTKTKTGSGLPNGPFGLLGAVEGLSYLALVAIVVVFGLQYFQQ 149
L G+S G+ A I+ K GSG +G AV G + ++ GLQ F +
Sbjct: 371 LGGISQPAGAGLAALWIWGARKAGSGAGDGQNETSWAVYGCMFAVTAGVMTSVGLQLFTE 430
Query: 150 GYI 152
G +
Sbjct: 431 GLV 433
>UniRef100_Q8YQ95 All3939 protein [Anabaena sp.]
Length = 315
Score = 32.3 bits (72), Expect = 4.4
Identities = 34/132 (25%), Positives = 58/132 (43%), Gaps = 17/132 (12%)
Query: 38 MAKEGSDTNSSPTEIAAIAGGLISTPVIGWSLYT--LKTTGCGLPPGPGGSIGALEGVSY 95
+ K+ + +P A +A G++ + +IG L+T ++ G P LEGV
Sbjct: 30 LLKKAKQSRLNPWVYAGVAVGIVISALIG-VLFTWIIQAVGAANPQYTVVVEPMLEGVFS 88
Query: 96 LVVVGIVAWSIYTKTKTGSGLPNGPFGLL--------GAVEGLSYLALVAIV------VV 141
++ + +++W + TK + G + A G+ L LVA+V V+
Sbjct: 89 VLAIAMLSWMLIWMTKQARFMKAQVEGAVTDALTQNSNAGWGVFSLILVAVVREGFETVL 148
Query: 142 FGLQYFQQGYIP 153
F FQQG IP
Sbjct: 149 FIAANFQQGLIP 160
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.318 0.140 0.429
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 321,644,058
Number of Sequences: 2790947
Number of extensions: 14995745
Number of successful extensions: 37918
Number of sequences better than 10.0: 37
Number of HSP's better than 10.0 without gapping: 13
Number of HSP's successfully gapped in prelim test: 24
Number of HSP's that attempted gapping in prelim test: 37887
Number of HSP's gapped (non-prelim): 42
length of query: 163
length of database: 848,049,833
effective HSP length: 117
effective length of query: 46
effective length of database: 521,509,034
effective search space: 23989415564
effective search space used: 23989415564
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 69 (31.2 bits)
Medicago: description of AC147178.12