Miyakogusa Predicted Gene
- Lj0g3v0359049.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0359049.1 tr|G7K6Y7|G7K6Y7_MEDTR Microtubule-associated
protein TORTIFOLIA1 OS=Medicago truncatula
GN=MTR_5g03,70.15,0,seg,NULL; HEAT,HEAT; ARM repeat,Armadillo-type
fold; no description,Armadillo-like helical; FAMILY N,CUFF.24700.1
(660 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G27210.1 | Symbols: | ARM repeat superfamily protein | chr1:... 411 e-115
AT5G62580.1 | Symbols: | ARM repeat superfamily protein | chr5:... 302 4e-82
AT1G59850.1 | Symbols: | ARM repeat superfamily protein | chr1:... 233 2e-61
AT2G07170.1 | Symbols: | ARM repeat superfamily protein | chr2:... 157 2e-38
AT4G27060.1 | Symbols: TOR1, SPR2, CN | ARM repeat superfamily p... 135 8e-32
AT1G50890.1 | Symbols: | ARM repeat superfamily protein | chr1:... 115 1e-25
>AT1G27210.1 | Symbols: | ARM repeat superfamily protein |
chr1:9455669-9457917 REVERSE LENGTH=625
Length = 625
Score = 411 bits (1056), Expect = e-115, Method: Compositional matrix adjust.
Identities = 272/610 (44%), Positives = 351/610 (57%), Gaps = 76/610 (12%)
Query: 22 ELRQRVIACLNKLSDRDTLACATAELESIARTLTHDSFSSFLSCIHNTDSSSKSPVRKQC 81
+L+QRVIACLNKL+DRDTLA A+AEL+SIAR LTHDSFS FL+CIHNTDSS KSPVRKQC
Sbjct: 34 DLKQRVIACLNKLADRDTLALASAELDSIARNLTHDSFSPFLNCIHNTDSSVKSPVRKQC 93
Query: 82 VHLLTLLSRFHGDALSPFLSKMISTVLLRLRDTDSAVRSACVDAVSAMSSRITRPSFSVS 141
V LL++LSR+HGD+L+P L+KM+STV+ RLRD DS+VRSAC A + MS+ +TR F+ S
Sbjct: 94 VALLSVLSRYHGDSLTPHLAKMVSTVIRRLRDPDSSVRSACAVATADMSAHVTRQPFA-S 152
Query: 142 FLKPFMDALAQEQDGNSQIGXXXXXXXXXXXXXXXXXXSLRRSALPRLGKLLKKDTCKAR 201
KP ++ L QE D N QIG LR+S LP++GKLLK D KA+
Sbjct: 153 VAKPLIETLIQEGDSNLQIGAALCLAASVDAATDPESEQLRKS-LPKIGKLLKSDGFKAK 211
Query: 202 AALLVLVGSV-------------------VEFLSSEDWTVRKASAEALGKVASVEKGLAS 242
AALL VGS+ +EFLSSEDW RK++AEALGKVA+ E LAS
Sbjct: 212 AALLSAVGSIITAGGAGTKPVLDWLVPVLIEFLSSEDWAARKSAAEALGKVATAED-LAS 270
Query: 243 QHKVLCVDSLQNRRFDKVKVVRETMNLTLEMWKEVTNVSEDVPAPVKSACASVEDGKGQC 302
Q+K C +L++RRFDKVK VRETMN L +WKEV+ E +P +S S +DG C
Sbjct: 271 QYKKTCTTALESRRFDKVKSVRETMNRALNLWKEVSTDDEASLSPSRS---STDDGNIGC 327
Query: 303 ---VAKSSPC-IGPKSSQSKKTVP-SSRSP--PSTASFVSSIKREGALKSNNKDLRVRAM 355
V +SS +G KS++ KK P RSP P S+ ++ ++E K N ++ M
Sbjct: 328 FSSVTRSSTIDVGLKSARPKKVTPIMKRSPSLPVNRSYAATRQKENLPKRNQGNM---TM 384
Query: 356 HQQNHSEISDEKLDTPVLKLPPSNISFDDIKRCDFEASKPAPYQNGVNSRVDIKRVLFSK 415
+ S + + K P F +K+ E + A + DI + S+
Sbjct: 385 LVEEASSVDN--------KGP----HFTPVKKSSEETEEKAN-----SGGPDIIKHTISE 427
Query: 416 MS--DEKVRKFGG--SKPRVVPCYXXXXXXXXVRVSNVSEACESSQDVEDLSLIREQLIX 471
S D KV FGG S RV PC + + ES +D E+LSLIREQL
Sbjct: 428 KSREDSKVSSFGGLRSGSRVAPCSDDGDSVKNCK----DDVEESKKDSEELSLIREQLAL 483
Query: 472 XXXXXXXXXXXXXRFIGSSQTGMNSLETRVYGLEMALDEISHDLALSSGRIP--NTDAIE 529
+F+G+SQ+G+ SLE+RV GLEMALDEIS DLA+S+GR+P ++
Sbjct: 484 IENQQSSLLDLLQKFMGTSQSGIQSLESRVSGLEMALDEISCDLAVSNGRVPRNSSGCAG 543
Query: 530 DTCCKLPGTDFLSSKFWKRNEGRYSTSRRFSLGSSASPTAVHNATDGNGSREVLTANNKK 589
D+C KLPGT+FLS KFW++ E R TA A G RE NN +
Sbjct: 544 DSCSKLPGTEFLSPKFWRKTEERPRNRN----------TANEMAAYDQGMRESTDTNNGQ 593
Query: 590 NQHRRGGGGF 599
RGG F
Sbjct: 594 ----RGGSVF 599
>AT5G62580.1 | Symbols: | ARM repeat superfamily protein |
chr5:25120010-25122259 FORWARD LENGTH=615
Length = 615
Score = 302 bits (774), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 216/629 (34%), Positives = 327/629 (51%), Gaps = 80/629 (12%)
Query: 19 STQELRQRVIACLNKLSDRDTLACATAELESIARTLTHDSFS----SFLSCIHNTDSSSK 74
+T+ +Q + L KL DRDT A EL+ +AR + S S SF+S I + D+ K
Sbjct: 2 ATKNSKQNMSVLLTKLGDRDTFTMAARELDLMARQIDPSSSSGNLQSFISVILSVDTGDK 61
Query: 75 SPVRKQCVHLLTLLS-RFHGDALSPFLSKMISTVLLRLRDTDSAVRSACVDAVSAMSSRI 133
VRK C+HLL +LS ++LSPFLSK+++ + RLRD DS++RS CV AVSA+SSR
Sbjct: 62 PAVRKHCIHLLAVLSVSLPLNSLSPFLSKILTRITRRLRDPDSSIRSTCVAAVSAISSRT 121
Query: 134 TRPSFSVSFLKPFMDALAQEQDGNSQIGXXXXXXXXXXXXXXXXXXSLRRSALPRLGKLL 193
T+P F +F+KP D L EQ+ N+QIG L ++ LPRL KL+
Sbjct: 122 TKPPFYSAFMKPLADTLFTEQEVNAQIGAALCLAAAIDSASDPDPVRLGQTLLPRLEKLV 181
Query: 194 KKDTCKARAALLVLVGSVV-----------------------EFLSSEDWTVRKASAEAL 230
K + KA++A +V++GSV+ FL SEDW RKA+AEAL
Sbjct: 182 KCNAFKAKSAGVVVIGSVIGAGGLSGTSVSSGGLKGLVDCLLSFLVSEDWAARKAAAEAL 241
Query: 231 GKVASVEKGLASQHKVLCVDSLQNRRFDKVKVVRETMNLTLEMWKEVTNVSEDVPAPVKS 290
G++A++E+ + K C+ ++R++DKVK VRE MN +E WK+V ++SE+V P +
Sbjct: 242 GRLATMERNELGEFKAKCLKIFESRKYDKVKAVREVMNQMMEAWKQVPDLSEEVSPPRSN 301
Query: 291 ACASVEDGKGQCVAKSSPCIGPKSSQSKKTVPSSRSPPSTASFVSSIKREGALKS--NNK 348
A + + G+ + S +G ++S+ + + +PP + S ++ +++ KS K
Sbjct: 302 ASSKGDASDGRYPSGSR--VGSTPAKSRTHLVNRSTPPGS-SLATTARKQANRKSIDQKK 358
Query: 349 DLRVRAMHQQNHSEISDEKLDTPVLKLPPSNISFDDIKRCDFEASKPAPYQNGVNSRVDI 408
++ + N + K + P+ +S +D + CD + +N +
Sbjct: 359 TSLTASLTKPNVRRRLEWKAGGASI---PTGVSLEDEQHCDHD-------ENAKET---- 404
Query: 409 KRVLFSKMSDEKVRKFGGS----KPRVVPCYXXXXXXXXVRVSNV-SEACESSQDVEDLS 463
S S V+K GG + P V N S C+ +ED+S
Sbjct: 405 -----SHSSHNTVQKLGGVSSSLNGNIPPSGATMVTGHHVLSENPNSNNCKG---LEDIS 456
Query: 464 LIREQLIXXXXXXXXXXXXXXRFIGSSQTGMNSLETRVYGLEMALDEISHDLALSSGRIP 523
LIR QL+ RF+GSSQ GM LETRV+GLE+ALDEIS+DLA+S+GR+
Sbjct: 457 LIRNQLVQIEQQQANLMDLLQRFVGSSQHGMRGLETRVHGLELALDEISYDLAVSNGRMS 516
Query: 524 NTDAIEDTCCKLPGTDFLSSKFWKRNEGRYSTSRRFSL-GSSASPTAVHNATDGNGSREV 582
N + + CC LP F+ SKFWK+++ +YS SR + +A T + N+
Sbjct: 517 NGSS-RNNCCLLPSGSFIKSKFWKKHDSKYSASRMSTYRNRNAETTEIQNS--------- 566
Query: 583 LTANNKKNQHR-RGGGGFLVNPLAEVQSD 610
+HR G GF+VNPLAE++ D
Sbjct: 567 --------RHRFNGSPGFIVNPLAEIRPD 587
>AT1G59850.1 | Symbols: | ARM repeat superfamily protein |
chr1:22028509-22030207 REVERSE LENGTH=498
Length = 498
Score = 233 bits (595), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 137/277 (49%), Positives = 186/277 (67%), Gaps = 23/277 (8%)
Query: 22 ELRQRVIACLNKLSDRDTLACATAELESIARTLTHDSFSSFLSCIHNTDSSSKSPVRKQC 81
+L+QRVIACLN+LSDRDTLA A AEL+SIA L+ ++FS F++C+ +TDSS+KSPVRK C
Sbjct: 21 DLKQRVIACLNRLSDRDTLALAAAELDSIALNLSPETFSLFINCLQSTDSSAKSPVRKHC 80
Query: 82 VHLLTLLSRFHGDALSPFLSKMISTVLLRLRDTDSAVRSACVDAVSAMSSRITRPSFSVS 141
V LL++LSR HGD+L+P LSKM+STVL RLRD DS+VR+ACV A M++ IT FS+
Sbjct: 81 VSLLSVLSRSHGDSLAPHLSKMVSTVLRRLRDPDSSVRAACVAASVDMTTNITGQPFSIL 140
Query: 142 FLKPFMDALAQEQDGNSQIGXXXXXXXXXXXXXXXXXXSLRRSALPRLGKLLKKDTCKAR 201
F P ++ + + D N+QI L++ ALP++GKLLK + KA+
Sbjct: 141 F-GPMIETVIHDCDPNAQISAAMCLAAAVDAADEPDVEQLQK-ALPKIGKLLKSEGFKAK 198
Query: 202 AALLVLVGSVV---------------------EFLSSEDWTVRKASAEALGKVASVEKGL 240
A LL +G+V+ EFLSS+DW RKA+AEA+ +VA VE+ L
Sbjct: 199 AELLGAIGTVIGAVGGRNSEKAVLDWLLPNVSEFLSSDDWRARKAAAEAMARVAMVEEEL 258
Query: 241 ASQHKVLCVDSLQNRRFDKVKVVRETMNLTLEMWKEV 277
A +K C+ L++RRFDKVK+VRETMN TL +WK++
Sbjct: 259 APLYKKTCLGILESRRFDKVKLVRETMNRTLGLWKQL 295
Score = 50.8 bits (120), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 59/125 (47%), Gaps = 6/125 (4%)
Query: 409 KRVLF-SKMSDEKVRKFGGSKPRVVPCYXXXXXXXXVRVSNVSEACESSQDVEDLSLIRE 467
KRVLF +KM KV++ G +K +VV S+ S+A + E+LSLIR
Sbjct: 376 KRVLFPAKM--HKVKENGSNKSQVVQSSDEESPKTDSGSSSSSQA---KSNAEELSLIRH 430
Query: 468 QLIXXXXXXXXXXXXXXRFIGSSQTGMNSLETRVYGLEMALDEISHDLALSSGRIPNTDA 527
Q+ +F+ SS GM SLE RV GLE + IS DL +S N +
Sbjct: 431 QITQIEKQQSSLLDLFQKFMESSHNGMQSLERRVRGLETSFSVISTDLLVSRSITQNGNH 490
Query: 528 IEDTC 532
+ C
Sbjct: 491 KRNAC 495
>AT2G07170.1 | Symbols: | ARM repeat superfamily protein |
chr2:2975565-2978692 FORWARD LENGTH=820
Length = 820
Score = 157 bits (398), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 109/342 (31%), Positives = 173/342 (50%), Gaps = 36/342 (10%)
Query: 22 ELRQRVIACLNKLSDRDTLACATAELESIARTLTHDSFSSFLSCIHNTDSSSKSPVRKQC 81
EL+++V+ LNKL+DRDT ELE L D S FLSCI +TDS KS VRK+C
Sbjct: 26 ELKKKVVIALNKLADRDTYQRGVDELEKTVEHLAPDKVSCFLSCILDTDSEQKSAVRKEC 85
Query: 82 VHLLTLLSRFHGDALSPFLSKMISTVLLRLRDTDSAVRSACVDAVSAMSSRIT---RPSF 138
+ L+ L+RFH + P+L KM+S+++ RL+D DS VR AC++ + ++S+++ +F
Sbjct: 86 IRLMGTLARFHEGLVGPYLGKMVSSIVKRLKDPDSVVRDACIETMGVLASKMSCYEDQNF 145
Query: 139 S--VSFLKPFMDALAQEQDGNSQIGXXXXXXXXXXXXXXXXXXSLRRSALPRLGKLLKKD 196
VS +KP +A+ +Q+ Q G ++R L R KLL
Sbjct: 146 GVFVSLVKPLFEAIG-DQNKYVQSGAALCLARVIDSSPEAPVAIIQR-MLMRTVKLLNNS 203
Query: 197 TCKARAALLVLVGSVV-------------------EFLSSEDWTVRKASAEALGKVASVE 237
A+ A++ L S++ + L ++DWT RKA++ AL ++A+
Sbjct: 204 HFIAKPAVIELNRSIILAGGATSKSVLSSAMSSFQDALKNKDWTTRKAASVALMEIAATG 263
Query: 238 KGLASQHKVLCVDSLQNRRFDKVKVVRETMNLTLEMWKEVTNVSEDVPAPVKSACASVED 297
+ K C+ SL++ RFDKVK VR+++ L L+ WK V P+ +S+ +
Sbjct: 264 EKFLGPLKASCICSLESCRFDKVKPVRDSVILALKYWKGVPGSDSPEPSETESSVKESYN 323
Query: 298 GKGQCV---------AKSSPCIGPKSSQSKKTVP-SSRSPPS 329
G + K I + ++K VP S+R PP+
Sbjct: 324 GARESSELFSTSDFKVKDGMSIKYVTDVTRKKVPVSARQPPT 365
>AT4G27060.1 | Symbols: TOR1, SPR2, CN | ARM repeat superfamily
protein | chr4:13581581-13585070 REVERSE LENGTH=864
Length = 864
Score = 135 bits (341), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 90/286 (31%), Positives = 151/286 (52%), Gaps = 31/286 (10%)
Query: 22 ELRQRVIACLNKLSDRDTLACATAELESIARTLTHDSFSSFLSCIHNTDSSSKSPVRKQC 81
EL+Q+++ ++KL+DRDT A +LE ++LT ++ FL+C++++ S K V+K+C
Sbjct: 41 ELKQKILTSISKLADRDTYQIAVEDLEKTIQSLTPETLPMFLNCLYDSCSDPKPAVKKEC 100
Query: 82 VHLLTLLSRFHGDALSPFLSKMISTVLLRLRDTDSAVRSACVDAVSAMSSRITR------ 135
+HLL+ + H D+ + L+K+I+ ++ RL+D+DS VR AC D + A+S +
Sbjct: 101 LHLLSYVCSLHCDSTAAHLTKIIAQIVKRLKDSDSGVRDACRDTIGALSGIYLKGKEEGT 160
Query: 136 ----PSFSVS-FLKPFMDALAQEQDGNSQIGXXXXXXXXXXXXXXXXXXSLRRSALPRLG 190
S +V F+KP +A+ EQ+ Q G S ++ PR+
Sbjct: 161 NTGSASLAVGLFVKPLFEAMG-EQNKVVQSGASMCMARMVESAASPPVTSFQK-LCPRIC 218
Query: 191 KLLKKDTCKARAALLVLVGSVVEF------------------LSSEDWTVRKASAEALGK 232
KLL + A+A+LL +V S+ + L S DW RKA+AE L
Sbjct: 219 KLLSNSSFLAKASLLPVVSSLSQVGAIAPQSLESLLESIHDCLGSTDWVTRKAAAETLTA 278
Query: 233 VASVEKGLASQHKVLCVDSLQNRRFDKVKVVRETMNLTLEMWKEVT 278
+AS GL + + L+ RFDK+K VRE++ L++WK+++
Sbjct: 279 LASHSSGLIKEKTDSTITVLETCRFDKIKPVRESVTEALQLWKKIS 324
>AT1G50890.1 | Symbols: | ARM repeat superfamily protein |
chr1:18862295-18865752 REVERSE LENGTH=821
Length = 821
Score = 115 bits (288), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 83/292 (28%), Positives = 141/292 (48%), Gaps = 33/292 (11%)
Query: 22 ELRQRVIACLNKLSDRDTLACATAELESIARTL--THDSFSSFLSCIHNTDSSSKSPVRK 79
EL+QR++ L++L DRDT A +LE I ++ + + L C+ ++ S K+PV++
Sbjct: 36 ELKQRILTSLSRLGDRDTYQIAVDDLEKIVVSVPDSPEILPVLLHCLFDSSSDLKAPVKR 95
Query: 80 QCVHLLTLLSRFHGDALSPFLSKMISTVLLRLRDTDSAVRSACVDAVSAMSSRITRP--- 136
+ + LL+ L + D L+K+IS ++ RL+D D+ VR AC DA+ ++S++ +
Sbjct: 96 ESIRLLSFLCLSYTDLSFSQLAKIISHIVKRLKDADNGVRDACRDAIGSLSAQFLKEKEV 155
Query: 137 --------SFSVSFLKPFMDALAQEQDGNSQIGXXXXXXXXXXXXXXXXXXSLRRSALPR 188
S F KP +A+A EQ+ + Q G + ++ PR
Sbjct: 156 ENGNYVGSSLVGLFAKPLFEAMA-EQNKSLQSGAAICMGKMIDSATEPPVAAFQK-LCPR 213
Query: 189 LGKLLKKDTCKARAALLVLVG------------------SVVEFLSSEDWTVRKASAEAL 230
+ KLL +A+LL +VG S+ E L +W RKA+A+ L
Sbjct: 214 ISKLLNSPNYITKASLLPVVGSLSQVGAIAPQSLESLLHSIHECLGCTNWVTRKAAADVL 273
Query: 231 GKVASVEKGLASQHKVLCVDSLQNRRFDKVKVVRETMNLTLEMWKEVTNVSE 282
+A L + + +L+ RFDK+K VRE+++ L +WK + E
Sbjct: 274 ISLAVHSSSLVADKTDSTLTALEACRFDKIKPVRESLSEALNVWKNIAGKGE 325