Miyakogusa Predicted Gene
- Lj4g3v1562770.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v1562770.1 tr|F4QCB8|F4QCB8_DICFS Inositol 5-phosphatase 2
OS=Dictyostelium fasciculatum (strain SH3)
GN=DFA_12,22.9,3e-18,seg,NULL; no description,Armadillo-like helical;
FAMILY NOT NAMED,NULL; HEAT,HEAT; Cohesin_HEAT,HEAT,CUFF.49421.1
(537 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G62580.1 | Symbols: | ARM repeat superfamily protein | chr5:... 406 e-113
AT1G27210.1 | Symbols: | ARM repeat superfamily protein | chr1:... 398 e-111
AT1G59850.1 | Symbols: | ARM repeat superfamily protein | chr1:... 250 2e-66
AT4G27060.1 | Symbols: TOR1, SPR2, CN | ARM repeat superfamily p... 209 6e-54
AT2G07170.1 | Symbols: | ARM repeat superfamily protein | chr2:... 196 2e-50
AT1G50890.1 | Symbols: | ARM repeat superfamily protein | chr1:... 160 2e-39
>AT5G62580.1 | Symbols: | ARM repeat superfamily protein |
chr5:25120010-25122259 FORWARD LENGTH=615
Length = 615
Score = 406 bits (1043), Expect = e-113, Method: Compositional matrix adjust.
Identities = 259/579 (44%), Positives = 342/579 (59%), Gaps = 95/579 (16%)
Query: 8 KQRVLGCLTKLSDRDTHSLAVAELESIARNLD----AATVPAFLSCIYSTDATDKSPVRK 63
KQ + LTKL DRDT ++A EL+ +AR +D + + +F+S I S D DK VRK
Sbjct: 7 KQNMSVLLTKLGDRDTFTMAARELDLMARQIDPSSSSGNLQSFISVILSVDTGDKPAVRK 66
Query: 64 QCVRLVAFLSEAHG-NALSPFLSKILASIIRRLRDPDSSVRSACVDSVSALSCHVTKQPF 122
C+ L+A LS + N+LSPFLSKIL I RRLRDPDSS+RS CV +VSA+S TK PF
Sbjct: 67 HCIHLLAVLSVSLPLNSLSPFLSKILTRITRRLRDPDSSIRSTCVAAVSAISSRTTKPPF 126
Query: 123 AS-FLKPLAEALFTEQDQNXXXXXXXXXXXXIDGAPDPDPAKLGK-LLPRFEKLLKRDGF 180
S F+KPLA+ LFTEQ+ N ID A DPDP +LG+ LLPR EKL+K + F
Sbjct: 127 YSAFMKPLADTLFTEQEVNAQIGAALCLAAAIDSASDPDPVRLGQTLLPRLEKLVKCNAF 186
Query: 181 RAKPALLTLIGSIVEAGGASGQAV----LKNLIPCLVESLSSNDWAVRKASAEALVVLAT 236
+AK A + +IGS++ AGG SG +V LK L+ CL+ L S DWA RKA+AEAL LAT
Sbjct: 187 KAKSAGVVVIGSVIGAGGLSGTSVSSGGLKGLVDCLLSFLVSEDWAARKAAAEALGRLAT 246
Query: 237 VEREFLSEFKAECLKVFENRRFDKVKLVREVMNQMLESWKQIPDVSDELXXXXXXXXXXK 296
+ER L EFKA+CLK+FE+R++DKVK VREVMNQM+E+WKQ+PD+S+E+ K
Sbjct: 247 MERNELGEFKAKCLKIFESRKYDKVKAVREVMNQMMEAWKQVPDLSEEV-SPPRSNASSK 305
Query: 297 GNGSDGHGRYPPVSQNSCSPGLVMANLRKKSASVSRFSPQDSSSASNAKSTKVLSSNKRM 356
G+ SD GRYP S+ +P + ++ V+R +P SS A+ A+
Sbjct: 306 GDASD--GRYPSGSRVGSTPA------KSRTHLVNRSTPPGSSLATTAR----------- 346
Query: 357 SSVASSRKLNHKNWDAQIFQSNTPSAAMADQGDLQGKSRIFKPEVKRALKYGGSKAGSRV 416
++ N K+ D Q T A + KP V+R L++ KAG
Sbjct: 347 ------KQANRKSID----QKKTSLTA-----------SLTKPNVRRRLEW---KAGGAS 382
Query: 417 VPY-----------HEE-----SQETQDAVSVSNDVSKDLN------------------- 441
+P H+E S + + V VS LN
Sbjct: 383 IPTGVSLEDEQHCDHDENAKETSHSSHNTVQKLGGVSSSLNGNIPPSGATMVTGHHVLSE 442
Query: 442 ----RNDKESEDLSLIRDQLLQIEKQQSYLVDLLQKFIGSSQNGMHSLETRVHGLELSLD 497
N K ED+SLIR+QL+QIE+QQ+ L+DLLQ+F+GSSQ+GM LETRVHGLEL+LD
Sbjct: 443 NPNSNNCKGLEDISLIRNQLVQIEQQQANLMDLLQRFVGSSQHGMRGLETRVHGLELALD 502
Query: 498 EISYDLAVSSGRMTSSDAPAHTCCLPPGADFLSSKFWRR 536
EISYDLAVS+GRM S+ + + CCL P F+ SKFW++
Sbjct: 503 EISYDLAVSNGRM-SNGSSRNNCCLLPSGSFIKSKFWKK 540
>AT1G27210.1 | Symbols: | ARM repeat superfamily protein |
chr1:9455669-9457917 REVERSE LENGTH=625
Length = 625
Score = 398 bits (1023), Expect = e-111, Method: Compositional matrix adjust.
Identities = 242/545 (44%), Positives = 329/545 (60%), Gaps = 32/545 (5%)
Query: 7 IKQRVLGCLTKLSDRDTHSLAVAELESIARNLDAATVPAFLSCIYSTDATDKSPVRKQCV 66
+KQRV+ CL KL+DRDT +LA AEL+SIARNL + FL+CI++TD++ KSPVRKQCV
Sbjct: 35 LKQRVIACLNKLADRDTLALASAELDSIARNLTHDSFSPFLNCIHNTDSSVKSPVRKQCV 94
Query: 67 RLVAFLSEAHGNALSPFLSKILASIIRRLRDPDSSVRSACVDSVSALSCHVTKQPFASFL 126
L++ LS HG++L+P L+K+++++IRRLRDPDSSVRSAC + + +S HVT+QPFAS
Sbjct: 95 ALLSVLSRYHGDSLTPHLAKMVSTVIRRLRDPDSSVRSACAVATADMSAHVTRQPFASVA 154
Query: 127 KPLAEALFTEQDQNXXXXXXXXXXXXIDGAPDPDPAKLGKLLPRFEKLLKRDGFRAKPAL 186
KPL E L E D N +D A DP+ +L K LP+ KLLK DGF+AK AL
Sbjct: 155 KPLIETLIQEGDSNLQIGAALCLAASVDAATDPESEQLRKSLPKIGKLLKSDGFKAKAAL 214
Query: 187 LTLIGSIVEAGGASGQAVLKNLIPCLVESLSSNDWAVRKASAEALVVLATVEREFLSEFK 246
L+ +GSI+ AGGA + VL L+P L+E LSS DWA RK++AEAL +AT E + S++K
Sbjct: 215 LSAVGSIITAGGAGTKPVLDWLVPVLIEFLSSEDWAARKSAAEALGKVATAE-DLASQYK 273
Query: 247 AECLKVFENRRFDKVKLVREVMNQMLESWKQIPDVSDELXXXXXXXXXXKGNGSDGHGRY 306
C E+RRFDKVK VRE MN+ L WK++ DE GN G +
Sbjct: 274 KTCTTALESRRFDKVKSVRETMNRALNLWKEV-STDDEASLSPSRSSTDDGN----IGCF 328
Query: 307 PPVSQNSCSPGLVMANLRKKSASVSRFSP----QDSSSASNAKST--KVLSSNKRMSSVA 360
V+++S + + KK + + SP S +A+ K K N M
Sbjct: 329 SSVTRSSTIDVGLKSARPKKVTPIMKRSPSLPVNRSYAATRQKENLPKRNQGNMTMLVEE 388
Query: 361 SSRKLNHKNWDAQIFQSNTPSAAMADQGD-------LQGKSRIFKPEVKRALKYGGSKAG 413
+S N + +S+ + A+ G + KSR E + +GG ++G
Sbjct: 389 ASSVDNKGPHFTPVKKSSEETEEKANSGGPDIIKHTISEKSR----EDSKVSSFGGLRSG 444
Query: 414 SRVVPYHEESQETQDAVSVSNDVSKDLNRNDKESEDLSLIRDQLLQIEKQQSYLVDLLQK 473
SRV P + D SV N D+ + K+SE+LSLIR+QL IE QQS L+DLLQK
Sbjct: 445 SRVAPCSD------DGDSVKN-CKDDVEESKKDSEELSLIREQLALIENQQSSLLDLLQK 497
Query: 474 FIGSSQNGMHSLETRVHGLELSLDEISYDLAVSSGRM--TSSDAPAHTCCLPPGADFLSS 531
F+G+SQ+G+ SLE+RV GLE++LDEIS DLAVS+GR+ SS +C PG +FLS
Sbjct: 498 FMGTSQSGIQSLESRVSGLEMALDEISCDLAVSNGRVPRNSSGCAGDSCSKLPGTEFLSP 557
Query: 532 KFWRR 536
KFWR+
Sbjct: 558 KFWRK 562
>AT1G59850.1 | Symbols: | ARM repeat superfamily protein |
chr1:22028509-22030207 REVERSE LENGTH=498
Length = 498
Score = 250 bits (639), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 134/278 (48%), Positives = 187/278 (67%), Gaps = 2/278 (0%)
Query: 3 TPQTIKQRVLGCLTKLSDRDTHSLAVAELESIARNLDAATVPAFLSCIYSTDATDKSPVR 62
T +KQRV+ CL +LSDRDT +LA AEL+SIA NL T F++C+ STD++ KSPVR
Sbjct: 18 TVTDLKQRVIACLNRLSDRDTLALAAAELDSIALNLSPETFSLFINCLQSTDSSAKSPVR 77
Query: 63 KQCVRLVAFLSEAHGNALSPFLSKILASIIRRLRDPDSSVRSACVDSVSALSCHVTKQPF 122
K CV L++ LS +HG++L+P LSK++++++RRLRDPDSSVR+ACV + ++ ++T QPF
Sbjct: 78 KHCVSLLSVLSRSHGDSLAPHLSKMVSTVLRRLRDPDSSVRAACVAASVDMTTNITGQPF 137
Query: 123 ASFLKPLAEALFTEQDQNXXXXXXXXXXXXIDGAPDPDPAKLGKLLPRFEKLLKRDGFRA 182
+ P+ E + + D N +D A +PD +L K LP+ KLLK +GF+A
Sbjct: 138 SILFGPMIETVIHDCDPNAQISAAMCLAAAVDAADEPDVEQLQKALPKIGKLLKSEGFKA 197
Query: 183 KPALLTLIGSIVEAGGA--SGQAVLKNLIPCLVESLSSNDWAVRKASAEALVVLATVERE 240
K LL IG+++ A G S +AVL L+P + E LSS+DW RKA+AEA+ +A VE E
Sbjct: 198 KAELLGAIGTVIGAVGGRNSEKAVLDWLLPNVSEFLSSDDWRARKAAAEAMARVAMVEEE 257
Query: 241 FLSEFKAECLKVFENRRFDKVKLVREVMNQMLESWKQI 278
+K CL + E+RRFDKVKLVRE MN+ L WKQ+
Sbjct: 258 LAPLYKKTCLGILESRRFDKVKLVRETMNRTLGLWKQL 295
>AT4G27060.1 | Symbols: TOR1, SPR2, CN | ARM repeat superfamily
protein | chr4:13581581-13585070 REVERSE LENGTH=864
Length = 864
Score = 209 bits (531), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 167/541 (30%), Positives = 268/541 (49%), Gaps = 54/541 (9%)
Query: 7 IKQRVLGCLTKLSDRDTHSLAVAELESIARNLDAATVPAFLSCIYSTDATDKSPVRKQCV 66
+KQ++L ++KL+DRDT+ +AV +LE ++L T+P FL+C+Y + + K V+K+C+
Sbjct: 42 LKQKILTSISKLADRDTYQIAVEDLEKTIQSLTPETLPMFLNCLYDSCSDPKPAVKKECL 101
Query: 67 RLVAFLSEAHGNALSPFLSKILASIIRRLRDPDSSVRSACVDSVSALSCHVTKQ------ 120
L++++ H ++ + L+KI+A I++RL+D DS VR AC D++ ALS K
Sbjct: 102 HLLSYVCSLHCDSTAAHLTKIIAQIVKRLKDSDSGVRDACRDTIGALSGIYLKGKEEGTN 161
Query: 121 ------PFASFLKPLAEALFTEQDQNXXXXXXXXXXXXIDGAPDPDPAKLGKLLPRFEKL 174
F+KPL EA+ EQ++ ++ A P KL PR KL
Sbjct: 162 TGSASLAVGLFVKPLFEAM-GEQNKVVQSGASMCMARMVESAASPPVTSFQKLCPRICKL 220
Query: 175 LKRDGFRAKPALLTLIGSIVEAGGASGQAVLKNLIPCLVESLSSNDWAVRKASAEALVVL 234
L F AK +LL ++ S+ + G + Q+ L++L+ + + L S DW RKA+AE L L
Sbjct: 221 LSNSSFLAKASLLPVVSSLSQVGAIAPQS-LESLLESIHDCLGSTDWVTRKAAAETLTAL 279
Query: 235 ATVEREFLSEFKAECLKVFENRRFDKVKLVREVMNQMLESWKQIP-----DVSDELXXXX 289
A+ + E + V E RFDK+K VRE + + L+ WK+I SD+
Sbjct: 280 ASHSSGLIKEKTDSTITVLETCRFDKIKPVRESVTEALQLWKKISGKYVDGASDDSKLSA 339
Query: 290 XXXXXXKGNG--------------SDGHGRYPPVSQ--NSCSPGLVMANLRKKSASVS-- 331
+ NG SDG P + C P + L+KK+ +S
Sbjct: 340 SEQLGSEKNGEKRSNLADLMKKEASDGSTLSPDSASKGKGCFPEKAVGLLKKKAPVLSDK 399
Query: 332 RFSPQDSSSASNAKSTKVLSSNKRMSSVASSRKLNHKNWDAQIFQSNTPSAAMADQGDLQ 391
F+P+ +S +V+ + ++ L+ N + SN AD ++
Sbjct: 400 DFNPEFFQRLERRQSVEVVVPRRCKNNDEEESGLDDLN---AMGSSNRLKNTQADDKQVK 456
Query: 392 GKSRIFKPEVKRALKYGGSKAG---SRVVPYHEESQETQDAVSVSNDVSKDLNRNDKESE 448
G+ F +A G KAG + P H D S + S NR
Sbjct: 457 GR---FDGNGSQARTSGDDKAGVVNGKETPGHHAPVSNTDNQSEGSFTS---NRG----- 505
Query: 449 DLSLIRDQLLQIEKQQSYLVDLLQKFIGSSQNGMHSLETRVHGLELSLDEISYDLAVSSG 508
+ S I+ QLLQ+E+QQ+ L+++LQ+FIG S + M +LE RV GLE +++++ DL++SSG
Sbjct: 506 NWSAIQRQLLQLERQQTNLMNMLQEFIGGSHDSMVTLEGRVRGLERIVEDMARDLSISSG 565
Query: 509 R 509
R
Sbjct: 566 R 566
>AT2G07170.1 | Symbols: | ARM repeat superfamily protein |
chr2:2975565-2978692 FORWARD LENGTH=820
Length = 820
Score = 196 bits (499), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 108/279 (38%), Positives = 167/279 (59%), Gaps = 7/279 (2%)
Query: 7 IKQRVLGCLTKLSDRDTHSLAVAELESIARNLDAATVPAFLSCIYSTDATDKSPVRKQCV 66
+K++V+ L KL+DRDT+ V ELE +L V FLSCI TD+ KS VRK+C+
Sbjct: 27 LKKKVVIALNKLADRDTYQRGVDELEKTVEHLAPDKVSCFLSCILDTDSEQKSAVRKECI 86
Query: 67 RLVAFLSEAHGNALSPFLSKILASIIRRLRDPDSSVRSACVDSV----SALSCHVTKQ-- 120
RL+ L+ H + P+L K+++SI++RL+DPDS VR AC++++ S +SC+ +
Sbjct: 87 RLMGTLARFHEGLVGPYLGKMVSSIVKRLKDPDSVVRDACIETMGVLASKMSCYEDQNFG 146
Query: 121 PFASFLKPLAEALFTEQDQNXXXXXXXXXXXXIDGAPDPDPAKLGKLLPRFEKLLKRDGF 180
F S +KPL EA+ +Q++ ID +P+ A + ++L R KLL F
Sbjct: 147 VFVSLVKPLFEAI-GDQNKYVQSGAALCLARVIDSSPEAPVAIIQRMLMRTVKLLNNSHF 205
Query: 181 RAKPALLTLIGSIVEAGGASGQAVLKNLIPCLVESLSSNDWAVRKASAEALVVLATVERE 240
AKPA++ L SI+ AGGA+ ++VL + + ++L + DW RKA++ AL+ +A +
Sbjct: 206 IAKPAVIELNRSIILAGGATSKSVLSSAMSSFQDALKNKDWTTRKAASVALMEIAATGEK 265
Query: 241 FLSEFKAECLKVFENRRFDKVKLVREVMNQMLESWKQIP 279
FL KA C+ E+ RFDKVK VR+ + L+ WK +P
Sbjct: 266 FLGPLKASCICSLESCRFDKVKPVRDSVILALKYWKGVP 304
>AT1G50890.1 | Symbols: | ARM repeat superfamily protein |
chr1:18862295-18865752 REVERSE LENGTH=821
Length = 821
Score = 160 bits (405), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 94/286 (32%), Positives = 157/286 (54%), Gaps = 16/286 (5%)
Query: 7 IKQRVLGCLTKLSDRDTHSLAVAELESIARNL--DAATVPAFLSCIYSTDATDKSPVRKQ 64
+KQR+L L++L DRDT+ +AV +LE I ++ +P L C++ + + K+PV+++
Sbjct: 37 LKQRILTSLSRLGDRDTYQIAVDDLEKIVVSVPDSPEILPVLLHCLFDSSSDLKAPVKRE 96
Query: 65 CVRLVAFLSEAHGNALSPFLSKILASIIRRLRDPDSSVRSACVDSVSALSCHVTKQP--- 121
+RL++FL ++ + L+KI++ I++RL+D D+ VR AC D++ +LS K+
Sbjct: 97 SIRLLSFLCLSYTDLSFSQLAKIISHIVKRLKDADNGVRDACRDAIGSLSAQFLKEKEVE 156
Query: 122 ---------FASFLKPLAEALFTEQDQNXXXXXXXXXXXXIDGAPDPDPAKLGKLLPRFE 172
F KPL EA+ EQ+++ ID A +P A KL PR
Sbjct: 157 NGNYVGSSLVGLFAKPLFEAM-AEQNKSLQSGAAICMGKMIDSATEPPVAAFQKLCPRIS 215
Query: 173 KLLKRDGFRAKPALLTLIGSIVEAGGASGQAVLKNLIPCLVESLSSNDWAVRKASAEALV 232
KLL + K +LL ++GS+ + G + Q+ L++L+ + E L +W RKA+A+ L+
Sbjct: 216 KLLNSPNYITKASLLPVVGSLSQVGAIAPQS-LESLLHSIHECLGCTNWVTRKAAADVLI 274
Query: 233 VLATVEREFLSEFKAECLKVFENRRFDKVKLVREVMNQMLESWKQI 278
LA +++ L E RFDK+K VRE +++ L WK I
Sbjct: 275 SLAVHSSSLVADKTDSTLTALEACRFDKIKPVRESLSEALNVWKNI 320