Miyakogusa Predicted Gene
- chr5.CM0357.270.nc
BLASTP 2.2.18 [Mar-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= chr5.CM0357.270.nc - phase: 0
(250 letters)
Database: TAIR8_pep
32,825 sequences; 13,166,001 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G22850.1 | Symbols: | similar to unknown protein [Arabidopsi... 376 e-105
AT5G43830.1 | Symbols: | similar to unknown protein [Arabidopsi... 368 e-102
AT4G27450.1 | Symbols: | similar to unknown protein [Arabidopsi... 236 9e-63
AT3G15450.1 | Symbols: | similar to unknown protein [Arabidopsi... 231 4e-61
AT5G19140.1 | Symbols: | auxin/aluminum-responsive protein, put... 220 5e-58
AT5G19140.2 | Symbols: | auxin/aluminum-responsive protein, put... 199 2e-51
AT3G15450.2 | Symbols: | similar to unknown protein [Arabidopsi... 161 4e-40
AT3G15450.3 | Symbols: | similar to unknown protein [Arabidopsi... 133 9e-32
>AT3G22850.1 | Symbols: | similar to unknown protein [Arabidopsis
thaliana] (TAIR:AT5G43830.1); similar to hypothetical
protein [Vitis vinifera] (GB:CAN71784.1); contains
domain PTHR11772 (PTHR11772); contains domain
G3DSA:3.60.20.10 (G3DSA:3.60.20.10); contains domain
SSF56235 (SSF56235) | chr3:8089074-8090282 FORWARD
Length = 248
Score = 376 bits (966), Expect = e-105, Method: Compositional matrix adjust.
Identities = 178/250 (71%), Positives = 206/250 (82%), Gaps = 2/250 (0%)
Query: 1 MLAVFDKSVAKGPEALQSPQSNSVSALKDGFLAQHFSSVYPGSVIVNLGTSGTLAYSLHK 60
MLA+FDK+VAK PEALQ + SV ALKD FL HFSSVYPG+V +NLG+SG +A SL K
Sbjct: 1 MLAIFDKNVAKTPEALQGQEGGSVCALKDRFLPNHFSSVYPGAVTINLGSSGFIACSLEK 60
Query: 61 QNPLLPRLFAVVDDIFCLFQGHIQNVAHLKQQYGLNKTANEVIIVIEAYRTLRDRGPYPA 120
QNPLLPRLFAVVDD+FC+FQGHI+NV LKQQYGL KTA EV IVIEAYRTLRDRGPY A
Sbjct: 61 QNPLLPRLFAVVDDMFCIFQGHIENVPILKQQYGLTKTATEVTIVIEAYRTLRDRGPYSA 120
Query: 121 AQVVRDFQGKFTFVLFDSGSKTAFISSDDDGSVPFFWGTDADGNLVLSDETEIVAKSCGK 180
QVVRDFQGKF F+L+D ++ F++ D DGSVP +WGTDA+G+LV+SD+ E V K CGK
Sbjct: 121 EQVVRDFQGKFGFMLYDCSTQNVFLAGDVDGSVPLYWGTDAEGHLVVSDDVETVKKGCGK 180
Query: 181 SSAPFPKGCFFSTSGGLSSFEHPLNEMKAVPRVDSSGEMCGATFKVDADAKKETTGMPRV 240
S APFPKGCFF++SGGL S+EHP NE+K VPRVDSSGE+CG TFKVD++AKKE MPRV
Sbjct: 181 SFAPFPKGCFFTSSGGLRSYEHPSNELKPVPRVDSSGEVCGVTFKVDSEAKKE--AMPRV 238
Query: 241 GSAANWSNNI 250
GS NWS I
Sbjct: 239 GSVQNWSKQI 248
>AT5G43830.1 | Symbols: | similar to unknown protein [Arabidopsis
thaliana] (TAIR:AT3G22850.1); similar to
aluminum-induced protein-like protein [Thellungiella
halophila] (GB:AAM19711.1); contains domain
G3DSA:3.60.20.10 (G3DSA:3.60.20.10); contains domain
SSF56235 (SSF56235) | chr5:17639820-17641466 REVERSE
Length = 251
Score = 368 bits (945), Expect = e-102, Method: Compositional matrix adjust.
Identities = 176/251 (70%), Positives = 204/251 (81%), Gaps = 1/251 (0%)
Query: 1 MLAVFDKSVAKGPEALQSPQSN-SVSALKDGFLAQHFSSVYPGSVIVNLGTSGTLAYSLH 59
MLAVF+K+VA PEALQSP S+ S ALKDG LA HF+SV P SV +N G+SG +AYSL
Sbjct: 1 MLAVFEKTVANSPEALQSPHSSESAFALKDGSLATHFASVNPNSVTLNFGSSGFVAYSLD 60
Query: 60 KQNPLLPRLFAVVDDIFCLFQGHIQNVAHLKQQYGLNKTANEVIIVIEAYRTLRDRGPYP 119
+P +PRLFAVVDDIFCLFQGHI+N+ LKQQYGLNK NE IIVIEAYRTLRDRGPYP
Sbjct: 61 NPDPRVPRLFAVVDDIFCLFQGHIENLPFLKQQYGLNKITNEAIIVIEAYRTLRDRGPYP 120
Query: 120 AAQVVRDFQGKFTFVLFDSGSKTAFISSDDDGSVPFFWGTDADGNLVLSDETEIVAKSCG 179
+VVRDF GKF F+LFDS KT F ++D DGSVPFFWGTDA+G+LV SD TE+V K C
Sbjct: 121 VDKVVRDFHGKFAFILFDSVKKTVFAAADADGSVPFFWGTDAEGHLVFSDNTEMVKKGCA 180
Query: 180 KSSAPFPKGCFFSTSGGLSSFEHPLNEMKAVPRVDSSGEMCGATFKVDADAKKETTGMPR 239
KS PFPKGCFF++SGGL SFEHP NE+K VPRVDSSG++CGATFKVDA+ K+E T MPR
Sbjct: 181 KSYGPFPKGCFFTSSGGLRSFEHPKNELKPVPRVDSSGDVCGATFKVDAETKREGTKMPR 240
Query: 240 VGSAANWSNNI 250
V S+ NW+ +I
Sbjct: 241 VDSSQNWAGHI 251
>AT4G27450.1 | Symbols: | similar to unknown protein [Arabidopsis
thaliana] (TAIR:AT3G15450.1); similar to unnamed protein
product [Vitis vinifera] (GB:CAO39242.1); contains
domain G3DSA:3.60.20.10 (G3DSA:3.60.20.10); contains
domain SSF56235 (SSF56235) | chr4:13727671-13728689
REVERSE
Length = 250
Score = 236 bits (602), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 123/258 (47%), Positives = 161/258 (62%), Gaps = 20/258 (7%)
Query: 1 MLAVFDKSVAKGPEALQSPQSNSVSA---LKDGFLAQHFSSVYP-GSVIVNLGTSGTLAY 56
MLA+F ++ A PE L SP S S L + L F YP + ++ G + LAY
Sbjct: 1 MLAIFHEAFAHPPEELNSPASEKCSKQPKLPEETL-NDFLLRYPLNTFSMSFGQAAVLAY 59
Query: 57 -------SLHKQNPLLPRLFAVVDDIFCLFQGHIQNVAHLKQQYGLNKTANEVIIVIEAY 109
S+H+ RLF DDI+CLF G + N+ L +QYGL KT NE + VIEAY
Sbjct: 60 VRPSASFSIHQ------RLFCGFDDIYCLFFGSLNNLCQLNKQYGLTKTTNEAMFVIEAY 113
Query: 110 RTLRDRGPYPAAQVVRDFQGKFTFVLFDSGSKTAFISSDDDGSVPFFWGTDADGNLVLSD 169
RTLRDRGPYPA QVV+D G F+FV++DS + + F + DG V +WG ADG++V+SD
Sbjct: 114 RTLRDRGPYPADQVVKDLDGSFSFVVYDSKAGSVFTALGSDGGVKLYWGIAADGSVVISD 173
Query: 170 ETEIVAKSCGKSSAPFPKGCFFSTSGGLSSFEHPLNEMKAVPRVDSSGEMCGATFKVDAD 229
+ +++ + C KS APFP GC F + GGL SFEHP+N++KA+PRVDS G +CGA FKV D
Sbjct: 174 DLDVIKEGCAKSFAPFPTGCMFHSEGGLMSFEHPMNKIKAMPRVDSEGVLCGANFKV--D 231
Query: 230 AKKETTGMPRVGSAANWS 247
+PR GS ANWS
Sbjct: 232 VYNRVNSIPRRGSEANWS 249
>AT3G15450.1 | Symbols: | similar to unknown protein [Arabidopsis
thaliana] (TAIR:AT4G27450.1); similar to unknown
[Populus trichocarpa] (GB:ABK93866.1); contains domain
PTHR11772 (PTHR11772); contains domain G3DSA:3.60.20.10
(G3DSA:3.60.20.10); contains domain SSF56235 (SSF56235)
| chr3:5213057-5214005 FORWARD
Length = 253
Score = 231 bits (588), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 117/256 (45%), Positives = 162/256 (63%), Gaps = 17/256 (6%)
Query: 1 MLAVFDKSVAKGPEALQSPQSN--------SVSALKDGFLAQHFSSVYPGSVIVNLGTSG 52
MLA+F K+ A PE L SP S+ L D FL+ H ++ + +N G S
Sbjct: 1 MLAIFQKAFAHPPEELNSPASHFSGKTPKLPGETLSD-FLSHHQNNAFS----MNFGDSA 55
Query: 53 TLAYSLHKQNPLLPRLFAVVDDIFCLFQGHIQNVAHLKQQYGLN-KTANEVIIVIEAYRT 111
LAY+ ++ L RLF +D I+C+F G + N+ L +QYGL+ K +NE + VIEAYRT
Sbjct: 56 VLAYA-RQETSLRQRLFCGLDGIYCMFLGRLNNLCTLNRQYGLSGKNSNEAMFVIEAYRT 114
Query: 112 LRDRGPYPAAQVVRDFQGKFTFVLFDSGSKTAFISSDDDGSVPFFWGTDADGNLVLSDET 171
LRDRGPYPA QV+R +G F FV++D+ + + F + DG +WG DG++V+SD+
Sbjct: 115 LRDRGPYPADQVLRGLEGSFAFVVYDTQTSSVFSALSSDGGESLYWGISGDGSVVMSDDI 174
Query: 172 EIVAKSCGKSSAPFPKGCFFSTSGGLSSFEHPLNEMKAVPRVDSSGEMCGATFKVDADAK 231
+I+ + C KS APFP GC F + GL SF+HP N MKA+PR+DS G +CGA+FKVDA +K
Sbjct: 175 QIIKQGCAKSFAPFPNGCMFHSETGLKSFDHPTNMMKAMPRIDSEGVLCGASFKVDACSK 234
Query: 232 KETTGMPRVGSAANWS 247
+PR GS ANW+
Sbjct: 235 --INSIPRRGSEANWA 248
>AT5G19140.1 | Symbols: | auxin/aluminum-responsive protein,
putative | chr5:6423400-6425787 FORWARD
Length = 234
Score = 220 bits (561), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 111/229 (48%), Positives = 148/229 (64%), Gaps = 4/229 (1%)
Query: 1 MLAVFDKSVAKGPEALQSPQSNSVSALKDG-FLAQHFSSVYPGSVIVNLGTSGTLAYSLH 59
ML +F ++ PE L + S + S G L F P +V V +G LAYS H
Sbjct: 1 MLGIFSGAIVSPPEELVAAGSRTPSPKTTGSTLVNRFVEKNPSAVSVQVGDYVQLAYSHH 60
Query: 60 KQNPLLPRLFAVVDDIFCLFQGHIQNVAHLKQQYGLNKTANEVIIVIEAYRTLRDRGPYP 119
++PL PR F D+IFCLFQG + N+ LKQQYGL K ANEV++VIEAY+TLRDR PYP
Sbjct: 61 NESPLRPRSFGAKDEIFCLFQGSLDNLGSLKQQYGLAKNANEVLLVIEAYKTLRDRAPYP 120
Query: 120 AAQVVRDFQGKFTFVLFDSGSKTAFISSDDDGSVPFFWGTDADGNLVLSDETEIVAKSCG 179
A VV G F FV+FD + T F++SD G VP +WG ADG + +D+ +++ +CG
Sbjct: 121 ANHVVAHLSGDFAFVVFDKSTSTLFVASDQVGKVPLYWGITADGYVAFADDVDLLKGACG 180
Query: 180 KSSAPFPKGCFFSTS-GGLSSFEHPLNEMKAVPRVDSSGEMCGATFKVD 227
KS A FP+GC++ST+ GGL SFE+P N++ AVP + GE+ GATFKV+
Sbjct: 181 KSLASFPQGCYYSTALGGLRSFENPKNKITAVPA--NEGEIWGATFKVE 227
>AT5G19140.2 | Symbols: | auxin/aluminum-responsive protein,
putative | chr5:6423400-6425787 FORWARD
Length = 222
Score = 199 bits (506), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 105/229 (45%), Positives = 140/229 (61%), Gaps = 16/229 (6%)
Query: 1 MLAVFDKSVAKGPEALQSPQSNSVSALKDG-FLAQHFSSVYPGSVIVNLGTSGTLAYSLH 59
ML +F ++ PE L + S + S G L F P +V V +G LAYS H
Sbjct: 1 MLGIFSGAIVSPPEELVAAGSRTPSPKTTGSTLVNRFVEKNPSAVSVQVGDYVQLAYSHH 60
Query: 60 KQNPLLPRLFAVVDDIFCLFQGHIQNVAHLKQQYGLNKTANEVIIVIEAYRTLRDRGPYP 119
++PL PR F D+IFCLFQG + N+ LKQQYGL K ANEV++VIEAY+TLRDR PYP
Sbjct: 61 NESPLRPRSFGAKDEIFCLFQGSLDNLGSLKQQYGLAKNANEVLLVIEAYKTLRDRAPYP 120
Query: 120 AAQVVRDFQGKFTFVLFDSGSKTAFISSDDDGSVPFFWGTDADGNLVLSDETEIVAKSCG 179
A VV G F FV+FD + T F++SD G VP +WG ADG + +D+ +++
Sbjct: 121 ANHVVAHLSGDFAFVVFDKSTSTLFVASDQVGKVPLYWGITADGYVAFADDVDLL----- 175
Query: 180 KSSAPFPKGCFFSTS-GGLSSFEHPLNEMKAVPRVDSSGEMCGATFKVD 227
KGC++ST+ GGL SFE+P N++ AVP + GE+ GATFKV+
Sbjct: 176 -------KGCYYSTALGGLRSFENPKNKITAVPA--NEGEIWGATFKVE 215
>AT3G15450.2 | Symbols: | similar to unknown protein [Arabidopsis
thaliana] (TAIR:AT4G27450.1); similar to unknown
[Populus trichocarpa] (GB:ABK93866.1); contains domain
N-terminal nucleophile aminohydrolases (Ntn hydrolases)
(SSF56235); contains domain no description
(G3DSA:3.60.20.10) | chr3:5213057-5213773 FORWARD
Length = 208
Score = 161 bits (407), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 84/197 (42%), Positives = 120/197 (60%), Gaps = 15/197 (7%)
Query: 1 MLAVFDKSVAKGPEALQSPQSN--------SVSALKDGFLAQHFSSVYPGSVIVNLGTSG 52
MLA+F K+ A PE L SP S+ L D FL+ H ++ + +N G S
Sbjct: 1 MLAIFQKAFAHPPEELNSPASHFSGKTPKLPGETLSD-FLSHHQNNAFS----MNFGDSA 55
Query: 53 TLAYSLHKQNPLLPRLFAVVDDIFCLFQGHIQNVAHLKQQYGLN-KTANEVIIVIEAYRT 111
LAY+ ++ L RLF +D I+C+F G + N+ L +QYGL+ K +NE + VIEAYRT
Sbjct: 56 VLAYA-RQETSLRQRLFCGLDGIYCMFLGRLNNLCTLNRQYGLSGKNSNEAMFVIEAYRT 114
Query: 112 LRDRGPYPAAQVVRDFQGKFTFVLFDSGSKTAFISSDDDGSVPFFWGTDADGNLVLSDET 171
LRDRGPYPA QV+R +G F FV++D+ + + F + DG +WG DG++V+SD+
Sbjct: 115 LRDRGPYPADQVLRGLEGSFAFVVYDTQTSSVFSALSSDGGESLYWGISGDGSVVMSDDI 174
Query: 172 EIVAKSCGKSSAPFPKG 188
+I+ + C KS APFP G
Sbjct: 175 QIIKQGCAKSFAPFPNG 191
>AT3G15450.3 | Symbols: | similar to unknown protein [Arabidopsis
thaliana] (TAIR:AT4G27450.1); similar to unknown
[Populus trichocarpa] (GB:ABK93866.1); contains domain
N-terminal nucleophile aminohydrolases (Ntn hydrolases)
(SSF56235); contains domain no description
(G3DSA:3.60.20.10) | chr3:5213057-5213871 FORWARD
Length = 186
Score = 133 bits (335), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 72/174 (41%), Positives = 103/174 (59%), Gaps = 15/174 (8%)
Query: 1 MLAVFDKSVAKGPEALQSPQSNSV--------SALKDGFLAQHFSSVYPGSVIVNLGTSG 52
MLA+F K+ A PE L SP S+ L D FL+ H ++ + +N G S
Sbjct: 1 MLAIFQKAFAHPPEELNSPASHFSGKTPKLPGETLSD-FLSHHQNNAFS----MNFGDSA 55
Query: 53 TLAYSLHKQNPLLPRLFAVVDDIFCLFQGHIQNVAHLKQQYGLN-KTANEVIIVIEAYRT 111
LAY+ ++ L RLF +D I+C+F G + N+ L +QYGL+ K +NE + VIEAYRT
Sbjct: 56 VLAYA-RQETSLRQRLFCGLDGIYCMFLGRLNNLCTLNRQYGLSGKNSNEAMFVIEAYRT 114
Query: 112 LRDRGPYPAAQVVRDFQGKFTFVLFDSGSKTAFISSDDDGSVPFFWGTDADGNL 165
LRDRGPYPA QV+R +G F FV++D+ + + F + DG +WG DG++
Sbjct: 115 LRDRGPYPADQVLRGLEGSFAFVVYDTQTSSVFSALSSDGGESLYWGISGDGSV 168