Miyakogusa Predicted Gene
- Lj6g3v1537040.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v1537040.1 Non Chatacterized Hit- tr|B9R7U1|B9R7U1_RICCO
Putative uncharacterized protein OS=Ricinus communis
G,62.22,0.000000003,CBS-domain,NULL; seg,NULL; UNCHARACTERIZED,NULL;
ANCIENT CONSERVED DOMAIN PROTEIN-RELATED,NULL; no d,CUFF.59588.1
(302 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G47330.1 | Symbols: | CBS domain-containing protein with a d... 276 9e-75
AT5G52790.1 | Symbols: | CBS domain-containing protein with a d... 270 7e-73
AT2G14520.1 | Symbols: | CBS domain-containing protein with a d... 255 3e-68
AT4G33700.1 | Symbols: | CBS domain-containing protein with a d... 249 2e-66
AT4G14240.1 | Symbols: | CBS domain-containing protein with a d... 246 1e-65
AT4G14240.2 | Symbols: | CBS domain-containing protein with a d... 246 2e-65
AT4G14230.1 | Symbols: | CBS domain-containing protein with a d... 246 2e-65
AT1G03270.1 | Symbols: | CBS domain-containing protein with a d... 237 7e-63
AT3G13070.1 | Symbols: | CBS domain-containing protein / transp... 48 9e-06
>AT1G47330.1 | Symbols: | CBS domain-containing protein with a
domain of unknown function (DUF21) |
chr1:17351149-17353739 FORWARD LENGTH=527
Length = 527
Score = 276 bits (707), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 148/323 (45%), Positives = 203/323 (62%), Gaps = 26/323 (8%)
Query: 1 MSPFVRVXXXXXXPIAYPFSKLLDWVFGKGHTALLGREELKTLVNLHANEAGKGGQLSLH 60
M+PFVRV PI+YP SK+LDW+ GKGH LL R ELKT VN H NEAGKGG L+
Sbjct: 131 MAPFVRVLLVLFFPISYPISKVLDWMLGKGHGVLLRRAELKTFVNFHGNEAGKGGDLTTD 190
Query: 61 ETTIIAGALDLTQKTAKDAMTPISETFSLDINSKLDMHTLGLIMSKGHSRIPIYSGKQTN 120
ET+II GAL+LT+KTAKDAMTPIS FSL++++ L++ TL IMS GHSR+P+Y T+
Sbjct: 191 ETSIITGALELTEKTAKDAMTPISNAFSLELDTPLNLETLNTIMSVGHSRVPVYFRNPTH 250
Query: 121 IIGLILVKNLIFCRPGDETPIKYMTIRRVPRVGQNWPLYDILNQFKKGQSHMAAVVK-CE 179
IIGLILVKNL+ E P++ M++R++PRV + PLYDILN+F+KG SH+A V K +
Sbjct: 251 IIGLILVKNLLAVDARKEVPLRKMSMRKIPRVSETMPLYDILNEFQKGHSHIAVVYKDLD 310
Query: 180 ENIRTVATDTEG----KTHRLCSSFVLDDCISISTDASNWHSHETEYY---SATLKNAMH 232
E ++ T G K + D C + + E E + + K+
Sbjct: 311 EQEQSPETSENGIERRKNKKTKDELFKDSC---RKPKAQFEVSEKEVFKIETGDAKSGKS 367
Query: 233 QEGDSEQLHRRS--------KQDTSTSF-----EN--MESLPTDEEVIGIITLEDIMEEL 277
+ G+ +Q ++ K+ SF EN + PT+EEV+G+IT+ED++EEL
Sbjct: 368 ENGEEQQGSGKTSLLAAPAKKRHRGCSFCILDIENTPIPDFPTNEEVVGVITMEDVIEEL 427
Query: 278 LQEDILDETDQYVNVHQNITIKL 300
LQE+ILDETD+YVN+H I + +
Sbjct: 428 LQEEILDETDEYVNIHNRIRVNM 450
>AT5G52790.1 | Symbols: | CBS domain-containing protein with a
domain of unknown function (DUF21) |
chr5:21391740-21394327 REVERSE LENGTH=500
Length = 500
Score = 270 bits (691), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 156/300 (52%), Positives = 193/300 (64%), Gaps = 30/300 (10%)
Query: 1 MSPFVRVXXXXXXPIAYPFSKLLDWVFGKGHTALLGREELKTLVNLHANEAGKGGQLSLH 60
+S VR+ P++YP SKLLD + GK H+ LLGR ELK+LV +H NEAGKGG+L+
Sbjct: 132 LSFLVRLIIIVFFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGNEAGKGGELTHD 191
Query: 61 ETTIIAGALDLTQKTAKDAMTPISETFSLDINSKLDMHTLGLIMSKGHSRIPIYSGKQTN 120
ETTII+GALD++QK+AKDAMTP+S+ FSLDIN KLD T+GLI S GHSRIPIYS
Sbjct: 192 ETTIISGALDMSQKSAKDAMTPVSQIFSLDINFKLDEKTMGLIASAGHSRIPIYSVNPNV 251
Query: 121 IIGLILVKNLIFCRPGDETPIKYMTIRRVPRVGQNWPLYDILNQFKKGQSHMAAVVKCEE 180
IIG ILVKNLI RP DET I+ + IRR+P+V N PLYDILN F+ G+SHMAAVV +
Sbjct: 252 IIGFILVKNLIKVRPEDETSIRDLPIRRMPKVDLNLPLYDILNIFQTGRSHMAAVVGTKN 311
Query: 181 NIRTVATDTEGKTHRLCSSFVLDDCISISTDASNWHSHETEYYSATLKNAMHQEGDSEQL 240
+ T+T + S D + +S A N S ET + S
Sbjct: 312 HTN---TNTPVHEKSINGSPNKDANVFLSIPALN--SSETSHQSPI-------------- 352
Query: 241 HRRSKQDTSTSFENMESLPTDEEVIGIITLEDIMEELLQEDILDETDQYVNVHQNITIKL 300
+ S S E DEEVIGIITLED+MEEL+QE+I DETDQYV +H+ ITI +
Sbjct: 353 ----RYIDSISDE-------DEEVIGIITLEDVMEELIQEEIYDETDQYVELHKRITINM 401
>AT2G14520.1 | Symbols: | CBS domain-containing protein with a
domain of unknown function (DUF21) |
chr2:6182362-6184648 REVERSE LENGTH=423
Length = 423
Score = 255 bits (652), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 144/301 (47%), Positives = 195/301 (64%), Gaps = 24/301 (7%)
Query: 1 MSPFVRVXXXXXXPIAYPFSKLLDWVFGKGHTALLGREELKTLVNLHANEAGKGGQLSLH 60
++PFVRV P+A+P SKLLD++ G G AL R ELKTLV+LH NEAGKGG+L+
Sbjct: 131 VAPFVRVLVWICLPVAWPISKLLDFLLGHGRVALFRRAELKTLVDLHGNEAGKGGELTHD 190
Query: 61 ETTIIAGALDLTQKTAKDAMTPISETFSLDINSKLDMHTLGLIMSKGHSRIPIYSGKQTN 120
ETTIIAGAL+L++K AKDAMTPIS+TF +DIN+KLD + LI+ KGHSR+P+Y ++TN
Sbjct: 191 ETTIIAGALELSEKMAKDAMTPISDTFVIDINAKLDRDLMNLILDKGHSRVPVYYEQRTN 250
Query: 121 IIGLILVKNLIFCRPGDETPIKYMTIRRVPRVGQNWPLYDILNQFKKGQSHMAAVVK-CE 179
IIGL+LVKNL+ P +E +K +TIRR+PRV + PLYDILN+F+KG SHMA VV+ C+
Sbjct: 251 IIGLVLVKNLLTINPDEEIQVKNVTIRRIPRVPETLPLYDILNEFQKGHSHMAVVVRQCD 310
Query: 180 ENIRTVATDTEGKTHRLCSSFVLDDCIS-ISTDASNWHS-HETEY-YSATLKNAMHQEGD 236
K H L S+ ++ ++ + D S ET+ +L+
Sbjct: 311 ------------KIHPLQSNDAANETVNEVRVDVDYERSPQETKLKRRRSLQKWKSFPNR 358
Query: 237 SEQLHRRSKQ-----DTSTSFENMESLPT---DEEVIGIITLEDIMEELLQEDILDETDQ 288
+ L RSK+ D N LP +E+ +GIIT+ED++EELLQE+I DETD
Sbjct: 359 ANSLGSRSKRWSKDNDADILQLNEHPLPKLDEEEDAVGIITMEDVIEELLQEEIFDETDH 418
Query: 289 Y 289
+
Sbjct: 419 H 419
>AT4G33700.1 | Symbols: | CBS domain-containing protein with a
domain of unknown function (DUF21) |
chr4:16176547-16179188 REVERSE LENGTH=424
Length = 424
Score = 249 bits (635), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 188/305 (61%), Gaps = 31/305 (10%)
Query: 1 MSPFVRVXXXXXXPIAYPFSKLLDWVFGKGHTALLGREELKTLVNLHANEAGKGGQLSLH 60
++PFVRV P+A+P SKLLD++ G AL R ELKTLV+ H NEAGKGG+L+
Sbjct: 131 VAPFVRVLVFICLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHGNEAGKGGELTHD 190
Query: 61 ETTIIAGALDLTQKTAKDAMTPISETFSLDINSKLDMHTLGLIMSKGHSRIPIYSGKQTN 120
ETTIIAGAL+L++K KDAMTPIS+ F +DIN+KLD + LI+ KGHSR+P+Y + TN
Sbjct: 191 ETTIIAGALELSEKMVKDAMTPISDIFVIDINAKLDRDLMNLILEKGHSRVPVYYEQPTN 250
Query: 121 IIGLILVKNLIFCRPGDETPIKYMTIRRVPRVGQNWPLYDILNQFKKGQSHMAAVVKCEE 180
IIGL+LVKNL+ P +E P+K +TIRR+PRV + PLYDILN+F+KG SHMA VV+ +
Sbjct: 251 IIGLVLVKNLLTINPDEEIPVKNVTIRRIPRVPEILPLYDILNEFQKGLSHMAVVVRQCD 310
Query: 181 NIRT------------VATDTEG----KTHRLCSSFVLDDCISISTDASNWHSHETEYYS 224
I V D+EG + L + L S AS++ S
Sbjct: 311 KIHPLPSKNGSVKEARVDVDSEGTPTPQERMLRTKRSLQKWKSFPNRASSFKGG-----S 365
Query: 225 ATLKNAMHQEGDSEQLHRRSKQDTSTSFENMESLPTDEEVIGIITLEDIMEELLQEDILD 284
+ K + + D QL+ + L +EE +GIIT+ED++EELLQE+I D
Sbjct: 366 KSKKWSKDNDADILQLNGNP----------LPKLAEEEEAVGIITMEDVIEELLQEEIFD 415
Query: 285 ETDQY 289
ETD +
Sbjct: 416 ETDHH 420
>AT4G14240.1 | Symbols: | CBS domain-containing protein with a
domain of unknown function (DUF21) |
chr4:8204712-8207273 REVERSE LENGTH=494
Length = 494
Score = 246 bits (628), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 141/306 (46%), Positives = 195/306 (63%), Gaps = 35/306 (11%)
Query: 4 FVRVXXXXXXPIAYPFSKLLDWVFGKGHTALLGREELKTLVNLHANEAGKGGQLSLHETT 63
VR+ PIA+P K+LD V G + AL R +LK LV++H+ EAGKGG+L+ ETT
Sbjct: 157 LVRILMTLCYPIAFPIGKILDLVLGH-NDALFRRAQLKALVSIHSQEAGKGGELTHDETT 215
Query: 64 IIAGALDLTQKTAKDAMTPISETFSLDINSKLDMHTLGLIMSKGHSRIPIYSGKQTNIIG 123
II+GALDLT+KTA++AMTPI TFSLD+NSKLD +G I+++GHSR+P+YSG N+IG
Sbjct: 216 IISGALDLTEKTAQEAMTPIESTFSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIG 275
Query: 124 LILVKNLIFCRPGDETPIKYMTIRRVPRVGQNWPLYDILNQFKKGQSHMAAVVKCEENIR 183
L+LVK+L+ RP ET + + IRR+PRV + PLYDILN+F+KG SHMAAVVK +
Sbjct: 276 LLLVKSLLTVRPETETLVSAVCIRRIPRVPADMPLYDILNEFQKGSSHMAAVVKVK---- 331
Query: 184 TVATDTEGKTHRLCSSFVLDDCISISTDASNWHSHETEYYSATLKNAMHQEGDSEQ-LHR 242
++ S +L++ H+ E+ T + +EG+ + +
Sbjct: 332 --------GKSKVPPSTLLEE-----------HTDESNDSDLTAPLLLKREGNHDNVIVT 372
Query: 243 RSKQDTSTSFENMESLP----------TDEEVIGIITLEDIMEELLQEDILDETDQYVNV 292
K + + F+N ES P D EVIGIITLED+ EELLQE+I+DETD+YV+V
Sbjct: 373 IDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDV 432
Query: 293 HQNITI 298
H+ I +
Sbjct: 433 HKRIRV 438
>AT4G14240.2 | Symbols: | CBS domain-containing protein with a
domain of unknown function (DUF21) |
chr4:8204712-8207273 REVERSE LENGTH=485
Length = 485
Score = 246 bits (627), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 141/306 (46%), Positives = 195/306 (63%), Gaps = 35/306 (11%)
Query: 4 FVRVXXXXXXPIAYPFSKLLDWVFGKGHTALLGREELKTLVNLHANEAGKGGQLSLHETT 63
VR+ PIA+P K+LD V G + AL R +LK LV++H+ EAGKGG+L+ ETT
Sbjct: 148 LVRILMTLCYPIAFPIGKILDLVLGH-NDALFRRAQLKALVSIHSQEAGKGGELTHDETT 206
Query: 64 IIAGALDLTQKTAKDAMTPISETFSLDINSKLDMHTLGLIMSKGHSRIPIYSGKQTNIIG 123
II+GALDLT+KTA++AMTPI TFSLD+NSKLD +G I+++GHSR+P+YSG N+IG
Sbjct: 207 IISGALDLTEKTAQEAMTPIESTFSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIG 266
Query: 124 LILVKNLIFCRPGDETPIKYMTIRRVPRVGQNWPLYDILNQFKKGQSHMAAVVKCEENIR 183
L+LVK+L+ RP ET + + IRR+PRV + PLYDILN+F+KG SHMAAVVK +
Sbjct: 267 LLLVKSLLTVRPETETLVSAVCIRRIPRVPADMPLYDILNEFQKGSSHMAAVVKVK---- 322
Query: 184 TVATDTEGKTHRLCSSFVLDDCISISTDASNWHSHETEYYSATLKNAMHQEGDSEQ-LHR 242
++ S +L++ H+ E+ T + +EG+ + +
Sbjct: 323 --------GKSKVPPSTLLEE-----------HTDESNDSDLTAPLLLKREGNHDNVIVT 363
Query: 243 RSKQDTSTSFENMESLP----------TDEEVIGIITLEDIMEELLQEDILDETDQYVNV 292
K + + F+N ES P D EVIGIITLED+ EELLQE+I+DETD+YV+V
Sbjct: 364 IDKANGQSFFQNNESGPHGFSHTSEAIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDV 423
Query: 293 HQNITI 298
H+ I +
Sbjct: 424 HKRIRV 429
>AT4G14230.1 | Symbols: | CBS domain-containing protein with a
domain of unknown function (DUF21) |
chr4:8200850-8203130 REVERSE LENGTH=495
Length = 495
Score = 246 bits (627), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 137/308 (44%), Positives = 192/308 (62%), Gaps = 37/308 (12%)
Query: 4 FVRVXXXXXXPIAYPFSKLLDWVFGKGHTALLGREELKTLVNLHANEAGKGGQLSLHETT 63
VR+ PI++P +K+LDWV G + L R +LK LV++H AGKGG+L+ ETT
Sbjct: 156 LVRILMVLSYPISFPIAKMLDWVLGH-NDPLFRRAQLKALVSIHGEAAGKGGELTHDETT 214
Query: 64 IIAGALDLTQKTAKDAMTPISETFSLDINSKLDMHTLGLIMSKGHSRIPIYSGKQTNIIG 123
II+GALDLT+KTA++AMTPI TFSLD+NSKLD + I ++GHSR+P+YS N+IG
Sbjct: 215 IISGALDLTEKTAQEAMTPIESTFSLDVNSKLDREAMDKIQARGHSRVPVYSDNPKNVIG 274
Query: 124 LILVKNLIFCRPGDETPIKYMTIRRVPRVGQNWPLYDILNQFKKGQSHMAAVVKCEENIR 183
L+LVK+L+ RP T + + IRR+PRV N PLYDILN+F+KG SHMAAVVK
Sbjct: 275 LLLVKSLLTVRPETGTLVSAVGIRRIPRVPANMPLYDILNEFQKGSSHMAAVVKV----- 329
Query: 184 TVATDTEGKTHRLCSSFVLDDCISISTDASNWHSHETEYYSATLKNAMHQEGDSEQLHRR 243
+GK+ S+ ++ + SN S+ +E + L + +EG+ + + R
Sbjct: 330 ------KGKSKGHPSTLHEEN-----SGESNVSSNNSELTAPLL---LKREGNHDSVIVR 375
Query: 244 SKQDTSTSF-------------ENMESLPTDEEVIGIITLEDIMEELLQEDILDETDQYV 290
+ SF E +E D +VIGIITLED+ EELLQE+I+DETD+Y+
Sbjct: 376 IDKANGQSFISEAGRQGFSHTSEEIE----DGDVIGIITLEDVFEELLQEEIVDETDEYI 431
Query: 291 NVHQNITI 298
+VH+ I +
Sbjct: 432 DVHKRIRV 439
>AT1G03270.1 | Symbols: | CBS domain-containing protein with a
domain of unknown function (DUF21) | chr1:799191-802436
FORWARD LENGTH=499
Length = 499
Score = 237 bits (605), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 134/293 (45%), Positives = 185/293 (63%), Gaps = 9/293 (3%)
Query: 4 FVRVXXXXXXPIAYPFSKLLDWVFGKGHTALLGREELKTLVNLHANEAGKGGQLSLHETT 63
VR+ PIAYP K+LD V G T L R +LK LV++H+ EAGKGG+L+ ET
Sbjct: 155 LVRILMIICYPIAYPIGKVLDAVIGHNDT-LFRRAQLKALVSIHSQEAGKGGELTHEETM 213
Query: 64 IIAGALDLTQKTAKDAMTPISETFSLDINSKLDMHTLGLIMSKGHSRIPIYSGKQTNIIG 123
II+GALDL+QKTA++AMTPI TFSLD+N+KLD T+G I+S+GHSRIP+Y G NIIG
Sbjct: 214 IISGALDLSQKTAEEAMTPIESTFSLDVNTKLDWETIGKILSRGHSRIPVYLGNPKNIIG 273
Query: 124 LILVKNLIFCRPGDETPIKYMTIRRVPRVGQNWPLYDILNQFKKGQSHMAA--VVKCEEN 181
L+LVK+L+ R E P+ ++IR++PRV + PLYDILN+F+KG SHMAA VK ++
Sbjct: 274 LLLVKSLLTVRAETEAPVSSVSIRKIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKDKDK 333
Query: 182 IRTVATDTEGKTHRLCSSFVLDDCISISTDASNWHSHETEYYSATLKNAMHQEGDSEQLH 241
+ + G+T + F +++ SH+ + + G + Q +
Sbjct: 334 KNNMQLLSNGETPKENMKFYQSS--NLTAPLLKHESHDVVVDIDKVPKHVKNRGRNFQQN 391
Query: 242 RRSKQDTSTSFENMESLPTDEEVIGIITLEDIMEELLQEDILDETDQYVNVHQ 294
+D E+ E D EVIGIITLED+ EELLQ +I+DETD Y++VH+
Sbjct: 392 GTVTRDLPCLLEDNE----DAEVIGIITLEDVFEELLQAEIVDETDVYIDVHK 440
>AT3G13070.1 | Symbols: | CBS domain-containing protein /
transporter associated domain-containing protein |
chr3:4191511-4195112 REVERSE LENGTH=661
Length = 661
Score = 47.8 bits (112), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 29/127 (22%), Positives = 67/127 (52%), Gaps = 7/127 (5%)
Query: 55 GQLSLHETTIIAGALDLTQKTAKDAMTPISETFSLDINSKL-DMHTLGLIMSKGHSRIPI 113
G + E +I L++ ++ MTP+ + ++D ++ L D H++ + + +SR+P+
Sbjct: 334 GAIEEEEQDMIENVLEIKDTHVREVMTPLVDVVAIDASASLVDFHSMWV--THQYSRVPV 391
Query: 114 YSGKQTNIIGLILVKNLI-FCRPGD---ETPIKYMTIRRVPRVGQNWPLYDILNQFKKGQ 169
+ + NI+G+ +L+ + + GD T + M + V + ++++L +F+ +
Sbjct: 392 FEQRIDNIVGIAYAMDLLDYVQKGDLLESTSVGDMAHKPAYFVPDSMSVWNLLREFRIRK 451
Query: 170 SHMAAVV 176
HMA V+
Sbjct: 452 VHMAVVL 458