Miyakogusa Predicted Gene

Lj4g3v2400520.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj4g3v2400520.1 Non Chatacterized Hit- tr|I1KN82|I1KN82_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.54142
PE,86.9,0,seg,NULL; DUF21,Domain of unknown function DUF21; no
description,NULL; CBS,Cystathionine beta-syntha,CUFF.50910.1
         (493 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G14240.1 | Symbols:  | CBS domain-containing protein with a d...   636   0.0  
AT4G14240.2 | Symbols:  | CBS domain-containing protein with a d...   612   e-175
AT4G14230.1 | Symbols:  | CBS domain-containing protein with a d...   593   e-169
AT1G03270.1 | Symbols:  | CBS domain-containing protein with a d...   564   e-161
AT4G33700.1 | Symbols:  | CBS domain-containing protein with a d...   402   e-112
AT2G14520.1 | Symbols:  | CBS domain-containing protein with a d...   385   e-107
AT5G52790.1 | Symbols:  | CBS domain-containing protein with a d...   377   e-104
AT1G47330.1 | Symbols:  | CBS domain-containing protein with a d...   366   e-101
AT3G13070.1 | Symbols:  | CBS domain-containing protein / transp...    60   3e-09
AT1G55930.1 | Symbols:  | CBS domain-containing protein / transp...    59   7e-09

>AT4G14240.1 | Symbols:  | CBS domain-containing protein with a
           domain of unknown function (DUF21) |
           chr4:8204712-8207273 REVERSE LENGTH=494
          Length = 494

 Score =  636 bits (1640), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 328/481 (68%), Positives = 371/481 (77%), Gaps = 9/481 (1%)

Query: 2   MNVVSALMVTRMLTRDHNSVDDVILGNESIPFGTISWYAYAGISCFLVLFAGIXXXXXXX 61
           M++++A+   R+L+    S  +   G E+IPFG+  W  YAGISCFLVLFAGI       
Sbjct: 1   MHLINAVAAARILSGIGQSNGNN--GGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLG 58

Query: 62  XXXXXXVDLEILERSGSPSEKKQAAIILPVVQKQHQLLVTLLLCNAVAMEALPIYLDKLV 121
                 V+LEIL+RSG+P+EKKQAA I PVVQKQHQLLVTLLLCNA+AME LPIYLDKL 
Sbjct: 59  LMSLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLF 118

Query: 122 NQFVAIILSVTFVLFFGEVIPQSICSRYGLAVGANLAWLVRILMVICYPVSYPVGKVLDY 181
           N++VAIILSVTFVL FGEVIPQ+IC+RYGLAVGAN  WLVRILM +CYP+++P+GK+LD 
Sbjct: 119 NEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDL 178

Query: 182 LLGHNEALFRRAQLKVLVSIHSQEAGKGGELTHDETTIISGALDLTEKTAEEAMTPIEST 241
           +LGHN+ALFRRAQLK LVSIHSQEAGKGGELTHDETTIISGALDLTEKTA+EAMTPIEST
Sbjct: 179 VLGHNDALFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIEST 238

Query: 242 FSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSI 301
           FSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKN+IGLLLVKSLLTVRPETET VSAV I
Sbjct: 239 FSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCI 298

Query: 302 RRIPRVPSDMPLYDILNEFQKGSSHMAAXXXXXXXXXETPQIIDEENKSTDGDSMLTTPL 361
           RRIPRVP+DMPLYDILNEFQKGSSHMAA           P  + EE+     DS LT PL
Sbjct: 299 RRIPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPL 358

Query: 362 LQKQDLKSGNVVVDIDKPS-RLPSSNKLTGPQHSDGTTNGPPAENIEDGEVIGIITLEDV 420
           L K++    NV+V IDK + +    N  +GP     T+     E IEDGEVIGIITLEDV
Sbjct: 359 LLKREGNHDNVIVTIDKANGQSFFQNNESGPHGFSHTS-----EAIEDGEVIGIITLEDV 413

Query: 421 FEELLQEEIVDETDEYVDVHKRIXXXXXXXXXXXXXXPSMRRMTGQKGAAGGQSKPAQSP 480
           FEELLQEEIVDETDEYVDVHKRI              PS R++  QKG  GGQ+K  Q+ 
Sbjct: 414 FEELLQEEIVDETDEYVDVHKRIRVAAAAAASSIARAPSSRKLLAQKG-TGGQNKQGQTN 472

Query: 481 K 481
           K
Sbjct: 473 K 473


>AT4G14240.2 | Symbols:  | CBS domain-containing protein with a
           domain of unknown function (DUF21) |
           chr4:8204712-8207273 REVERSE LENGTH=485
          Length = 485

 Score =  612 bits (1579), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 321/481 (66%), Positives = 362/481 (75%), Gaps = 18/481 (3%)

Query: 2   MNVVSALMVTRMLTRDHNSVDDVILGNESIPFGTISWYAYAGISCFLVLFAGIXXXXXXX 61
           M++++A+   R+L+    S  +   G E+IPFG+  W  YAGISCFLVLFAGI       
Sbjct: 1   MHLINAVAAARILSGIGQSNGNN--GGEAIPFGSFEWITYAGISCFLVLFAGIMSGLTLG 58

Query: 62  XXXXXXVDLEILERSGSPSEKKQAAIILPVVQKQHQLLVTLLLCNAVAMEALPIYLDKLV 121
                 V+LEIL+RS         A I PVVQKQHQLLVTLLLCNA+AME LPIYLDKL 
Sbjct: 59  LMSLGLVELEILQRS---------AAIFPVVQKQHQLLVTLLLCNAMAMEGLPIYLDKLF 109

Query: 122 NQFVAIILSVTFVLFFGEVIPQSICSRYGLAVGANLAWLVRILMVICYPVSYPVGKVLDY 181
           N++VAIILSVTFVL FGEVIPQ+IC+RYGLAVGAN  WLVRILM +CYP+++P+GK+LD 
Sbjct: 110 NEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWLVRILMTLCYPIAFPIGKILDL 169

Query: 182 LLGHNEALFRRAQLKVLVSIHSQEAGKGGELTHDETTIISGALDLTEKTAEEAMTPIEST 241
           +LGHN+ALFRRAQLK LVSIHSQEAGKGGELTHDETTIISGALDLTEKTA+EAMTPIEST
Sbjct: 170 VLGHNDALFRRAQLKALVSIHSQEAGKGGELTHDETTIISGALDLTEKTAQEAMTPIEST 229

Query: 242 FSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSI 301
           FSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKN+IGLLLVKSLLTVRPETET VSAV I
Sbjct: 230 FSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNVIGLLLVKSLLTVRPETETLVSAVCI 289

Query: 302 RRIPRVPSDMPLYDILNEFQKGSSHMAAXXXXXXXXXETPQIIDEENKSTDGDSMLTTPL 361
           RRIPRVP+DMPLYDILNEFQKGSSHMAA           P  + EE+     DS LT PL
Sbjct: 290 RRIPRVPADMPLYDILNEFQKGSSHMAAVVKVKGKSKVPPSTLLEEHTDESNDSDLTAPL 349

Query: 362 LQKQDLKSGNVVVDIDKPS-RLPSSNKLTGPQHSDGTTNGPPAENIEDGEVIGIITLEDV 420
           L K++    NV+V IDK + +    N  +GP     T+     E IEDGEVIGIITLEDV
Sbjct: 350 LLKREGNHDNVIVTIDKANGQSFFQNNESGPHGFSHTS-----EAIEDGEVIGIITLEDV 404

Query: 421 FEELLQEEIVDETDEYVDVHKRIXXXXXXXXXXXXXXPSMRRMTGQKGAAGGQSKPAQSP 480
           FEELLQEEIVDETDEYVDVHKRI              PS R++  QKG  GGQ+K  Q+ 
Sbjct: 405 FEELLQEEIVDETDEYVDVHKRIRVAAAAAASSIARAPSSRKLLAQKG-TGGQNKQGQTN 463

Query: 481 K 481
           K
Sbjct: 464 K 464


>AT4G14230.1 | Symbols:  | CBS domain-containing protein with a
           domain of unknown function (DUF21) |
           chr4:8200850-8203130 REVERSE LENGTH=495
          Length = 495

 Score =  593 bits (1528), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 312/489 (63%), Positives = 360/489 (73%), Gaps = 19/489 (3%)

Query: 2   MNVVSALMVTRMLTRDHNSVDDVILGNESIPFGTISWYAYAGISCFLVLFAGIXXXXXXX 61
           M+ ++A++  RML     S     L +E+IPFG++ W  YAGISCFLVLFAGI       
Sbjct: 1   MHPINAVVAARMLAGISQSN---ALQSEAIPFGSLEWITYAGISCFLVLFAGIMSGLTLG 57

Query: 62  XXXXXXVDLEILERSGSPSEKKQAAIILPVVQKQHQLLVTLLLCNAVAMEALPIYLDKLV 121
                 V+LEIL+RSG+P EKKQ+A I PVVQKQHQLLVTLLL NA+AME LPIYLDK+ 
Sbjct: 58  LMSLGLVELEILQRSGTPKEKKQSAAIFPVVQKQHQLLVTLLLFNALAMEGLPIYLDKIF 117

Query: 122 NQFVAIILSVTFVLFFGEVIPQSICSRYGLAVGANLAWLVRILMVICYPVSYPVGKVLDY 181
           N++VAIILSVTFVLF GEVIPQ+IC+RYGLAVGANL WLVRILMV+ YP+S+P+ K+LD+
Sbjct: 118 NEYVAIILSVTFVLFVGEVIPQAICTRYGLAVGANLVWLVRILMVLSYPISFPIAKMLDW 177

Query: 182 LLGHNEALFRRAQLKVLVSIHSQEAGKGGELTHDETTIISGALDLTEKTAEEAMTPIEST 241
           +LGHN+ LFRRAQLK LVSIH + AGKGGELTHDETTIISGALDLTEKTA+EAMTPIEST
Sbjct: 178 VLGHNDPLFRRAQLKALVSIHGEAAGKGGELTHDETTIISGALDLTEKTAQEAMTPIEST 237

Query: 242 FSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSI 301
           FSLDVNSKLD EAM KI ARGHSRVPVYS NPKN+IGLLLVKSLLTVRPET T VSAV I
Sbjct: 238 FSLDVNSKLDREAMDKIQARGHSRVPVYSDNPKNVIGLLLVKSLLTVRPETGTLVSAVGI 297

Query: 302 RRIPRVPSDMPLYDILNEFQKGSSHMAAXXXXXXXXXETPQIIDEENKS----TDGDSML 357
           RRIPRVP++MPLYDILNEFQKGSSHMAA           P  + EEN      +  +S L
Sbjct: 298 RRIPRVPANMPLYDILNEFQKGSSHMAAVVKVKGKSKGHPSTLHEENSGESNVSSNNSEL 357

Query: 358 TTPLLQKQDLKSGNVVVDIDKPS--RLPSSNKLTGPQHSDGTTNGPPAENIEDGEVIGII 415
           T PLL K++    +V+V IDK +     S     G  H+        +E IEDG+VIGII
Sbjct: 358 TAPLLLKREGNHDSVIVRIDKANGQSFISEAGRQGFSHT--------SEEIEDGDVIGII 409

Query: 416 TLEDVFEELLQEEIVDETDEYVDVHKRI--XXXXXXXXXXXXXXPSMRRMTGQKGAAGGQ 473
           TLEDVFEELLQEEIVDETDEY+DVHKRI                PS RR+ G KG+ G +
Sbjct: 410 TLEDVFEELLQEEIVDETDEYIDVHKRIRVATVAAVAISSLARAPSGRRLLGPKGSGGPK 469

Query: 474 SKPAQSPKK 482
           +  A S  K
Sbjct: 470 TPKASSTPK 478


>AT1G03270.1 | Symbols:  | CBS domain-containing protein with a
           domain of unknown function (DUF21) | chr1:799191-802436
           FORWARD LENGTH=499
          Length = 499

 Score =  564 bits (1454), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 293/447 (65%), Positives = 338/447 (75%), Gaps = 22/447 (4%)

Query: 8   LMVTRMLTRDHNSVDDVILGNESIPFGTISWYAYAGISCFLVLFAGIXXXXXXXXXXXXX 67
           ++ T  L R   S++  +   E I FG+  W+   G++CFLVLFAGI             
Sbjct: 3   VLSTLALVRAAYSLNSFVFEAEDIRFGSPWWFVVVGVACFLVLFAGIMSGLTLGLMSLGL 62

Query: 68  VDLEILERSGSPSEKKQAAIILPVVQKQHQLLVTLLLCNAVAMEALPIYLDKLVNQFVAI 127
           V+LEIL++SGS +EKKQAA ILPVV+KQHQLLVTLLLCNA AMEALPI LDK+ + FVA+
Sbjct: 63  VELEILQQSGSSAEKKQAAAILPVVKKQHQLLVTLLLCNAAAMEALPICLDKIFHPFVAV 122

Query: 128 ILSVTFVLFFGEVIPQSICSRYGLAVGANLAWLVRILMVICYPVSYPVGKVLDYLLGHNE 187
           +LSVTFVL FGE+IPQ+ICSRYGLAVGAN  WLVRILM+ICYP++YP+GKVLD ++GHN+
Sbjct: 123 LLSVTFVLAFGEIIPQAICSRYGLAVGANFLWLVRILMIICYPIAYPIGKVLDAVIGHND 182

Query: 188 ALFRRAQLKVLVSIHSQEAGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVN 247
            LFRRAQLK LVSIHSQEAGKGGELTH+ET IISGALDL++KTAEEAMTPIESTFSLDVN
Sbjct: 183 TLFRRAQLKALVSIHSQEAGKGGELTHEETMIISGALDLSQKTAEEAMTPIESTFSLDVN 242

Query: 248 SKLDWEAMGKILARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIPRV 307
           +KLDWE +GKIL+RGHSR+PVY GNPKNIIGLLLVKSLLTVR ETE PVS+VSIR+IPRV
Sbjct: 243 TKLDWETIGKILSRGHSRIPVYLGNPKNIIGLLLVKSLLTVRAETEAPVSSVSIRKIPRV 302

Query: 308 PSDMPLYDILNEFQKGSSHMAAXX----------XXXXXXXETPQIIDEENKSTDGDSML 357
           PSDMPLYDILNEFQKGSSHMAA                   ETP+    EN      S L
Sbjct: 303 PSDMPLYDILNEFQKGSSHMAAVVKVKDKDKKNNMQLLSNGETPK----ENMKFYQSSNL 358

Query: 358 TTPLLQKQDLKSGNVVVDIDK-PSRLPSSNKLTGPQHSDGTTNGPPA--ENIEDGEVIGI 414
           T PLL+ +   S +VVVDIDK P  +   N+    Q +   T   P   E+ ED EVIGI
Sbjct: 359 TAPLLKHE---SHDVVVDIDKVPKHV--KNRGRNFQQNGTVTRDLPCLLEDNEDAEVIGI 413

Query: 415 ITLEDVFEELLQEEIVDETDEYVDVHK 441
           ITLEDVFEELLQ EIVDETD Y+DVHK
Sbjct: 414 ITLEDVFEELLQAEIVDETDVYIDVHK 440


>AT4G33700.1 | Symbols:  | CBS domain-containing protein with a
           domain of unknown function (DUF21) |
           chr4:16176547-16179188 REVERSE LENGTH=424
          Length = 424

 Score =  402 bits (1032), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 218/421 (51%), Positives = 275/421 (65%), Gaps = 28/421 (6%)

Query: 37  SWYAYAGISCFLVLFAGIXXXXXXXXXXXXXVDLEILERSGSPSEKKQAAIILPVVQKQH 96
           +++ +  +  FLVLFAG+             VDLE+L +SG+P  +K AA ILPVV+ QH
Sbjct: 11  NFFIHIAVIVFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPEHRKYAAKILPVVKNQH 70

Query: 97  QLLVTLLLCNAVAMEALPIYLDKLVNQFVAIILSVTFVLFFGEVIPQSICSRYGLAVGAN 156
            LLVTLL+CNA AME LPI+LD LV  + AI++SVT +L FGE+IPQSICSRYGLA+GA 
Sbjct: 71  LLLVTLLICNAAAMETLPIFLDGLVTAWGAILISVTLILLFGEIIPQSICSRYGLAIGAT 130

Query: 157 LAWLVRILMVICYPVSYPVGKVLDYLLGHNEA-LFRRAQLKVLVSIHSQEAGKGGELTHD 215
           +A  VR+L+ IC PV++P+ K+LD+LLGH  A LFRRA+LK LV  H  EAGKGGELTHD
Sbjct: 131 VAPFVRVLVFICLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHGNEAGKGGELTHD 190

Query: 216 ETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKN 275
           ETTII+GAL+L+EK  ++AMTPI   F +D+N+KLD + M  IL +GHSRVPVY   P N
Sbjct: 191 ETTIIAGALELSEKMVKDAMTPISDIFVIDINAKLDRDLMNLILEKGHSRVPVYYEQPTN 250

Query: 276 IIGLLLVKSLLTVRPETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAXXXXXX 335
           IIGL+LVK+LLT+ P+ E PV  V+IRRIPRVP  +PLYDILNEFQKG SHMA       
Sbjct: 251 IIGLVLVKNLLTINPDEEIPVKNVTIRRIPRVPEILPLYDILNEFQKGLSHMAVVVRQCD 310

Query: 336 XXXETPQI----------IDEENKSTDGDSMLTTPLLQKQDLKSGNVVVDIDKPSRLPS- 384
                P            +D E   T  + ML T    K+ L+          P+R  S 
Sbjct: 311 KIHPLPSKNGSVKEARVDVDSEGTPTPQERMLRT----KRSLQKWKSF-----PNRASSF 361

Query: 385 -----SNKLTGPQHSD-GTTNGPPAENI-EDGEVIGIITLEDVFEELLQEEIVDETDEYV 437
                S K +    +D    NG P   + E+ E +GIIT+EDV EELLQEEI DETD + 
Sbjct: 362 KGGSKSKKWSKDNDADILQLNGNPLPKLAEEEEAVGIITMEDVIEELLQEEIFDETDHHF 421

Query: 438 D 438
           +
Sbjct: 422 E 422


>AT2G14520.1 | Symbols:  | CBS domain-containing protein with a
           domain of unknown function (DUF21) |
           chr2:6182362-6184648 REVERSE LENGTH=423
          Length = 423

 Score =  385 bits (990), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 208/416 (50%), Positives = 272/416 (65%), Gaps = 19/416 (4%)

Query: 37  SWYAYAGISCFLVLFAGIXXXXXXXXXXXXXVDLEILERSGSPSEKKQAAIILPVVQKQH 96
           S++ +  +   LVLFAG+             VDLE+L +SG+P ++  AA ILPVV+ QH
Sbjct: 11  SFFIHIAVIVLLVLFAGLMSGLTLGLMSMSLVDLEVLAKSGTPRDRIHAAKILPVVKNQH 70

Query: 97  QLLVTLLLCNAVAMEALPIYLDKLVNQFVAIILSVTFVLFFGEVIPQSICSRYGLAVGAN 156
            LL TLL+CNA AMEALPI+LD LV  + AI++SVT +L FGE+IPQS+CSR+GLA+GA 
Sbjct: 71  LLLCTLLICNAAAMEALPIFLDALVTAWGAILISVTLILLFGEIIPQSVCSRHGLAIGAT 130

Query: 157 LAWLVRILMVICYPVSYPVGKVLDYLLGHNE-ALFRRAQLKVLVSIHSQEAGKGGELTHD 215
           +A  VR+L+ IC PV++P+ K+LD+LLGH   ALFRRA+LK LV +H  EAGKGGELTHD
Sbjct: 131 VAPFVRVLVWICLPVAWPISKLLDFLLGHGRVALFRRAELKTLVDLHGNEAGKGGELTHD 190

Query: 216 ETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKILARGHSRVPVYSGNPKN 275
           ETTII+GAL+L+EK A++AMTPI  TF +D+N+KLD + M  IL +GHSRVPVY     N
Sbjct: 191 ETTIIAGALELSEKMAKDAMTPISDTFVIDINAKLDRDLMNLILDKGHSRVPVYYEQRTN 250

Query: 276 IIGLLLVKSLLTVRPETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAXXXXX- 334
           IIGL+LVK+LLT+ P+ E  V  V+IRRIPRVP  +PLYDILNEFQKG SHMA       
Sbjct: 251 IIGLVLVKNLLTINPDEEIQVKNVTIRRIPRVPETLPLYDILNEFQKGHSHMAVVVRQCD 310

Query: 335 -----XXXXETPQIIDEENKSTDGD-SMLTTPLLQKQDLKSGNVVVDIDKPSRLPSSNKL 388
                       + ++E     D + S   T L +++ L+          P+R  S    
Sbjct: 311 KIHPLQSNDAANETVNEVRVDVDYERSPQETKLKRRRSLQKWKSF-----PNRANSLGSR 365

Query: 389 TGPQHSDGTT-----NGPPAENI-EDGEVIGIITLEDVFEELLQEEIVDETDEYVD 438
           +     D        N  P   + E+ + +GIIT+EDV EELLQEEI DETD + +
Sbjct: 366 SKRWSKDNDADILQLNEHPLPKLDEEEDAVGIITMEDVIEELLQEEIFDETDHHFE 421


>AT5G52790.1 | Symbols:  | CBS domain-containing protein with a
           domain of unknown function (DUF21) |
           chr5:21391740-21394327 REVERSE LENGTH=500
          Length = 500

 Score =  377 bits (967), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 199/417 (47%), Positives = 273/417 (65%), Gaps = 29/417 (6%)

Query: 31  IPFGTISWYAYAGISCFLVLFAGIXXXXXXXXXXXXXVDLEILERSGSPSEKKQAAIILP 90
           +P     ++ Y  +   LV+FAG+             V+LE++ ++G P ++K A  ILP
Sbjct: 6   VPCCETMFWVYLLVCVALVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNAEKILP 65

Query: 91  VVQKQHQLLVTLLLCNAVAMEALPIYLDKLVNQFVAIILSVTFVLFFGEVIPQSICSRYG 150
           +V+ QH LL TLL+ NA+AMEALPI++D L+  + AI++SVT +L FGE+IPQ++CSRYG
Sbjct: 66  LVKNQHLLLCTLLIGNALAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAVCSRYG 125

Query: 151 LAVGANLAWLVRILMVICYPVSYPVGKVLDYLLG-HNEALFRRAQLKVLVSIHSQEAGKG 209
           L++GA L++LVR+++++ +P+SYP+ K+LD LLG  +  L  RA+LK LV +H  EAGKG
Sbjct: 126 LSIGAKLSFLVRLIIIVFFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHGNEAGKG 185

Query: 210 GELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKILARGHSRVPVY 269
           GELTHDETTIISGALD+++K+A++AMTP+   FSLD+N KLD + MG I + GHSR+P+Y
Sbjct: 186 GELTHDETTIISGALDMSQKSAKDAMTPVSQIFSLDINFKLDEKTMGLIASAGHSRIPIY 245

Query: 270 SGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAA 329
           S NP  IIG +LVK+L+ VRPE ET +  + IRR+P+V  ++PLYDILN FQ G SHMAA
Sbjct: 246 SVNPNVIIGFILVKNLIKVRPEDETSIRDLPIRRMPKVDLNLPLYDILNIFQTGRSHMAA 305

Query: 330 XX-XXXXXXXETPQIIDEENKSTDGDS--MLTTPLLQKQDLKSGNVVVDIDKPSRLPSSN 386
                      TP      N S + D+   L+ P L   +    + +  ID  S      
Sbjct: 306 VVGTKNHTNTNTPVHEKSINGSPNKDANVFLSIPALNSSETSHQSPIRYIDSISD----- 360

Query: 387 KLTGPQHSDGTTNGPPAENIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRI 443
                               ED EVIGIITLEDV EEL+QEEI DETD+YV++HKRI
Sbjct: 361 --------------------EDEEVIGIITLEDVMEELIQEEIYDETDQYVELHKRI 397


>AT1G47330.1 | Symbols:  | CBS domain-containing protein with a
           domain of unknown function (DUF21) |
           chr1:17351149-17353739 FORWARD LENGTH=527
          Length = 527

 Score =  366 bits (940), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 202/442 (45%), Positives = 276/442 (62%), Gaps = 29/442 (6%)

Query: 31  IPFGTISWYAYAGISCFLVLFAGIXXXXXXXXXXXXXVDLEILERSGSPSEKKQAAIILP 90
           IP    ++  Y  I   LV FAG+             VDLE+L +SG P ++  A  I P
Sbjct: 5   IPCCGTTFSLYVVIIIALVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRINAGKIFP 64

Query: 91  VVQKQHQLLVTLLLCNAVAMEALPIYLDKLVNQFVAIILSVTFVLFFGEVIPQSICSRYG 150
           VV+ QH LL TLL+ N++AMEALPI+LDK+V  ++AI+LSVT +L FGE++PQ++C+RYG
Sbjct: 65  VVKNQHLLLCTLLIGNSMAMEALPIFLDKIVPPWLAILLSVTLILVFGEIMPQAVCTRYG 124

Query: 151 LAVGANLAWLVRILMVICYPVSYPVGKVLDYLLGHNEA-LFRRAQLKVLVSIHSQEAGKG 209
           L VGA +A  VR+L+V+ +P+SYP+ KVLD++LG     L RRA+LK  V+ H  EAGKG
Sbjct: 125 LKVGAIMAPFVRVLLVLFFPISYPISKVLDWMLGKGHGVLLRRAELKTFVNFHGNEAGKG 184

Query: 210 GELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKILARGHSRVPVY 269
           G+LT DET+II+GAL+LTEKTA++AMTPI + FSL++++ L+ E +  I++ GHSRVPVY
Sbjct: 185 GDLTTDETSIITGALELTEKTAKDAMTPISNAFSLELDTPLNLETLNTIMSVGHSRVPVY 244

Query: 270 SGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAA 329
             NP +IIGL+LVK+LL V    E P+  +S+R+IPRV   MPLYDILNEFQKG SH+A 
Sbjct: 245 FRNPTHIIGLILVKNLLAVDARKEVPLRKMSMRKIPRVSETMPLYDILNEFQKGHSHIAV 304

Query: 330 XXXXXXXXXETPQ-----IIDEENKSTDGDSMLTTPLLQKQDLK-SGNVVVDIDKPSRLP 383
                    ++P+     I   +NK T  +    +    K   + S   V  I+      
Sbjct: 305 VYKDLDEQEQSPETSENGIERRKNKKTKDELFKDSCRKPKAQFEVSEKEVFKIETGDAKS 364

Query: 384 SSNKLTGPQHSDGTTN--GPPAENIEDG--------------------EVIGIITLEDVF 421
             ++    Q   G T+    PA+    G                    EV+G+IT+EDV 
Sbjct: 365 GKSENGEEQQGSGKTSLLAAPAKKRHRGCSFCILDIENTPIPDFPTNEEVVGVITMEDVI 424

Query: 422 EELLQEEIVDETDEYVDVHKRI 443
           EELLQEEI+DETDEYV++H RI
Sbjct: 425 EELLQEEILDETDEYVNIHNRI 446


>AT3G13070.1 | Symbols:  | CBS domain-containing protein /
           transporter associated domain-containing protein |
           chr3:4191511-4195112 REVERSE LENGTH=661
          Length = 661

 Score = 60.5 bits (145), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 53/214 (24%), Positives = 96/214 (44%), Gaps = 33/214 (15%)

Query: 134 VLFFGEVIPQSICSRYGLAVGA----NLAWLVRILMVICYPVSYPVGKVLDYL------- 182
           +L   E+ P+S+       V       +AWL  +L        YPVG+++ YL       
Sbjct: 256 ILLLTEITPKSVAVHNAQEVARIVVRPVAWLSLVL--------YPVGRIVTYLSMGILKI 307

Query: 183 ---LGHNEALFRRAQLKVLVSIHSQEAGKGGELTHDETTIISGALDLTEKTAEEAMTPIE 239
               G +E      +LK+++    + A   G +  +E  +I   L++ +    E MTP+ 
Sbjct: 308 LGLKGRSEPYVTEDELKLML----RGAELSGAIEEEEQDMIENVLEIKDTHVREVMTPLV 363

Query: 240 STFSLDVNSKL-DWEAMGKILARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETE----T 294
              ++D ++ L D+ +M   +   +SRVPV+     NI+G+     LL    + +    T
Sbjct: 364 DVVAIDASASLVDFHSMW--VTHQYSRVPVFEQRIDNIVGIAYAMDLLDYVQKGDLLEST 421

Query: 295 PVSAVSIRRIPRVPSDMPLYDILNEFQKGSSHMA 328
            V  ++ +    VP  M ++++L EF+    HMA
Sbjct: 422 SVGDMAHKPAYFVPDSMSVWNLLREFRIRKVHMA 455


>AT1G55930.1 | Symbols:  | CBS domain-containing protein /
           transporter associated domain-containing protein |
           chr1:20918895-20922133 FORWARD LENGTH=653
          Length = 653

 Score = 58.9 bits (141), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 55/214 (25%), Positives = 94/214 (43%), Gaps = 33/214 (15%)

Query: 134 VLFFGEVIPQSICSRYGLAVGA----NLAWLVRILMVICYPVSYPVGKVLDYL------- 182
           +L   E+ P+S+       V       +AWL  IL        YPVG+V+ YL       
Sbjct: 251 ILLLTEITPKSVAVHNAQEVARIVVRPVAWLSLIL--------YPVGRVVTYLSMGILKI 302

Query: 183 ---LGHNEALFRRAQLKVLVSIHSQEAGKGGELTHDETTIISGALDLTEKTAEEAMTPIE 239
               G +E      +LK+++    + A   G +  +E  +I   L++ +    E MTP+ 
Sbjct: 303 LGLKGRSEPYVTEDELKLML----RGAELSGAIEEEEQDMIENVLEIKDTHVREVMTPLV 358

Query: 240 STFSLDVNSKL-DWEAMGKILARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETE----T 294
              ++D +  L D+      +   +SRVPV+     NI+G+     LL   P+ +    T
Sbjct: 359 DVVAIDGSGSLVDFHNFW--VTHQYSRVPVFEQRIDNIVGIAYAMDLLDYVPKGKLLEST 416

Query: 295 PVSAVSIRRIPRVPSDMPLYDILNEFQKGSSHMA 328
            V  ++ +    VP  M ++++L EF+    HMA
Sbjct: 417 TVVDMAHKPAFFVPDSMSVWNLLREFRIRKVHMA 450