Miyakogusa Predicted Gene

chr1.CM0094.140.nc
Show Alignment: 

BLASTP 2.2.18 [Mar-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= chr1.CM0094.140.nc - phase: 0 
         (620 letters)

Database: TAIR8_pep 
           32,825 sequences; 13,166,001 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G00650.1 | Symbols: FLA, FRI | FRI (FRIGIDA) | chr4:269026-27...   197   1e-50
AT5G48385.1 | Symbols:  | similar to hydroxyproline-rich glycopr...   100   3e-21
AT3G22440.1 | Symbols:  | hydroxyproline-rich glycoprotein famil...    86   6e-17
AT4G14900.1 | Symbols:  | hydroxyproline-rich glycoprotein famil...    70   5e-12
AT1G31814.1 | Symbols: FRL2 | FRL2 (FRIGIDA LIKE 2) | chr1:11412...    52   1e-06
AT5G27230.1 | Symbols:  | similar to hydroxyproline-rich glycopr...    45   1e-04
AT5G16320.1 | Symbols: FRL1 | FRL1 (FRIGIDA LIKE 1) | chr5:53445...    42   8e-04

>AT4G00650.1 | Symbols: FLA, FRI | FRI (FRIGIDA) |
           chr4:269026-270363 FORWARD
          Length = 318

 Score =  197 bits (502), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 105/268 (39%), Positives = 167/268 (62%), Gaps = 29/268 (10%)

Query: 40  QLAKSVNELTDLATAIQAFKSRYDELQRHLNYIERTIDTRTKE----LQSRT-------- 87
           Q  KS++EL   + A++ FK ++D+LQ+H+  IE  ID++ +     L +R         
Sbjct: 54  QFLKSIDELAAFSVAVETFKRQFDDLQKHIESIENAIDSKLESNGVVLAARNNNFHQPML 113

Query: 88  --PKPAVEKVTVVNGDVSRAELKTKAEGDVKGEENELVSLCQTMCSRGLRRYILSHLAEV 145
             P+  V   T V       E+  +     +GE      +C+ MCS+GLR+YI +++++ 
Sbjct: 114 SPPRNNVSVETTVTVSQPSQEIVPETSNKPEGER-----ICELMCSKGLRKYIYANISDQ 168

Query: 146 KMLGEQVPEALKSAPKPSKLVFECIGRFFLQGSKAYTKDSPMVTARLASVLVLEYYLL-- 203
             L E++P ALK A +P+K V +CIG+F+LQG +A+TK+SPM +AR  S+L+LE +LL  
Sbjct: 169 AKLMEEIPSALKLAKEPAKFVLDCIGKFYLQGRRAFTKESPMSSARQVSLLILESFLLMP 228

Query: 204 ---SGRVDSEAQEEDPSLNKEASQAAVSWRKRLIVEGGVANACEIDARGLVLFIACYGIP 260
               G+V  E+  +D     EA  AAV+WRKRL+ EGG+A A ++DARGL+L +AC+G+P
Sbjct: 229 DRGKGKVKIESWIKD-----EAETAAVAWRKRLMTEGGLAAAEKMDARGLLLLVACFGVP 283

Query: 261 IDFLNEDVWNLVRLSNPREISKALRQSQ 288
            +F + D+ +L+R+S   EI+ AL++SQ
Sbjct: 284 SNFRSTDLLDLIRMSGSNEIAGALKRSQ 311


>AT5G48385.1 | Symbols:  | similar to hydroxyproline-rich
           glycoprotein family protein [Arabidopsis thaliana]
           (TAIR:AT3G22440.1); similar to unknown [Populus
           trichocarpa] (GB:ABK94872.1); contains InterPro domain
           Frigida-like (InterPro:IPR012474) |
           chr5:19626697-19628938 FORWARD
          Length = 558

 Score =  100 bits (249), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 87/321 (27%), Positives = 154/321 (47%), Gaps = 28/321 (8%)

Query: 121 ELVSLCQTMCSRGLRRYILSHLAEVKMLGEQVPEALKSAPKPSKLVFECIGRFF-LQGSK 179
           +L+ LC  M S GL +++  +   +  L E++P A ++A  P+ LV + +  F+ ++   
Sbjct: 174 QLLKLCGDMDSTGLHKFVSDNRKNLASLKEEIPMAFRAAANPASLVLDSLEGFYPMEAPT 233

Query: 180 A-YTKDSPMVTARLASVLVLEYY--LLSGRVDSEAQEEDPSLN--KEASQAAVSWRKRL- 233
           A   KD+ ++  R   ++++E    LLSG +D        S N    A   A  W   L 
Sbjct: 234 ADGKKDANLLGMRRTCIMLMECLSILLSG-LDRNCLAVVLSQNVKHRAKTIAEGWNPLLE 292

Query: 234 IVEGGVANACEIDARGLVLFIACYGIPIDFLNEDVWNLVRLSNPREISKALRQSQALLMR 293
            ++    N   ++A   +  +A + I  DF  +++  L+ + + R  +  L +S  L  +
Sbjct: 293 SLDMDACNGNSLEAHAFLQLLATFAIVADFKEDELLKLIPMVSRRRQAAELCRSLGLAEK 352

Query: 294 VTDIAEVMMKRGKVVDALDLAYTFGFEEKFSPKTTLTSFLYKSDEVWKKAKQDARDFPIL 353
           +  + EV++  GK +DA++LA+ F   E+FSP + L S+L ++     + +      P +
Sbjct: 353 MPGVIEVLVNSGKQIDAVNLAFAFELTEQFSPVSLLKSYLIEARRSSPQGR-PGNASPAV 411

Query: 354 LKEANEKYLAALKSIVNSLEGHKVDLEELLPGWELKEKILILENDISDINKIFDDTLVNK 413
             E NE+ L  LK+++  +E H   LEE  P   L ++IL LE                K
Sbjct: 412 QDEFNERELIGLKTVIKCIEEH--SLEEQYPVEPLHKRILQLE----------------K 453

Query: 414 RKMDKNNSSIKMKVPEAKRTR 434
            K DK  ++  MK P+ KR R
Sbjct: 454 AKADKKRATEPMK-PQPKRPR 473


>AT3G22440.1 | Symbols:  | hydroxyproline-rich glycoprotein family
           protein | chr3:7959861-7961893 FORWARD
          Length = 532

 Score = 86.3 bits (212), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 123/493 (24%), Positives = 207/493 (41%), Gaps = 69/493 (13%)

Query: 46  NELTDLATAI-QAFKSRYDELQRHLNYIERTIDTRTKELQSR--TPKPAVEKVTVVNGDV 102
            EL+D  T++ Q    + + L++ +  ++    T  + L+ R  T   +VE V    G+ 
Sbjct: 36  QELSDHFTSLEQNLMKKSEALKQMIETLDNQTQTSLESLKRREVTIDHSVEIVAGKVGER 95

Query: 103 SRAELKT--KAE-----------GDVKGEE---NELVSLCQTMCSRGLRRYILSHLAEVK 146
           +RA L++  KA            GDV  EE   + L SLC  M +RG   ++ +   E++
Sbjct: 96  ARAALESLEKARDGCGDGSNDDSGDVDDEEGLLSALKSLCLKMDARGFWNFVTARKKELE 155

Query: 147 MLGEQVPEALKSAPKPSKLVFECIGRFFLQGSKAYTKDSPMVTARLASVLVLEYY---LL 203
            L  ++P AL     P+ LV E I   F   ++    D        A V++LE     ++
Sbjct: 156 NLRSKIPAALVDCVDPAMLVLEAISEVFPVDTRG---DKVSNDYGWACVVILESLTPVIV 212

Query: 204 SGRVDSEAQEEDPSLNKEASQAAVSWRKRLIVEGGVANACEIDARGLVLFIACYGIPIDF 263
              +        PS+ ++A + A +W+K L   G + N    D    +  +  +GI    
Sbjct: 213 DPVIGKSRLLVTPSVKEKAKEIAETWKKSLEERGRIENVKTPDVHTFLQHLVTFGI---V 269

Query: 264 LNEDVW---NLVRLSNPREISKALRQSQALLMRVTDIAEVMMKRGKVVDALDLAYTFGFE 320
            +ED+     LV  S  R+    L  S  L  ++ D+ E ++ RG+ +DA+   Y  G  
Sbjct: 270 KSEDLALYRKLVVGSAWRKQMPKLAVSVGLGDQMPDMIEELISRGQQLDAVHFTYEVGLV 329

Query: 321 EKFSPKTTLTSFLYKSDEVWKKAKQDARDFPILLKEANEKYLAALKSIVNSLEGHKVDLE 380
           +KF P   L ++L  + +      +D+ +          K  +ALK+++  +E +K  LE
Sbjct: 330 DKFPPVPLLKAYLRDAKKSAASIMEDSSNTGRATHLVARKEQSALKAVLKCIEEYK--LE 387

Query: 381 ELLPGWELKEKILILENDISDINKIFDDTLVNKRKMDKNNSSIKMKVPEAKRTRF----- 435
           E  P   LK+++  LE            T   KRK           VP  KRTR      
Sbjct: 388 EEFPPENLKKRLDQLEK-----------TKTEKRK--------PAAVPANKRTRASYNGP 428

Query: 436 --PVKDQPVTSPSLARLHEQRFASHMDGSLTPNFVDGRLYGHPNYPSAAYAHHIGSVSGS 493
             P K   +T+  ++     R  SH     +P +     Y  P+ P+  Y++       S
Sbjct: 429 MPPAKAGRITNAYVSSFPFIRSPSH-----SPQYASPAAY--PSPPTTVYSNRSPPYPYS 481

Query: 494 ---LPESYHGSAV 503
              +P SY GS +
Sbjct: 482 PEIIPGSYQGSPI 494


>AT4G14900.1 | Symbols:  | hydroxyproline-rich glycoprotein family
           protein | chr4:8521754-8523602 REVERSE
          Length = 532

 Score = 69.7 bits (169), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 100/410 (24%), Positives = 166/410 (40%), Gaps = 60/410 (14%)

Query: 114 DVKGEENELVSLCQTMCSRGLRRYILSHLAEVKMLGEQVPEALKSAPKPSKLVFECIGRF 173
           D  G  + L SLC  M +RG   ++++   E++ L  Q+P AL     P KLV E +   
Sbjct: 119 DGDGLLSALKSLCLKMDARGFWGFVIARKKELENLRSQIPVALVDCVDPPKLVLEAVSEV 178

Query: 174 F----LQGSKAYTKDSPMVTARLASVLVLEYYLLSGRVDSEAQEE----DPSLNKEASQA 225
           F      G +  + D        A V++LE  L+   VD    +      PS+ ++A + 
Sbjct: 179 FPVDKRGGGEKVSND-----FGWACVVILE-SLIPVMVDPVMGKSRLLVTPSVKEKAKEI 232

Query: 226 AVSWRKRLIVEGGVANACEIDARGLVLFIACYGIPIDFLNEDVWNLVRL----SNPREIS 281
           A +W+  L   GG+ N    D    +  +  +GI    + +D   L R     S  R+  
Sbjct: 233 AETWKASLEERGGIENVKTPDVHTFLQHLVTFGI----VKKDDLALYRKLVVGSAWRKQM 288

Query: 282 KALRQSQALLMRVTDIAEVMMKRGKVVDALDLAYTFGFEEKFSPKTTLTSFLYKSDEVWK 341
             L  S  L  ++ D+ E ++ RG+ +DA+   +  G    F P   L ++L  + +   
Sbjct: 289 PKLAVSVGLGDQMPDMIEELIIRGQQLDAVHFTFEVGLVHLFPPVPLLKAYLRDAKKATA 348

Query: 342 KAKQDARDFPILLKEANEKYLAALKSIVNSLEGHKVDLEELLPGWELKEKILILENDISD 401
               D+ +          K  +AL++++  +E +K  LEE  P   LK+++  LE     
Sbjct: 349 LITDDSNNSGRSAHLVARKEQSALRAVLKCIEEYK--LEEEFPPENLKKRLDQLE----- 401

Query: 402 INKIFDDTLVNKRKMDKNNSSIKMKVPEAKRTRF-------PVKDQPVTSPSLARLHEQR 454
                      K K +K   ++   +P  KRTR        P K   +T+  ++      
Sbjct: 402 -----------KTKTEKRKPAV---IPANKRTRASYSGPMPPAKAGRITNAYVSSFPPPP 447

Query: 455 FASHMDGSLTPNFVDGRLYGHPNY---PSAAYAHHIGSVSGSLPESYHGS 501
                  S +P       YG P Y   P   Y++       S PE+ HGS
Sbjct: 448 PTFIRSQSHSPQ------YGVPAYTTSPPTIYSNRSPPYQYS-PEAVHGS 490


>AT1G31814.1 | Symbols: FRL2 | FRL2 (FRIGIDA LIKE 2) |
           chr1:11412966-11414387 REVERSE
          Length = 473

 Score = 52.0 bits (123), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 80/402 (19%), Positives = 165/402 (41%), Gaps = 41/402 (10%)

Query: 41  LAKSVNELTDLATAIQAFKSRYDELQRH-------LNYIERTIDTRTKELQSRTPKPAVE 93
           +A S+N++ +     Q  K  +D+LQ H        N     ID+    LQS        
Sbjct: 7   IAASINQIDEKK---QKLKKAFDDLQAHRSLLSPSFNLSWSEIDSHFSSLQSSLFNRLQS 63

Query: 94  KVTVVN-GDVSRAELKTKAEGDVKGEENELVSLCQTMCSRGLRRYILSHLAEVKMLGEQV 152
            VT  N G++   E  T    +      EL   C+    +GL  Y++ +  +   + E++
Sbjct: 64  AVTSSNSGNI---ETPTAVTTETPVLWPELRKFCEKNDGKGLGNYMIENSRKRLSINEEL 120

Query: 153 PEALKSAPKPSKLVFECIGRFFLQGSKAYTKDSPMVTARLASVLVLEYYLLSGRVDSEAQ 212
           P A++ +  P+ LV + I     +GS   +  S   +AR   V  +   LL   ++  A 
Sbjct: 121 PNAIRCSENPAPLVLDAI-----EGSYHCSSPSSSSSARAIDVKRIFVLLLEALIEINAN 175

Query: 213 EEDPSLNKEASQAAVSWRKRLIVEGGVANACEIDARGLVLFIACYGIPIDFLNEDVWNLV 272
             +  L + A   A  W+  +   G   +    +A G +  +A + +   F  E++ + +
Sbjct: 176 LTN-DLRERARTIAYDWKPNI---GNKPS----EALGFLHLVAAFELGSLFSTEEICDYI 227

Query: 273 RL-SNPREISKALRQSQALLMRVTDIAEVMMKRGKVVDALDLAYTFGFEEKFSPKTTLTS 331
            L S  ++ +   ++      R+  + +  +  G+++ A+   Y      +F P + L +
Sbjct: 228 FLISKYKQATTICKKIGLDRNRIGVLVQKFLDTGRLLVAIRFIYENEMVGEFEPVSILKT 287

Query: 332 FLYKSDEVWKKAKQDARDFPILLKEANEKYLAALKSIVNSLEGHKVDLEELLPGWELKEK 391
            L  S E  K+   +      +  EA +K L+AL++++  ++   ++ E           
Sbjct: 288 SLKNSREAAKRVCAEGNYSLKVQNEATDKELSALRAVIKVVKEKNIESE----------- 336

Query: 392 ILILENDISDINKIFDDTLVNKRKMDKNNSSIKMKVPEAKRT 433
              +E  + +  K  +D    +++  K NS    + P+ ++ 
Sbjct: 337 --FMEEKLEECVKELEDQKAQRKRATKFNSPANPQQPQEQKV 376


>AT5G27230.1 | Symbols:  | similar to hydroxyproline-rich
           glycoprotein family protein [Arabidopsis thaliana]
           (TAIR:AT3G22440.1); similar to unknown [Picea
           sitchensis] (GB:ABK24390.1); contains InterPro domain
           Frigida-like (InterPro:IPR012474) | chr5:9584258-9587841
           FORWARD
          Length = 948

 Score = 45.4 bits (106), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/161 (21%), Positives = 79/161 (49%), Gaps = 5/161 (3%)

Query: 221 EASQAAVSWRKRLIVEGGVANACEIDARGLVLFIACYGIPIDFLNEDVWNLVRLSNPREI 280
           +A + AV W+ ++      +   +++    + F+  +GI  +F  +D+  L+  S  + +
Sbjct: 630 DAKKLAVYWKDKI----AKSKRDQLEVICFLQFLGIFGIVSEFKADDLLGLLDNSYWQTV 685

Query: 281 SKALRQSQALLMRVTDIAEVMMKRGKVVDALDLAYTFGFEEKFSPKTTLTS-FLYKSDEV 339
           S  L Q   L   +    + ++K G  + A+D  Y+FG   +F P + + +  L  + E 
Sbjct: 686 SPDLCQFLGLDDAIPGFIQNLIKTGHRIKAIDYIYSFGMVHRFQPVSAIINDSLRITKES 745

Query: 340 WKKAKQDARDFPILLKEANEKYLAALKSIVNSLEGHKVDLE 380
            +K+ ++A++       A ++ + AL++ +  +  HK++ E
Sbjct: 746 AEKSYREAKNESTTQVAAIDRQVRALRAAIKCISCHKLESE 786


>AT5G16320.1 | Symbols: FRL1 | FRL1 (FRIGIDA LIKE 1) |
           chr5:5344510-5345922 FORWARD
          Length = 470

 Score = 42.4 bits (98), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 57/281 (20%), Positives = 127/281 (45%), Gaps = 20/281 (7%)

Query: 119 ENELVSLCQTMCSRGLRRYILSHLAEVKMLGEQVPEALKSAPKPSKLVFECIGRFFLQGS 178
           E EL +LC+ +   GL +Y++    +   L ++V  A++ +P  + +V + I     +GS
Sbjct: 98  EPELRALCEKIDGIGLIKYLIRIWDDETPLNQEVSAAIRYSPDTASMVLDAI-----EGS 152

Query: 179 KAYTKDSPMVTARLASVLVLEYYLLSGRVDSEAQEEDPSLNKEASQAAVSWRKRLIVEGG 238
             YT  S   +  +  V VL   L+   ++  A     + N+ A + A  W+ ++ V+  
Sbjct: 153 N-YTPSSSGRSFDVRRVFVL---LMEVLIEINANITVDTRNR-AKKLAYHWKSKVGVK-- 205

Query: 239 VANACEIDARGLVLFIACYGIPIDFLNEDVWNLV-RLSNPREISKALRQSQALLMRVTDI 297
                  +A   +  +A + +  +F  E++ + V  ++  ++ +    +      RV  +
Sbjct: 206 -----PFEALVFLHLVAAFELGSEFDTEELSDYVFMIAKYKQATLVCNKIGVDRKRVGKL 260

Query: 298 AEVMMKRGKVVDALDLAYTFGFEEKFSPKTTLTSFLYKSDEVWKKAKQDARDFPILLKEA 357
            + ++  GK + A+   Y  G  ++F P   L S++    E   +   +         EA
Sbjct: 261 IKTLLDSGKPILAVKFMYECGMTDEFEPIPVLKSYIKDCREAALRVCVEDNYSLKSQNEA 320

Query: 358 NEKYLAALKSIVNSLEGHKVDLEELLPGWELKEKILILEND 398
           ++K ++ALK ++  ++    +LE      +++E++  LE +
Sbjct: 321 SDKEVSALKPLIKIIKDQ--NLESEFTQEKVEERVEELEKN 359