Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC147496.2 - phase: 0 /pseudo
         (550 letters)

Database: ara_mips 
           26,719 sequences; 11,318,596 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

At2g06890 putative retroelement integrase                             178  8e-45
At4g07850 putative polyprotein                                        171  1e-42
At2g06170 putative Ty3-gypsy-like retroelement pol polyprotein        118  1e-26
At4g08100 putative polyprotein                                        117  1e-26
At4g04230 putative transposon protein                                  64  2e-10
At4g13320 hypothetical protein                                         45  1e-04
At1g65280                                                              35  0.083
At5g60530 late embryonic abundant protein - like                       34  0.18
At3g07280 unknown protein                                              34  0.18
At5g27120 SAR DNA-binding protein - like                               33  0.31
At5g53800 unknown protein                                              32  0.70
At1g80980 hypothetical protein                                         32  0.70
At1g80700 hypothetical protein                                         32  0.70
At1g44910 splicing factor like protein                                 32  0.70
At4g03850 putative transposon protein                                  32  0.91
At5g40010 putative protein                                             32  1.2
At2g39320 hypothetical protein                                         32  1.2
At1g45545 hypothetical protein                                         32  1.2
At1g21160 transcription factor, putative                               32  1.2
At5g37350 unknown protein                                              31  1.6

>At2g06890 putative retroelement integrase
          Length = 1215

 Score =  178 bits (451), Expect = 8e-45
 Identities = 109/243 (44%), Positives = 145/243 (58%), Gaps = 19/243 (7%)

Query: 306 EEAEEIPS-GDLFMIRRFLGNQAKEEESNQRETLFHTRCLVQGKVCFLIIDGGSRTNVAS 364
           +E EE+P+ G+L + RR L  Q K +E  QR+ LFHTRC V GKVC LIIDGGS TNVAS
Sbjct: 200 KENEELPAQGELLVARRTLSVQTKTDEQEQRKNLFHTRCHVHGKVCSLIIDGGSCTNVAS 259

Query: 365 TRLVSKMELETKPHPKPYKLQWLNENVEILVDKQVEVCFKIGKYEDDVLCDVVPMEASHL 424
             +V K+ L+           WLN++ ++ V  QV V   IGKYED++LCDV+PMEA H+
Sbjct: 260 ETMVKKLGLK-----------WLNDSGKMRVKNQVVVPIVIGKYEDEILCDVLPMEAGHI 308

Query: 425 LLGRP*QFDRSVLHDGRTNKYSFMHSGQKISLAPLSPSEVRDDQKKRKEKYEKEKRKIKR 484
           LLGRP Q DR V+HDG TN++SF   G K  L  ++P EV  DQ   K+K    K ++ +
Sbjct: 309 LLGRPWQSDRKVMHDGFTNRHSFEFKGGKTILVSMTPHEVYQDQIHLKQK----KEQVVK 364

Query: 485 KEREFA*KKKKRRVSRNSHEQTAPIFTTLQKVALLTN-TQNFPSCTKFLLQECEDVFPKK 543
           +   FA  K     S  S +Q   +F   + +  LTN     PS    LLQ+ +DVFP+ 
Sbjct: 365 QPNFFA--KSGEVKSAYSSKQPMLLFVFKEALTSLTNFAPVLPSEMTSLLQDYKDVFPED 422

Query: 544 VPQ 546
            P+
Sbjct: 423 NPK 425


>At4g07850 putative polyprotein
          Length = 1138

 Score =  171 bits (432), Expect = 1e-42
 Identities = 88/151 (58%), Positives = 105/151 (69%), Gaps = 3/151 (1%)

Query: 289 DLGE*NVEVENHEGYVEEEAEEIPSGDLFMIRRFLGNQAKEEESNQRETLFHTRCLVQGK 348
           D GE   E E  E   E + EE P G+L +  R L    K EE  QRE LFHTRCL++GK
Sbjct: 303 DSGEVESEDEKPE---ESDVEEAPKGELLVTMRVLSVLNKAEEQAQRENLFHTRCLIKGK 359

Query: 349 VCFLIIDGGSRTNVASTRLVSKMELETKPHPKPYKLQWLNENVEILVDKQVEVCFKIGKY 408
           VC LIIDGGS TNVAS  +V K+ LE  PHPKPYKLQWLNE+ E+ V +QV+V   IGKY
Sbjct: 360 VCSLIIDGGSCTNVASETMVQKLGLEEFPHPKPYKLQWLNESGEMAVTRQVQVPLAIGKY 419

Query: 409 EDDVLCDVVPMEASHLLLGRP*QFDRSVLHD 439
           ED++LCD++P+EASH+LLGRP Q DR    D
Sbjct: 420 EDEILCDILPLEASHVLLGRPWQSDRKDYQD 450


>At2g06170 putative Ty3-gypsy-like retroelement pol polyprotein
          Length = 587

 Score =  118 bits (295), Expect = 1e-26
 Identities = 64/168 (38%), Positives = 96/168 (57%), Gaps = 6/168 (3%)

Query: 296 EVENHEGYVEEEAEEIPSGDLF--MIRRFLGNQAKEEESNQRETLFHTRCLVQGKVCFLI 353
           E+E  + Y E E  E  S ++   +++R L      +E  QR  LF TRC +  KVC LI
Sbjct: 183 ELEEEDEYAEVEFAEEESNEMINLVLQRIL---LSSKEEGQRRNLFRTRCSINDKVCNLI 239

Query: 354 IDGGSRTNVASTRLVSKMELETKPHPKPYKLQWLNENVEILVDKQVEVCFKIGK-YEDDV 412
           +D GS  N+ S +LV  ++L T  H KPY L W+++  +  V     V   IGK Y+++V
Sbjct: 240 VDIGSSENLVSQKLVEYLKLPTTLHQKPYSLGWVSKGSQFCVSLSCRVPISIGKHYKEEV 299

Query: 413 LCDVVPMEASHLLLGRP*QFDRSVLHDGRTNKYSFMHSGQKISLAPLS 460
           LCDV+ M+  H++LGR  Q+D  + + G+ N   F  +G KI +AP+S
Sbjct: 300 LCDVLNMDVCHIILGRSWQYDNDITYRGKDNVLMFTWNGHKIVMAPVS 347


>At4g08100 putative polyprotein
          Length = 1054

 Score =  117 bits (294), Expect = 1e-26
 Identities = 57/119 (47%), Positives = 79/119 (65%)

Query: 367 LVSKMELETKPHPKPYKLQWLNENVEILVDKQVEVCFKIGKYEDDVLCDVVPMEASHLLL 426
           +V K+ LE   HP+PY LQW NE  E+ V +QV+V   IGKY D+++CD++ M+ASH+LL
Sbjct: 388 MVEKLGLEVLKHPRPYSLQWRNETGEMSVKEQVKVPLSIGKYHDEIMCDILHMDASHILL 447

Query: 427 GRP*QFDRSVLHDGRTNKYSFMHSGQKISLAPLSPSEVRDDQKKRKEKYEKEKRKIKRK 485
           GRP Q DR VL DG TN+ +F H+G+K +L P++  EV  DQ   K++  K    I  K
Sbjct: 448 GRPWQSDRKVLQDGFTNRQTFEHNGRKTTLIPMTLHEVYLDQLSMKQRAIKPTEPIDTK 506


>At4g04230 putative transposon protein
          Length = 315

 Score = 64.3 bits (155), Expect = 2e-10
 Identities = 31/58 (53%), Positives = 40/58 (68%)

Query: 419 MEASHLLLGRP*QFDRSVLHDGRTNKYSFMHSGQKISLAPLSPSEVRDDQKKRKEKYE 476
           M   HLLLGRP QFDR+  H+ RTN YSF ++ +K +LAPLSP EV D Q    +++E
Sbjct: 1   MRVGHLLLGRPWQFDRATCHNRRTNHYSFTYNDRKYNLAPLSPLEVHDLQIHMNKEHE 58


>At4g13320 hypothetical protein
          Length = 216

 Score = 45.1 bits (105), Expect = 1e-04
 Identities = 34/108 (31%), Positives = 56/108 (51%), Gaps = 8/108 (7%)

Query: 338 LFHTRCLVQGKVCFLIIDGGSRTNVASTRLVSKMELET-KPHPKPYKLQWLNENVEILVD 396
           +F T+C++  + C L++ GG+  N+ S  LV +++L+T K +P    +    E  + + +
Sbjct: 98  VFRTQCVINDEACRLVLYGGN--NIISKGLVKQLKLKTLKKYPSVRVMATRRE--DKVAE 153

Query: 397 KQVEVCFKIGK-YEDDVLCDVVPM--EASHLLLGRP*QFDRSVLHDGR 441
           +   V   IG  Y+D V C VV M  E   LL G P  +     H+GR
Sbjct: 154 ETCRVPVSIGDFYKDKVTCYVVNMEEEEDQLLFGGPWLYRVQATHNGR 201


>At1g65280 
          Length = 605

 Score = 35.4 bits (80), Expect = 0.083
 Identities = 14/37 (37%), Positives = 27/37 (72%)

Query: 468 QKKRKEKYEKEKRKIKRKEREFA*KKKKRRVSRNSHE 504
           +K+RK + E++++KI+RKER+    KKK++  +  +E
Sbjct: 56  EKRRKREKERKRKKIERKERKRRDMKKKKKTKKREYE 92


>At5g60530 late embryonic abundant protein - like
          Length = 439

 Score = 34.3 bits (77), Expect = 0.18
 Identities = 15/40 (37%), Positives = 27/40 (67%)

Query: 468 QKKRKEKYEKEKRKIKRKEREFA*KKKKRRVSRNSHEQTA 507
           +K++K+K EKEK+  +RKE+E   K++K +  ++  E  A
Sbjct: 95  EKEKKDKLEKEKKDKERKEKERKEKERKAKEKKDKEESEA 134



 Score = 30.8 bits (68), Expect = 2.0
 Identities = 13/41 (31%), Positives = 26/41 (62%)

Query: 465 RDDQKKRKEKYEKEKRKIKRKEREFA*KKKKRRVSRNSHEQ 505
           ++ +KK KEK  K+K++ ++K++E   KK K R  +   ++
Sbjct: 61  KEQEKKDKEKAAKDKKEKEKKDKEEKEKKDKERKEKEKKDK 101



 Score = 30.8 bits (68), Expect = 2.0
 Identities = 14/38 (36%), Positives = 25/38 (64%)

Query: 468 QKKRKEKYEKEKRKIKRKEREFA*KKKKRRVSRNSHEQ 505
           +K++K+K EKEK+  +RKE+E   K +K +  +   E+
Sbjct: 77  EKEKKDKEEKEKKDKERKEKEKKDKLEKEKKDKERKEK 114



 Score = 30.4 bits (67), Expect = 2.7
 Identities = 19/79 (24%), Positives = 39/79 (49%)

Query: 427 GRP*QFDRSVLHDGRTNKYSFMHSGQKISLAPLSPSEVRDDQKKRKEKYEKEKRKIKRKE 486
           G   Q D+    +G++N        Q+      +  + ++ +KK KE+ EK+ ++ K KE
Sbjct: 38  GNEVQVDKGKGDNGKSNGNGPKDKEQEKKDKEKAAKDKKEKEKKDKEEKEKKDKERKEKE 97

Query: 487 REFA*KKKKRRVSRNSHEQ 505
           ++   +K+K+   R   E+
Sbjct: 98  KKDKLEKEKKDKERKEKER 116


>At3g07280 unknown protein
          Length = 481

 Score = 34.3 bits (77), Expect = 0.18
 Identities = 19/56 (33%), Positives = 32/56 (56%)

Query: 467 DQKKRKEKYEKEKRKIKRKEREFA*KKKKRRVSRNSHEQTAPIFTTLQKVALLTNT 522
           ++K+ KEK EKEK K K KER+   +K K R  ++  ++ +      +   +L+NT
Sbjct: 39  EKKEGKEKREKEKSKDKHKERKERKEKHKDRKDKDRDKEKSRTSEDRKAAGVLSNT 94



 Score = 29.6 bits (65), Expect = 4.5
 Identities = 15/60 (25%), Positives = 32/60 (53%)

Query: 463 EVRDDQKKRKEKYEKEKRKIKRKEREFA*KKKKRRVSRNSHEQTAPIFTTLQKVALLTNT 522
           E R+ +K + +  E+++RK K K+R+   + K++  +    +    +  T  +  L+TNT
Sbjct: 45  EKREKEKSKDKHKERKERKEKHKDRKDKDRDKEKSRTSEDRKAAGVLSNTGDREKLVTNT 104


>At5g27120 SAR DNA-binding protein - like
          Length = 533

 Score = 33.5 bits (75), Expect = 0.31
 Identities = 15/46 (32%), Positives = 29/46 (62%)

Query: 452 QKISLAPLSPSEVRDDQKKRKEKYEKEKRKIKRKEREFA*KKKKRR 497
           +K    P +  E    +KK+K K+E+E+ ++  K++E + KKKK++
Sbjct: 485 KKTEAEPETAEEPAKKEKKKKRKHEEEETEMPAKKKEKSEKKKKKK 530


>At5g53800 unknown protein
          Length = 339

 Score = 32.3 bits (72), Expect = 0.70
 Identities = 18/38 (47%), Positives = 24/38 (62%), Gaps = 1/38 (2%)

Query: 460 SPSEVRDDQKKRKEKYEKEKRKIKRKEREFA*KKKKRR 497
           S SE  D ++   E  E+ +RK KRKERE   K++KRR
Sbjct: 114 SESEYSDSEESESED-ERRRRKRKRKEREEEEKERKRR 150


>At1g80980 hypothetical protein
          Length = 214

 Score = 32.3 bits (72), Expect = 0.70
 Identities = 20/79 (25%), Positives = 38/79 (47%)

Query: 468 QKKRKEKYEKEKRKIKRKEREFA*KKKKRRVSRNSHEQTAPIFTTLQKVALLTNTQNFPS 527
           +KKRKE+ +KEK + ++K  +     K         ++   I  T++++ L T   +  +
Sbjct: 119 EKKRKEEEKKEKEEAEQKALQVEAATKSHEELMEMRQRLGKIEETIKEIVLETKKPSGNA 178

Query: 528 CTKFLLQECEDVFPKKVPQ 546
            TK    +   + PK+V Q
Sbjct: 179 PTKTQEDQSTKLSPKEVSQ 197


>At1g80700 hypothetical protein
          Length = 214

 Score = 32.3 bits (72), Expect = 0.70
 Identities = 20/79 (25%), Positives = 38/79 (47%)

Query: 468 QKKRKEKYEKEKRKIKRKEREFA*KKKKRRVSRNSHEQTAPIFTTLQKVALLTNTQNFPS 527
           +KKRKE+ +KEK + ++K  +     K         ++   I  T++++ L T   +  +
Sbjct: 119 EKKRKEEEKKEKEEAEQKALQVEAATKSHEELMEMRQRLGKIEETIKEIVLETKKPSGNA 178

Query: 528 CTKFLLQECEDVFPKKVPQ 546
            TK    +   + PK+V Q
Sbjct: 179 PTKTQEDQSTKLSPKEVSQ 197


>At1g44910 splicing factor like protein
          Length = 958

 Score = 32.3 bits (72), Expect = 0.70
 Identities = 20/44 (45%), Positives = 27/44 (60%), Gaps = 5/44 (11%)

Query: 465 RDDQKKRKEKY--EKEKRKIKRKEREFA*KKKKRRVSRNSHEQT 506
           RD++K RKEK   EKEKRK K KER    +K++ R      E++
Sbjct: 813 RDEEKVRKEKERDEKEKRKDKDKERR---EKEREREKEKGKERS 853



 Score = 29.3 bits (64), Expect = 5.9
 Identities = 12/35 (34%), Positives = 24/35 (68%)

Query: 465 RDDQKKRKEKYEKEKRKIKRKEREFA*KKKKRRVS 499
           RD+++KRK+K ++ + K + +E+E   ++ KR  S
Sbjct: 824 RDEKEKRKDKDKERREKEREREKEKGKERSKREES 858


>At4g03850 putative transposon protein
          Length = 334

 Score = 32.0 bits (71), Expect = 0.91
 Identities = 36/117 (30%), Positives = 53/117 (44%), Gaps = 20/117 (17%)

Query: 416 VVPMEASH---LLLGRP*QFDRSVLHDGRTNKYSFMHSGQKISLA---PLSPSEVRDDQK 469
           V+ ME  H   L+LGRP       + D R  K S ++ G+ I L      +P E ++D+K
Sbjct: 153 VLDMEVEHKDPLILGRPFLASVGAVIDVREGKIS-LNLGKHIMLQFDINKTPQESKEDEK 211

Query: 470 KRK-------EKYEKEKRKIKRKEREFA*KKKKRRVSRNSH--EQTAPIFTTLQKVA 517
                     EKYE EK K  +K  +    K++  + R +H  E+       LQK A
Sbjct: 212 TSGDDRVIPGEKYETEKVKELKKRSD----KQEETIERLAHSVEELRSKLNQLQKEA 264


>At5g40010 putative protein
          Length = 514

 Score = 31.6 bits (70), Expect = 1.2
 Identities = 17/47 (36%), Positives = 30/47 (63%), Gaps = 3/47 (6%)

Query: 463 EVRDDQKKR---KEKYEKEKRKIKRKEREFA*KKKKRRVSRNSHEQT 506
           E +++ K+R   +EK +KE+ +IKRK+RE    KK+ +  +  +E T
Sbjct: 465 EEKEEAKRRIEDEEKKKKEEEEIKRKKREEKKIKKEEKEEKEENETT 511


>At2g39320 hypothetical protein
          Length = 189

 Score = 31.6 bits (70), Expect = 1.2
 Identities = 13/39 (33%), Positives = 27/39 (68%), Gaps = 5/39 (12%)

Query: 465 RDDQKKRKEKYEKEKRKIKRKEREFA*KKKKRRVSRNSH 503
           +D + K+K+K +K+K K++++++E     KK + +RN H
Sbjct: 151 KDKEDKKKDKEDKKKAKVQKEKKE-----KKEKKNRNHH 184



 Score = 28.9 bits (63), Expect = 7.7
 Identities = 12/40 (30%), Positives = 26/40 (65%)

Query: 466 DDQKKRKEKYEKEKRKIKRKEREFA*KKKKRRVSRNSHEQ 505
           +++K+RK+  ++EK+K K  +++    KKK +V +   E+
Sbjct: 136 EEEKERKDMEKEEKKKDKEDKKKDKEDKKKAKVQKEKKEK 175


>At1g45545 hypothetical protein
          Length = 752

 Score = 31.6 bits (70), Expect = 1.2
 Identities = 17/76 (22%), Positives = 34/76 (44%)

Query: 465 RDDQKKRKEKYEKEKRKIKRKEREFA*KKKKRRVSRNSHEQTAPIFTTLQKVALLTNTQN 524
           R++ +K KE+ ++ K  +   ER+    KK+   SR S +        LQ+       ++
Sbjct: 520 REELRKAKEESDEAKTGLSAVERQLMESKKEMEASRASEKLALAAIKALQETEYANKIED 579

Query: 525 FPSCTKFLLQECEDVF 540
             S  K ++   E+ +
Sbjct: 580 ISSSPKSIIISVEEYY 595


>At1g21160 transcription factor, putative
          Length = 1088

 Score = 31.6 bits (70), Expect = 1.2
 Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 4/57 (7%)

Query: 463 EVRDDQKKRKE----KYEKEKRKIKRKEREFA*KKKKRRVSRNSHEQTAPIFTTLQK 515
           E  D +KK +E    K E+E+R  + +ERE    ++KR++ +   +Q   I T  QK
Sbjct: 234 EAEDGKKKEEEERLRKEEEERRIEEEREREAEEIRQKRKIRKMEKKQEGLILTAKQK 290


>At5g37350 unknown protein
          Length = 531

 Score = 31.2 bits (69), Expect = 1.6
 Identities = 13/42 (30%), Positives = 29/42 (68%)

Query: 459 LSPSEVRDDQKKRKEKYEKEKRKIKRKEREFA*KKKKRRVSR 500
           L P + +  +K+ K+K ++EKR+ ++ +   + KK+K++VS+
Sbjct: 485 LGPEDKKAARKEHKKKVKEEKRESRKTKTPKSVKKRKKKVSK 526


  Database: ara_mips
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 2,978,382
  Number of sequences in database:  6832
  
  Database: /data/blast2/ara_mips_chr2
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 1,737,135
  Number of sequences in database:  4184
  
  Database: /data/blast2/ara_mips_chr3
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 2,236,886
  Number of sequences in database:  5377
  
  Database: /data/blast2/ara_mips_chr4
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 1,748,816
  Number of sequences in database:  4030
  
  Database: /data/blast2/ara_mips_chr5
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 2,569,679
  Number of sequences in database:  6098
  
  Database: /data/blast2/ara_mips_chl
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 25,951
  Number of sequences in database:  85
  
  Database: /data/blast2/ara_mips_mit
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 21,747
  Number of sequences in database:  113
  
Lambda     K      H
   0.358    0.160    0.569 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,770,819
Number of Sequences: 26719
Number of extensions: 435270
Number of successful extensions: 5060
Number of sequences better than 10.0: 43
Number of HSP's better than 10.0 without gapping: 28
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 4605
Number of HSP's gapped (non-prelim): 256
length of query: 550
length of database: 11,318,596
effective HSP length: 104
effective length of query: 446
effective length of database: 8,539,820
effective search space: 3808759720
effective search space used: 3808759720
T: 11
A: 40
X1: 14 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.8 bits)
S2: 63 (28.9 bits)


Medicago: description of AC147496.2