Medicago
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC146721.2 - phase: 1 /pseudo
         (407 letters)

Database: MTGI 
           36,976 sequences; 27,044,181 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

CA921489 weakly similar to PIR|T47566|T475 hypothetical protein ...   181  5e-46
TC85019 weakly similar to GP|9293890|dbj|BAB01793.1 emb|CAB70981...   179  2e-45
AJ548369 weakly similar to PIR|T48469|T484 hypothetical protein ...   122  2e-28
BE942775                                                               77  2e-14
AW696166 GP|15025223|gb Uncharacterized conserved protein {Clost...    72  5e-13
AW687065 similar to GP|9293890|dbj| emb|CAB70981.1~gene_id:MVE11...    59  3e-09
BG586161 similar to GP|21554129|gb| putative thaumatin {Arabidop...    44  9e-05
BG587051                                                               42  4e-04
TC83706 weakly similar to PIR|T48283|T48283 ankyrin-like protein...    41  0.001
TC91163 similar to PIR|T44832|T44832 probable emulsan repeating ...    35  0.040
TC88966 similar to GP|21305941|gb|AAM45816.1 NADH dehydrogenase ...    33  0.15
TC83751 weakly similar to GP|21311555|gb|AAM46778.1 15.8 kDa ole...    31  0.76
TC87068 similar to PIR|G96703|G96703 unknown protein  30164-3299...    30  1.3
TC84611 similar to PIR|T48213|T48213 hypothetical protein T20L15...    29  2.9
TC77007 similar to PIR|T10804|T10804 tonoplast intrinsic protein...    29  3.8
TC84209 similar to GP|4204311|gb|AAD10692.1| lcl|prt_seq No defi...    28  6.4
TC86560 similar to GP|8843737|dbj|BAA97285.1 myosin heavy chain-...    28  8.4
BQ140905                                                               28  8.4
AL383749 similar to GP|11907611|gb| Cdc24 {Eremothecium gossypii...    28  8.4

>CA921489 weakly similar to PIR|T47566|T475 hypothetical protein F24B22.30 -
           Arabidopsis thaliana, partial (10%)
          Length = 809

 Score =  181 bits (459), Expect = 5e-46
 Identities = 97/207 (46%), Positives = 133/207 (63%), Gaps = 1/207 (0%)
 Frame = -3

Query: 194 AALQMQGELQWFKAVESTVPPMCKEAKNADGLTPHELFTKNHEHLLNEGRQWAKDIASSF 253
           +A QMQ E   FK VE    P+ KE KN +G T  ++F + H+ LL EG+ W KD ++S 
Sbjct: 729 SAFQMQREYSGFKEVEKWDHPLHKEVKNQEGKTAWQVFKEEHKALLEEGKNWMKDTSNSC 550

Query: 254 TIVGTLIITIMFAAAFTVPGGNNQDKGTPIFLGKNAFSFFIVTDSLSLIASASSVLMFIG 313
            +V TLI TI FAAA TVPGGNNQDKG PIFL  N F  F+V+D+L+L +S +S+LMF+ 
Sbjct: 549 MLVATLIATIAFAAAITVPGGNNQDKGIPIFLSDNTFMVFVVSDALALFSSMASLLMFLA 370

Query: 314 ILTSRYAEEDFNTSLPAKLLFGLFTIFLSVVFMMCSFCSALALMLK-GYRWIIITAIASS 372
           IL +RY EEDF  +LP +L+ G+ ++F +VV  M +F +AL+++LK    W  I     +
Sbjct: 369 ILNARYTEEDFMMALPERLILGMASLFFAVVTTMVAFGAALSMLLKERLTWAPIPIALLA 190

Query: 373 VIPILVFMFSLLRLFSEVCISFLRSYF 399
            +PI +F    L LF E+ IS   S F
Sbjct: 189 CVPIALFAKLQLPLFIEMVISTYESQF 109


>TC85019 weakly similar to GP|9293890|dbj|BAB01793.1
           emb|CAB70981.1~gene_id:MVE11.3~similar to unknown
           protein {Arabidopsis thaliana}, partial (10%)
          Length = 663

 Score =  179 bits (453), Expect = 2e-45
 Identities = 101/215 (46%), Positives = 135/215 (61%), Gaps = 1/215 (0%)
 Frame = +2

Query: 143 AILNRQDKVFKLIYEMEGQKELKTTK-DIFENNLLHLAAELGPSSYRGCRSNAALQMQGE 201
           AI NRQ+KVF L+ EM     L     D   N   HLAA +  +S     + AA QM+ E
Sbjct: 23  AIKNRQEKVFNLLREMPIICNLLVLALDESNNTTSHLAARV--ASQAESIACAAFQMKRE 196

Query: 202 LQWFKAVESTVPPMCKEAKNADGLTPHELFTKNHEHLLNEGRQWAKDIASSFTIVGTLII 261
           L WFK VE    P+ K+ KN DG T  ++F + H+ LL EG+ W KD ++S  +V TLI 
Sbjct: 197 LHWFKEVEKLDHPLHKDVKNNDGKTAWQVFKEEHKTLLEEGKNWMKDTSNSCMLVATLIA 376

Query: 262 TIMFAAAFTVPGGNNQDKGTPIFLGKNAFSFFIVTDSLSLIASASSVLMFIGILTSRYAE 321
           TI FAAA TVPGGNNQDKG PIFL    F  FIV+D+L+L +S  S+LMF+ I+  RYA+
Sbjct: 377 TITFAAAITVPGGNNQDKGIPIFLSDKTFMLFIVSDALALFSSMVSLLMFLSIIHGRYAK 556

Query: 322 EDFNTSLPAKLLFGLFTIFLSVVFMMCSFCSALAL 356
           EDF  +LP +L+ G+  +F +V   M +F +AL++
Sbjct: 557 EDFVVALPKRLILGMAALFFAVGTTMIAFGAALSM 661


>AJ548369 weakly similar to PIR|T48469|T484 hypothetical protein T1E3.90 -
           Arabidopsis thaliana, partial (18%)
          Length = 579

 Score =  122 bits (306), Expect = 2e-28
 Identities = 64/175 (36%), Positives = 99/175 (56%), Gaps = 1/175 (0%)
 Frame = +2

Query: 109 AAKYGIIEFINSMREANPDLLWAMDKYKRGIFAHAILNRQDKVFKLIYEMEGQKE-LKTT 167
           A K G   F+  +  + PDL+  +D   R IF  A+ +    +F LI+E+   K+ +   
Sbjct: 50  ATKVGNFHFVAELLRSEPDLIRDVDDKNRSIFHIAVQHCHSSIFSLIHELGSFKDSIIDL 229

Query: 168 KDIFENNLLHLAAELGPSSYRGCRSNAALQMQGELQWFKAVESTVPPMCKEAKNADGLTP 227
           +D   NN+LH AA+L P S     S AALQM  E+ WF+ V+  + P+  + KN++G TP
Sbjct: 230 EDDERNNILHYAAKLAPPSQLNLISGAALQMTHEILWFEEVKELMSPIEIKKKNSNGKTP 409

Query: 228 HELFTKNHEHLLNEGRQWAKDIASSFTIVGTLIITIMFAAAFTVPGGNNQDKGTP 282
            E+F + H+ LL +   W + I +   ++ TLI T +F A F +PGG N++ GTP
Sbjct: 410 DEIFAEEHKELLTKAESWIESITNYCILISTLIFTGVFTATFNIPGGFNKNTGTP 574


>BE942775 
          Length = 111

 Score = 76.6 bits (187), Expect = 2e-14
 Identities = 36/36 (100%), Positives = 36/36 (100%)
 Frame = +3

Query: 73  NHYLVREIMRRLCEKIEKISSESELHQCSIHDAMLQ 108
           NHYLVREIMRRLCEKIEKISSESELHQCSIHDAMLQ
Sbjct: 3   NHYLVREIMRRLCEKIEKISSESELHQCSIHDAMLQ 110


>AW696166 GP|15025223|gb Uncharacterized conserved protein {Clostridium
           acetobutylicum}, partial (1%)
          Length = 661

 Score = 71.6 bits (174), Expect = 5e-13
 Identities = 45/151 (29%), Positives = 72/151 (46%), Gaps = 5/151 (3%)
 Frame = +3

Query: 107 LQAAKYGIIEFINSMREANPDLLWAMDKYKRGIFAHAILNRQDKVFKLIYEMEGQK---- 162
           L AAK GI+E +N +    P  +      K  +   A+  RQ  + + +  ++  K    
Sbjct: 117 LVAAKNGIVEMVNEILIKVPSAIHNTTSRKENVLLVAVKYRQPLIVETLRMIKHSKPELW 296

Query: 163 -ELKTTKDIFENNLLHLAAELGPSSYRGCRSNAALQMQGELQWFKAVESTVPPMCKEAKN 221
             L    D  EN +LHLAAE          + +ALQM  +++WF+ ++S VP       N
Sbjct: 297 NNLILAMDEDENTVLHLAAEALGGDKPWQIAGSALQMMWDIKWFQYIKSLVPQHFIFRNN 476

Query: 222 ADGLTPHELFTKNHEHLLNEGRQWAKDIASS 252
           + G T  E+F K H+ L+ +  +W KD + S
Sbjct: 477 SSGKTSREIFKKTHKGLIKDSSEWLKDTSES 569


>AW687065 similar to GP|9293890|dbj| emb|CAB70981.1~gene_id:MVE11.3~similar
           to unknown protein {Arabidopsis thaliana}, partial (2%)
          Length = 650

 Score = 58.9 bits (141), Expect = 3e-09
 Identities = 29/61 (47%), Positives = 39/61 (63%)
 Frame = +3

Query: 208 VESTVPPMCKEAKNADGLTPHELFTKNHEHLLNEGRQWAKDIASSFTIVGTLIITIMFAA 267
           VE    P+ KE KN +G T  ++F + H+ LL EG+ W KD ++S  +V TLI TI FAA
Sbjct: 465 VEKLDHPLHKEVKNQEGKTAWQVFKEEHKALLEEGKNWMKDTSNSCMLVATLIATIAFAA 644

Query: 268 A 268
           A
Sbjct: 645 A 647



 Score = 31.2 bits (69), Expect = 0.76
 Identities = 18/35 (51%), Positives = 21/35 (59%)
 Frame = +3

Query: 172 ENNLLHLAAELGPSSYRGCRSNAALQMQGELQWFK 206
           +N   HLAA L  +S     S +A QMQ ELQWFK
Sbjct: 3   QNTTSHLAARL--ASQVESISGSAFQMQRELQWFK 101


>BG586161 similar to GP|21554129|gb| putative thaumatin {Arabidopsis
           thaliana}, partial (6%)
          Length = 754

 Score = 44.3 bits (103), Expect = 9e-05
 Identities = 38/142 (26%), Positives = 65/142 (45%), Gaps = 4/142 (2%)
 Frame = +1

Query: 247 KDIASSFTIVGTLIITIMFAAAFTVPGGNNQDKGTPIFLGKNAFSFFIVTDSLSLIASAS 306
           KD+  +  +V TLIIT   AA F VPG   +  G    L    F  FI+  ++SL +S S
Sbjct: 235 KDMVETLILVSTLIITASVAACFAVPG---EADGKANNLCHAMFQAFIIFITISLFSSIS 405

Query: 307 SVLMFIGILTSRYAEEDFNTSLPAKLLFGLFTIFLSVVFMMCSFCSALALMLKGYRWI-- 364
           S+++             F+  +   +L G+  I LS+ F+     + L  ++    W+  
Sbjct: 406 SIIILFWATLGLTELVKFSLKIVMPIL-GIALISLSLAFI-----AGLYTVISELTWLAN 567

Query: 365 --IITAIASSVIPILVFMFSLL 384
             ++  +   V+ IL++M   L
Sbjct: 568 VFLVMTLIFVVVEILLYMLLFL 633


>BG587051 
          Length = 759

 Score = 42.0 bits (97), Expect = 4e-04
 Identities = 25/65 (38%), Positives = 36/65 (54%), Gaps = 2/65 (3%)
 Frame = +1

Query: 144 ILNRQDKVFKLIYEMEGQKELKTT--KDIFENNLLHLAAELGPSSYRGCRSNAALQMQGE 201
           +L+R   +F L++++   K +  T   D   N LLHLAA+L P +     S AA QM  E
Sbjct: 541 VLHRHASIFNLVHQIGHIKGIIVTYENDDDRNTLLHLAAKLAPRNQLELVSGAAFQMCVE 720

Query: 202 LQWFK 206
           L WF+
Sbjct: 721 LLWFE 735


>TC83706 weakly similar to PIR|T48283|T48283 ankyrin-like protein -
           Arabidopsis thaliana, partial (34%)
          Length = 746

 Score = 40.8 bits (94), Expect = 0.001
 Identities = 30/109 (27%), Positives = 52/109 (47%), Gaps = 9/109 (8%)
 Frame = +1

Query: 251 SSFTIVGTLIITIMFAAAFTVPGGNNQD---------KGTPIFLGKNAFSFFIVTDSLSL 301
           +S T+V  LI T+ FAA FTVPG   Q+          G         F  F++ DS +L
Sbjct: 100 NSNTVVAVLIATVAFAAIFTVPGQYPQNTKNLAPGMSPGEANIAPNIEFLIFVIFDSTAL 279

Query: 302 IASASSVLMFIGILTSRYAEEDFNTSLPAKLLFGLFTIFLSVVFMMCSF 350
             S + V++   ++      +   T++  KL++ +  + +SV F+  S+
Sbjct: 280 FISLAVVIVQTSVVVIEREAKKQMTAVINKLMW-IACVLISVAFLAMSY 423


>TC91163 similar to PIR|T44832|T44832 probable emulsan repeating unit
           polymerase [imported] - Acinetobacter lwoffii, partial
           (3%)
          Length = 606

 Score = 35.4 bits (80), Expect = 0.040
 Identities = 27/71 (38%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
 Frame = +1

Query: 332 LLFGLFTIFLSVV--FMMCSFCSALALMLKGYRWIIITAIASSVIPILVFMFSLLRLFSE 389
           +LF  F  F  VV  F+  SFCSAL L     R  +++A+ S+   +LV   S   LF  
Sbjct: 313 VLFSFFVPFFIVVSFFLFFSFCSALLLC----RRFVVSAVGSAAARLLVCGSSASCLFLV 480

Query: 390 VCISFLRSYFL 400
           V +  L+S FL
Sbjct: 481 VLLCRLKSGFL 513


>TC88966 similar to GP|21305941|gb|AAM45816.1 NADH dehydrogenase subunit 2
           {Neoheterandria umbratilis}, partial (4%)
          Length = 915

 Score = 33.5 bits (75), Expect = 0.15
 Identities = 23/92 (25%), Positives = 42/92 (45%), Gaps = 23/92 (25%)
 Frame = -2

Query: 244 QWAKDIASSFTIVGTLIITIMFAAAFTVPGGNNQDK---------------GTPIFLGK- 287
           +W KD+ SS ++  +LI T+ F+ A   PGG  Q                  T I +G+ 
Sbjct: 821 EWLKDMKSSISLTASLIATLTFSLATNPPGGVVQASVGDSNECGKILISTINTTICVGEA 642

Query: 288 -------NAFSFFIVTDSLSLIASASSVLMFI 312
                  + +  F++ +++  IAS S +L+ +
Sbjct: 641 ILATRSHDKYLAFLICNTICFIASLSVILVLV 546


>TC83751 weakly similar to GP|21311555|gb|AAM46778.1 15.8 kDa oleosin
           {Theobroma cacao}, partial (29%)
          Length = 860

 Score = 31.2 bits (69), Expect = 0.76
 Identities = 17/55 (30%), Positives = 30/55 (53%)
 Frame = +1

Query: 333 LFGLFTIFLSVVFMMCSFCSALALMLKGYRWIIITAIASSVIPILVFMFSLLRLF 387
           LFGLFT+F+     +   C    + + G+  I+   +   + PILV +F++L +F
Sbjct: 91  LFGLFTLFIVCTISLFLTCLTFVVTIMGF--ILFAPMIILLSPILVPVFAVLFVF 249


>TC87068 similar to PIR|G96703|G96703 unknown protein  30164-32998
           [imported] - Arabidopsis thaliana, partial (24%)
          Length = 711

 Score = 30.4 bits (67), Expect = 1.3
 Identities = 20/61 (32%), Positives = 29/61 (46%), Gaps = 12/61 (19%)
 Frame = -3

Query: 333 LFGLFTIFLSVVFMMCSFCSALALMLKG------------YRWIIITAIASSVIPILVFM 380
           L G+  IF +++F M SF    AL L+             Y W +I  I  +  PI+VF+
Sbjct: 412 LNGIVFIF*ALIFFMTSFSQP*ALFLQQIIQNSWNIILCIY*WQVIWLIIQTSFPIIVFL 233

Query: 381 F 381
           F
Sbjct: 232 F 230


>TC84611 similar to PIR|T48213|T48213 hypothetical protein T20L15.190 -
           Arabidopsis thaliana, partial (52%)
          Length = 946

 Score = 29.3 bits (64), Expect = 2.9
 Identities = 17/66 (25%), Positives = 34/66 (50%)
 Frame = +1

Query: 54  NLIKFTMLSGIKKIYGIKRNHYLVREIMRRLCEKIEKISSESELHQCSIHDAMLQAAKYG 113
           NL      + ++ I   KRN  ++++IMR++   ++KI     +H+  +  + L   K G
Sbjct: 79  NLESVMFRNALQGIDSSKRNALIIKQIMRQIITSLKKIHDTGIVHR-DVKPSNLVVTKKG 255

Query: 114 IIEFIN 119
            I+ I+
Sbjct: 256 QIKLID 273


>TC77007 similar to PIR|T10804|T10804 tonoplast intrinsic protein  delta
           type - upland cotton, complete
          Length = 1041

 Score = 28.9 bits (63), Expect = 3.8
 Identities = 15/60 (25%), Positives = 34/60 (56%), Gaps = 3/60 (5%)
 Frame = +3

Query: 331 KLLFGLFTIFLSVVFMMCSFCSALALMLKGYRWIIITAIASSVIP---ILVFMFSLLRLF 387
           +LLF +   FL ++ ++ +F   ++ +L    W+++  + SS++    +L F+  LL +F
Sbjct: 249 QLLFAMVLHFLLLLQLVPTFLVVMSTLLSPLDWLLVDRLPSSLVSSTGLLSFLDPLLHVF 428


>TC84209 similar to GP|4204311|gb|AAD10692.1| lcl|prt_seq No definition line
           found {Arabidopsis thaliana}, partial (36%)
          Length = 701

 Score = 28.1 bits (61), Expect = 6.4
 Identities = 17/57 (29%), Positives = 29/57 (50%), Gaps = 8/57 (14%)
 Frame = -1

Query: 44  SS*LGSLICV--------NLIKFTMLSGIKKIYGIKRNHYLVREIMRRLCEKIEKIS 92
           +S  GS +C+         LI     +  K+ +  KRNH +VRE+ R+   K +K++
Sbjct: 416 NSLFGSSVCIIAFSQKNHELIGMNQTARKKRCFP-KRNHDIVRELFRKFLNKADKVA 249


>TC86560 similar to GP|8843737|dbj|BAA97285.1 myosin heavy chain-like
            {Arabidopsis thaliana}, partial (28%)
          Length = 2450

 Score = 27.7 bits (60), Expect = 8.4
 Identities = 18/49 (36%), Positives = 26/49 (52%), Gaps = 4/49 (8%)
 Frame = -3

Query: 289  AFSFFIVTDSLSLIASA----SSVLMFIGILTSRYAEEDFNTSLPAKLL 333
            +FSFF  + + SL+ASA    SSVL FI   + R +       L  K++
Sbjct: 1608 SFSFFSASSNASLVASASFFNSSVLFFISSTSRRASSVSDVNRLSVKII 1462


>BQ140905 
          Length = 308

 Score = 27.7 bits (60), Expect = 8.4
 Identities = 19/53 (35%), Positives = 24/53 (44%), Gaps = 2/53 (3%)
 Frame = -2

Query: 276 NQDKGTPIFLGKNAFSFFIVTDSLSL--IASASSVLMFIGILTSRYAEEDFNT 326
           + DKG PIF   N FS F  ++ L L  +         I +L  R   ED NT
Sbjct: 298 HDDKGAPIFYTNNNFSTFFCSNRLDLFVVLYEHKKSPKIILLAKRRKRED*NT 140


>AL383749 similar to GP|11907611|gb| Cdc24 {Eremothecium gossypii}, partial
           (2%)
          Length = 305

 Score = 27.7 bits (60), Expect = 8.4
 Identities = 12/27 (44%), Positives = 15/27 (55%), Gaps = 5/27 (18%)
 Frame = +3

Query: 6   CRCCN-----KSCKVLSSNNDVVSSIW 27
           C CCN     KSCK+ S  N+   S+W
Sbjct: 222 CTCCNNCKLFKSCKLFSIYNNCYYSLW 302


  Database: MTGI
    Posted date:  Oct 22, 2004  3:39 PM
  Number of letters in database: 27,044,181
  Number of sequences in database:  36,976
  
Lambda     K      H
   0.328    0.139    0.417 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,616,198
Number of Sequences: 36976
Number of extensions: 193841
Number of successful extensions: 1550
Number of sequences better than 10.0: 38
Number of HSP's better than 10.0 without gapping: 1537
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1547
length of query: 407
length of database: 9,014,727
effective HSP length: 98
effective length of query: 309
effective length of database: 5,391,079
effective search space: 1665843411
effective search space used: 1665843411
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 59 (27.3 bits)


Medicago: description of AC146721.2