Medicago
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC148816.5 + phase: 0 /pseudo
         (211 letters)

Database: MTGI 
           36,976 sequences; 27,044,181 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

BE248682 similar to GP|18568269|gb putative gag-pol polyprotein ...   121  2e-28
TC83437 weakly similar to PIR|D86384|D86384 unknown protein [imp...   120  5e-28
CB891696                                                               59  2e-09
BQ148771                                                               52  1e-07
TC77455 similar to GP|22335695|dbj|BAC10549. nine-cis-epoxycarot...    52  2e-07
TC83624 homologue to PIR|G84581|G84581 copia-like retroelement p...    49  2e-06
BG647708 weakly similar to GP|13786450|gb| putative reverse tran...    42  2e-04
BF006686                                                               40  5e-04
BG586266 similar to GP|7267666|em RNA-directed DNA polymerase-li...    35  0.021
TC80819 weakly similar to GP|10140689|gb|AAG13524.1 putative non...    35  0.027
BG584442                                                               34  0.047
TC91262 similar to GP|20804797|dbj|BAB92481. putative amino acid...    34  0.047
BG646188 similar to PIR|T48467|T48 aspartyl aminopeptidase-like ...    31  0.40
AW774658 similar to GP|2808681|emb| Hcr9-4B {Lycopersicon hirsut...    27  0.54
CB892942 similar to GP|13569548|gb unknown {Arabidopsis thaliana...    29  1.2
TC82948                                                                28  2.0
AW560239 weakly similar to GP|14495231|dbj hypothetical protein~...    27  4.4
TC86890 weakly similar to GP|4406801|gb|AAD20110.1| unknown prot...    27  5.7
TC88914 weakly similar to PIR|F96672|F96672 Similar to Flavonol ...    26  9.8
TC91200 weakly similar to GP|10177640|dbj|BAB10787. contains sim...    26  9.8

>BE248682 similar to GP|18568269|gb putative gag-pol polyprotein {Zea mays},
           partial (1%)
          Length = 441

 Score =  121 bits (303), Expect = 2e-28
 Identities = 57/81 (70%), Positives = 65/81 (79%)
 Frame = +3

Query: 1   VSHLQYADDTLCIGKASVQNLWTMKAILRGFQMVSGLKINFSKSSLVGINVSEDFMAMAC 60
           VSHLQYADDTLCIG  +V NLWT+KA+L+GF+M SGLK+NF KSSL+GINV  DFM  AC
Sbjct: 159 VSHLQYADDTLCIGMPTVDNLWTLKALLQGFEMASGLKVNFHKSSLIGINVPRDFMEAAC 338

Query: 61  DFLNCSAGSITFKYLGLPVGA 81
            FLNC   SI F YLGLP G+
Sbjct: 339 RFLNCREESIPFIYLGLPGGS 401


>TC83437 weakly similar to PIR|D86384|D86384 unknown protein [imported] -
           Arabidopsis thaliana, partial (6%)
          Length = 951

 Score =  120 bits (300), Expect = 5e-28
 Identities = 58/134 (43%), Positives = 88/134 (65%)
 Frame = +2

Query: 1   VSHLQYADDTLCIGKASVQNLWTMKAILRGFQMVSGLKINFSKSSLVGINVSEDFMAMAC 60
           VSHLQ+A+DTL +   +  N+  ++A L  F  +SGLK+NF KS LV +N++  +++ A 
Sbjct: 542 VSHLQFANDTLLLETKNWANIRALRAALVIF*AMSGLKVNFHKSGLVCVNIAPSWLSEAA 721

Query: 61  DFLNCSAGSITFKYLGLPVGANMRSMSTWEPLVETIGGRLNTWSTRYISFGGRIVLLNSV 120
             L+   G + F YLG+P+  N R +S WEP+V  I  RL  W++R++SFGGR+VLL SV
Sbjct: 722 SVLSWKVGKVPFLYLGMPIEGNSRRLSFWEPIVNRIKARLTGWNSRFLSFGGRLVLLKSV 901

Query: 121 LNSMPIFYLSFLKM 134
           L S+ ++ L   K+
Sbjct: 902 LTSLSVYALPSSKL 943


>CB891696 
          Length = 638

 Score = 58.5 bits (140), Expect = 2e-09
 Identities = 45/144 (31%), Positives = 76/144 (52%), Gaps = 1/144 (0%)
 Frame = +1

Query: 18  VQNLWTMKAILRGFQMVSGLKINFSKSSLVGINVSEDFMAMACDFLNCSAGSITFKYLGL 77
           V+N+ TMK I+  F++ S L +NF KS L+ +NV   F      ++ C    + FKYLG+
Sbjct: 4   VENILTMKTIVSYFELASSLWVNFLKSGLINLNVIGHF*GW*NIYIKCKVH*VIFKYLGI 183

Query: 78  PVGANMRSMSTWEPLVETIGGRLNT-WSTRYISFGGRIVLLNSVLNSMPIFYLSFLKMLV 136
            VG N   ++  E L++ +   L + W+T+ +        +++   S  I Y S +K+ V
Sbjct: 184 LVGENPCRVNM*ELLLKLLTN*LGSWWNTK*LWTQNGFSQIHAK*ISQNI-YFSLMKIPV 360

Query: 137 GVWKRIVRIQR*FLWGGVGGGKKI 160
            V + I +++  FL G +   KKI
Sbjct: 361 KV*ELISQLKTQFL*GNLKVTKKI 432


>BQ148771 
          Length = 680

 Score = 52.4 bits (124), Expect = 1e-07
 Identities = 25/105 (23%), Positives = 53/105 (49%)
 Frame = -3

Query: 100 LNTWSTRYISFGGRIVLLNSVLNSMPIFYLSFLKMLVGVWKRIVRIQR*FLWGGVGGGKK 159
           L  W   ++S   R+ L  SV+ ++P++ +    +     + I ++QR F+WG     ++
Sbjct: 573 LANWKANHLSLARRVTLAKSVIEAVPLYPMMTTIIPKACIEEIQKLQRKFVWGDTEVSRR 394

Query: 160 ISWVNWKSVCQQKENGGVRVKDIRVMNISLLAKWRWRLIDGREAL 204
              V W+++ + K   G+ ++ + VMN + + K  W +  G  +L
Sbjct: 393 YHAVGWETMSKPKTIYGLGLRRLDVMNKACIMKLGWSIYSGSNSL 259


>TC77455 similar to GP|22335695|dbj|BAC10549. nine-cis-epoxycarotenoid
           dioxygenase1 {Pisum sativum}, partial (43%)
          Length = 1865

 Score = 52.0 bits (123), Expect = 2e-07
 Identities = 22/41 (53%), Positives = 31/41 (74%)
 Frame = -3

Query: 169 CQQKENGGVRVKDIRVMNISLLAKWRWRLIDGREALWK*VL 209
           C  +  GG+ V+DIR++N+SLLAKW WRL+  + +LWK VL
Sbjct: 969 CLPRCKGGLGVRDIRLVNVSLLAKWWWRLLQDQSSLWKEVL 847


>TC83624 homologue to PIR|G84581|G84581 copia-like retroelement pol
           polyprotein [imported] - Arabidopsis thaliana, partial
           (1%)
          Length = 831

 Score = 48.5 bits (114), Expect = 2e-06
 Identities = 19/56 (33%), Positives = 38/56 (66%)
 Frame = +1

Query: 41  FSKSSLVGINVSEDFMAMACDFLNCSAGSITFKYLGLPVGANMRSMSTWEPLVETI 96
           FS+ + + +N+ E F+  + +FL C+   + F +LGLP+GAN +  ST +P+++++
Sbjct: 256 FSRVNFMALNLEESFVEASPNFLLCNVNEVPFCFLGLPIGANPKRSSTRKPVLDSL 423


>BG647708 weakly similar to GP|13786450|gb| putative reverse transcriptase
           {Oryza sativa}, partial (9%)
          Length = 708

 Score = 41.6 bits (96), Expect = 2e-04
 Identities = 22/70 (31%), Positives = 35/70 (49%), Gaps = 1/70 (1%)
 Frame = +1

Query: 1   VSHLQYADDTLCIGKASVQNLWTMKAILRGFQMVSGLKINFSKSSL-VGINVSEDFMAMA 59
           ++HL +ADD+L   +A++    T+  +L  +Q  SG  +NF KS +    NV      M 
Sbjct: 145 ITHLLFADDSLLFARANLTEAATIMQVLHSYQSASGQLVNFEKSEVSYSQNVPNQEKEMI 324

Query: 60  CDFLNCSAGS 69
           C  +    GS
Sbjct: 325 CQQIAIKTGS 354


>BF006686 
          Length = 325

 Score = 40.4 bits (93), Expect = 5e-04
 Identities = 16/29 (55%), Positives = 21/29 (72%)
 Frame = +3

Query: 89  WEPLVETIGGRLNTWSTRYISFGGRIVLL 117
           WEPL+E +   L +W  + +SFGGRIVLL
Sbjct: 237 WEPLLEHVNKMLKSWGNKLLSFGGRIVLL 323


>BG586266 similar to GP|7267666|em RNA-directed DNA polymerase-like protein
           {Arabidopsis thaliana}, partial (18%)
          Length = 789

 Score = 35.0 bits (79), Expect = 0.021
 Identities = 22/77 (28%), Positives = 40/77 (51%), Gaps = 1/77 (1%)
 Frame = -3

Query: 1   VSHLQYADDTLCIGKASVQNLWTMKAILRGFQMVSGLKINFSKSSLV-GINVSEDFMAMA 59
           ++HL +ADDT+  GK++  +   + +I+  ++  SG  IN +KS++      S+  +   
Sbjct: 253 INHLLFADDTMFFGKSNASSCAILLSIMDKYRAASGRCIN*TKSAITFSSKTSQAIIDRV 74

Query: 60  CDFLNCSAGSITFKYLG 76
              L  +    T KYLG
Sbjct: 73  KGELKIAKEGGTGKYLG 23


>TC80819 weakly similar to GP|10140689|gb|AAG13524.1 putative non-LTR
           retroelement reverse transcriptase {Oryza sativa
           (japonica cultivar-group)}, partial (2%)
          Length = 1262

 Score = 34.7 bits (78), Expect = 0.027
 Identities = 17/35 (48%), Positives = 23/35 (65%), Gaps = 2/35 (5%)
 Frame = +1

Query: 177 VRVKDIRV--MNISLLAKWRWRLIDGREALWK*VL 209
           +R+K + V   N+SLL KW WRL+  +E LW  VL
Sbjct: 19  LRMKGLGVGAFNLSLLGKWCWRLLVDKEGLWHRVL 123


>BG584442 
          Length = 775

 Score = 33.9 bits (76), Expect = 0.047
 Identities = 21/79 (26%), Positives = 36/79 (44%), Gaps = 1/79 (1%)
 Frame = +1

Query: 115 VLLNSVLNSMPIFYLSFLKMLVGVWKRIVRIQR*FLWGGVGGGKK-ISWVNWKSVCQQKE 173
           V++   L S+  + +S   +L      I +I   F W  VG  +K + W++ + +   K 
Sbjct: 430 VMIKYALQSISSYVMSIFLLLNSQVDEIEKIMNTFSWVHVGENRKGMHWMS*EKLFVHKN 609

Query: 174 NGGVRVKDIRVMNISLLAK 192
            GG+   D    NI +L K
Sbjct: 610 YGGMGFTDFTTFNIPMLGK 666


>TC91262 similar to GP|20804797|dbj|BAB92481. putative amino acid or GABA
           permease {Oryza sativa (japonica cultivar-group)},
           partial (36%)
          Length = 904

 Score = 33.9 bits (76), Expect = 0.047
 Identities = 14/32 (43%), Positives = 22/32 (68%)
 Frame = +1

Query: 108 ISFGGRIVLLNSVLNSMPIFYLSFLKMLVGVW 139
           I+  G I++L+ ++NS+PI +LSFL  L   W
Sbjct: 520 IAIHGGILVLHGIINSLPISWLSFLGQLAAFW 615


>BG646188 similar to PIR|T48467|T48 aspartyl aminopeptidase-like protein -
           Arabidopsis thaliana, partial (30%)
          Length = 776

 Score = 30.8 bits (68), Expect = 0.40
 Identities = 24/69 (34%), Positives = 36/69 (51%), Gaps = 1/69 (1%)
 Frame = +3

Query: 78  PVGANMRSMSTWEPLVETIGGRL-NTWSTRYISFGGRIVLLNSVLNSMPIFYLSFLKMLV 136
           P  A++++ S     V+T GG L +TW  R +S  GR++L  S          SF+  LV
Sbjct: 393 PKTASLKASSYMMVNVQTYGGGLWHTWFDRDLSVAGRVILKRS--------DKSFVHKLV 548

Query: 137 GVWKRIVRI 145
            V + I+RI
Sbjct: 549 KVSRPILRI 575


>AW774658 similar to GP|2808681|emb| Hcr9-4B {Lycopersicon hirsutum}, partial
           (4%)
          Length = 665

 Score = 27.3 bits (59), Expect(2) = 0.54
 Identities = 12/19 (63%), Positives = 14/19 (73%)
 Frame = -1

Query: 2   SHLQYADDTLCIGKASVQN 20
           SHLQ+ADDTL +G  S  N
Sbjct: 383 SHLQFADDTLLLGVKSWAN 327



 Score = 21.6 bits (44), Expect(2) = 0.54
 Identities = 8/24 (33%), Positives = 17/24 (70%)
 Frame = -3

Query: 24  MKAILRGFQMVSGLKINFSKSSLV 47
           +++IL  F+ +SGLK+N  +  ++
Sbjct: 321 LRSILVIFENMSGLKVNLREEVII 250


>CB892942 similar to GP|13569548|gb unknown {Arabidopsis thaliana}, partial
           (45%)
          Length = 767

 Score = 29.3 bits (64), Expect = 1.2
 Identities = 15/32 (46%), Positives = 19/32 (58%), Gaps = 2/32 (6%)
 Frame = -1

Query: 174 NGGVRVKDIRVMNISLLA--KWRWRLIDGREA 203
           + G RV  +RV   SLL   KW WR+ +G EA
Sbjct: 401 SSGTRVLCVRVQKTSLLQVKKWTWRVREGGEA 306


>TC82948 
          Length = 705

 Score = 28.5 bits (62), Expect = 2.0
 Identities = 11/44 (25%), Positives = 25/44 (56%)
 Frame = +3

Query: 159 KISWVNWKSVCQQKENGGVRVKDIRVMNISLLAKWRWRLIDGRE 202
           K+  V+W+ VC+  + G + ++ +  +N +L  K  W ++  +E
Sbjct: 255 KVVKVSWEKVCRPIKEGSLGIRSLSKLNEALNLKLCWDMMISKE 386


>AW560239 weakly similar to GP|14495231|dbj hypothetical protein~similar to
           Arabidopsis thaliana chromosome 1  T6A9.6, partial (10%)
          Length = 630

 Score = 27.3 bits (59), Expect = 4.4
 Identities = 15/44 (34%), Positives = 25/44 (56%)
 Frame = -3

Query: 166 KSVCQQKENGGVRVKDIRVMNISLLAKWRWRLIDGREALWK*VL 209
           + + Q+ E+G     DIR+ N+ + A  +W L+D  EA  K +L
Sbjct: 574 EKILQEWESGNTTF-DIRIPNMMITAYCKWGLLDKAEAYIKRLL 446


>TC86890 weakly similar to GP|4406801|gb|AAD20110.1| unknown protein
           {Arabidopsis thaliana}, partial (54%)
          Length = 2446

 Score = 26.9 bits (58), Expect = 5.7
 Identities = 10/22 (45%), Positives = 16/22 (72%)
 Frame = +1

Query: 175 GGVRVKDIRVMNISLLAKWRWR 196
           GGVR+K + +++I LL +  WR
Sbjct: 790 GGVRIKTVPLLSIGLLCEVTWR 855


>TC88914 weakly similar to PIR|F96672|F96672 Similar to Flavonol
           3-O-Glucosyltransferase [imported] - Arabidopsis
           thaliana, partial (20%)
          Length = 1051

 Score = 26.2 bits (56), Expect = 9.8
 Identities = 10/27 (37%), Positives = 16/27 (59%)
 Frame = -3

Query: 125 PIFYLSFLKMLVGVWKRIVRIQR*FLW 151
           PI    + K L  +WK I++I+  +LW
Sbjct: 647 PILKTFYAKFLSQIWKPILKIKENYLW 567


>TC91200 weakly similar to GP|10177640|dbj|BAB10787. contains similarity to
           heparanase~gene_id:MGG23.2 {Arabidopsis thaliana},
           partial (15%)
          Length = 805

 Score = 26.2 bits (56), Expect = 9.8
 Identities = 17/64 (26%), Positives = 30/64 (46%), Gaps = 8/64 (12%)
 Frame = +3

Query: 44  SSLVGINVSEDFMAMA--------CDFLNCSAGSITFKYLGLPVGANMRSMSTWEPLVET 95
           SS++G N+ +DF+           CD+  CS G  +   L L     + ++  + PL   
Sbjct: 477 SSVIG-NIDDDFICATLDWWPPQKCDYGTCSWGLASLLNLDLNNKIFLNAVKAFSPLKLR 653

Query: 96  IGGR 99
           +GG+
Sbjct: 654 LGGK 665


  Database: MTGI
    Posted date:  Oct 22, 2004  3:39 PM
  Number of letters in database: 27,044,181
  Number of sequences in database:  36,976
  
Lambda     K      H
   0.330    0.142    0.465 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,787,115
Number of Sequences: 36976
Number of extensions: 115890
Number of successful extensions: 768
Number of sequences better than 10.0: 41
Number of HSP's better than 10.0 without gapping: 764
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 768
length of query: 211
length of database: 9,014,727
effective HSP length: 92
effective length of query: 119
effective length of database: 5,612,935
effective search space: 667939265
effective search space used: 667939265
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 56 (26.2 bits)


Medicago: description of AC148816.5