Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0074.12
         (369 letters)

Database: uniref100 
           2,790,947 sequences; 848,049,833 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

UniRef100_Q9SQW9 Putative retroelement pol polyprotein [Arabidop...    46  0.001
UniRef100_Q7XPJ7 OSJNBa0087O24.13 protein [Oryza sativa]               43  0.012
UniRef100_Q94GG1 Putative polyprotein [Oryza sativa]                   43  0.016
UniRef100_Q6CBR5 Similarity [Yarrowia lipolytica]                      43  0.016
UniRef100_Q8LAQ8 Hypothetical protein [Arabidopsis thaliana]           42  0.021
UniRef100_Q6L3V6 Hypothetical protein [Solanum demissum]               42  0.027
UniRef100_Q8LLX2 Putative retroelement [Oryza sativa]                  42  0.035
UniRef100_Q8S653 Putative retroelement [Oryza sativa]                  42  0.035
UniRef100_Q8H7X6 Putative retroelement [Oryza sativa]                  39  0.23
UniRef100_Q9FWC7 Putative plant disease resistance polyprotein [...    38  0.51
UniRef100_Q9S9R5 F28J9.14 [Arabidopsis thaliana]                       37  0.67
UniRef100_Q9FYC9 Hypothetical protein F22J12_60 [Arabidopsis tha...    37  0.67
UniRef100_Q7XZZ8 Putative polyprotein [Oryza sativa]                   37  0.67
UniRef100_UPI00004345B9 UPI00004345B9 UniRef100 entry                  37  0.87
UniRef100_UPI00004345B8 UPI00004345B8 UniRef100 entry                  37  0.87
UniRef100_Q852F5 Putative polyprotein [Oryza sativa]                   37  0.87
UniRef100_Q6C914 Similarities with tr|Q9XE85 Sorghum bicolor Pol...    37  0.87
UniRef100_UPI00004193CA UPI00004193CA UniRef100 entry                  37  1.1
UniRef100_Q84ZV5 Polyprotein [Glycine max]                             37  1.1
UniRef100_Q7XEK0 Hypothetical protein [Oryza sativa]                   37  1.1

>UniRef100_Q9SQW9 Putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1661

 Score = 46.2 bits (108), Expect = 0.001
 Identities = 37/144 (25%), Positives = 64/144 (43%), Gaps = 17/144 (11%)

Query: 22  FYSPPSQPNTSPYYCHNVPSYYPQTHSSLQLSEFNGENPEYWLFMADCYFLQNPYPWETR 81
           F +    P ++P+     P +      ++    + G N + WLF  +  FL N    E +
Sbjct: 207 FQTQTFPPQSAPHQ----PRFEAAPRRTVDYPAYEGGNADDWLFRLEQCFLSNRTLEEEK 262

Query: 82  FQLLAIHLGGKALTWFRLS--HQQIHSWEEFKVAFRLYFVYNRPNGFSTTTVKYGFEKSS 139
            +     L G ++TW+R S   +QI++W EF+  F L F   RP+  S+           
Sbjct: 263 LEKAVSCLTGASVTWWRCSKDREQIYTWREFQEKFMLRF---RPSRGSSAV-------DH 312

Query: 140 LVEIQRSSEQVSPHRVVFSETTLE 163
           L+ + R +  V  +R  F E T++
Sbjct: 313 LLNV-RQTGTVEEYRERFEELTVD 335


>UniRef100_Q7XPJ7 OSJNBa0087O24.13 protein [Oryza sativa]
          Length = 1311

 Score = 43.1 bits (100), Expect = 0.012
 Identities = 31/97 (31%), Positives = 42/97 (42%), Gaps = 8/97 (8%)

Query: 23  YSPPS--QPNTSPY---YCHNVPSYYPQTHS-SLQLSEFNGENPEYWLFMADCYFLQNPY 76
           + PP   Q N  P+   Y H   S  P   S ++    F G+ PE W+  A+ YF     
Sbjct: 257 FHPPHHHQYNPEPHHTNYAHRPHSADPAKRSRNVDFPTFEGDYPESWIRKAEKYFSLYQT 316

Query: 77  PWETRFQLLAIHLGGKALTWFRLSHQQIH--SWEEFK 111
           P E +  L  +H+ G+A  W   S       SW EFK
Sbjct: 317 PEEDKVLLAEVHISGRADQWIESSAVPTASLSWPEFK 353


>UniRef100_Q94GG1 Putative polyprotein [Oryza sativa]
          Length = 869

 Score = 42.7 bits (99), Expect = 0.016
 Identities = 21/73 (28%), Positives = 38/73 (51%), Gaps = 4/73 (5%)

Query: 53  SEFNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFRLSHQQIH---SWEE 109
           ++F+G+NP+ W   ++ YF     P+ET      +H  G A  W + +++++H   +W E
Sbjct: 96  AKFDGDNPKLWKTNSEKYFSMYQVPYETWSSFATLHFIGNAALWLQ-TYEELHCVENWSE 154

Query: 110 FKVAFRLYFVYNR 122
             VA    F  +R
Sbjct: 155 LSVAVHSKFGKDR 167


>UniRef100_Q6CBR5 Similarity [Yarrowia lipolytica]
          Length = 1136

 Score = 42.7 bits (99), Expect = 0.016
 Identities = 34/96 (35%), Positives = 45/96 (46%), Gaps = 11/96 (11%)

Query: 138 SSLVEIQRSSEQVSPHRVVFSETTLEDGAQLPFEASSDVHLLPEVVSRVELGSAETEIES 197
           S++     SS  V+    V SE T    A +P EA+S     PEV S  E  S       
Sbjct: 654 SNVTSSANSSSDVTSSADVSSEVTTS--ADIPSEATSSAETTPEVTSSAEATS------- 704

Query: 198 HESSTSED-SSLITTPVDSTPYATRSDAITPDVNKT 232
            E +TS D SS +T   ++TP AT S   TP+V  +
Sbjct: 705 -EVTTSADISSEVTASAEATPEATSSAEATPEVTSS 739


>UniRef100_Q8LAQ8 Hypothetical protein [Arabidopsis thaliana]
          Length = 272

 Score = 42.4 bits (98), Expect = 0.021
 Identities = 23/89 (25%), Positives = 40/89 (44%), Gaps = 1/89 (1%)

Query: 24  SPPSQPNTSPYYCHNVPSYYPQTHSSLQLSEFNGENPEYWLFMADCYFLQNPYPWETRFQ 83
           S  ++ N+   + H +  Y       +++  F G N   WL  A+ YF    +    + Q
Sbjct: 88  STAAEDNSIDTHPHRMSRYIQLMKPKIEMPVFEGPNVNSWLTRAERYFEFGSFTNAEKIQ 147

Query: 84  LLAIHLGGKALTWFRLSHQQ-IHSWEEFK 111
           L+ + + G AL WF L ++     W +FK
Sbjct: 148 LVYMSVEGPALCWFNLENRNPFVDWNDFK 176


>UniRef100_Q6L3V6 Hypothetical protein [Solanum demissum]
          Length = 490

 Score = 42.0 bits (97), Expect = 0.027
 Identities = 61/276 (22%), Positives = 112/276 (40%), Gaps = 28/276 (10%)

Query: 59  NPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFR--LSHQQIHSWEEFKVAFRL 116
           NP+ W+F A+ YF    +       L  ++L G+AL WFR    ++Q   W+ FK    L
Sbjct: 14  NPDDWIFRAERYFAYLGFLENDWIPLPFLYLDGEALDWFRWMYRNKQFWDWKHFKEKLSL 73

Query: 117 YFVYNRPNGFSTTTVKYGFEKSSLVEIQRSSEQVSPHRVVFSETTLEDGAQLPFEASSDV 176
            F        + TT +     S +  I    ++V  +  V S        QL    +S  
Sbjct: 74  CF-------RARTTPESALAHSQITTILTLLQKVQDNFSVMSN-------QLDNTHNSIS 119

Query: 177 HLLPEVVSRVELGSAETEIESHESSTSEDSSLITTPVDSTPYATRSDAITPDVNKTFVLK 236
           ++     + V+  + E  I+S  +  SE+  + +   D       +  + P++  T +  
Sbjct: 120 NITGLSPTIVQEAAIEDAIDSATTRVSEE--IASALGDGQSSIANNFEVHPEMFDTLI-- 175

Query: 237 TTDLVLRNPRQVFGNLLEGKFVQKNNEAQFEKHFDKLLMQFHNSSTTVDVQKIGVPCPKV 296
            T+L+L     V G+   G   Q  +E+  +   + L+ +F  +   V    +G+  P  
Sbjct: 176 DTNLLLAGTTFVIGDEHSGHVDQVFDESSHQ--IEDLVHEFGMAFDLVADNLVGLQIP-- 231

Query: 297 FEKMLVTRNTRICVDNTFEKLLETYINLLLDWQRMR 332
              +  TR+    +    ++ LET  N++ D    R
Sbjct: 232 ---LQATRSVEPPLVEE-DECLETKTNIVFDGSLQR 263


>UniRef100_Q8LLX2 Putative retroelement [Oryza sativa]
          Length = 813

 Score = 41.6 bits (96), Expect = 0.035
 Identities = 23/67 (34%), Positives = 34/67 (50%), Gaps = 4/67 (5%)

Query: 55  FNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFRLSHQQIH---SWEEFK 111
           F+GENP+ W   A+ YF      +ET  Q   +H  G A  W + +++++H   SW E  
Sbjct: 117 FSGENPKLWKKNAEKYFGMYNVAYETWAQFATLHFTGNAALWLQ-TYEELHSVESWAELC 175

Query: 112 VAFRLYF 118
           VA    F
Sbjct: 176 VAVNSKF 182


>UniRef100_Q8S653 Putative retroelement [Oryza sativa]
          Length = 1043

 Score = 41.6 bits (96), Expect = 0.035
 Identities = 23/67 (34%), Positives = 34/67 (50%), Gaps = 4/67 (5%)

Query: 55  FNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFRLSHQQIH---SWEEFK 111
           F+GENP+ W   A+ YF      +ET  Q   +H  G A  W + +++++H   SW E  
Sbjct: 117 FSGENPKLWKKNAEKYFGMYNVAYETWAQFATLHFTGNAALWLQ-TYEELHSVESWAELC 175

Query: 112 VAFRLYF 118
           VA    F
Sbjct: 176 VAVNSKF 182


>UniRef100_Q8H7X6 Putative retroelement [Oryza sativa]
          Length = 1021

 Score = 38.9 bits (89), Expect = 0.23
 Identities = 26/84 (30%), Positives = 39/84 (45%), Gaps = 16/84 (19%)

Query: 35  YCHNVPSYYPQTHSSLQLSEFNGENPEYWLFMADCYF-LQNPYP--WETRFQLLAIHLGG 91
           Y H VP         L   +F+G +PE W    + YF + N +P  W    ++  I+  G
Sbjct: 226 YNHRVPK--------LDFPKFDGTDPEDWRMRCEHYFDVNNTFPGLW---VRIATIYFSG 274

Query: 92  KALTWFR--LSHQQIHSWEEFKVA 113
           +A +W R   +H +   WE F VA
Sbjct: 275 RAASWLRSTKAHVRFPIWENFCVA 298


>UniRef100_Q9FWC7 Putative plant disease resistance polyprotein [Oryza sativa]
          Length = 894

 Score = 37.7 bits (86), Expect = 0.51
 Identities = 16/71 (22%), Positives = 32/71 (44%), Gaps = 2/71 (2%)

Query: 50  LQLSEFNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFRLSH--QQIHSW 107
           +    F+G +   WL M + YF         +   + +H+ G A  W+       +++SW
Sbjct: 122 MDFPRFDGSDVRIWLNMCETYFDMYQITQNFKVSAVVLHMSGNAAQWYHSYKLVNEVNSW 181

Query: 108 EEFKVAFRLYF 118
           ++F++A    F
Sbjct: 182 DQFRMAVATEF 192


>UniRef100_Q9S9R5 F28J9.14 [Arabidopsis thaliana]
          Length = 525

 Score = 37.4 bits (85), Expect = 0.67
 Identities = 32/120 (26%), Positives = 51/120 (41%), Gaps = 9/120 (7%)

Query: 50  LQLSEFNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFR-LSHQQI---- 104
           L    FNG+  + WL + + +F     P E +  L++IH  G A  W + +SH  +    
Sbjct: 139 LDFPRFNGDKIQEWLLLVEQFFEIYQTPDEFKVCLVSIHFDGLANAWHQSISHSVMWEHV 198

Query: 105 -HSWEEFKVAFRLYFVYNRPNGFSTTTVKYGFEKSSLVEIQRSSEQVSPHRVVFSETTLE 163
            H W  +K+  ++   YN     S   +    E   + E     E +S  RV F E  L+
Sbjct: 199 RHDWWSYKLLLQVR--YNEHVDDSIAKLTQLQETEGIEEYHARFELIST-RVNFGEDYLK 255


>UniRef100_Q9FYC9 Hypothetical protein F22J12_60 [Arabidopsis thaliana]
          Length = 221

 Score = 37.4 bits (85), Expect = 0.67
 Identities = 29/120 (24%), Positives = 47/120 (39%), Gaps = 11/120 (9%)

Query: 10  SILSAQRDPRSIFYSPPSQPNTS-PYYCHNVPSYYPQTH----SSLQLSEFNGENPEYWL 64
           S+    R    +  SP SQ   S P +     S Y          +    FNG+  + WL
Sbjct: 24  SMAPHDRSSSMVLGSPESQSGCSDPNHDVRSDSQYHYRRLRRLGKVDFPRFNGDGIKDWL 83

Query: 65  FMADCYFLQNPYPWETRFQLLAIHLGGKALTWFRLSHQQI------HSWEEFKVAFRLYF 118
           F  + +FL +  P E +  + +IH      TW +   Q I      H W  +K+  ++ +
Sbjct: 84  FQIEQFFLIDHTPEELKVDIASIHFDDIDATWHQSIVQSIMWRHVRHDWWNYKLLLQVRY 143


>UniRef100_Q7XZZ8 Putative polyprotein [Oryza sativa]
          Length = 1246

 Score = 37.4 bits (85), Expect = 0.67
 Identities = 24/70 (34%), Positives = 34/70 (48%), Gaps = 4/70 (5%)

Query: 33  PYYCHNV-PSYYPQTH---SSLQLSEFNGENPEYWLFMADCYFLQNPYPWETRFQLLAIH 88
           P Y H V P+ Y +       L+L+ FNGE+   WL   + +F     P +    L + H
Sbjct: 279 PRYLHGVIPNPYTEMSLRSQRLELTLFNGEDAVGWLQQCEKFFEMTGTPVDQWVNLASGH 338

Query: 89  LGGKALTWFR 98
           L G+A  WFR
Sbjct: 339 LVGRAGKWFR 348


>UniRef100_UPI00004345B9 UPI00004345B9 UniRef100 entry
          Length = 5078

 Score = 37.0 bits (84), Expect = 0.87
 Identities = 27/109 (24%), Positives = 49/109 (44%), Gaps = 18/109 (16%)

Query: 132  KYGFEKSSLVEIQRSSEQVSPHRVVFSETTLEDGAQLPFEASSDVHLLPEVVSRVELGSA 191
            K   EK  + +I++  E++ P  +     T  D ++ P   SS++H++ E         A
Sbjct: 2290 KVDIEKHVVEKIEKVKEEIKPESIPIETKTAIDASKRP-SISSEIHVVTE---------A 2339

Query: 192  ETEIESHESSTSEDSSLITTPVDSTPYATRSDAITPDVNKTFVLKTTDL 240
            ET++     S SE        +D T   TRS +I  D+  +  ++T  +
Sbjct: 2340 ETKVIDKSKSPSE--------IDETEDKTRSPSIAGDIEDSKEVETASI 2380


>UniRef100_UPI00004345B8 UPI00004345B8 UniRef100 entry
          Length = 4971

 Score = 37.0 bits (84), Expect = 0.87
 Identities = 27/109 (24%), Positives = 49/109 (44%), Gaps = 18/109 (16%)

Query: 132  KYGFEKSSLVEIQRSSEQVSPHRVVFSETTLEDGAQLPFEASSDVHLLPEVVSRVELGSA 191
            K   EK  + +I++  E++ P  +     T  D ++ P   SS++H++ E         A
Sbjct: 1825 KVDIEKHVVEKIEKVKEEIKPESIPIETKTAIDASKRP-SISSEIHVVTE---------A 1874

Query: 192  ETEIESHESSTSEDSSLITTPVDSTPYATRSDAITPDVNKTFVLKTTDL 240
            ET++     S SE        +D T   TRS +I  D+  +  ++T  +
Sbjct: 1875 ETKVIDKSKSPSE--------IDETEDKTRSPSIAGDIEDSKEVETASI 1915


>UniRef100_Q852F5 Putative polyprotein [Oryza sativa]
          Length = 1155

 Score = 37.0 bits (84), Expect = 0.87
 Identities = 16/71 (22%), Positives = 31/71 (43%), Gaps = 2/71 (2%)

Query: 50  LQLSEFNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFRLSH--QQIHSW 107
           +    F+G +   WL M + YF         +     +H+ G A  W+       +++SW
Sbjct: 122 MDFPRFDGSDVRIWLDMCETYFDMYQITQNFKVSAAVLHMSGNAAQWYHSYKLVNEVNSW 181

Query: 108 EEFKVAFRLYF 118
           ++F++A    F
Sbjct: 182 DQFRMAVATEF 192


>UniRef100_Q6C914 Similarities with tr|Q9XE85 Sorghum bicolor Polyprotein [Yarrowia
           lipolytica]
          Length = 678

 Score = 37.0 bits (84), Expect = 0.87
 Identities = 30/108 (27%), Positives = 47/108 (42%), Gaps = 10/108 (9%)

Query: 32  SPYYCHNVPSYYPQTHSSLQLSEFNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGG 91
           S Y  H +P +   T  ++ +S+ N +    WL   + +    P P   R  LL   L G
Sbjct: 54  SIYPVHTIPIF---TGDAVLVSQ-NAQLAHDWLIAVENFLCTFPVPHHARPHLLGSRLFG 109

Query: 92  KALTWFRLSHQQ--IHSWEEFKVAFRLYFVYNRPNGFSTTTVKYGFEK 137
            A  W+R S  +  + +W EFK  F  Y+       F++ T  + F K
Sbjct: 110 SAGLWWRQSMAKNILSNWHEFKSNFASYWCPE----FNSQTESHFFHK 153


>UniRef100_UPI00004193CA UPI00004193CA UniRef100 entry
          Length = 1507

 Score = 36.6 bits (83), Expect = 1.1
 Identities = 31/119 (26%), Positives = 52/119 (43%), Gaps = 16/119 (13%)

Query: 119  VYNRPNGFSTTTVKYGFEKSSLVEIQRS------SEQVSPHRVVFSETTL-----EDGAQ 167
            VY+   G + TTV   F +S+   ++R       S   S H  +F+E +      E+   
Sbjct: 1326 VYSSSPGSTETTV---FPRSTTTSVRREEPTTFHSRPASTHTTLFTEDSTTSGLTEESTA 1382

Query: 168  LPFEASSDVHLLPEVVSRVELGSAETEIESHESSTSEDSSLITTPVDSTPYATRSDAIT 226
             P   +S    LP  ++  +LG  E    +H S+ S  ++L  TP  ST    + ++ T
Sbjct: 1383 FPGSPASTQTGLPATLTTADLGLVEASTPTHSSTGSLHTTL--TPASSTSTGLQEESTT 1439



 Score = 34.3 bits (77), Expect = 5.7
 Identities = 28/105 (26%), Positives = 46/105 (43%), Gaps = 12/105 (11%)

Query: 127 STTTVKYGFEKSSLVEIQRSSEQVSPHRVVFSETTL-----EDGAQLPFEASSDVHLLPE 181
           STTT  +G E ++       S   S H  +F+E +      E+    P   +S    LP 
Sbjct: 506 STTTSVHGEEPTTF-----HSRPASTHTTLFTEDSTTSGLTEESTAFPGSPASTQTGLPA 560

Query: 182 VVSRVELGSAETEIESHESSTSEDSSLITTPVDSTPYATRSDAIT 226
            ++  +LG  E    +H S+ S  ++L  TP  ST    + ++ T
Sbjct: 561 TLTTADLGLVEASTPTHSSTGSLHTTL--TPASSTSAGLQEESTT 603


>UniRef100_Q84ZV5 Polyprotein [Glycine max]
          Length = 1552

 Score = 36.6 bits (83), Expect = 1.1
 Identities = 19/71 (26%), Positives = 33/71 (45%), Gaps = 2/71 (2%)

Query: 50  LQLSEFNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFRL--SHQQIHSW 107
           L    F+G+N   W+F A+ +F     P   R  + ++HL    + W+++    +   SW
Sbjct: 100 LDFPRFDGKNVMDWIFKAEQFFDYYATPDADRLIIASVHLDQDVVPWYQMLQKTEPFSSW 159

Query: 108 EEFKVAFRLYF 118
           + F  A  L F
Sbjct: 160 QAFTRALELDF 170


>UniRef100_Q7XEK0 Hypothetical protein [Oryza sativa]
          Length = 1611

 Score = 36.6 bits (83), Expect = 1.1
 Identities = 18/49 (36%), Positives = 25/49 (50%)

Query: 50  LQLSEFNGENPEYWLFMADCYFLQNPYPWETRFQLLAIHLGGKALTWFR 98
           L++S F GE+P  WL   + +F     P +    L   HL G+A  WFR
Sbjct: 232 LEISLFTGEDPVDWLKQCEKFFEITGTPVDQWVNLAVAHLYGRAAKWFR 280


  Database: uniref100
    Posted date:  Jan 5, 2005  1:24 AM
  Number of letters in database: 848,049,833
  Number of sequences in database:  2,790,947
  
Lambda     K      H
   0.318    0.134    0.395 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 600,302,556
Number of Sequences: 2790947
Number of extensions: 24036212
Number of successful extensions: 70104
Number of sequences better than 10.0: 80
Number of HSP's better than 10.0 without gapping: 12
Number of HSP's successfully gapped in prelim test: 69
Number of HSP's that attempted gapping in prelim test: 70007
Number of HSP's gapped (non-prelim): 145
length of query: 369
length of database: 848,049,833
effective HSP length: 129
effective length of query: 240
effective length of database: 488,017,670
effective search space: 117124240800
effective search space used: 117124240800
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 75 (33.5 bits)


Lotus: description of TM0074.12