Lotus
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0592.7
         (116 letters)

Database: MTGI 
           36,976 sequences; 27,044,181 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

BG644690 weakly similar to GP|18542179|gb putative pol protein {...    98  5e-22
AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsi...    89  3e-19
BG587156 similar to PIR|G85055|G8 probable polyprotein [imported...    82  4e-17
CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotia...    64  7e-12
CB893805 similar to GP|10177935|d copia-type polyprotein {Arabid...    47  4e-11
BG587174 similar to PIR|A47759|A4775 retrovirus-related reverse ...    60  1e-10
BG586293 weakly similar to PIR|E84473|E84 probable retroelement ...    48  5e-07
BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F2...    31  0.061
BQ143821                                                               27  0.88
TC82700 similar to GP|23617236|dbj|BAC20903. hypothetical protei...    25  5.7
BE320454                                                               25  5.7
TC86822 weakly similar to GP|18086336|gb|AAL57631.1 At1g78060/F2...    24  7.5
TC87401 similar to PIR|B84587|B84587 probable glutaredoxin [impo...    24  7.5
BG589079                                                               24  9.8
TC86679 similar to GP|17065046|gb|AAL32677.1 Unknown protein {Ar...    24  9.8
TC90643 weakly similar to GP|19386797|dbj|BAB86176. hypothetical...    24  9.8
TC81375 weakly similar to GP|21537367|gb|AAM61708.1 unknown {Ara...    24  9.8
TC88072 similar to PIR|T06580|T06580 subtilisin-like proteinase ...    24  9.8

>BG644690 weakly similar to GP|18542179|gb putative pol protein {Zea mays},
           partial (22%)
          Length = 629

 Score = 97.8 bits (242), Expect = 5e-22
 Identities = 47/108 (43%), Positives = 73/108 (67%)
 Frame = -2

Query: 6   FTQKEGIGYFDTYAPVARITTIRVLLALASLYKFVIHQMDVKTAFLNGELDEEVYMKQPE 65
           + QKEGI Y + ++PVAR+  IR+L+A A+   F ++QMDVK+AF+NG+L EEV++KQP 
Sbjct: 388 YNQKEGIDYDEAFSPVARMEAIRILIAFAAFMGFKLYQMDVKSAFINGDLKEEVFVKQPP 209

Query: 66  GFVIKGQVQKVCKLTKSLYELKQTPKKKWHQKFDQVVLAHEYKINEFD 113
           GF        V +L K+LY LKQ P + W+++  + +L + +K  + D
Sbjct: 208 GFEDAEVPNHVFRLNKTLYGLKQAP-RAWYERLSKFLLKNGFKRGKID 68


>AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsis thaliana},
           partial (9%)
          Length = 675

 Score = 88.6 bits (218), Expect = 3e-19
 Identities = 45/91 (49%), Positives = 61/91 (66%)
 Frame = +1

Query: 6   FTQKEGIGYFDTYAPVARITTIRVLLALASLYKFVIHQMDVKTAFLNGELDEEVYMKQPE 65
           F+Q  G  Y +T++PV +  TIR++L +A  YK+ I Q+D+  AFLNG L EEVYM QP+
Sbjct: 268 FSQTLGCDYTETFSPVIKPVTIRLILTIAITYKWEIQQIDINNAFLNGFLQEEVYMSQPQ 447

Query: 66  GFVIKGQVQKVCKLTKSLYELKQTPKKKWHQ 96
           GF    +   VCKL KSLY LKQ P + W++
Sbjct: 448 GFEAANK-SLVCKLNKSLYGLKQAP-RAWYE 534


>BG587156 similar to PIR|G85055|G8 probable polyprotein [imported] -
           Arabidopsis thaliana, partial (17%)
          Length = 618

 Score = 81.6 bits (200), Expect = 4e-17
 Identities = 44/108 (40%), Positives = 66/108 (60%)
 Frame = -1

Query: 6   FTQKEGIGYFDTYAPVARITTIRVLLALASLYKFVIHQMDVKTAFLNGELDEEVYMKQPE 65
           FT   G  Y +T+APVA++ TIR++L+LA    + + QMDVK AFL GEL++EVYM  P 
Sbjct: 351 FTLTYGEDYIETFAPVAKLHTIRIVLSLAVNLGWGLWQMDVKNAFLQGELEDEVYMYPPP 172

Query: 66  GFVIKGQVQKVCKLTKSLYELKQTPKKKWHQKFDQVVLAHEYKINEFD 113
           G     +   V +L K++Y LKQ+P + W+ K    +    ++ +E D
Sbjct: 171 GLEHLVKRGNVLRLKKAIYGLKQSP-RAWYNKLSTTLNGRGFRKSELD 31


>CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotiana tabacum},
           partial (7%)
          Length = 780

 Score = 64.3 bits (155), Expect = 7e-12
 Identities = 34/76 (44%), Positives = 51/76 (66%), Gaps = 1/76 (1%)
 Frame = -2

Query: 18  YAPVARITTIRVLLALASLYKFVIHQMDVKTAFLNGELDEEVYMKQPEGFVIKGQVQK-V 76
           + P+ ++ TI  LL++ ++    +  +DVKTAFL G+L E++YM QPEGF    +V K V
Sbjct: 554 FVPIVKLNTIMFLLSIVAIENLYLE*LDVKTAFLRGDLVEDIYMHQPEGF--S*EVGKMV 381

Query: 77  CKLTKSLYELKQTPKK 92
            KL KS+Y LKQ P++
Sbjct: 380 GKLKKSMYGLKQGPRQ 333


>CB893805 similar to GP|10177935|d copia-type polyprotein {Arabidopsis
           thaliana}, partial (14%)
          Length = 778

 Score = 46.6 bits (109), Expect(2) = 4e-11
 Identities = 20/35 (57%), Positives = 29/35 (82%)
 Frame = +2

Query: 41  IHQMDVKTAFLNGELDEEVYMKQPEGFVIKGQVQK 75
           ++Q+DVK+AFL GEL+EEV++ QP+G+V KG   K
Sbjct: 359 VYQLDVKSAFLYGELNEEVFVDQPQGYVKKGDKLK 463



 Score = 35.0 bits (79), Expect(2) = 4e-11
 Identities = 15/33 (45%), Positives = 25/33 (75%)
 Frame = +3

Query: 6   FTQKEGIGYFDTYAPVARITTIRVLLALASLYK 38
           ++Q+ G+ Y + +APVAR  TIR+++ALA+  K
Sbjct: 249 YSQQYGVDYTEVFAPVARWDTIRMVIALAAQIK 347


>BG587174 similar to PIR|A47759|A4775 retrovirus-related reverse
           transcriptase homolog - rape retrotransposon copia-like
           (fragment), partial (84%)
          Length = 249

 Score = 60.1 bits (144), Expect = 1e-10
 Identities = 26/46 (56%), Positives = 37/46 (79%)
 Frame = -1

Query: 54  ELDEEVYMKQPEGFVIKGQVQKVCKLTKSLYELKQTPKKKWHQKFD 99
           EL+E++YM QPEGF+  G+   VCKL KSLY LKQ+P ++W+++FD
Sbjct: 249 ELEEKIYMTQPEGFLFPGKEDHVCKLRKSLYGLKQSP-RQWYKRFD 115


>BG586293 weakly similar to PIR|E84473|E84 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana, partial
           (7%)
          Length = 763

 Score = 48.1 bits (113), Expect = 5e-07
 Identities = 24/48 (50%), Positives = 34/48 (70%)
 Frame = +2

Query: 6   FTQKEGIGYFDTYAPVARITTIRVLLALASLYKFVIHQMDVKTAFLNG 53
           + +++GI + + +APV RI TI +LLALA+     IH +DVK AFLNG
Sbjct: 179 YVKQQGIDFDEVFAPVVRIETI*LLLALAATNGC*IHHIDVKIAFLNG 322


>BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F20P5.25
           [imported] - Arabidopsis thaliana, partial (10%)
          Length = 744

 Score = 31.2 bits (69), Expect = 0.061
 Identities = 13/33 (39%), Positives = 22/33 (66%)
 Frame = +2

Query: 75  KVCKLTKSLYELKQTPKKKWHQKFDQVVLAHEY 107
           KVC+L KS+Y LKQ   ++W+ K  + +++  Y
Sbjct: 17  KVCELQKSIYGLKQA-SRQWYSKLSESLISFGY 112


>BQ143821 
          Length = 810

 Score = 27.3 bits (59), Expect = 0.88
 Identities = 12/35 (34%), Positives = 18/35 (51%)
 Frame = +3

Query: 76  VCKLTKSLYELKQTPKKKWHQKFDQVVLAHEYKIN 110
           VC   K +Y L   P K W Q    ++L ++ K+N
Sbjct: 180 VCVRLKHIYILHDQPTKCWCQDLSHMML*YQIKVN 284


>TC82700 similar to GP|23617236|dbj|BAC20903. hypothetical protein~similar
           to Arabidopsis thaliana chromosome 4  At4g13970, partial
           (39%)
          Length = 1066

 Score = 24.6 bits (52), Expect = 5.7
 Identities = 9/21 (42%), Positives = 13/21 (61%)
 Frame = +2

Query: 94  WHQKFDQVVLAHEYKINEFDK 114
           WH K D+V L    ++N+F K
Sbjct: 134 WHDKLDRVALIPFARVNDFVK 196


>BE320454 
          Length = 338

 Score = 24.6 bits (52), Expect = 5.7
 Identities = 14/47 (29%), Positives = 22/47 (46%)
 Frame = -1

Query: 13  GYFDTYAPVARITTIRVLLALASLYKFVIHQMDVKTAFLNGELDEEV 59
           G F+ ++  A  TT  + L   S +    +Q     AFLN  +DE +
Sbjct: 275 GSFEPHSISASFTTCSLSLVWFSFFAKAPYQTSNPAAFLNKPIDESM 135


>TC86822 weakly similar to GP|18086336|gb|AAL57631.1 At1g78060/F28K19_32
           {Arabidopsis thaliana}, partial (8%)
          Length = 704

 Score = 24.3 bits (51), Expect = 7.5
 Identities = 14/34 (41%), Positives = 19/34 (55%)
 Frame = +1

Query: 28  RVLLALASLYKFVIHQMDVKTAFLNGELDEEVYM 61
           RVL    SL K  I ++ +K    NG+LDE + M
Sbjct: 457 RVLN*YHSLMKCSIEKLSIK*RITNGKLDELIAM 558


>TC87401 similar to PIR|B84587|B84587 probable glutaredoxin [imported] -
           Arabidopsis thaliana, partial (63%)
          Length = 914

 Score = 24.3 bits (51), Expect = 7.5
 Identities = 14/38 (36%), Positives = 21/38 (54%)
 Frame = +2

Query: 53  GELDEEVYMKQPEGFVIKGQVQKVCKLTKSLYELKQTP 90
           GEL    Y  QPEG +I+  + ++C L     +L+ TP
Sbjct: 590 GELRPCQYYSQPEGNLIRLPICRICWL-----KLESTP 688


>BG589079 
          Length = 385

 Score = 23.9 bits (50), Expect = 9.8
 Identities = 11/33 (33%), Positives = 20/33 (60%), Gaps = 1/33 (3%)
 Frame = -1

Query: 65  EGFVIKGQVQKVCKLTKSLYELKQTPKKK-WHQ 96
           +GF++     K+C L+K L  +K  P+++ W Q
Sbjct: 292 KGFLVARVN*KICMLSKHLLSMKCIPEER*WKQ 194


>TC86679 similar to GP|17065046|gb|AAL32677.1 Unknown protein {Arabidopsis
           thaliana}, partial (48%)
          Length = 1445

 Score = 23.9 bits (50), Expect = 9.8
 Identities = 12/36 (33%), Positives = 20/36 (55%)
 Frame = -2

Query: 72  QVQKVCKLTKSLYELKQTPKKKWHQKFDQVVLAHEY 107
           Q+Q + +   SLY+L   P K+  Q  DQ +L+  +
Sbjct: 433 QIQNLLQ*LFSLYDLPLKPLKERSQLHDQFLLSQNH 326


>TC90643 weakly similar to GP|19386797|dbj|BAB86176. hypothetical
           protein~similar to Arabidopsis thaliana protein
           T1F15.13, partial (21%)
          Length = 771

 Score = 23.9 bits (50), Expect = 9.8
 Identities = 9/25 (36%), Positives = 18/25 (72%), Gaps = 1/25 (4%)
 Frame = +3

Query: 85  ELKQTPKKKWHQKFDQVV-LAHEYK 108
           +L++ PK+KW + + QV  ++H Y+
Sbjct: 90  QLRKVPKQKWTEMWRQVKNISHHYE 164


>TC81375 weakly similar to GP|21537367|gb|AAM61708.1 unknown {Arabidopsis
           thaliana}, partial (30%)
          Length = 797

 Score = 23.9 bits (50), Expect = 9.8
 Identities = 8/23 (34%), Positives = 13/23 (55%)
 Frame = -2

Query: 83  LYELKQTPKKKWHQKFDQVVLAH 105
           ++++ Q     WHQ  DQV+  H
Sbjct: 298 VHQMLQLSCNHWHQNLDQVIPLH 230


>TC88072 similar to PIR|T06580|T06580 subtilisin-like proteinase (EC
           3.4.21.-) p69f - tomato, partial (27%)
          Length = 1097

 Score = 23.9 bits (50), Expect = 9.8
 Identities = 11/27 (40%), Positives = 15/27 (54%), Gaps = 4/27 (14%)
 Frame = +1

Query: 93  KWH----QKFDQVVLAHEYKINEFDKC 115
           KW+    QKF   ++A  YKI+  D C
Sbjct: 313 KWYCCIDQKFSP*LVASRYKISNHDNC 393


  Database: MTGI
    Posted date:  Oct 22, 2004  3:39 PM
  Number of letters in database: 27,044,181
  Number of sequences in database:  36,976
  
Lambda     K      H
   0.328    0.143    0.426 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,099,692
Number of Sequences: 36976
Number of extensions: 33600
Number of successful extensions: 212
Number of sequences better than 10.0: 36
Number of HSP's better than 10.0 without gapping: 210
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 210
length of query: 116
length of database: 9,014,727
effective HSP length: 92
effective length of query: 24
effective length of database: 5,612,935
effective search space: 134710440
effective search space used: 134710440
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 50 (23.9 bits)


Lotus: description of TM0592.7