Lotus
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0279a.6
         (1582 letters)

Database: LJGI 
           28,460 sequences; 14,692,800 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

TC12832 weakly similar to UP|POLX_TOBAC (P10978) Retrovirus-rela...    89  7e-18
BP063694                                                               77  2e-14
AV780009                                                               71  1e-12
AV766088                                                               67  2e-11
TC8302                                                                 62  7e-10
BP033602                                                               62  7e-10
AV766665                                                               58  1e-08
TC9039                                                                 55  7e-08
TC19389 weakly similar to UP|POLX_TOBAC (P10978) Retrovirus-rela...    47  2e-05
BP063902                                                               44  3e-04
BP075943                                                               39  0.005
TC17418 similar to UP|Q94AX2 (Q94AX2) AT5g39790/MKM21_80, partia...    38  0.011
TC18003 similar to PIR|T05112|T05112 splicing factor 9G8-like SR...    35  0.093
BP058111                                                               35  0.12
BI418821                                                               34  0.16
AV421607                                                               34  0.21
TC9337 similar to UP|PME_PRUPE (Q43062) Pectinesterase PPE8B pre...    34  0.21
TC18909                                                                34  0.21
TC17853 similar to UP|Q42412 (Q42412) RNA-binding protein RZ-1, ...    33  0.27
AV773507                                                               33  0.27

>TC12832 weakly similar to UP|POLX_TOBAC (P10978) Retrovirus-related Pol
            polyprotein from transposon TNT 1-94 [Contains: Protease
            ; Reverse transcriptase ; Endonuclease] , partial (9%)
          Length = 747

 Score = 88.6 bits (218), Expect = 7e-18
 Identities = 41/82 (50%), Positives = 59/82 (71%)
 Frame = +2

Query: 1300 AKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKTA 1359
            A KILGM+IHR++  +++WLSQ +Y++ VL RF+M   NP+STPL  ++KLS    P + 
Sbjct: 203  ANKILGMQIHRDRKDRRIWLSQKNYLQKVLRRFNMQDCNPISTPLPVNYKLSSSMIPSSE 382

Query: 1360 SEIEGMSKISHASAVGCLMYVM 1381
            +E   MS++ +ASAVG LMY M
Sbjct: 383  AERMEMSRVPYASAVGSLMYAM 448


>BP063694 
          Length = 511

 Score = 77.4 bits (189), Expect = 2e-14
 Identities = 42/118 (35%), Positives = 61/118 (51%), Gaps = 1/118 (0%)
 Frame = -2

Query: 417 KLWHMRLGHLSERGMMELYKRNLLKGVRSCTIGLC-KYCVLGKQCRVRFKTGQHKTKGIL 475
           +LWH RLGH++   + +L+K   L         +C   C + K  R+ F     +   +L
Sbjct: 363 ELWHSRLGHVNFDIIKQLHKHGCLDVSSILPKPICCTSCQMAKSKRLVFHDNNKRASAVL 184

Query: 476 DYVHSDVWGPTKEPSVGGFRYFVTFTDDFSRKVWVYFLKYKSEVFAKFKLWKAEVENQ 533
           D +H D+ GP+   S+ GF YFV F DDFSR  W Y LK KS+       +K  +EN+
Sbjct: 183 DLIHCDLRGPSPVASIDGFSYFVIFVDDFSRFTWFYPLKRKSDFSDVLLRFKVFMENR 10


>AV780009 
          Length = 529

 Score = 71.2 bits (173), Expect = 1e-12
 Identities = 42/100 (42%), Positives = 59/100 (59%), Gaps = 1/100 (1%)
 Frame = -1

Query: 1408 EAVKWILRYLKGTADRGIMFSREQGVVPLVV-GYVDSDYAGDLDDRRSTTGYVFTLAGGP 1466
            +A   +LRY+KG   +G+ FS +    PL +  Y DSD+AG  D RRS TGY   L    
Sbjct: 529  QAATRVLRYVKGAPAQGLFFSADS---PLKLQAYSDSDWAGCPDTRRSVTGYSIFLGTSL 359

Query: 1467 ICWKSSVQSIVAMSTTEAEYMAVAEAVKEALWLTGLVKKL 1506
            I W++  Q+ V+ S++EAEY A+A  V E  WL+ L + L
Sbjct: 358  ISWRTKKQTTVSRSSSEAEYRALAATVCEVQWLSYLFQFL 239


>AV766088 
          Length = 501

 Score = 67.0 bits (162), Expect = 2e-11
 Identities = 37/106 (34%), Positives = 53/106 (49%)
 Frame = -3

Query: 415 ATKLWHMRLGHLSERGMMELYKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGI 474
           A  LWH RLGH S   +  L     +         +C+ CV GK  R+ F +  + T   
Sbjct: 316 AQSLWHSRLGHPSSSALRYLRSNKFISYELLNYSPVCESCVFGKHVRLPFVSSNNVTVMP 137

Query: 475 LDYVHSDVWGPTKEPSVGGFRYFVTFTDDFSRKVWVYFLKYKSEVF 520
            D +HSD+W  +   S  G R++V F DDF+  +W + L  KS+VF
Sbjct: 136 FDILHSDLW-TSPVLSSAGHRFYVLFLDDFTDFLWTFPLSNKSQVF 2


>TC8302 
          Length = 494

 Score = 62.0 bits (149), Expect = 7e-10
 Identities = 42/145 (28%), Positives = 68/145 (45%), Gaps = 2/145 (1%)
 Frame = +3

Query: 102 QRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLCSLPGSYDHLVTTLT 161
           +RLY+L+M E   +  H+   N + A L+    T+ + ++  +LL SLP SYD LV  + 
Sbjct: 3   KRLYTLRMSESTSMSDHIDHMNTLFAQLSASNFTIGENERAELLLESLPDSYDQLVINIK 182

Query: 162 YGK--DSITLDSISSTLLQHAQRRRSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSGKKK 219
                + + L  +S  LL+   +R + E+   SS               K     +  + 
Sbjct: 183 NNNIVNHLPLMMLSEVLLEEDSQR*NKEDR*DSS---------------KPMEALTMTRG 317

Query: 220 RSKSKDRKTAECYSCKQIGHWKRDC 244
           RSKS+ +   +CY C Q  H K+DC
Sbjct: 318 RSKSRKKINLKCYHCGQR*HLKKDC 392


>BP033602 
          Length = 533

 Score = 62.0 bits (149), Expect = 7e-10
 Identities = 35/90 (38%), Positives = 50/90 (54%)
 Frame = +3

Query: 414 DATKLWHMRLGHLSERGMMELYKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKG 473
           D   LWH RLGH + + +  L+  +L K V +C+   C+ CVL K  R  + +  +    
Sbjct: 264 DQIMLWHNRLGHPNFQYLRHLFP-DLFKNV-NCSSLECESCVLAKNQRAPYYSQPYHASR 437

Query: 474 ILDYVHSDVWGPTKEPSVGGFRYFVTFTDD 503
               +HSDVWGP+K  +  G R+FVTF DD
Sbjct: 438 PFYLIHSDVWGPSKITTQFGKRWFVTFIDD 527


>AV766665 
          Length = 601

 Score = 58.2 bits (139), Expect = 1e-08
 Identities = 34/90 (37%), Positives = 49/90 (53%), Gaps = 4/90 (4%)
 Frame = +3

Query: 1403 GKQHWEAVKW----ILRYLKGTADRGIMFSREQGVVPLVVGYVDSDYAGDLDDRRSTTGY 1458
            GK  W ++K     I  YLK  + RG +F +E      + G+  +DY G + DR ST GY
Sbjct: 333  GKNRWSSLKTFTTRIF*YLKANSRRGPLFQKEGK--SSMDGFTYADYLGSIVDRLSTMGY 506

Query: 1459 VFTLAGGPICWKSSVQSIVAMSTTEAEYMA 1488
               L+G  + W+S  Q+I+A S+ EAE  A
Sbjct: 507  YMFLSGNLVTWRSKQQNIIARSSGEAELRA 596


>TC9039 
          Length = 1218

 Score = 55.5 bits (132), Expect = 7e-08
 Identities = 37/149 (24%), Positives = 67/149 (44%), Gaps = 6/149 (4%)
 Frame = +2

Query: 102 QRLYSLKMQEGGDLQAHVYAFNNILADLTRLGVTVDDEDKTIILLCSLPGSYDHLVTTLT 161
           Q L S+K Q  G+++ ++   +NI + L  L + + D+    ++L SLP  +     +  
Sbjct: 467 QNLISMKYQGKGNIREYIMGMSNIASKLKALKLELSDDLLIHLVLLSLPAQFSQFKISYN 646

Query: 162 YGKDSITLDSISSTLLQHAQRRRSVEEGGGSSGEGLFVKGGQDRGRGK------GKAVDS 215
             K+  +L+ + S  +Q  +R +   +         FV   +D+G+ K       +A D+
Sbjct: 647 CPKEKWSLNELISFCVQEEERLKQERKESAH-----FVSTSKDKGKRKKTVEPKNEAADA 811

Query: 216 GKKKRSKSKDRKTAECYSCKQIGHWKRDC 244
              K+ K  D     CY C   GH K+ C
Sbjct: 812 PAPKKQKEDDT----CYFCNVSGHMKKKC 886



 Score = 36.6 bits (83), Expect = 0.032
 Identities = 18/55 (32%), Positives = 29/55 (52%)
 Frame = +3

Query: 311  YVYLGDDKPCIIKGM*QVKIALDDGGVRTLSQVRYVPEVTKNLISLGTLHENGYS 365
            Y+Y+GD K   ++ +   ++ L  G    L     VP   +NLIS+  L ++GYS
Sbjct: 957  YIYVGDGKTVEVEAIGHFRLLLCTGFYLDLKDTFVVPSFRRNLISVSNLDKSGYS 1121


>TC19389 weakly similar to UP|POLX_TOBAC (P10978) Retrovirus-related Pol
            polyprotein from transposon TNT 1-94 [Contains: Protease
            ; Reverse transcriptase ; Endonuclease] , partial (6%)
          Length = 498

 Score = 47.4 bits (111), Expect = 2e-05
 Identities = 20/44 (45%), Positives = 28/44 (63%)
 Frame = +3

Query: 1461 TLAGGPICWKSSVQSIVAMSTTEAEYMAVAEAVKEALWLTGLVK 1504
            T AGG + W S +Q  VA+ST EAE++A  EA  E LW+   ++
Sbjct: 9    TFAGGAVAWPSRLQKCVALSTAEAEFIAATEACHELLWMKNFLQ 140


>BP063902 
          Length = 494

 Score = 43.5 bits (101), Expect = 3e-04
 Identities = 29/95 (30%), Positives = 45/95 (46%)
 Frame = -3

Query: 372 RDILRVSKGAMTVMRAKRTAGNIYKLLGSTVMGDVASVETDDDATKLWHMRLGHLSERGM 431
           R ILR  K  M ++R   + G++Y +  S+    +AS         +WH RLGH +   +
Sbjct: 294 RPILRHLKTGMPLLRCN-SLGDLYPVTRSSPFAGLAS--------SVWHHRLGHPASSAL 142

Query: 432 MELYKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKT 466
             L    L+    S +  +C  CVLGK  R+ F +
Sbjct: 141 NHLRNNKLIFCEPSRSSSVCDSCVLGKHVRLPFSS 37


>BP075943 
          Length = 547

 Score = 39.3 bits (90), Expect = 0.005
 Identities = 38/145 (26%), Positives = 64/145 (43%), Gaps = 2/145 (1%)
 Frame = -3

Query: 1300 AKKILGMEIHREKGAKKLWLSQNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKTA 1359
            AK  LG+EI R      + L+Q  Y   ++S        P      +H   S      T 
Sbjct: 413  AKYFLGLEIARSTSG--IVLNQRKYALQLISDSGHFGFQP---RFYSHGTNSQTLGTNTG 249

Query: 1360 SEIEGMSKISHASAVGCLMYVMVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLKG 1419
            + +  +   S+   VG L+Y+   TRPD+  A +Q+ +F+S P   H + +   L+YL G
Sbjct: 248  TPLTDIG--SYRRIVGRLLYLNT-TRPDITFAVNQLSQFLSAPTDIHEQQLTGFLKYL*G 78

Query: 1420 TADRGIMFSREQGV--VPLVVGYVD 1442
            +   G+ +        +P ++G  D
Sbjct: 77   SPGSGLFYPASSSTL*LPSLIGGPD 3


>TC17418 similar to UP|Q94AX2 (Q94AX2) AT5g39790/MKM21_80, partial (20%)
          Length = 739

 Score = 38.1 bits (87), Expect = 0.011
 Identities = 40/154 (25%), Positives = 66/154 (41%), Gaps = 1/154 (0%)
 Frame = -1

Query: 1321 QNSYVEGVLSRFDMSKANPVSTPLANHFKLSLEQCPKTASEIEGMSKISHASAVGCLMYV 1380
            Q  YV+ +L +F M+ +  +ST +       +E   +   +   +    + S +G + Y+
Sbjct: 739  QKIYVDDILKKFKMTNSKYISTTIGGK---EIEAGRRNGGK--RVDSTYYKSLIGSVRYL 575

Query: 1381 MVCTRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYLKGTADRGIMFSREQGVVPLVVGY 1440
                R D+        +FM +P   H +  +  LRY+KGT   GI          LV+  
Sbjct: 574  NT-VRSDIVCGVGLRSRFM-EP*DCH*QGAQRSLRYIKGTLKDGIFM--------LVITM 425

Query: 1441 VDSDYAGDLDDRRSTT-GYVFTLAGGPICWKSSV 1473
             +S     LD +  T  GY F L  G I W S++
Sbjct: 424  *NS-----LDTQIVTRPGYAFHLGRGAISWSSTI 338


>TC18003 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
           RSZp22 [validated] -                Arabidopsis thaliana
           {Arabidopsis thaliana;}, partial (42%)
          Length = 587

 Score = 35.0 bits (79), Expect = 0.093
 Identities = 24/70 (34%), Positives = 31/70 (44%)
 Frame = +1

Query: 186 VEEGGGSSGEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDRKTAECYSCKQIGHWKRDCP 245
           VE    S G G    GG+  GRG+G             +D K   CY C + GH+ R+C 
Sbjct: 88  VELSHNSKGGG----GGRGGGRGRG------------GEDLK---CYECGEPGHFARECR 210

Query: 246 NRSGKSGNSS 255
           +R G  G  S
Sbjct: 211 SRGGSRGLGS 240


>BP058111 
          Length = 570

 Score = 34.7 bits (78), Expect = 0.12
 Identities = 19/63 (30%), Positives = 29/63 (45%)
 Frame = -1

Query: 418 LWHMRLGHLSERGMMELYKRNLLKGVRSCTIGLCKYCVLGKQCRVRFKTGQHKTKGILDY 477
           LWH+RLGH S + +  L K       +  T   C  C + KQ ++ F      +  I D 
Sbjct: 186 LWHLRLGHTSSKKLAILQKNFPFITCKKIT-SPCDTCHMAKQKKLSFPNSVTLSSEIFDL 10

Query: 478 VHS 480
           +H+
Sbjct: 9   IHT 1


>BI418821 
          Length = 614

 Score = 34.3 bits (77), Expect = 0.16
 Identities = 22/74 (29%), Positives = 27/74 (35%), Gaps = 6/74 (8%)
 Frame = +2

Query: 189 GGGSSGEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDRKT------AECYSCKQIGHWKR 242
           GG   GE     GG   G G     D+G   R   +          A CY+C   GH  R
Sbjct: 332 GGWRGGERRNGGGGGGGGGGCYNCGDTGHLARDCHRSNNNGGGGGGAACYNCGDAGHLAR 511

Query: 243 DCPNRSGKSGNSSS 256
           DC   +  SG   +
Sbjct: 512 DCNRSNNNSGGGGA 553



 Score = 30.8 bits (68), Expect = 1.8
 Identities = 17/59 (28%), Positives = 21/59 (34%)
 Frame = +2

Query: 200 KGGQDRGRGKGKAVDSGKKKRSKSKDRKTAECYSCKQIGHWKRDCPNRSGKSGNSSSAA 258
           +GG+ R  G G     G              CY+C   GH  RDC   +   G    AA
Sbjct: 341 RGGERRNGGGGGGGGGG--------------CYNCGDTGHLARDCHRSNNNGGGGGGAA 475



 Score = 30.0 bits (66), Expect = 3.0
 Identities = 25/81 (30%), Positives = 31/81 (37%), Gaps = 21/81 (25%)
 Frame = +2

Query: 189 GGGSSGEGLFVKG-----------GQDRGRGKGKAV-----DSGKKKRSKSKDRKT---- 228
           GGG  G G +  G             + G G G A      D+G   R  ++        
Sbjct: 368 GGGGGGGGCYNCGDTGHLARDCHRSNNNGGGGGGAACYNCGDAGHLARDCNRSNNNSGGG 547

Query: 229 -AECYSCKQIGHWKRDCPNRS 248
            A CY+C   GH  RDC NRS
Sbjct: 548 GAGCYNCGDTGHLARDC-NRS 607


>AV421607 
          Length = 245

 Score = 33.9 bits (76), Expect = 0.21
 Identities = 11/21 (52%), Positives = 14/21 (66%)
 Frame = +3

Query: 231 CYSCKQIGHWKRDCPNRSGKS 251
           CY C + GHW RDCP+ +  S
Sbjct: 54  CYKCGRPGHWSRDCPSSAPNS 116


>TC9337 similar to UP|PME_PRUPE (Q43062) Pectinesterase PPE8B precursor
            (Pectin methylesterase) (PE) , partial (54%)
          Length = 1145

 Score = 33.9 bits (76), Expect = 0.21
 Identities = 14/34 (41%), Positives = 20/34 (58%)
 Frame = +3

Query: 450  LCKYCVLGKQCRVRFKTGQHKTKGILDYVHSDVW 483
            +C  CVLGK  R+ F + +  T    D +HSD+W
Sbjct: 1026 VCDSCVLGKHVRLPFSSSETITLRPFDILHSDLW 1127


>TC18909 
          Length = 621

 Score = 33.9 bits (76), Expect = 0.21
 Identities = 24/87 (27%), Positives = 39/87 (44%), Gaps = 1/87 (1%)
 Frame = +2

Query: 1384 TRPDLAQAASQVCKFMSKPGKQHWEAVKWILRYL-KGTADRGIMFSREQGVVPLVVGYVD 1442
            T P ++ + ++VC F+S+  ++H  A K IL    K     G               + D
Sbjct: 326  TSPQISFSVNKVC*FLSESLEEHGTAGKCILNVT*KAL*TMGFFIQTN------FPAFSD 487

Query: 1443 SDYAGDLDDRRSTTGYVFTLAGGPICW 1469
            +D+A ++DDRRST   +    G    W
Sbjct: 488  ADWASNVDDRRSTFWEIVFTLGPKFRW 568


>TC17853 similar to UP|Q42412 (Q42412) RNA-binding protein RZ-1, partial
           (58%)
          Length = 881

 Score = 33.5 bits (75), Expect = 0.27
 Identities = 17/50 (34%), Positives = 28/50 (56%), Gaps = 4/50 (8%)
 Frame = +2

Query: 201 GGQDRGRGKGKAVDSGKKKRSK----SKDRKTAECYSCKQIGHWKRDCPN 246
           G + R RG+ +  D G + RS+    S+     EC+ C + GH+ R+CP+
Sbjct: 320 GDRYRERGRDRD-DRGDRDRSRGYGGSRGSNGGECFKCGKPGHFARECPS 466


>AV773507 
          Length = 496

 Score = 33.5 bits (75), Expect = 0.27
 Identities = 19/62 (30%), Positives = 28/62 (44%), Gaps = 2/62 (3%)
 Frame = +2

Query: 201 GGQDRGRG-KGKAVDSGKKKRSKSKDRKTAE-CYSCKQIGHWKRDCPNRSGKSGNSSSAA 258
           GG D  +G +G    SG +    + DR   + C+ C + GH  RDCP   G  G      
Sbjct: 254 GGDDADQGYRGGGYSSGGRGSYGAGDRVGQDDCFECGRPGHRARDCPLAGGGGGRGRGGG 433

Query: 259 NV 260
           ++
Sbjct: 434 SL 439


  Database: LJGI
    Posted date:  Jul 30, 2004 11:16 AM
  Number of letters in database: 14,692,800
  Number of sequences in database:  28,460
  
Lambda     K      H
   0.341    0.149    0.500 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 29,589,527
Number of Sequences: 28460
Number of extensions: 452495
Number of successful extensions: 3903
Number of sequences better than 10.0: 57
Number of HSP's better than 10.0 without gapping: 3704
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3841
length of query: 1582
length of database: 4,897,600
effective HSP length: 102
effective length of query: 1480
effective length of database: 1,994,680
effective search space: 2952126400
effective search space used: 2952126400
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (22.0 bits)
S2: 62 (28.5 bits)


Lotus: description of TM0279a.6