KMC003705A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003705A_C01 KMC003705A_c01
gattcgtcgttcattctctcactgtctcttCTTCTTCGCAGATTCGATTCTCCATCACCC
AGTCTGCGGAACAATCTGGGTTTTAGGGATTTACAGTTACCATCACTATGCTAGGTTCAT
TGCCTATTACTGCCTCTCCCGGCTCCTCCGCTCGATGGAAACCTCGGTCCGCTTCCCGAG
TCTCAGGTGATCGAGGAGAGACCCTCCGATGGGGAGCAGAGACCGGCGACACTAGCAGCA
ACAGTTGCCACACACACTAGAACCATTGGAATAATTCATCCACCTCCGGACATTAGAACC
ATTGTTGATAAAACCTCGCAGTTTGTGGCTAAAAACGGTCCGGAATTCGAGAAGAGGATT
GTTGCGAATAACGCGGGGAATGCCAAGTTTAATTTCCTTCACTGTCCGATCCGTATCATG
CTTATTATCAACATCGCTTGGCTGAATTTCGTGCTCAGAATCAGTCTTCTACCCAGCAAC
CTGGTGACTTGGCTGGAGATTCGGATGTTCCTGAATCAACCCCATCAGCACCAGCCCCTG
ATAGTAATGGTGTAGTAGAAGCAGCAGGAGAAAAGCCTGATATTTCTGCCCAGTCTAGAC
CAGTAAGGAAAGTGCTTGACCCGCCTGAGGCTGAGCAATACACGGTTAGGCTTCgctgaa
ggaataacaggggaagagctggatattataaagcttacagcgctgtggctcgaaatggga
atcttttttgacgg


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003705A_C01 KMC003705A_c01
         (734 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||G86280 protein T5E21.13 [imported] - Arabidopsis thaliana g...   127  1e-46
gb|AAL91182.1| splicing factor, putative [Arabidopsis thaliana]       127  1e-46
ref|NP_172917.1| splicing factor, putative; protein id: At1g1465...   127  1e-46
ref|NP_172916.1| splicing factor, putative; protein id: At1g1464...    94  2e-27
gb|EAA05190.1| ebiP8201 [Anopheles gambiae str. PEST]                  73  3e-18

>pir||G86280 protein T5E21.13 [imported] - Arabidopsis thaliana
            gi|7527720|gb|AAF63169.1|AC010657_5 T5E21.13 [Arabidopsis
            thaliana]
          Length = 1776

 Score =  127 bits (318), Expect(3) = 1e-46
 Identities = 66/97 (68%), Positives = 75/97 (77%), Gaps = 6/97 (6%)
 Frame = +1

Query: 127  LLPLPAPPLDGNLGPLPESQV----IEERPSDGEQRPATLA--ATVATHTRTIGIIHPPP 288
            +LPL APP DG LGPLP SQ+    +EER    EQ  + LA  A VATHTRTIGIIHPPP
Sbjct: 998  ILPLEAPPTDGKLGPLPPSQLTDQEVEERELQAEQNNSNLAPPAAVATHTRTIGIIHPPP 1057

Query: 289  DIRTIVDKTSQFVAKNGPEFEKRIVANNAGNAKFNFL 399
            DIRTIV+KT+QFV+KNG EFEKRI+ +N  NAKFNFL
Sbjct: 1058 DIRTIVEKTAQFVSKNGLEFEKRIIVSNEKNAKFNFL 1094

 Score = 94.4 bits (233), Expect(2) = 2e-27
 Identities = 50/96 (52%), Positives = 63/96 (65%), Gaps = 5/96 (5%)
 Frame = +1

Query: 127 LLPLPAPPLDGNLGPLPESQV----IEERPSDGEQRPATLAA-TVATHTRTIGIIHPPPD 291
           +LPL APP DGNLGPLP SQ+    I+E    GEQ  +      VATHT  IGII+PPP+
Sbjct: 154 ILPLEAPPADGNLGPLPPSQLTDEEIKENEFQGEQNNSIQTPIAVATHTNPIGIIYPPPE 213

Query: 292 IRTIVDKTSQFVAKNGPEFEKRIVANNAGNAKFNFL 399
           IR IV+ T+QFV++NG  F  ++    A NA F+FL
Sbjct: 214 IRKIVETTAQFVSQNGLAFGNKVKTEKANNANFSFL 249

 Score = 70.5 bits (171), Expect(3) = 1e-46
 Identities = 39/87 (44%), Positives = 49/87 (55%)
 Frame = +3

Query: 393  FPSLSDPYHAYYQHRLAEFRAQNQSSTQQPGDLAGDSDVPESTPSAPAPDSNGVVEAAGE 572
            F   SDPYHA+YQH+L E+RAQN+   Q   D  G +D    T +A         EA   
Sbjct: 1093 FLKSSDPYHAFYQHKLTEYRAQNKDGAQGTDDSDGTTDPQLDTGAADES------EAGDT 1146

Query: 573  KPDISAQSRPVRKVLDPPEAEQYTVRL 653
            +PD+ AQ R   K L+ PE E+YTVRL
Sbjct: 1147 QPDLQAQFRIPSKPLEAPEPEKYTVRL 1173

 Score = 50.4 bits (119), Expect(2) = 2e-27
 Identities = 36/115 (31%), Positives = 52/115 (44%), Gaps = 1/115 (0%)
 Frame = +3

Query: 393 FPSLSDPYHAYYQHRLAEFRAQNQSSTQQPGDLAGDSDVPESTPSAPAPDSNGVVEAAGE 572
           F    +PYH +Y++++ E+    +   Q  G    D++ P+    + A            
Sbjct: 248 FLKSDNPYHGFYRYKVTEYSCHIRDGAQ--GTDVDDTEDPKLDDESDA------------ 293

Query: 573 KPDISAQSRPVRKVLDPPEAEQYTVRLR*RNNRGRAGYYK-AYSAVARNGNLF*R 734
           KPD+ AQ R  RK+L+ PE E+YTVRL            K     VARNG  F R
Sbjct: 294 KPDLQAQFRAPRKILEAPEPEKYTVRLPEGIMEAELDIIKHTAQFVARNGQSFLR 348

 Score = 32.3 bits (72), Expect = 7.1
 Identities = 12/33 (36%), Positives = 21/33 (63%)
 Frame = +1

Query: 301 IVDKTSQFVAKNGPEFEKRIVANNAGNAKFNFL 399
           I+  T+QFVA+NG  F + ++     N++F F+
Sbjct: 331 IIKHTAQFVARNGQSFLRELMRREVNNSQFQFM 363

 Score = 32.3 bits (72), Expect(3) = 1e-46
 Identities = 15/15 (100%), Positives = 15/15 (100%)
 Frame = +1

Query: 658  EGITGEELDIIKLTA 702
            EGITGEELDIIKLTA
Sbjct: 1175 EGITGEELDIIKLTA 1189

>gb|AAL91182.1| splicing factor, putative [Arabidopsis thaliana]
          Length = 785

 Score =  127 bits (318), Expect(3) = 1e-46
 Identities = 66/97 (68%), Positives = 75/97 (77%), Gaps = 6/97 (6%)
 Frame = +1

Query: 127 LLPLPAPPLDGNLGPLPESQV----IEERPSDGEQRPATLA--ATVATHTRTIGIIHPPP 288
           +LPL APP DG LGPLP SQ+    +EER    EQ  + LA  A VATHTRTIGIIHPPP
Sbjct: 7   ILPLEAPPTDGKLGPLPPSQLTDQEVEERELQAEQNNSNLAPPAAVATHTRTIGIIHPPP 66

Query: 289 DIRTIVDKTSQFVAKNGPEFEKRIVANNAGNAKFNFL 399
           DIRTIV+KT+QFV+KNG EFEKRI+ +N  NAKFNFL
Sbjct: 67  DIRTIVEKTAQFVSKNGLEFEKRIIVSNEKNAKFNFL 103

 Score = 70.5 bits (171), Expect(3) = 1e-46
 Identities = 39/87 (44%), Positives = 49/87 (55%)
 Frame = +3

Query: 393 FPSLSDPYHAYYQHRLAEFRAQNQSSTQQPGDLAGDSDVPESTPSAPAPDSNGVVEAAGE 572
           F   SDPYHA+YQH+L E+RAQN+   Q   D  G +D    T +A         EA   
Sbjct: 102 FLKSSDPYHAFYQHKLTEYRAQNKDGAQGTDDSDGTTDPQLDTGAADES------EAGDT 155

Query: 573 KPDISAQSRPVRKVLDPPEAEQYTVRL 653
           +PD+ AQ R   K L+ PE E+YTVRL
Sbjct: 156 QPDLQAQFRIPSKPLEAPEPEKYTVRL 182

 Score = 32.3 bits (72), Expect(3) = 1e-46
 Identities = 15/15 (100%), Positives = 15/15 (100%)
 Frame = +1

Query: 658 EGITGEELDIIKLTA 702
           EGITGEELDIIKLTA
Sbjct: 184 EGITGEELDIIKLTA 198

>ref|NP_172917.1| splicing factor, putative; protein id: At1g14650.1, supported by
           cDNA: gi_19698892 [Arabidopsis thaliana]
          Length = 785

 Score =  127 bits (318), Expect(3) = 1e-46
 Identities = 66/97 (68%), Positives = 75/97 (77%), Gaps = 6/97 (6%)
 Frame = +1

Query: 127 LLPLPAPPLDGNLGPLPESQV----IEERPSDGEQRPATLA--ATVATHTRTIGIIHPPP 288
           +LPL APP DG LGPLP SQ+    +EER    EQ  + LA  A VATHTRTIGIIHPPP
Sbjct: 7   ILPLEAPPTDGKLGPLPPSQLTDQEVEERELQAEQNNSNLAPPAAVATHTRTIGIIHPPP 66

Query: 289 DIRTIVDKTSQFVAKNGPEFEKRIVANNAGNAKFNFL 399
           DIRTIV+KT+QFV+KNG EFEKRI+ +N  NAKFNFL
Sbjct: 67  DIRTIVEKTAQFVSKNGLEFEKRIIVSNEKNAKFNFL 103

 Score = 70.5 bits (171), Expect(3) = 1e-46
 Identities = 39/87 (44%), Positives = 49/87 (55%)
 Frame = +3

Query: 393 FPSLSDPYHAYYQHRLAEFRAQNQSSTQQPGDLAGDSDVPESTPSAPAPDSNGVVEAAGE 572
           F   SDPYHA+YQH+L E+RAQN+   Q   D  G +D    T +A         EA   
Sbjct: 102 FLKSSDPYHAFYQHKLTEYRAQNKDGAQGTDDSDGTTDPQLDTGAADES------EAGDT 155

Query: 573 KPDISAQSRPVRKVLDPPEAEQYTVRL 653
           +PD+ AQ R   K L+ PE E+YTVRL
Sbjct: 156 QPDLQAQFRIPSKPLEAPEPEKYTVRL 182

 Score = 32.3 bits (72), Expect(3) = 1e-46
 Identities = 15/15 (100%), Positives = 15/15 (100%)
 Frame = +1

Query: 658 EGITGEELDIIKLTA 702
           EGITGEELDIIKLTA
Sbjct: 184 EGITGEELDIIKLTA 198

>ref|NP_172916.1| splicing factor, putative; protein id: At1g14640.1 [Arabidopsis
           thaliana]
          Length = 735

 Score = 94.4 bits (233), Expect(2) = 2e-27
 Identities = 50/96 (52%), Positives = 63/96 (65%), Gaps = 5/96 (5%)
 Frame = +1

Query: 127 LLPLPAPPLDGNLGPLPESQV----IEERPSDGEQRPATLAA-TVATHTRTIGIIHPPPD 291
           +LPL APP DGNLGPLP SQ+    I+E    GEQ  +      VATHT  IGII+PPP+
Sbjct: 7   ILPLEAPPADGNLGPLPPSQLTDEEIKENEFQGEQNNSIQTPIAVATHTNPIGIIYPPPE 66

Query: 292 IRTIVDKTSQFVAKNGPEFEKRIVANNAGNAKFNFL 399
           IR IV+ T+QFV++NG  F  ++    A NA F+FL
Sbjct: 67  IRKIVETTAQFVSQNGLAFGNKVKTEKANNANFSFL 102

 Score = 50.4 bits (119), Expect(2) = 2e-27
 Identities = 36/115 (31%), Positives = 52/115 (44%), Gaps = 1/115 (0%)
 Frame = +3

Query: 393 FPSLSDPYHAYYQHRLAEFRAQNQSSTQQPGDLAGDSDVPESTPSAPAPDSNGVVEAAGE 572
           F    +PYH +Y++++ E+    +   Q  G    D++ P+    + A            
Sbjct: 101 FLKSDNPYHGFYRYKVTEYSCHIRDGAQ--GTDVDDTEDPKLDDESDA------------ 146

Query: 573 KPDISAQSRPVRKVLDPPEAEQYTVRLR*RNNRGRAGYYK-AYSAVARNGNLF*R 734
           KPD+ AQ R  RK+L+ PE E+YTVRL            K     VARNG  F R
Sbjct: 147 KPDLQAQFRAPRKILEAPEPEKYTVRLPEGIMEAELDIIKHTAQFVARNGQSFLR 201

 Score = 32.3 bits (72), Expect = 7.1
 Identities = 12/33 (36%), Positives = 21/33 (63%)
 Frame = +1

Query: 301 IVDKTSQFVAKNGPEFEKRIVANNAGNAKFNFL 399
           I+  T+QFVA+NG  F + ++     N++F F+
Sbjct: 184 IIKHTAQFVARNGQSFLRELMRREVNNSQFQFM 216

>gb|EAA05190.1| ebiP8201 [Anopheles gambiae str. PEST]
          Length = 675

 Score = 73.2 bits (178), Expect(2) = 3e-18
 Identities = 36/68 (52%), Positives = 46/68 (66%)
 Frame = +1

Query: 196 ERPSDGEQRPATLAATVATHTRTIGIIHPPPDIRTIVDKTSQFVAKNGPEFEKRIVANNA 375
           E+  D E+   TL+  +      +GII+PPP++R IVDKT+ FVA+NGPEFE RI  N  
Sbjct: 2   EKEVDVEKPAPTLSGPI------VGIIYPPPEVRNIVDKTASFVARNGPEFESRIRQNEL 55

Query: 376 GNAKFNFL 399
           GN KFNFL
Sbjct: 56  GNPKFNFL 63

 Score = 40.8 bits (94), Expect(2) = 3e-18
 Identities = 34/116 (29%), Positives = 51/116 (43%), Gaps = 4/116 (3%)
 Frame = +3

Query: 393 FPSLSDPYHAYYQHRLAEFR-AQNQSSTQQPGDLAGDS-DVPES-TPSAPAPDSNGVVEA 563
           F S  DPYHAYYQH++ E R  +  SS+   G  AG +  +P++  P+A       +++A
Sbjct: 62  FLSPGDPYHAYYQHKVQEIREGRTDSSSGAAGGQAGSAGGLPKAQVPNATQQKQQELLKA 121

Query: 564 AGEKPDISAQSRP-VRKVLDPPEAEQYTVRLR*RNNRGRAGYYKAYSAVARNGNLF 728
             E+  +     P    + DPP      + +                 VARNG LF
Sbjct: 122 VTEQQFVPKDPPPEFEFIADPPSISALDLDI----------VKLTAQFVARNGRLF 167

 Score = 33.1 bits (74), Expect = 4.2
 Identities = 28/92 (30%), Positives = 40/92 (43%), Gaps = 11/92 (11%)
 Frame = +1

Query: 157 GNLGPLPESQVIEERPSDGEQRPATLAATVATHTRT-------IGIIHPPPDIRT----I 303
           G+ G LP++QV    P+  +Q+   L   V                I  PP I      I
Sbjct: 97  GSAGGLPKAQV----PNATQQKQQELLKAVTEQQFVPKDPPPEFEFIADPPSISALDLDI 152

Query: 304 VDKTSQFVAKNGPEFEKRIVANNAGNAKFNFL 399
           V  T+QFVA+NG  F   ++     N +F+FL
Sbjct: 153 VKLTAQFVARNGRLFLTNLMNREQRNCQFDFL 184

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 671,718,885
Number of Sequences: 1393205
Number of extensions: 15993266
Number of successful extensions: 59406
Number of sequences better than 10.0: 109
Number of HSP's better than 10.0 without gapping: 54398
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 59225
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 34906576228
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPDL007f08_f AV776876 1 588
2 MFBL021e04_f BP042310 31 581
3 GNf063c09 BP072038 182 735




Lotus japonicus
Kazusa DNA Research Institute