KMC005095A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC005095A_C01 KMC005095A_c01
TAAATGTTTGAAAATCTTTCTATAAATGATTTTAAAGTAGTGATCAACTATGTGGAAGGT
TATGTGTGTAGCTTCCGACTCGAGATTCGAGATTCGAGATTCTATGGAAATAGAAGCTGA
TCTAGATATGCTATATGTATTAGAAGAAAATTAGATACACTTAGGGGGGATGTGAGTGAA
GTGAAAAGTCTATGGTATATGGAAGTTGAATATTTGAGAGTTGAAACAAACTTGGGTCAT
CCATTTGTATGAACAAGAAGAAGATCCAATGTTGTAATTTTTATATAAGACATGTATTGT
ATGTCATGGCTTAAAGCAGTTTAATATTTAATGTTGAACTCTGCTGCAACACTTTCGATT
GTTTGAATGCACTCCACTACACTTGCCTTCTTTGTCATTGATGCCATCAAGGCTTCATGA
CTCTCCTTGTTATTTTTCAAGAGACTAGCAGCAAACATAACTGCCCATCTAGTCAGATTT
TGTTGCTGATCTTTGCTGAGTTGAGGCTTGGTTCTGTTTATAAATCTCTGGAGAGTGAAA
AGATCAGCTGATTGACCAATCACCTTGTCATATATTAAACCTTCTGCTGCCAGTCCAGCC
ATTGACACAACAGCCAATCTATCTATCTCTTTAGCATTAAGCTGGCCACTATAGATAAGC
TTTTCAAGCCTCTGATCAATGAGATTGACATGCTCTTTCCCAATATCCAGCGAATACCCC
AAAATGGGTACACCAAGCAGATAAGCAATCAAAAAGTGTGCTGCTTCATG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC005095A_C01 KMC005095A_c01
         (770 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_565523.1| expressed protein; protein id: At2g21960.1, sup...   230  2e-59
gb|AAM64802.1| unknown [Arabidopsis thaliana]                         228  7e-59
pir||C84607 hypothetical protein At2g21960 [imported] - Arabidop...   221  9e-57
ref|ZP_00071560.1| hypothetical protein [Trichodesmium erythraeu...    96  4e-19
gb|ZP_00107031.1| hypothetical protein [Nostoc punctiforme]            85  1e-15

>ref|NP_565523.1| expressed protein; protein id: At2g21960.1, supported by cDNA:
           33232., supported by cDNA: gi_14334629 [Arabidopsis
           thaliana] gi|14334630|gb|AAK59493.1| unknown protein
           [Arabidopsis thaliana] gi|20198006|gb|AAD20413.2|
           expressed protein [Arabidopsis thaliana]
           gi|23296622|gb|AAN13134.1| unknown protein [Arabidopsis
           thaliana]
          Length = 332

 Score =  230 bits (586), Expect = 2e-59
 Identities = 111/142 (78%), Positives = 134/142 (94%)
 Frame = -1

Query: 770 HEAAHFLIAYLLGVPILGYSLDIGKEHVNLIDQRLEKLIYSGQLNAKEIDRLAVVSMAGL 591
           HEAAHFL+AYL+G+PILGYSLDIGKEHVNLID+RL KLIYSG+L++KE+DRLA V+MAGL
Sbjct: 191 HEAAHFLVAYLIGLPILGYSLDIGKEHVNLIDERLAKLIYSGKLDSKELDRLAAVAMAGL 250

Query: 590 AAEGLIYDKVIGQSADLFTLQRFINRTKPQLSKDQQQNLTRWAVMFAASLLKNNKESHEA 411
           AAEGL YDKVIGQSADLF+LQRFINR++P++S +QQQNLTRWAV+++ASLLKNNK  HEA
Sbjct: 251 AAEGLKYDKVIGQSADLFSLQRFINRSQPKISNEQQQNLTRWAVLYSASLLKNNKTIHEA 310

Query: 410 LMASMTKKASVVECIQTIESVA 345
           LMA+M+K ASV+ECIQTIE+ +
Sbjct: 311 LMAAMSKNASVLECIQTIETAS 332

>gb|AAM64802.1| unknown [Arabidopsis thaliana]
          Length = 332

 Score =  228 bits (581), Expect = 7e-59
 Identities = 110/142 (77%), Positives = 133/142 (93%)
 Frame = -1

Query: 770 HEAAHFLIAYLLGVPILGYSLDIGKEHVNLIDQRLEKLIYSGQLNAKEIDRLAVVSMAGL 591
           HEAAHFL+AYL+G+PILGYSLDIGKEHVNLID+RL KLIYSG+L++KE+DRLA V+MAGL
Sbjct: 191 HEAAHFLVAYLIGLPILGYSLDIGKEHVNLIDERLAKLIYSGKLDSKELDRLAAVAMAGL 250

Query: 590 AAEGLIYDKVIGQSADLFTLQRFINRTKPQLSKDQQQNLTRWAVMFAASLLKNNKESHEA 411
           AAEGL YDKVIGQSADLF+LQRFINR++P++S +QQQNLTRWA +++ASLLKNNK  HEA
Sbjct: 251 AAEGLKYDKVIGQSADLFSLQRFINRSQPKISNEQQQNLTRWAXLYSASLLKNNKTIHEA 310

Query: 410 LMASMTKKASVVECIQTIESVA 345
           LMA+M+K ASV+ECIQTIE+ +
Sbjct: 311 LMAAMSKNASVLECIQTIETAS 332

>pir||C84607 hypothetical protein At2g21960 [imported] - Arabidopsis thaliana
          Length = 344

 Score =  221 bits (563), Expect = 9e-57
 Identities = 111/154 (72%), Positives = 134/154 (86%), Gaps = 12/154 (7%)
 Frame = -1

Query: 770 HEAAHFL------------IAYLLGVPILGYSLDIGKEHVNLIDQRLEKLIYSGQLNAKE 627
           HEAAHFL            +AYL+G+PILGYSLDIGKEHVNLID+RL KLIYSG+L++KE
Sbjct: 191 HEAAHFLGTLEKFKSFDSKVAYLIGLPILGYSLDIGKEHVNLIDERLAKLIYSGKLDSKE 250

Query: 626 IDRLAVVSMAGLAAEGLIYDKVIGQSADLFTLQRFINRTKPQLSKDQQQNLTRWAVMFAA 447
           +DRLA V+MAGLAAEGL YDKVIGQSADLF+LQRFINR++P++S +QQQNLTRWAV+++A
Sbjct: 251 LDRLAAVAMAGLAAEGLKYDKVIGQSADLFSLQRFINRSQPKISNEQQQNLTRWAVLYSA 310

Query: 446 SLLKNNKESHEALMASMTKKASVVECIQTIESVA 345
           SLLKNNK  HEALMA+M+K ASV+ECIQTIE+ +
Sbjct: 311 SLLKNNKTIHEALMAAMSKNASVLECIQTIETAS 344

>ref|ZP_00071560.1| hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 229

 Score = 96.3 bits (238), Expect = 4e-19
 Identities = 56/149 (37%), Positives = 90/149 (59%), Gaps = 10/149 (6%)
 Frame = -1

Query: 770 HEAAHFLIAYLLGVPILGYSLDIGKEH---------VNLIDQRLEKLIYSGQLNAKEIDR 618
           HEA HFL+AYLL +PI GY+L+  +           V   DQ+L   +YSG ++++ +DR
Sbjct: 77  HEAGHFLVAYLLEIPISGYALNAWEAFRQGQSSQGGVRFDDQKLAAQLYSGVISSQLVDR 136

Query: 617 LAVVSMAGLAAEGLIYDKVIGQSADLFTLQRFINRTK-PQLSKDQQQNLTRWAVMFAASL 441
              V MAG+AAE L+Y    G + D   +   + + K P  SK +Q     WA + A +L
Sbjct: 137 YCTVWMAGIAAENLVYGNAEGGAEDRTKITAILRQLKRPGESKLKQS----WASLQARNL 192

Query: 440 LKNNKESHEALMASMTKKASVVECIQTIE 354
           L+N++ +++AL+ +MT+++SV +C QTI+
Sbjct: 193 LENHQSAYKALVKAMTERSSVSDCYQTIK 221

>gb|ZP_00107031.1| hypothetical protein [Nostoc punctiforme]
          Length = 225

 Score = 85.1 bits (209), Expect = 1e-15
 Identities = 53/143 (37%), Positives = 80/143 (55%), Gaps = 9/143 (6%)
 Frame = -1

Query: 770 HEAAHFLIAYLLGVPILGYSLDI---------GKEHVNLIDQRLEKLIYSGQLNAKEIDR 618
           HEA HFL+AYLLG+P+ GY+L           G+  V+  D  L   +  G+++A+ +DR
Sbjct: 77  HEAGHFLVAYLLGIPVTGYTLSAWEAWKQGQPGQGGVSFDDGELASQLEVGKISAQMLDR 136

Query: 617 LAVVSMAGLAAEGLIYDKVIGQSADLFTLQRFINRTKPQLSKDQQQNLTRWAVMFAASLL 438
              V MAG+AAE L++D   G S D   L   +  T    S+   Q   R+  + A +LL
Sbjct: 137 YCTVWMAGIAAETLVFDNAEGGSDDKSKLIGVL--TVLGFSESVYQQKLRFHALQAKTLL 194

Query: 437 KNNKESHEALMASMTKKASVVEC 369
           + N  S+EAL+ +M ++ASV +C
Sbjct: 195 QENWSSYEALVNAMRQRASVEDC 217

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 598,616,749
Number of Sequences: 1393205
Number of extensions: 11771007
Number of successful extensions: 26726
Number of sequences better than 10.0: 22
Number of HSP's better than 10.0 without gapping: 25992
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 26689
length of database: 448,689,247
effective HSP length: 121
effective length of database: 280,111,442
effective search space used: 37815044670
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MWM039c05_f AV765284 1 314
2 MPD096e02_f AV776290 174 604
3 MPD013a05_f AV770855 176 442
4 MFB003b10_f BP034092 218 783
5 MWM044f06_f AV765367 272 543
6 SPD099g04_f BP051925 276 677




Lotus japonicus
Kazusa DNA Research Institute