KMC000744A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000744A_C01 KMC000744A_c01
gCCAAAAGAAAATAGAGTAATAGACAAATTGCTATGATAAAATCATTCTACTACAATATT
GTTTACACAAAAGACCTTTGGTGACTCAAACAAACAGTCACTTAAATTACATGTTTCAAC
CGTTAGAAGTCATAAAAATACTGTTATACCAGCTTTCTCAATTGCAGCCTCCAACTAGGG
GGAATATCCTAAGGAGTAGGACTGGAAAAAAATTAAATATCCACCAAATGCATAAATGTA
CAAAATTCATTATGTATTGTAATTGCGCCATCTGTTTGGTACCTTCAACTGCTGGAAGGT
ATTTTGTGATACATATAGAAACCAGAGTATACACTAACTGCTCATTTCTCCCAAAATAAA
AAAATTGCAGCAATCATTCCTTATGCTGAGTATAGTAGAGTTGAATTATGTAATTTTGGT
ATCTTTGGATTCTACAATATGATTTATATGCCAAAAGCTGAAATATTGGTTGATGTAAAG
GTAAAGAAATCATATCATCATTTCATTATGGAAAAGTTCCAGTCATTCGGATATGGTCAA
GGGTCCTTCTAAGTCCATACTTGAGCACAGCCTGGTTACCCATTCTAAATCCATCTCCAT
AAGCTGATGAGCTATCTGTCTCAATCTGGTGTTTAAAAAACTCACCAAGTTTCTTCTCAA
AGTCGGACCGTTTGCCCTTCTTTGATGATGACCAGGAGGAAGGTGATGATGAGGATGGAT
TGATAGAGGAAGAAGAAAGAGGACGATGCAGGATCAGTTCCAGAGACAGAATATATTTCT
GAATCTAGCCACATATGAGCCAAGACTTTGGAAATTCCTTCTTC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000744A_C01 KMC000744A_c01
         (824 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||C85435 hypothetical protein AT4g36860 [imported] - Arabidop...   121  2e-31
ref|NP_195404.2| putative protein; protein id: At4g36860.1, supp...   121  2e-31
gb|AAL32677.1| Unknown protein [Arabidopsis thaliana] gi|2138714...   119  7e-31
gb|AAL31691.1|AC092390_12 unknown protein, 5' partial [Oryza sat...   105  2e-26
ref|NP_181513.1| hypothetical protein; protein id: At2g39830.1 [...    98  6e-23

>pir||C85435 hypothetical protein AT4g36860 [imported] - Arabidopsis thaliana
           gi|4006886|emb|CAB16816.1| putative protein [Arabidopsis
           thaliana] gi|7270635|emb|CAB80352.1| putative protein
           [Arabidopsis thaliana]
          Length = 542

 Score =  121 bits (303), Expect(2) = 2e-31
 Identities = 60/72 (83%), Positives = 65/72 (89%)
 Frame = -2

Query: 724 INPSSSSPSSWSSSKKGKRSDFEKKLGEFFKHQIETDSSSAYGDGFRMGNQAVLKYGLRR 545
           I  SSSS    +SSKKG+RSDFEKKLGEFFKHQIE+DSSSAYGDGFR GNQAVLK+GLRR
Sbjct: 468 IASSSSSAVVSASSKKGERSDFEKKLGEFFKHQIESDSSSAYGDGFRQGNQAVLKHGLRR 527

Query: 544 TLDHIRMTGTFP 509
           TLDHIR+TGTFP
Sbjct: 528 TLDHIRLTGTFP 539

 Score = 37.0 bits (84), Expect(2) = 2e-31
 Identities = 20/30 (66%), Positives = 23/30 (76%), Gaps = 1/30 (3%)
 Frame = -1

Query: 824 EEGISKVLAHMWLDSEIYSVSG-TDPASSS 738
           EEGI +VLAHMWL+SE Y+ S   D ASSS
Sbjct: 443 EEGICQVLAHMWLESETYAGSTLVDIASSS 472

>ref|NP_195404.2| putative protein; protein id: At4g36860.1, supported by cDNA:
           gi_17065045 [Arabidopsis thaliana]
          Length = 351

 Score =  121 bits (303), Expect(2) = 2e-31
 Identities = 60/72 (83%), Positives = 65/72 (89%)
 Frame = -2

Query: 724 INPSSSSPSSWSSSKKGKRSDFEKKLGEFFKHQIETDSSSAYGDGFRMGNQAVLKYGLRR 545
           I  SSSS    +SSKKG+RSDFEKKLGEFFKHQIE+DSSSAYGDGFR GNQAVLK+GLRR
Sbjct: 277 IASSSSSAVVSASSKKGERSDFEKKLGEFFKHQIESDSSSAYGDGFRQGNQAVLKHGLRR 336

Query: 544 TLDHIRMTGTFP 509
           TLDHIR+TGTFP
Sbjct: 337 TLDHIRLTGTFP 348

 Score = 37.0 bits (84), Expect(2) = 2e-31
 Identities = 20/30 (66%), Positives = 23/30 (76%), Gaps = 1/30 (3%)
 Frame = -1

Query: 824 EEGISKVLAHMWLDSEIYSVSG-TDPASSS 738
           EEGI +VLAHMWL+SE Y+ S   D ASSS
Sbjct: 252 EEGICQVLAHMWLESETYAGSTLVDIASSS 281

>gb|AAL32677.1| Unknown protein [Arabidopsis thaliana] gi|21387149|gb|AAM47978.1|
           unknown protein [Arabidopsis thaliana]
          Length = 553

 Score =  119 bits (299), Expect(2) = 7e-31
 Identities = 59/72 (81%), Positives = 65/72 (89%)
 Frame = -2

Query: 724 INPSSSSPSSWSSSKKGKRSDFEKKLGEFFKHQIETDSSSAYGDGFRMGNQAVLKYGLRR 545
           I  SSSS    +SSKKG+RSDFE+KLGEFFKHQIE+DSSSAYGDGFR GNQAVLK+GLRR
Sbjct: 482 IASSSSSAVVSASSKKGERSDFEEKLGEFFKHQIESDSSSAYGDGFRQGNQAVLKHGLRR 541

Query: 544 TLDHIRMTGTFP 509
           TLDHIR+TGTFP
Sbjct: 542 TLDHIRLTGTFP 553

 Score = 37.0 bits (84), Expect(2) = 7e-31
 Identities = 20/30 (66%), Positives = 23/30 (76%), Gaps = 1/30 (3%)
 Frame = -1

Query: 824 EEGISKVLAHMWLDSEIYSVSG-TDPASSS 738
           EEGI +VLAHMWL+SE Y+ S   D ASSS
Sbjct: 457 EEGICQVLAHMWLESETYAGSTLVDIASSS 486

>gb|AAL31691.1|AC092390_12 unknown protein, 5' partial [Oryza sativa]
          Length = 223

 Score =  105 bits (263), Expect(2) = 2e-26
 Identities = 51/77 (66%), Positives = 63/77 (81%)
 Frame = -2

Query: 739 LSSSSINPSSSSPSSWSSSKKGKRSDFEKKLGEFFKHQIETDSSSAYGDGFRMGNQAVLK 560
           ++S + + SSSS SS  SSKKG ++DFEKKLGEFFKHQIETD S  YGDGFR G +AV +
Sbjct: 146 IASIAASSSSSSSSSAPSSKKGVQTDFEKKLGEFFKHQIETDPSDVYGDGFRDGIKAVER 205

Query: 559 YGLRRTLDHIRMTGTFP 509
           YGLR+TLDH+++TG FP
Sbjct: 206 YGLRKTLDHMKLTGVFP 222

 Score = 35.8 bits (81), Expect(2) = 2e-26
 Identities = 18/27 (66%), Positives = 21/27 (77%)
 Frame = -1

Query: 824 EEGISKVLAHMWLDSEIYSVSGTDPAS 744
           EEGI +VLAHMWL+SEI S S +  AS
Sbjct: 122 EEGICQVLAHMWLESEITSGSSSIIAS 148

>ref|NP_181513.1| hypothetical protein; protein id: At2g39830.1 [Arabidopsis
           thaliana] gi|7487693|pir||T01013 hypothetical protein
           At2g39830 [imported] - Arabidopsis thaliana
           gi|2642165|gb|AAB87132.1| hypothetical protein
           [Arabidopsis thaliana]
          Length = 434

 Score = 98.2 bits (243), Expect(2) = 6e-23
 Identities = 51/76 (67%), Positives = 58/76 (76%)
 Frame = -2

Query: 736 SSSSINPSSSSPSSWSSSKKGKRSDFEKKLGEFFKHQIETDSSSAYGDGFRMGNQAVLKY 557
           S+SS+  SSSS  S   +KKG +S+ EKKLGEFFKHQI  D+S AYG GFR  N A  KY
Sbjct: 361 STSSVATSSSSSFS---NKKGGKSNVEKKLGEFFKHQIAHDASPAYGGGFRAANAAACKY 417

Query: 556 GLRRTLDHIRMTGTFP 509
           GLRRTLDHIR+TGTFP
Sbjct: 418 GLRRTLDHIRLTGTFP 433

 Score = 32.0 bits (71), Expect(2) = 6e-23
 Identities = 16/31 (51%), Positives = 23/31 (73%), Gaps = 2/31 (6%)
 Frame = -1

Query: 824 EEGISKVLAHMWLDSEIYSVSGTD--PASSS 738
           EEGI +VL++MWL+SE+ S   T   P++SS
Sbjct: 334 EEGICQVLSYMWLESEVLSDPSTRNLPSTSS 364

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 725,625,630
Number of Sequences: 1393205
Number of extensions: 16534304
Number of successful extensions: 82191
Number of sequences better than 10.0: 44
Number of HSP's better than 10.0 without gapping: 61084
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 75458
length of database: 448,689,247
effective HSP length: 121
effective length of database: 280,111,442
effective search space used: 42857050626
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MRL018b06_f BP084622 1 353
2 MRL019e03_f BP084692 2 468
3 GENLf051d04 BP065069 35 542
4 GENLf033g04 BP064094 387 827




Lotus japonicus
Kazusa DNA Research Institute