KMC000937A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000937A_C01 KMC000937A_c01
cagcgaaatgctccataatttcgtataaattctttcaatctcaataatggtgcaatatga
ggatggcctggctgactataAAATGATTGACATTGAATAGCTCTTTCAACTGCATCATAG
GCAAATCAATTTCCAAGCTACCATCCCTCCACCGACGAGCTGGTGTGGAGCCTTCCTCAG
GACCCAGATTAAAAGGGGGATGGTAAGGAACAATCTCTCCACTTCTATTCTTTGCCATCA
ATTCCTGAGCTTCAAAAAGGCCAGGAAAGGCGCAAGAAGCGGTGACTGCACTCCATATAA
CCACATGGGGTGAGGTCAAGTAATTAAGACATCTCGGAGGTTCATGCTTCCTTGGGGAGC
AAACTGTAATCCCAAGAACTCTACCTGTCATGTCATAAGCTTCTTGAAATGTAAGGTTGT
TTGTCAGATGCCTCAACATAATTTGCAACTGTCTGATCTCATGAACAGCACCACGTGTTG
CGACCCTCTTCACAACTGTGAAAATCCCTCCCATTTGATCGAAAAATTGCATTGAGTGCC
ATGAGTCTTCAAAGAAACTCTGAAGCTCAGGCCAAGACCTAGTAGCAACAACAGCACACA
TTATGGATCCCACACTTGAACCAGCAATTACCCTAGGCAAGAGTTTATGTTCTACCAGTG
TTTTAACCACGCCTACATGAGAAGCTCCAAGAGAAGCACCCCCACTTAACAGCAAAGCTG
TCCTCCCAAATGCATGTCTAGTTTCATGCATGAAAGCAAGCTTTTCTTCCAGTAATAGCT
CCTGTGAATCAGAGTCACAGACcatcctcaattgggttgacacttcatcaatgtactcct
tgattaacctggggacttgaagcctacccttgtgcagttcag


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000937A_C01 KMC000937A_c01
         (882 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_196024.1| expressed protein; protein id: At5g04040.1 [Ara...   491  e-141
ref|NP_191273.1| expressed protein; protein id: At3g57140.1 [Ara...   493  e-141
dbj|BAB61223.1| contains EST AU057376(S21389)~similar to Arabido...   460  e-132
gb|EAA29514.1| hypothetical protein [Neurospora crassa]               194  1e-48
emb|CAD60564.1| unnamed protein product [Podospora anserina]          191  1e-47

>ref|NP_196024.1| expressed protein; protein id: At5g04040.1 [Arabidopsis thaliana]
           gi|11282314|pir||T48431 hypothetical protein F8F6.250 -
           Arabidopsis thaliana gi|7406414|emb|CAB85524.1| putative
           protein [Arabidopsis thaliana]
           gi|22531263|gb|AAM97135.1| putative protein [Arabidopsis
           thaliana]
          Length = 825

 Score =  491 bits (1264), Expect(2) = e-141
 Identities = 240/274 (87%), Positives = 262/274 (95%), Gaps = 2/274 (0%)
 Frame = -3

Query: 880 ELHKGRLQVPRLIKEYIDEVSTQLRMVCDSDSQELLLEEKLAFMHETRHAFGRTALLLSG 701
           ELHKGRLQVPR IKEYIDEVSTQLRMVC+SDS+EL LEEKL+FMHETRHAFGRTALLLSG
Sbjct: 177 ELHKGRLQVPRHIKEYIDEVSTQLRMVCNSDSEELSLEEKLSFMHETRHAFGRTALLLSG 236

Query: 700 GASLGASHVGVVKTLVEHKLLPRVIAGSSVGSIMCAVVATRSWPELQSFFEDSWHSMQFF 521
           GASLGA HVGVV+TLVEHKLLPR+IAGSSVGSI+CAVVA+RSWPELQSFFE+S HS+QFF
Sbjct: 237 GASLGAFHVGVVRTLVEHKLLPRIIAGSSVGSIICAVVASRSWPELQSFFENSLHSLQFF 296

Query: 520 DQMGGIFTVVKRVATRGAVHEIRQLQIMLRHLTNNLTFQEAYDMTGRVLGITVCSPRKHE 341
           DQ+GG+F++VKRV T+GA+H+IRQLQ MLR+LT+NLTFQEAYDMTGR+LGITVCSPRKHE
Sbjct: 297 DQLGGVFSIVKRVMTQGALHDIRQLQCMLRNLTSNLTFQEAYDMTGRILGITVCSPRKHE 356

Query: 340 PPRCLNYLTSPHVVIWSAVTASCAFPGLFEAQELMAKNRSGEIVPYHPPFNLGPEEG--S 167
           PPRCLNYLTSPHVVIWSAVTASCAFPGLFEAQELMAK+RSGEIVPYHPPFNL PE G  S
Sbjct: 357 PPRCLNYLTSPHVVIWSAVTASCAFPGLFEAQELMAKDRSGEIVPYHPPFNLDPEVGTKS 416

Query: 166 TPARRWRDGSLEIDLPMMQLKELFNVNHFIVSQA 65
           +  RRWRDGSLE+DLPMMQLKELFNVNHFIVSQA
Sbjct: 417 SSGRRWRDGSLEVDLPMMQLKELFNVNHFIVSQA 450

 Score = 33.9 bits (76), Expect(2) = e-141
 Identities = 14/24 (58%), Positives = 17/24 (70%)
 Frame = -2

Query: 83  FYSQPGHPHIAPLLRLKEFIRNYG 12
           F     +PHIAPLLRLK+ +R YG
Sbjct: 445 FIVSQANPHIAPLLRLKDLVRAYG 468

>ref|NP_191273.1| expressed protein; protein id: At3g57140.1 [Arabidopsis thaliana]
           gi|11282313|pir||T47774 hypothetical protein F24I3.220 -
           Arabidopsis thaliana gi|6911884|emb|CAB72184.1| putative
           protein [Arabidopsis thaliana]
           gi|26450904|dbj|BAC42559.1| unknown protein [Arabidopsis
           thaliana] gi|29029050|gb|AAO64904.1| At3g57140
           [Arabidopsis thaliana]
          Length = 801

 Score =  493 bits (1270), Expect(2) = e-141
 Identities = 239/272 (87%), Positives = 256/272 (93%)
 Frame = -3

Query: 880 ELHKGRLQVPRLIKEYIDEVSTQLRMVCDSDSQELLLEEKLAFMHETRHAFGRTALLLSG 701
           ELHKGRL VPRLIKEYIDEVSTQLRMVCD D++EL LEEKL+FMHETRHA+GRTALLLSG
Sbjct: 178 ELHKGRLHVPRLIKEYIDEVSTQLRMVCDMDTEELSLEEKLSFMHETRHAYGRTALLLSG 237

Query: 700 GASLGASHVGVVKTLVEHKLLPRVIAGSSVGSIMCAVVATRSWPELQSFFEDSWHSMQFF 521
           GASLGA H+GVVKTLVEHKLLPR+IAGSSVGS+MCAVV TRSWPELQSFFE SWH++QFF
Sbjct: 238 GASLGAFHLGVVKTLVEHKLLPRIIAGSSVGSVMCAVVGTRSWPELQSFFEGSWHALQFF 297

Query: 520 DQMGGIFTVVKRVATRGAVHEIRQLQIMLRHLTNNLTFQEAYDMTGRVLGITVCSPRKHE 341
           DQMGGIFT VKRV T+GAVHEIR LQ  LR+LTNNLTFQEAYD+TGR+LGITVCS RKHE
Sbjct: 298 DQMGGIFTTVKRVMTQGAVHEIRHLQWKLRNLTNNLTFQEAYDITGRILGITVCSLRKHE 357

Query: 340 PPRCLNYLTSPHVVIWSAVTASCAFPGLFEAQELMAKNRSGEIVPYHPPFNLGPEEGSTP 161
           PPRCLNYLTSPHVVIWSAVTASCAFPGLFEAQELMAK+R+GEIVPYHPPFNL PEEGS  
Sbjct: 358 PPRCLNYLTSPHVVIWSAVTASCAFPGLFEAQELMAKDRTGEIVPYHPPFNLDPEEGSAS 417

Query: 160 ARRWRDGSLEIDLPMMQLKELFNVNHFIVSQA 65
            RRWRDGSLE+DLPM+QLKELFNVNHFIVSQA
Sbjct: 418 VRRWRDGSLEMDLPMIQLKELFNVNHFIVSQA 449

 Score = 31.6 bits (70), Expect(2) = e-141
 Identities = 13/24 (54%), Positives = 16/24 (66%)
 Frame = -2

Query: 83  FYSQPGHPHIAPLLRLKEFIRNYG 12
           F     +PHIAP LR+KEF+R  G
Sbjct: 444 FIVSQANPHIAPFLRMKEFVRACG 467

>dbj|BAB61223.1| contains EST AU057376(S21389)~similar to Arabidopsis thaliana
           chromosome 5, F8F6.250~unknown protein [Oryza sativa
           (japonica cultivar-group)] gi|20804680|dbj|BAB92368.1|
           P0512C01.22 [Oryza sativa (japonica cultivar-group)]
          Length = 1044

 Score =  460 bits (1183), Expect(2) = e-132
 Identities = 228/274 (83%), Positives = 252/274 (91%), Gaps = 2/274 (0%)
 Frame = -3

Query: 880 ELHKGRLQVPRLIKEYIDEVSTQLRMVCDSDSQELLLEEKLAFMHETRHAFGRTALLLSG 701
           ELHKGRLQVP+LIKEYI+EVSTQL+MVC+SDS +L LEEKLAFMHETRHAFGRTALLLSG
Sbjct: 179 ELHKGRLQVPKLIKEYIEEVSTQLKMVCNSDSDDLPLEEKLAFMHETRHAFGRTALLLSG 238

Query: 700 GASLGASHVGVVKTLVEHKLLPRVIAGSSVGSIMCAVVATRSWPELQSFFEDSWHSMQFF 521
           GASLG  HVGVVKTLVEHKLLPR+I+GSSVGSIMC++VATRSWPEL+SFFE+ WHS++FF
Sbjct: 239 GASLGCFHVGVVKTLVEHKLLPRIISGSSVGSIMCSIVATRSWPELESFFEE-WHSLKFF 297

Query: 520 DQMGGIFTVVKRVATRGAVHEIRQLQIMLRHLTNNLTFQEAYDMTGRVLGITVCSPRKHE 341
           DQMGGIF VVKR+ T GAVH+IR LQ +LR+LT+NLTFQEAYDMTGR+L +TVCSPRKHE
Sbjct: 298 DQMGGIFPVVKRILTHGAVHDIRHLQTLLRNLTSNLTFQEAYDMTGRILVVTVCSPRKHE 357

Query: 340 PPRCLNYLTSPHVVIWSAVTASCAFPGLFEAQELMAKNRSGEIVPYHPPFNLGPEE--GS 167
           PPRCLNYLTSPHV+IWSAVTASCAFPGLFEAQELMAK+R GE VP+H PF LG EE  G+
Sbjct: 358 PPRCLNYLTSPHVLIWSAVTASCAFPGLFEAQELMAKDRFGETVPFHAPFLLGLEERVGA 417

Query: 166 TPARRWRDGSLEIDLPMMQLKELFNVNHFIVSQA 65
           T  RRWRDGSLE DLPM QLKELFNVNHFIVSQA
Sbjct: 418 T-TRRWRDGSLESDLPMKQLKELFNVNHFIVSQA 450

 Score = 35.4 bits (80), Expect(2) = e-132
 Identities = 16/24 (66%), Positives = 17/24 (70%)
 Frame = -2

Query: 83  FYSQPGHPHIAPLLRLKEFIRNYG 12
           F     +PHIAPLLRLKE IR YG
Sbjct: 445 FIVSQANPHIAPLLRLKEIIRAYG 468

>gb|EAA29514.1| hypothetical protein [Neurospora crassa]
          Length = 802

 Score =  194 bits (493), Expect = 1e-48
 Identities = 113/268 (42%), Positives = 168/268 (62%), Gaps = 7/268 (2%)
 Frame = -3

Query: 850 RLIKEYIDEVSTQLRMVCDSDSQELLLE----EKLAFMHETRHAFGRTALLLSGGASLGA 683
           +LI++Y+D     +  + D  +Q L  +    + L  M   R +FGR+ALLLSGGA+ G 
Sbjct: 182 KLIEDYVDSAVKTIGALMDQSTQTLPADMETKDLLEGMLFARQSFGRSALLLSGGATFGM 241

Query: 682 SHVGVVKTLVEHKLLPRVIAGSSVGSIMCAVVATRSWPELQSFFED-SWHSMQFFDQMG- 509
           SH+GV+K+L E  LLPR+I+G+S GSI+C+V+ TR   E+        +  +  F     
Sbjct: 242 SHIGVIKSLFEANLLPRIISGASAGSIVCSVLCTRKDEEVPDLIRTFPYGDLDVFKGPND 301

Query: 508 GIFTVVKRVATRGAVHEIRQLQIMLRHLTNNLTFQEAYDMTGRVLGITVCSPRKHEPPRC 329
           GI   ++R+ T+G+  +I  L  ++R +  +LTFQEAY+ T R+  I V +   +E PR 
Sbjct: 302 GISDSLRRLLTQGSWADITNLTRVMRSMLGDLTFQEAYNRTRRICNICVSTASIYELPRL 361

Query: 328 LNYLTSPHVVIWSAVTASCAFPGLFEAQELMAKN-RSGEIVPYHPPFNLGPEEGSTPARR 152
           LNY+T+P+V+IWSAV ASC+ P +F+A  L+ K+  +G  VP++P          TP +R
Sbjct: 362 LNYITAPNVMIWSAVAASCSVPLVFQAAPLLVKDPATGAHVPWNP----------TP-QR 410

Query: 151 WRDGSLEIDLPMMQLKELFNVNHFIVSQ 68
           W DGS++ DLPM +L E+FNVNHFIVSQ
Sbjct: 411 WIDGSVDNDLPMTRLAEMFNVNHFIVSQ 438

>emb|CAD60564.1| unnamed protein product [Podospora anserina]
          Length = 824

 Score =  191 bits (486), Expect = 1e-47
 Identities = 111/267 (41%), Positives = 165/267 (61%), Gaps = 7/267 (2%)
 Frame = -3

Query: 847 LIKEYIDEVSTQLRMVCDSDSQELLL----EEKLAFMHETRHAFGRTALLLSGGASLGAS 680
           LI+ Y+D     +  + +  +  +      ++ L  M   R +FGR+ALLLSGGA+ G S
Sbjct: 180 LIERYVDSAVKTIEALVEKSAYSIPAGMETQDLLEGMLYARQSFGRSALLLSGGATFGMS 239

Query: 679 HVGVVKTLVEHKLLPRVIAGSSVGSIMCAVVATRSWPELQSFFED-SWHSMQFFD-QMGG 506
           H+GV+K L E KLLPR+I+G+S GSI+CAV+ TR   E+ +  E   +  +  F+ +  G
Sbjct: 240 HIGVLKALYESKLLPRIISGASAGSIVCAVLCTRKDEEIPALVEAFPYGDLGVFEGEKDG 299

Query: 505 IFTVVKRVATRGAVHEIRQLQIMLRHLTNNLTFQEAYDMTGRVLGITVCSPRKHEPPRCL 326
           +   ++R+ T G   +I  L  ++R    ++TFQEAY+ T R+  I V S   +E PR L
Sbjct: 300 LSDHIRRLLTEGCWADISNLTRVMRSWLGDVTFQEAYNRTRRICNICVSSASIYELPRLL 359

Query: 325 NYLTSPHVVIWSAVTASCAFPGLFEAQELMAKN-RSGEIVPYHPPFNLGPEEGSTPARRW 149
           NY+T+P+V+IWSAV ASC+ P +F+A  L+ K+  +G  VP++P          TP + W
Sbjct: 360 NYITAPNVMIWSAVAASCSVPLVFQAASLLVKDPATGAHVPWNP----------TP-QHW 408

Query: 148 RDGSLEIDLPMMQLKELFNVNHFIVSQ 68
            DGS++ DLPM +L E+FNVNHFIVSQ
Sbjct: 409 IDGSVDNDLPMTRLAEMFNVNHFIVSQ 435

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 778,501,928
Number of Sequences: 1393205
Number of extensions: 17022800
Number of successful extensions: 42290
Number of sequences better than 10.0: 159
Number of HSP's better than 10.0 without gapping: 40504
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 42229
length of database: 448,689,247
effective HSP length: 122
effective length of database: 278,718,237
effective search space used: 47660818527
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENLf046a01 BP064751 1 467
2 MWL073d12_f AV769895 303 882




Lotus japonicus
Kazusa DNA Research Institute