KMC005722A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC005722A_C01 KMC005722A_c01
TCGGTTGCTCTCTGTACACGTGGGGGTTGCGATTTTACAACCAAAGCCGCTTTTGCACAA
TCTGGAGGCGCTTCTGCCATGTTGCTCATCAATGATGAAGAAGATCTCTTTGAGATGGCT
TGCTCCAATAGCACCGGAGGAAACATTTCAATTCCAGTTGTGTTGATTACGAAATCAGCA
GGAGAAGCTTTCAACAAATCTTTAGCATCTGGAAGGAAAGTGGAAGTTTTGTTATATGCT
CCACCAAGCCCACTTGTAGATTTCTCATTTGCATTTCTGTGGTTGATGGCTGTTGGAACA
ATTGTGTGCGCTTCACTATGGTCAGATATAACTACTCCCAAGAAGTCTGGTGAACACTAT
AATGAGTTGTTTCCTAAGGAATCTTCAAATGTCGAGGGAGCGAAAGATGTTTCTGATAAA
GAAATTCTTAACATCAATTCAATGTCTGCTGTTGTATTTATTCATATCAACATCTGTCAC
TCTTGATCTATTATTCTTCTTCATGTCATCTTGGTTTAATCTGGGTGCTTGTTGTACTTT
TCTGCATCGCTGGTGTCGAGGGGATGCACAATTGTATTATAAGCCTCACGTTAAGAAAAT
GCG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC005722A_C01 KMC005722A_c01
         (603 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||H86192 hypothetical protein [imported] - Arabidopsis thalia...   176  1e-47
ref|NP_172073.1| unknown protein; protein id: At1g05820.1 [Arabi...   176  3e-46
ref|NP_181835.1| unknown protein; protein id: At2g43105.1 [Arabi...   147  3e-43
dbj|BAB92664.1| vacuolar sorting receptor-like protein [Oryza sa...   120  3e-32
ref|NP_564815.1| expressed protein; protein id: At1g63690.1, sup...   115  6e-29

>pir||H86192 hypothetical protein [imported] - Arabidopsis thaliana
           gi|6850311|gb|AAF29388.1|AC009999_8 Contains similarity
           to a vacuolar sorting receptor homolog from Arabidopsis
           thaliana gb|U79959
          Length = 536

 Score =  176 bits (446), Expect(2) = 1e-47
 Identities = 83/155 (53%), Positives = 119/155 (76%), Gaps = 1/155 (0%)
 Frame = +1

Query: 1   SVALCTRGGCDFTTKAAFAQSGGASAMLLINDEEDLFEMACSNS-TGGNISIPVVLITKS 177
           S+AL  RG C FT KA  AQ+GGA+A++LIND+E+L EM C    T  N+SIP+++IT S
Sbjct: 103 SIALSVRGECAFTVKAQVAQAGGAAALVLINDKEELDEMVCGEKDTSLNVSIPILMITTS 162

Query: 178 AGEAFNKSLASGRKVEVLLYAPPSPLVDFSFAFLWLMAVGTIVCASLWSDITTPKKSGEH 357
           +G+A  KS+   +KVE+LLYAP SP+VD++  FLWLM+VGT+  AS+WS +T+PKK+ E 
Sbjct: 163 SGDALKKSIMQNKKVELLLYAPKSPIVDYAVVFLWLMSVGTVFVASVWSHVTSPKKNDEQ 222

Query: 358 YNELFPKESSNVEGAKDVSDKEILNINSMSAVVFI 462
           Y+EL PK+SSNV+  K  +++E L+I++M AV+F+
Sbjct: 223 YDELSPKKSSNVDATKGGAEEETLDISAMGAVIFV 257

 Score = 35.4 bits (80), Expect(2) = 1e-47
 Identities = 17/43 (39%), Positives = 26/43 (59%)
 Frame = +3

Query: 474 LSLLIYYSSSCHLGLIWVLVVLFCIAGVEGMHNCIISLTLRKC 602
           L LL ++ SS     I +L + F I G++GMHN  ++L  R+C
Sbjct: 264 LVLLFFFMSSW---FILILTIFFVIGGMQGMHNINVTLITRRC 303

>ref|NP_172073.1| unknown protein; protein id: At1g05820.1 [Arabidopsis thaliana]
          Length = 441

 Score =  176 bits (446), Expect(2) = 3e-46
 Identities = 83/155 (53%), Positives = 119/155 (76%), Gaps = 1/155 (0%)
 Frame = +1

Query: 1   SVALCTRGGCDFTTKAAFAQSGGASAMLLINDEEDLFEMACSNS-TGGNISIPVVLITKS 177
           S+AL  RG C FT KA  AQ+GGA+A++LIND+E+L EM C    T  N+SIP+++IT S
Sbjct: 103 SIALSVRGECAFTVKAQVAQAGGAAALVLINDKEELDEMVCGEKDTSLNVSIPILMITTS 162

Query: 178 AGEAFNKSLASGRKVEVLLYAPPSPLVDFSFAFLWLMAVGTIVCASLWSDITTPKKSGEH 357
           +G+A  KS+   +KVE+LLYAP SP+VD++  FLWLM+VGT+  AS+WS +T+PKK+ E 
Sbjct: 163 SGDALKKSIMQNKKVELLLYAPKSPIVDYAVVFLWLMSVGTVFVASVWSHVTSPKKNDEQ 222

Query: 358 YNELFPKESSNVEGAKDVSDKEILNINSMSAVVFI 462
           Y+EL PK+SSNV+  K  +++E L+I++M AV+F+
Sbjct: 223 YDELSPKKSSNVDATKGGAEEETLDISAMGAVIFV 257

 Score = 30.8 bits (68), Expect(2) = 3e-46
 Identities = 15/20 (75%), Positives = 15/20 (75%)
 Frame = +2

Query: 464 ISTSVTLDLLFFFMSSWFNL 523
           IS S  L LLFFFMSSWF L
Sbjct: 258 ISASTFLVLLFFFMSSWFIL 277

>ref|NP_181835.1| unknown protein; protein id: At2g43105.1 [Arabidopsis thaliana]
           gi|25408833|pir||F84861 hypothetical protein At2g43070
           [imported] - Arabidopsis thaliana
           gi|20197149|gb|AAM14939.1| unknown protein [Arabidopsis
           thaliana] gi|20197629|gb|AAM15159.1| unknown protein
           [Arabidopsis thaliana]
          Length = 543

 Score =  147 bits (371), Expect(2) = 3e-43
 Identities = 78/154 (50%), Positives = 106/154 (68%), Gaps = 1/154 (0%)
 Frame = +1

Query: 4   VALCTRGGCDFTTKAAFAQSGGASAMLLINDEEDLFEMAC-SNSTGGNISIPVVLITKSA 180
           +AL  RG C FT KA  A++ GASA+L+IND+EDL EM C    T  N+SIPV++I+KS+
Sbjct: 108 IALSIRGNCAFTEKAKHAEAAGASALLVINDKEDLDEMGCMEKDTSLNVSIPVLMISKSS 167

Query: 181 GEAFNKSLASGRKVEVLLYAPPSPLVDFSFAFLWLMAVGTIVCASLWSDITTPKKSGEHY 360
           G+A NKS+   + VE+LLYAP  P VD +   L LMAVGT+V ASLWS++T P ++ E Y
Sbjct: 168 GDALNKSMVDNKNVELLLYAPKRPAVDLTAGLLLLMAVGTVVVASLWSELTDPDQANESY 227

Query: 361 NELFPKESSNVEGAKDVSDKEILNINSMSAVVFI 462
           + +  K+ S+    KD  +KEIL+I+   AV FI
Sbjct: 228 S-ILAKDVSSAGTRKDDPEKEILDISVTGAVFFI 260

 Score = 49.7 bits (117), Expect(2) = 3e-43
 Identities = 22/43 (51%), Positives = 30/43 (69%)
 Frame = +3

Query: 474 LSLLIYYSSSCHLGLIWVLVVLFCIAGVEGMHNCIISLTLRKC 602
           L LL Y+ SS     +WVL + FCI G++GMHN I+++ LRKC
Sbjct: 267 LLLLFYFMSSW---FVWVLTIFFCIGGMQGMHNIIMAVILRKC 306

>dbj|BAB92664.1| vacuolar sorting receptor-like protein [Oryza sativa (japonica
           cultivar-group)]
          Length = 569

 Score =  120 bits (301), Expect(3) = 3e-32
 Identities = 65/161 (40%), Positives = 98/161 (60%), Gaps = 7/161 (4%)
 Frame = +1

Query: 1   SVALCTRGGCDFTTKAAFAQSGGASAMLLINDEE------DLFEMACS-NSTGGNISIPV 159
           S+A+  RG C F  KA  A+SGGA+A+LLINDE+      DL +M C+ N T  NI IPV
Sbjct: 116 SIAVAERGECTFLEKAKTAESGGAAALLLINDEDGQVLRVDLQKMVCTQNDTVPNIGIPV 175

Query: 160 VLITKSAGEAFNKSLASGRKVEVLLYAPPSPLVDFSFAFLWLMAVGTIVCASLWSDITTP 339
           V++++SAG      +  G KV++L+YAP  P  D +  FLWLMAVG++ CAS+WS +   
Sbjct: 176 VMVSQSAGRKILSGMDGGAKVDILMYAPEKPSFDGAIPFLWLMAVGSVACASVWSFVVVG 235

Query: 340 KKSGEHYNELFPKESSNVEGAKDVSDKEILNINSMSAVVFI 462
            +           +++   G ++ +D EI+ + + +A+VFI
Sbjct: 236 DED----------KNAPTLGGEEAADSEIVELQTKTALVFI 266

 Score = 36.6 bits (83), Expect(3) = 3e-32
 Identities = 13/27 (48%), Positives = 21/27 (77%)
 Frame = +3

Query: 522 WVLVVLFCIAGVEGMHNCIISLTLRKC 602
           W+LVVLFC++G++G+H    +L +R C
Sbjct: 286 WLLVVLFCLSGLQGLHYVASTLIVRTC 312

 Score = 23.5 bits (49), Expect(3) = 3e-32
 Identities = 9/21 (42%), Positives = 13/21 (61%)
 Frame = +2

Query: 452 LYLFISTSVTLDLLFFFMSSW 514
           L   ++ S+ L  LFFF S+W
Sbjct: 263 LVFIVTASLVLLFLFFFKSTW 283

>ref|NP_564815.1| expressed protein; protein id: At1g63690.1, supported by cDNA:
           2571., supported by cDNA: gi_17065461 [Arabidopsis
           thaliana] gi|17065462|gb|AAL32885.1| Unknown protein
           [Arabidopsis thaliana]
          Length = 540

 Score =  115 bits (287), Expect(2) = 6e-29
 Identities = 61/154 (39%), Positives = 94/154 (60%), Gaps = 1/154 (0%)
 Frame = +1

Query: 4   VALCTRGGCDFTTKAAFAQSGGASAMLLINDEEDLFEMACS-NSTGGNISIPVVLITKSA 180
           V +  RG C FT KA  A++ GASA+L+IN++++L++M C  + T  +I IP V++ + A
Sbjct: 107 VVIVERGNCRFTAKANNAEAAGASALLIINNQKELYKMVCEPDETDLDIQIPAVMLPQDA 166

Query: 181 GEAFNKSLASGRKVEVLLYAPPSPLVDFSFAFLWLMAVGTIVCASLWSDITTPKKSGEHY 360
           G +  K LA+  KV   LY+P  P VD +  FLWLMA+GTI+CAS WS  +  + + EH 
Sbjct: 167 GASLQKMLANSSKVSAQLYSPRRPAVDVAEVFLWLMAIGTILCASYWSAWSAREAAIEH- 225

Query: 361 NELFPKESSNVEGAKDVSDKEILNINSMSAVVFI 462
           ++L       +    D     ++ INS+SA+ F+
Sbjct: 226 DKLLKDAIDEIPNTND-GGSGVVEINSISAIFFV 258

 Score = 34.3 bits (77), Expect(2) = 6e-29
 Identities = 13/24 (54%), Positives = 19/24 (79%)
 Frame = +3

Query: 525 VLVVLFCIAGVEGMHNCIISLTLR 596
           +LVV+FCI GVEG+  C+++L  R
Sbjct: 279 LLVVVFCIGGVEGLQTCLVALLSR 302

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 523,426,681
Number of Sequences: 1393205
Number of extensions: 11171642
Number of successful extensions: 41744
Number of sequences better than 10.0: 103
Number of HSP's better than 10.0 without gapping: 40429
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 41703
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 23711793746
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MWM003a11_f AV764689 1 603
2 MWM003b10_f AV764691 1 596




Lotus japonicus
Kazusa DNA Research Institute