KCC002171A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC002171A_C01 KCC002171A_c01
cggcgacacgagcggccaaggagacagcggtgctggcctaggcactcgggactcagctgc
aggcgtgtgggtgcacatctCGGTGACGAGTGCGGTGGTGCCAGTGACGGCTGCGCGGCA
GACGCGCCGGAGGAATGAAGCTTGGGTGCCGACAGCTCCTGCTGCCGTCCGGTTTGGTTT
GCGGCGCGTGTCTGGATGACAAAGGGGATGTTGCAACGTCACCGCTTCACGGATGAGGAC
TTAAACGATTGACGCGCGTTGACGTGTGTGCTTGGCAAATGCTTCTCCGCTTCGCCGACC
TTCGTGTTGTTTCGGTGGCGTGCCTAGTGATGCAACTCTTAAGTTGGCCTTCGGCAGCTG
CTTGTTACCGGGCTGGGAGCGCTTGTGCGGGTTGCGGCGTGTGGCCCTGTTCGCGCTTGT
GGTTGCGGTATCCGGATACTGCTAGGGGCGCTGCATGCAGCTTGCTGAGTGACTGAGCTT
GCCGACACCAGGCCTGTTGCTGCTGCAGTTTGGCTTGTTTGATGTGAGATGTGCCCCCTG
CAATGAGTCGCGCAGTTCGCTTGCACTGCTGTTGCGGGAATGTCCGTGGCGCGGCTCTTG
GGGAAAGTGAGTCGTCTGGTTGATTGGTAGATTGCGCCTGTTTAAGTGCCTTTAGTAGTT
AAGGTGATATAAATAGTATCCCGGCATTGGTTGGGTGTTCATGTGGGTATGGTGCGCTCG
TGCTCTGTGACGTCAGTCGGTTTGTCGTGTTGATGCTCCCATGTGGGTTCGTAG


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC002171A_C01 KCC002171A_c01
         (774 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||A35419 neutrophil protein - pig (fragment) gi|164673|gb|AAA...    43  0.005
ref|NP_727652.1| CG32644-PB [Drosophila melanogaster] gi|2283312...    43  0.006
emb|CAB46679.1| proteophosphoglycan [Leishmania major]                 42  0.008
pir||T46707 proteophosphoglycan, membrane-associated [imported] ...    42  0.008
ref|NP_176045.1| proline-rich protein family [Arabidopsis thaliana]    41  0.019

>pir||A35419 neutrophil protein - pig (fragment) gi|164673|gb|AAA63449.1|
           neutrophil protein
          Length = 284

 Score = 43.1 bits (100), Expect = 0.005
 Identities = 42/169 (24%), Positives = 67/169 (38%), Gaps = 1/169 (0%)
 Frame = -3

Query: 643 KQAQSTNQPDDSLSPRAAPRTFPQQQCKRTARLIAGGTSHIKQAKLQQQQAWCRQAQSLS 464
           ++ +S  QP+    P+  P+  PQ Q +   +          QA+ Q Q     Q Q LS
Sbjct: 15  EEKESEPQPEPMSQPQPLPQPQPQSQPQPQPQAQT-------QAQCQSQSQPALQPQPLS 67

Query: 463 KLHAAPLAVSGYRNHK-REQGHTPQPAQALPAR*QAAAEGQLKSCITRHATETTRRSAKR 287
           +    PLAV        +EQGH P   +  P       E  ++  +T H    ++   + 
Sbjct: 68  QPETIPLAVLQPPPQAIQEQGHLPPERKEFPVESAKLTEVTVEPVLTVHPESKSKTKTRS 127

Query: 286 RSICQAHTSTRVNRLSPHP*SGDVATSPLSSRHAPQTKPDGSRSCRHPS 140
           RS  +A   T  +R      S   ++S  +S  +  +   GS S R  S
Sbjct: 128 RSRGRARNKTSKSRSRSS--SSSSSSSSSTSSSSGSSSSSGSSSSRSSS 174

>ref|NP_727652.1| CG32644-PB [Drosophila melanogaster] gi|22833125|gb|AAN09644.1|
           CG32644-PB [Drosophila melanogaster]
          Length = 582

 Score = 42.7 bits (99), Expect = 0.006
 Identities = 55/244 (22%), Positives = 89/244 (35%), Gaps = 14/244 (5%)
 Frame = -2

Query: 737 TDVTEHERTIPT*TPNQCRDTIYITLT---------TKGT*TGAIYQSTRRLTFPKSRAT 585
           T  TE   T  T  P+   +  + T T         T+GT T     +T + T P S +T
Sbjct: 252 TTTTEASTTTTTIEPSTSTNAAFTTTTSTEASTTTTTEGTTTSTEQTTTTKTTSP-STST 310

Query: 584 DIPATAVQANCATHCRGHISHQTSQTAAATGLVSASSVTQQAACSAPSSIRIP----QPQ 417
           +I  T  +   +T        +T+++   T   SAS++T+Q+  + P++   P    +P 
Sbjct: 311 EILTTTTELTTSTE-----PARTTESTTTTIADSASTITEQSNTTEPTTTAEPTTTTEPT 365

Query: 416 ARTGPHAAT-RTSAPSPVTSSCRRPT*ELHH*ARHRNNTKVGEAEKHLPSTHVNARQSFK 240
             T P   T  T+   P T++   PT          N T         P+T      + +
Sbjct: 366 TTTEPTTTTENTTTTEPTTTT--EPTTTTEPTTTTENTTTTEPTTTTEPTTTTEPTTTTE 423

Query: 239 SSSVKR*RCNIPFVIQTRAANQTGRQQELSAPKLHSSGASAAQPSLAPPHSSPRCAPTRL 60
           S++      N      T     T  Q         S+ +S  +PS     SS     T +
Sbjct: 424 STTTTERVTNTDLTTTTTLLTTTKSQTSTDP----STTSSTTEPSTTTNRSSSSTTTTTV 479

Query: 59  QLSP 48
              P
Sbjct: 480 TTEP 483

>emb|CAB46679.1| proteophosphoglycan [Leishmania major]
          Length = 873

 Score = 42.4 bits (98), Expect = 0.008
 Identities = 42/173 (24%), Positives = 74/173 (42%)
 Frame = -2

Query: 602  PKSRATDIPATAVQANCATHCRGHISHQTSQTAAATGLVSASSVTQQAACSAPSSIRIPQ 423
            P + ++  P+++  A  A+      S  +S  +A     S+SS    ++ SAPS+     
Sbjct: 639  PSASSSSAPSSSSSAPSASSSSAPSSSSSSAPSA-----SSSSAPSSSSSSAPSASSSSA 693

Query: 422  PQARTGPHAATRTSAPSPVTSSCRRPT*ELHH*ARHRNNTKVGEAEKHLPSTHVNARQSF 243
            P + +   +A+ +SAPS   SS   P+          +++    +    PS+  ++  S 
Sbjct: 694  PSSSSSAPSASSSSAPS---SSSSAPSASSSSAPSSSSSSAPSASSSSAPSSSSSSAPSA 750

Query: 242  KSSSVKR*RCNIPFVIQTRAANQTGRQQELSAPKLHSSGASAAQPSLAPPHSS 84
             SSS      + P      A++ +      SAP   SS A ++  S AP  SS
Sbjct: 751  SSSSAPSSSSSAP-----SASSSSAPSSSSSAPSASSSSAPSSSSSSAPSASS 798

 Score = 40.8 bits (94), Expect = 0.024
 Identities = 42/175 (24%), Positives = 75/175 (42%), Gaps = 2/175 (1%)
 Frame = -2

Query: 602  PKSRATDIPATAVQANCATHCRGHISHQTSQTAAATGLVSASSVTQQAACS--APSSIRI 429
            P + ++  P+++  A  A+      S  ++ +A+++   S+SS +  +A S  APSS   
Sbjct: 686  PSASSSSAPSSSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSSAPSASSSSAPSSSSS 745

Query: 428  PQPQARTGPHAATRTSAPSPVTSSCRRPT*ELHH*ARHRNNTKVGEAEKHLPSTHVNARQ 249
              P A +    ++ +SAPS  +SS           A   +++    +    PS+  ++  
Sbjct: 746  SAPSASSSSAPSSSSSAPSASSSS-----------APSSSSSAPSASSSSAPSSSSSSAP 794

Query: 248  SFKSSSVKR*RCNIPFVIQTRAANQTGRQQELSAPKLHSSGASAAQPSLAPPHSS 84
            S  SSS             + +++        SAP   SS A +A  S AP  SS
Sbjct: 795  SASSSSA-----------PSSSSSSAPSASSSSAPSSSSSSAPSASSSSAPSSSS 838

 Score = 37.7 bits (86), Expect = 0.21
 Identities = 42/165 (25%), Positives = 70/165 (41%), Gaps = 6/165 (3%)
 Frame = -2

Query: 560  ANCATHCRGHISHQTSQTAAATGLVSASSVTQQAACSAPSSIRIPQPQARTGPHAATRTS 381
            ++CAT        ++S +A +    SASS    ++ SAPS+     P + +   +A+ +S
Sbjct: 605  SDCATENACKPETESSSSAPSASSSSASS----SSSSAPSASSSSAPSSSSSAPSASSSS 660

Query: 380  APS------PVTSSCRRPT*ELHH*ARHRNNTKVGEAEKHLPSTHVNARQSFKSSSVKR* 219
            APS      P  SS   P+      A   +++    +    PS   ++  S  SS+    
Sbjct: 661  APSSSSSSAPSASSSSAPS-SSSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSAPSAS 719

Query: 218  RCNIPFVIQTRAANQTGRQQELSAPKLHSSGASAAQPSLAPPHSS 84
              + P    + +++        SAP   SS A +A  S AP  SS
Sbjct: 720  SSSAP----SSSSSSAPSASSSSAPSSSSSSAPSASSSSAPSSSS 760

 Score = 32.7 bits (73), Expect = 6.6
 Identities = 35/149 (23%), Positives = 61/149 (40%)
 Frame = -2

Query: 530  ISHQTSQTAAATGLVSASSVTQQAACSAPSSIRIPQPQARTGPHAATRTSAPSPVTSSCR 351
            I+ Q    A+     +A     +++ SAPS+       + +   +A+ +SAPS   SS  
Sbjct: 596  ITAQPELLASDCATENACKPETESSSSAPSASSSSASSSSSSAPSASSSSAPS---SSSS 652

Query: 350  RPT*ELHH*ARHRNNTKVGEAEKHLPSTHVNARQSFKSSSVKR*RCNIPFVIQTRAANQT 171
             P+          +++    +    PS+  ++  S  SSS      + P      A++ +
Sbjct: 653  APSASSSSAPSSSSSSAPSASSSSAPSSSSSSAPSASSSSAPSSSSSAP-----SASSSS 707

Query: 170  GRQQELSAPKLHSSGASAAQPSLAPPHSS 84
                  SAP   SS A ++  S AP  SS
Sbjct: 708  APSSSSSAPSASSSSAPSSSSSSAPSASS 736

>pir||T46707 proteophosphoglycan, membrane-associated [imported] - Leishmania
           major (fragment) gi|5420389|emb|CAB46680.1|
           proteophosphoglycan [Leishmania major]
          Length = 383

 Score = 42.4 bits (98), Expect = 0.008
 Identities = 42/175 (24%), Positives = 77/175 (44%), Gaps = 2/175 (1%)
 Frame = -2

Query: 602 PKSRATDIPATAVQANCATHCRGHISHQTSQTAAATGLVSASSVTQQAACS--APSSIRI 429
           P + ++  P+++  A  A+      S  ++ +A+++   S+SS +  +A S  APSS   
Sbjct: 11  PSASSSSAPSSSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSSAPSASSSSAPSSSSS 70

Query: 428 PQPQARTGPHAATRTSAPSPVTSSCRRPT*ELHH*ARHRNNTKVGEAEKHLPSTHVNARQ 249
             P A +    ++ +SAPS  +SS    +      A   +++    +    PS   ++  
Sbjct: 71  SAPSASSSSAPSSSSSAPSASSSSAPSSSSS----APSASSSSAPSSSSSAPSASSSSAP 126

Query: 248 SFKSSSVKR*RCNIPFVIQTRAANQTGRQQELSAPKLHSSGASAAQPSLAPPHSS 84
           S  SS+      + P    + +++        SAP   SS A +A  S AP  SS
Sbjct: 127 SSSSSAPSASSSSAP----SSSSSSAPSASSSSAPSSSSSSAPSASSSSAPSSSS 177

 Score = 34.3 bits (77), Expect = 2.3
 Identities = 35/153 (22%), Positives = 67/153 (42%), Gaps = 8/153 (5%)
 Frame = -2

Query: 518 TSQTAAATGLVSASSVTQQAACSAPSSIRIPQPQARTGPHAATRTSAPS------PVTSS 357
           ++ +++++   ++SS    ++ SAPS+     P + +   +A+ +SAPS      P  SS
Sbjct: 2   SAPSSSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSSAPSASS 61

Query: 356 CRRPT*ELHH*ARHRNNTKVGEAEKHLPSTHVNARQSFKSSSVKR*RCNIPFVIQT--RA 183
              P+      A   +++    +    PS   ++  S  SS+      + P    +   A
Sbjct: 62  SSAPS-SSSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSAPSA 120

Query: 182 ANQTGRQQELSAPKLHSSGASAAQPSLAPPHSS 84
           ++ +      SAP   SS A ++  S AP  SS
Sbjct: 121 SSSSAPSSSSSAPSASSSSAPSSSSSSAPSASS 153

 Score = 34.3 bits (77), Expect = 2.3
 Identities = 32/137 (23%), Positives = 56/137 (40%)
 Frame = -2

Query: 476 SVTQQAACSAPSSIRIPQPQARTGPHAATRTSAPSPVTSSCRRPT*ELHH*ARHRNNTKV 297
           S    ++ SAPS+     P + +   +A+ +SAPS   SS   P+          +++  
Sbjct: 1   SSAPSSSSSAPSASSSSAPSSSSSAPSASSSSAPS---SSSSAPSASSSSAPSSSSSSAP 57

Query: 296 GEAEKHLPSTHVNARQSFKSSSVKR*RCNIPFVIQTRAANQTGRQQELSAPKLHSSGASA 117
             +    PS+  ++  S  SSS      + P    + A + +      S+    SS +SA
Sbjct: 58  SASSSSAPSSSSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSA 117

Query: 116 AQPSLAPPHSSPRCAPT 66
              S +   SS   AP+
Sbjct: 118 PSASSSSAPSSSSSAPS 134

 Score = 33.1 bits (74), Expect = 5.1
 Identities = 23/85 (27%), Positives = 42/85 (49%), Gaps = 3/85 (3%)
 Frame = -2

Query: 602 PKSRATDIPATAVQANCATHCRGHISHQTSQTAAATGLV---SASSVTQQAACSAPSSIR 432
           P + ++  P+++  A  A+      S  +S  +A++      S+SS    ++ SAPSS  
Sbjct: 118 PSASSSSAPSSSSSAPSASSSSAPSSSSSSAPSASSSSAPSSSSSSAPSASSSSAPSSSS 177

Query: 431 IPQPQARTGPHAATRTSAPSPVTSS 357
              P A +    ++ +SAPS  +SS
Sbjct: 178 SSAPSASSSSAPSSSSSAPSASSSS 202

>ref|NP_176045.1| proline-rich protein family [Arabidopsis thaliana]
          Length = 185

 Score = 41.2 bits (95), Expect = 0.019
 Identities = 36/120 (30%), Positives = 42/120 (35%)
 Frame = -1

Query: 372 PGNKQLPKANLRVASLGTPPKQHEGRRSGEAFAKHTRQRASIV*VLIREAVTLQHPLCHP 193
           P N    +    VA    PP     RR       H RQ   +     R A   Q P   P
Sbjct: 52  PANHLRRRTTTAVAGQPQPPSPENRRRRNHHHNDHRRQPPPLP--ENRAATAGQPPSPSP 109

Query: 192 DTRRKPNRTAAGAVGTQASFLRRVCRAAVTGTTALVTEMCTHTPAAESRVPRPAPLSPWP 13
           D  R   RT   AV  Q    RR   AA  GTT +  +           +P P+P SP P
Sbjct: 110 DNHRHHRRTTTAAVAGQPPHHRRTTAAA--GTTTIAGQPPPPESPPPESLPPPSPESPSP 167



EST assemble image


clone accession position
1 HC093b01_r AV638957 1 481
2 LC029a12_r AV620950 283 774




Chlamydomonas reinhardtii
Kazusa DNA Research Institute