KMC001161A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001161A_C01 KMC001161A_c01
aacaaaacaattttaaattaaaaaatgaaagaaaactaacacgaaaccccaatttttttt
tggtttaaaaaataatagcaAGCTCATAATCAACCTTGGCTCTATAAAATTAATGGCACG
CAGCTAACTCGACAAAGAAATGACTCTAAACATTCAAACGACTCGAAAGAGAAGGAGAAA
GATGGAAAATTTCTGGTTTCTCTTCAAAAAAAGTTCCTGGGAGCAGATGCAGCTATGCTA
AAAAAAACACTGACGCTGTCCCTTTTACTTCCATGTGTTCGGTTCCTCCTGCAACCTAAA
CTGnAGGACCTGACAGAAGGCTAAGTAAACTTTCAAGAATCAGAATAACTGAATAGCCAT
TTTGAGTTnGTTCTGTTTCCATTTGGGCAACTTGTAAAATGCATCCTTGGACATCCCAAA
TTTCTCTTTGAACTCCGCAGATGATAGATAAGTCTCCCGTTTAGTCACATCAATGCCTGG
TACAGGATCTGGAGAAGATACTTTAAGACGTTCATACGGGTGAAAGGAGAGACCTTCCTC
GTCTTCTGCTTCACCCTCCTTCACATCTTCCTCTATGGTAAGAGACTCCACACGGCTGCT
CGCAGAATTCTCCTTGTCATTTTTATCAGGGTTTGATTTGGGGGTGACTGGACTCACTTT
AACTGACCGAGGTATCATAGTTTCTCGTGCTGAAGGTGGTTGTTCAAAAGAAGAAGTAAG
TGCAGCTATGGCTGCAGATTTTGGTGCCAGTATTGCAGAATCAGGTGTCTTAGATTTTGG
ATACAGCTTTCTGACTACTGGAGGTGGAGTTGAAAGGTTCCTAGCACCGGGATTCTCAAA
ATTAGCTGCTAATGCATTGAAGGCTGGAGACCTACCCCTCACACGAAcacgatcaggact
aacagacatgctgcgggaggaacgctgggatttatctgggacactactagaccttcctcc
ataagcg


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001161A_C01 KMC001161A_c01
         (967 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_194745.1| putative villin; protein id: At4g30160.1 [Arabi...   265  6e-70
ref|NP_200542.1| villin; protein id: At5g57320.1 [Arabidopsis th...   214  1e-54
sp|O81645|VIL3_ARATH Villin 3 gi|11358922|pir||T50668 villin 3 [...    88  2e-16
ref|NP_567048.1| villin 3 fragment; protein id: At3g57410.1, sup...    87  4e-16
pir||T45819 villin 3 homolog F28O9.260 - Arabidopsis thaliana (f...    87  4e-16

>ref|NP_194745.1| putative villin; protein id: At4g30160.1 [Arabidopsis thaliana]
            gi|25091517|sp|O65570|VIL4_ARATH Villin 4
            gi|7488222|pir||T14076 probable villin [imported] -
            Arabidopsis thaliana gi|3093294|emb|CAA73320.1| putative
            villin [Arabidopsis thaliana] gi|5730126|emb|CAB52460.1|
            putative villin [Arabidopsis thaliana]
            gi|7269916|emb|CAB81009.1| putative villin [Arabidopsis
            thaliana] gi|26449688|dbj|BAC41968.1| putative villin
            [Arabidopsis thaliana] gi|29029072|gb|AAO64915.1|
            At4g30160 [Arabidopsis thaliana]
          Length = 974

 Score =  265 bits (678), Expect = 6e-70
 Identities = 147/222 (66%), Positives = 170/222 (76%), Gaps = 14/222 (6%)
 Frame = -2

Query: 966  AYGGRSSSVPDKSQRSSRSMSVSPDRVRVRGRSPAFNALAANFENPGARNLSTPPPVVRK 787
            +YGGR+S VPDKSQ+ SRSMS SPDRVRVRGRSPAFNALAA FE+  ARNLSTPPPVVRK
Sbjct: 756  SYGGRAS-VPDKSQQRSRSMSFSPDRVRVRGRSPAFNALAATFESQNARNLSTPPPVVRK 814

Query: 786  LYPKSKTPDSAIL--APKSAAIAALTSSFEQPPSARETMIPRSVKVSPVTPKS-NPDKND 616
            LYP+S TPDS+    APKS+AIA+ ++ FE+ P  +E  IP+ VK SP TP+S  P+ N 
Sbjct: 815  LYPRSVTPDSSKFAPAPKSSAIASRSALFEKIP-PQEPSIPKPVKASPKTPESPAPESNS 873

Query: 615  K-----------ENSASSRVESLTIEEDVKEGEAEDEEGLSFHPYERLKVSSPDPVPGID 469
            K           E S SSR+ESLTI+ED KEG  EDEE L  HPY+RLK +S DPV  ID
Sbjct: 874  KEQEEKKENDKEEGSMSSRIESLTIQEDAKEG-VEDEEDLPAHPYDRLKTTSTDPVSDID 932

Query: 468  VTKRETYLSSAEFKEKFGMSKDAFYKLPKWKQNXLKMAIQLF 343
            VT+RE YLSS EFKEKFGM+K+AFYKLPKWKQN  KMA+QLF
Sbjct: 933  VTRREAYLSSEEFKEKFGMTKEAFYKLPKWKQNKFKMAVQLF 974

>ref|NP_200542.1| villin; protein id: At5g57320.1 [Arabidopsis thaliana]
            gi|8777365|dbj|BAA96955.1| villin [Arabidopsis thaliana]
          Length = 962

 Score =  214 bits (546), Expect = 1e-54
 Identities = 123/213 (57%), Positives = 147/213 (68%), Gaps = 5/213 (2%)
 Frame = -2

Query: 966  AYGGRSSSVPDKSQRSSRSMSVSPDRVRVRGRSPAFNALAANFENPGARNLSTPPPVV-- 793
            AY  RS+ VPDKSQ  SRSM+ SPDR RVRGRSPAFNALAANFE    RN STPPP+V  
Sbjct: 756  AYSSRST-VPDKSQPRSRSMTFSPDRARVRGRSPAFNALAANFEKLNIRNQSTPPPMVSP 814

Query: 792  --RKLYPKSKTPDSAILAPKSAAIAALTSSFEQP-PSARETMIPRSVKVSPVTPKSNPDK 622
              RKLYPKS  PD + +APKSA IAA T+ FE+P P+++E   P S   S  T ++   K
Sbjct: 815  MVRKLYPKSHAPDLSKIAPKSA-IAARTALFEKPTPTSQEP--PTSPSSSEATNQAEAPK 871

Query: 621  NDKENSASSRVESLTIEEDVKEGEAEDEEGLSFHPYERLKVSSPDPVPGIDVTKRETYLS 442
            +  E +    + S  I ED KE EAE+E  L   PYERLK  S DPV  +D+T+RE YL+
Sbjct: 872  STSETNEEEAMSS--INEDSKEEEAEEESSLPTFPYERLKTDSEDPVSDVDLTRREAYLT 929

Query: 441  SAEFKEKFGMSKDAFYKLPKWKQNXLKMAIQLF 343
            S EFKEKF M+K+ FYKLPKWKQN LKM++ LF
Sbjct: 930  SVEFKEKFEMTKNEFYKLPKWKQNKLKMSVNLF 962

>sp|O81645|VIL3_ARATH Villin 3 gi|11358922|pir||T50668 villin 3 [imported] - Arabidopsis
            thaliana gi|3415117|gb|AAC31607.1| villin 3 [Arabidopsis
            thaliana]
          Length = 966

 Score = 88.2 bits (217), Expect = 2e-16
 Identities = 69/201 (34%), Positives = 96/201 (47%), Gaps = 8/201 (3%)
 Frame = -2

Query: 921  SSRSMSVSPDRVRVRG-------RSPAFNALAANFENPGARNLSTPPPVVRKLYPKSKTP 763
            SS   + SP R R  G       R+ A  AL + F +  +   S  PP    L  ++   
Sbjct: 772  SSSGRTSSPSRDRSNGSQGGPRQRAEALAALTSAFNSSPS---SKSPPRRSGLTSQASQR 828

Query: 762  DSAILAPKSAAIAALTSSFEQPPSARETMIPRSVKVSPVTPKSNPDKNDKENSASSRVES 583
             +A+ A      A    S +  PSA       S      T ++   K ++E S ++  E+
Sbjct: 829  AAAVAALSQVLTAEKKKSPDTSPSAEAKDEETSFSEVEATEEATEAKEEEEVSPAA--EA 886

Query: 582  LTIEEDVKEGEAEDEE-GLSFHPYERLKVSSPDPVPGIDVTKRETYLSSAEFKEKFGMSK 406
               E   K+ ++E E  G++F  YERL+  S  PV GID  +RE YLS  EFK  FGM K
Sbjct: 887  SAEEAKPKQDDSEVETTGVTF-TYERLQAKSEKPVTGIDFKRREAYLSEVEFKTVFGMEK 945

Query: 405  DAFYKLPKWKQNXLKMAIQLF 343
            ++FYKLP WKQ+ LK    LF
Sbjct: 946  ESFYKLPGWKQDLLKKKFNLF 966

>ref|NP_567048.1| villin 3 fragment; protein id: At3g57410.1, supported by cDNA:
            gi_3415116 [Arabidopsis thaliana]
          Length = 965

 Score = 87.0 bits (214), Expect = 4e-16
 Identities = 69/201 (34%), Positives = 98/201 (48%), Gaps = 8/201 (3%)
 Frame = -2

Query: 921  SSRSMSVSPDRVRVRG-------RSPAFNALAANFENPGARNLSTPPPVVRKLYPKSKTP 763
            SS   + SP R R  G       R+ A  AL + F +  +   S  PP    L  ++   
Sbjct: 772  SSSGRTSSPSRDRSNGSQGGPRQRAEALAALTSAFNSSPS---SKSPPRRSGLTSQASQR 828

Query: 762  DSAILAPKSAAIAALTSSFEQPPSARETMIPRSVKVSPVTPKSNPDKNDKENSASSRVES 583
             +A+ A      A    S +  PSA E    ++      T ++   K ++E S ++  E+
Sbjct: 829  AAAVAALSQVLTAEKKKSPDTSPSA-EAKDEKAFSEVEATEEATEAKEEEEVSPAA--EA 885

Query: 582  LTIEEDVKEGEAEDEE-GLSFHPYERLKVSSPDPVPGIDVTKRETYLSSAEFKEKFGMSK 406
               E   K+ ++E E  G++F  YERL+  S  PV GID  +RE YLS  EFK  FGM K
Sbjct: 886  SAEEAKPKQDDSEVETTGVTF-TYERLQAKSEKPVTGIDFKRREAYLSEVEFKTVFGMEK 944

Query: 405  DAFYKLPKWKQNXLKMAIQLF 343
            ++FYKLP WKQ+ LK    LF
Sbjct: 945  ESFYKLPGWKQDLLKKKFNLF 965

>pir||T45819 villin 3 homolog F28O9.260 - Arabidopsis thaliana (fragment)
           gi|6735320|emb|CAB68147.1| villin 3 fragment
           [Arabidopsis thaliana]
          Length = 383

 Score = 87.0 bits (214), Expect = 4e-16
 Identities = 69/201 (34%), Positives = 98/201 (48%), Gaps = 8/201 (3%)
 Frame = -2

Query: 921 SSRSMSVSPDRVRVRG-------RSPAFNALAANFENPGARNLSTPPPVVRKLYPKSKTP 763
           SS   + SP R R  G       R+ A  AL + F +  +   S  PP    L  ++   
Sbjct: 190 SSSGRTSSPSRDRSNGSQGGPRQRAEALAALTSAFNSSPS---SKSPPRRSGLTSQASQR 246

Query: 762 DSAILAPKSAAIAALTSSFEQPPSARETMIPRSVKVSPVTPKSNPDKNDKENSASSRVES 583
            +A+ A      A    S +  PSA E    ++      T ++   K ++E S ++  E+
Sbjct: 247 AAAVAALSQVLTAEKKKSPDTSPSA-EAKDEKAFSEVEATEEATEAKEEEEVSPAA--EA 303

Query: 582 LTIEEDVKEGEAEDEE-GLSFHPYERLKVSSPDPVPGIDVTKRETYLSSAEFKEKFGMSK 406
              E   K+ ++E E  G++F  YERL+  S  PV GID  +RE YLS  EFK  FGM K
Sbjct: 304 SAEEAKPKQDDSEVETTGVTF-TYERLQAKSEKPVTGIDFKRREAYLSEVEFKTVFGMEK 362

Query: 405 DAFYKLPKWKQNXLKMAIQLF 343
           ++FYKLP WKQ+ LK    LF
Sbjct: 363 ESFYKLPGWKQDLLKKKFNLF 383

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 817,331,827
Number of Sequences: 1393205
Number of extensions: 17916664
Number of successful extensions: 64642
Number of sequences better than 10.0: 268
Number of HSP's better than 10.0 without gapping: 56904
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 63441
length of database: 448,689,247
effective HSP length: 123
effective length of database: 277,325,032
effective search space used: 54910356336
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENLf062e06 BP065675 1 514
2 MWM066b09_f AV765761 373 967




Lotus japonicus
Kazusa DNA Research Institute