KMC004981A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004981A_C01 KMC004981A_c01
aaaacaagaacaaaatcattgCTCGCAGATACCAAGATGGTTTTCAATTTCCATTAGCAT
TTGCACAATTGCTAGACCTTTCATGTTAAAAAAAGTACAAAAAAGGCAAAATCTTTTATC
CTATACAACAAAAAATTGAACTAGTTTTCCAGACCCCTAGCATCAATGGGTTTCAAACCA
CTGCATCACTTTTTTCCATGAAGGCAAAATTAATCACACACATGCTCTGACAGTTGATAT
TTAATCACTCCTTCCTGAATAAAGAACAAAATATATCGGTGGTGCATGTGCATTATTCAT
TGCGGATTGAATAATACTTATCCCAGAGAGTTTGTGATTTTGAGATTTCATTCATAAGAC
TCTTGAACTTGACACTGCAATCTCCTCCACTTATGCGATTCAAGGATGCAACAGCTCGGA
GTGCACTTCGGATCATGTCTTCATTTCGATCTACTTCTTGCTTGACAGCATCTTGCTTTG
GCTTAAAATTAATAGTCTTCTGGAGAGGATCCACCAATGAATCTAACACTGCTAGGACTG
CCGAAGGACAATTATCAGCTAGTTTCGAAAGTATCAGGTGACAAGGCATTTTAACATCGT
AATGATCATCCAAACCAGATTTAAGATAAGGAACAATGAATGATGAGGGGTTCAGTTGAT
CTAGACAACTATCCAGCAATGTGTCTACACATTCGAAAGCCGCCTTCCTCAACTCAAGCC
CATCATCCACAGTATGCTTGAAAGGACCAAGATCAACTGTCCTAATCAGTTCTTGCTTAA
CAATTGTTTGATCATAAAGAAGAGGCAACAGATCAGAAAGAAGCCCTTTGATGAGGTTTG
GCTTATTGTGTGCAAATGTGCTTAGAGCCAAGACAGCAGCCCGTCTaacatgcctgttat
tatctttgataagcatcagaaatgatgatatttcagggtatatgatctcatctatcttct
cagggc


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004981A_C01 KMC004981A_c01
         (966 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_178360.2| unknown protein; protein id: At2g02560.1, suppo...   360  2e-98
pir||T00607 hypothetical protein At2g02560 [imported] - Arabidop...   360  2e-98
gb|AAO51125.1| similar to Homo sapiens (Human). Hypothetical pro...   200  3e-50
ref|NP_446456.1| TBP-interacting protein 120A [Rattus norvegicus...   197  3e-49
emb|CAD38737.1| hypothetical protein [Homo sapiens]                   196  3e-49

>ref|NP_178360.2| unknown protein; protein id: At2g02560.1, supported by cDNA:
            gi_20466781 [Arabidopsis thaliana]
            gi|20466782|gb|AAM20708.1| unknown protein [Arabidopsis
            thaliana]
          Length = 1219

 Score =  360 bits (923), Expect = 2e-98
 Identities = 178/223 (79%), Positives = 203/223 (90%)
 Frame = -3

Query: 964  PEKIDEIIYPEISSFLMLIKDNNRHVRRAAVLALSTFAHNKPNLIKGLLSDLLPLLYDQT 785
            PEK+DEII+P+ISSFLMLIKD +RHVRRAAV ALSTFAH KPNLIKGLL +LLPLLYDQT
Sbjct: 997  PEKLDEIIFPQISSFLMLIKDGDRHVRRAAVSALSTFAHYKPNLIKGLLPELLPLLYDQT 1056

Query: 784  IVKQELIRTVDLGPFKHTVDDGLELRKAAFECVDTLLDSCLDQLNPSSFIVPYLKSGLDD 605
            ++K+ELIRTVDLGPFKH VDDGLELRKAAFECV TL+DSCLDQ+NPSSFIVP+LKSGL+D
Sbjct: 1057 VIKKELIRTVDLGPFKHVVDDGLELRKAAFECVFTLVDSCLDQVNPSSFIVPFLKSGLED 1116

Query: 604  HYDVKMPCHLILSKLADNCPSAVLAVLDSLVDPLQKTINFKPKQDAVKQEVDRNEDMIRS 425
            HYD+KM CHLILS LAD CPSAVLAVLDSLV+PL KTI+FKPKQDAVKQE DRNEDMIRS
Sbjct: 1117 HYDLKMLCHLILSLLADKCPSAVLAVLDSLVEPLHKTISFKPKQDAVKQEHDRNEDMIRS 1176

Query: 424  ALRAVASLNRISGGDCSVKFKSLMNEISKSQTLWDKYYSIRNE 296
            ALRA++SL+RI+G D S KFK LM ++ +S  LW+K+ +IRNE
Sbjct: 1177 ALRAISSLDRINGVDYSHKFKGLMGDMKRSVPLWEKFQTIRNE 1219

>pir||T00607 hypothetical protein At2g02560 [imported] - Arabidopsis thaliana
            gi|3184283|gb|AAC18930.1| unknown protein [Arabidopsis
            thaliana]
          Length = 1217

 Score =  360 bits (923), Expect = 2e-98
 Identities = 178/223 (79%), Positives = 203/223 (90%)
 Frame = -3

Query: 964  PEKIDEIIYPEISSFLMLIKDNNRHVRRAAVLALSTFAHNKPNLIKGLLSDLLPLLYDQT 785
            PEK+DEII+P+ISSFLMLIKD +RHVRRAAV ALSTFAH KPNLIKGLL +LLPLLYDQT
Sbjct: 995  PEKLDEIIFPQISSFLMLIKDGDRHVRRAAVSALSTFAHYKPNLIKGLLPELLPLLYDQT 1054

Query: 784  IVKQELIRTVDLGPFKHTVDDGLELRKAAFECVDTLLDSCLDQLNPSSFIVPYLKSGLDD 605
            ++K+ELIRTVDLGPFKH VDDGLELRKAAFECV TL+DSCLDQ+NPSSFIVP+LKSGL+D
Sbjct: 1055 VIKKELIRTVDLGPFKHVVDDGLELRKAAFECVFTLVDSCLDQVNPSSFIVPFLKSGLED 1114

Query: 604  HYDVKMPCHLILSKLADNCPSAVLAVLDSLVDPLQKTINFKPKQDAVKQEVDRNEDMIRS 425
            HYD+KM CHLILS LAD CPSAVLAVLDSLV+PL KTI+FKPKQDAVKQE DRNEDMIRS
Sbjct: 1115 HYDLKMLCHLILSLLADKCPSAVLAVLDSLVEPLHKTISFKPKQDAVKQEHDRNEDMIRS 1174

Query: 424  ALRAVASLNRISGGDCSVKFKSLMNEISKSQTLWDKYYSIRNE 296
            ALRA++SL+RI+G D S KFK LM ++ +S  LW+K+ +IRNE
Sbjct: 1175 ALRAISSLDRINGVDYSHKFKGLMGDMKRSVPLWEKFQTIRNE 1217

>gb|AAO51125.1| similar to Homo sapiens (Human). Hypothetical protein [Dictyostelium
            discoideum]
          Length = 1238

 Score =  200 bits (508), Expect = 3e-50
 Identities = 103/222 (46%), Positives = 153/222 (68%)
 Frame = -3

Query: 961  EKIDEIIYPEISSFLMLIKDNNRHVRRAAVLALSTFAHNKPNLIKGLLSDLLPLLYDQTI 782
            E +D+ + P IS FL L+ D +  VRR+A+L+L+  AHNKPNLI+  LS  LP+LY+   
Sbjct: 1004 EVVDQYLAPNISQFLSLLHDGDLIVRRSALLSLNYIAHNKPNLIRNDLSVYLPILYNNAK 1063

Query: 781  VKQELIRTVDLGPFKHTVDDGLELRKAAFECVDTLLDSCLDQLNPSSFIVPYLKSGLDDH 602
            +K ELIR VDLGPFKH VDDG+E+RK AFEC+ TLLD+ +D+++ + FIV       D  
Sbjct: 1064 IKPELIREVDLGPFKHKVDDGIEIRKTAFECMYTLLDTSIDKIDVAPFIVSLCDGLKDTQ 1123

Query: 601  YDVKMPCHLILSKLADNCPSAVLAVLDSLVDPLQKTINFKPKQDAVKQEVDRNEDMIRSA 422
            YD+K+ CHL++ +LA++  +A+L  +  L++PL+  +  K  + AVKQ+++RNE+ IRSA
Sbjct: 1124 YDIKLLCHLMIIRLANSNGAALLENITLLLEPLRVILMTKVNETAVKQQIERNEECIRSA 1183

Query: 421  LRAVASLNRISGGDCSVKFKSLMNEISKSQTLWDKYYSIRNE 296
            LRAVAS++RI   D  VKF+  +    ++  L  ++ SI +E
Sbjct: 1184 LRAVASISRIPNSDSIVKFEEFVKNTIRTTPLAAQFNSILSE 1225

>ref|NP_446456.1| TBP-interacting protein 120A [Rattus norvegicus]
            gi|11281653|pir||T42735 TBP-interacting protein TIP120 -
            rat gi|1799570|dbj|BAA13432.1| TIP120 [Rattus norvegicus]
            gi|7688703|gb|AAF67492.1|AF157326_1 TIP120 protein [Homo
            sapiens]
          Length = 1230

 Score =  197 bits (500), Expect = 3e-49
 Identities = 104/223 (46%), Positives = 155/223 (68%)
 Frame = -3

Query: 964  PEKIDEIIYPEISSFLMLIKDNNRHVRRAAVLALSTFAHNKPNLIKGLLSDLLPLLYDQT 785
            P+ ID ++   I  FL  ++D + +VRR A++  ++ AHNKP+LI+ LL  +LP LY++T
Sbjct: 997  PQPIDPLLKNCIGDFLKTLEDPDLNVRRVALVTFNSAAHNKPSLIRDLLDSVLPHLYNET 1056

Query: 784  IVKQELIRTVDLGPFKHTVDDGLELRKAAFECVDTLLDSCLDQLNPSSFIVPYLKSGLDD 605
             V++ELIR V++GPFKHTVDDGL++RKAAFEC+ TLLDSCLD+L+   F+  +++ GL D
Sbjct: 1057 KVRKELIREVEMGPFKHTVDDGLDIRKAAFECMYTLLDSCLDRLDIFEFL-NHVEDGLKD 1115

Query: 604  HYDVKMPCHLILSKLADNCPSAVLAVLDSLVDPLQKTINFKPKQDAVKQEVDRNEDMIRS 425
            HYD+KM   L+L +L+  CPSAVL  LD LV+PL+ T   K K ++VKQE ++ +++ RS
Sbjct: 1116 HYDIKMLTFLMLVRLSTLCPSAVLQRLDRLVEPLRATCTTKVKANSVKQEFEKQDELKRS 1175

Query: 424  ALRAVASLNRISGGDCSVKFKSLMNEISKSQTLWDKYYSIRNE 296
            A+RAVA+L  I   + S       ++IS +  L   + SI+ +
Sbjct: 1176 AMRAVAALLTIPEAEKSPLMSEFQSQISSNPELAAIFESIQKD 1218

>emb|CAD38737.1| hypothetical protein [Homo sapiens]
          Length = 770

 Score =  196 bits (499), Expect = 3e-49
 Identities = 104/223 (46%), Positives = 155/223 (68%)
 Frame = -3

Query: 964  PEKIDEIIYPEISSFLMLIKDNNRHVRRAAVLALSTFAHNKPNLIKGLLSDLLPLLYDQT 785
            P+ ID ++   I  FL  ++D + +VRR A++  ++ AHNKP+LI+ LL  +LP LY++T
Sbjct: 537  PQPIDPLLKNCIGDFLKTLEDPDLNVRRVALVTFNSAAHNKPSLIRDLLDTVLPHLYNET 596

Query: 784  IVKQELIRTVDLGPFKHTVDDGLELRKAAFECVDTLLDSCLDQLNPSSFIVPYLKSGLDD 605
             V++ELIR V++GPFKHTVDDGL++RKAAFEC+ TLLDSCLD+L+   F+  +++ GL D
Sbjct: 597  KVRKELIREVEMGPFKHTVDDGLDIRKAAFECMYTLLDSCLDRLDIFEFL-NHVEDGLKD 655

Query: 604  HYDVKMPCHLILSKLADNCPSAVLAVLDSLVDPLQKTINFKPKQDAVKQEVDRNEDMIRS 425
            HYD+KM   L+L +L+  CPSAVL  LD LV+PL+ T   K K ++VKQE ++ +++ RS
Sbjct: 656  HYDIKMLTFLMLVRLSTLCPSAVLQRLDRLVEPLRATCTTKVKANSVKQEFEKQDELKRS 715

Query: 424  ALRAVASLNRISGGDCSVKFKSLMNEISKSQTLWDKYYSIRNE 296
            A+RAVA+L  I   + S       ++IS +  L   + SI+ +
Sbjct: 716  AMRAVAALLTIPEAEKSPLMSEFQSQISSNPELAAIFESIQKD 758

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 759,651,989
Number of Sequences: 1393205
Number of extensions: 15513547
Number of successful extensions: 35344
Number of sequences better than 10.0: 38
Number of HSP's better than 10.0 without gapping: 34095
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 35306
length of database: 448,689,247
effective HSP length: 123
effective length of database: 277,325,032
effective search space used: 54910356336
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPDL007c01_f AV776859 1 322
2 SPDL064f03_f BP055987 22 533
3 MFBL003a02_f BP041396 23 564
4 SPDL021g11_f BP053331 34 502
5 SPDL100c02_f BP058279 34 395
6 MRL046e12_f BP085936 74 250
7 SPDL095a11_f BP057943 440 969




Lotus japonicus
Kazusa DNA Research Institute