Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC005063A_C01 KMC005063A_c01
(512 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_177728.1| unknown protein; protein id: At1g76020.1 [Arabi... 174 7e-43
gb|AAF79818.1|AC007396_19 T4O12.23 [Arabidopsis thaliana] 174 7e-43
ref|NP_683315.1| unknown protein; protein id: At1g20225.1, suppo... 160 6e-39
pir||H86335 T20H2.2 protein - Arabidopsis thaliana gi|8778978|gb... 160 8e-39
gb|EAA36442.1| predicted protein [Neurospora crassa] 37 0.13
>ref|NP_177728.1| unknown protein; protein id: At1g76020.1 [Arabidopsis thaliana]
Length = 225
Score = 174 bits (440), Expect = 7e-43
Identities = 83/150 (55%), Positives = 112/150 (74%)
Frame = +1
Query: 52 LILQLLMALQVLIFSPAGADYIPPAKTDGFVYLNRRSFNFDSILIEAFYDPLCPDSRDSW 231
+I +L+ L ++ + A +PP + DGFVY F+ D+ILIEA++DP+CPDSRDSW
Sbjct: 1 MIRAVLLFLVFVVETRVQAQLVPPVRQDGFVYPPGHRFDPDTILIEAYFDPVCPDSRDSW 60
Query: 232 PPLKQALHDYGSGVSLVVHLLPLPYHDNAYVASRALHVVNALNSSATFPLLELLFKDQEK 411
PPLKQALH YGS V+L++HLLPLPYHDNAYV SRALH+VN ++++ATF LLE FK Q
Sbjct: 61 PPLKQALHHYGSRVALLLHLLPLPYHDNAYVTSRALHIVNTVHANATFSLLEGFFKHQSL 120
Query: 412 FYGAQTRNLSRASIQEEFVKSATEVIGSSF 501
FY AQT+ LSR ++ E+ V+ T +G+S+
Sbjct: 121 FYNAQTQLLSRPAVVEKIVELGTVSLGNSY 150
>gb|AAF79818.1|AC007396_19 T4O12.23 [Arabidopsis thaliana]
Length = 263
Score = 174 bits (440), Expect = 7e-43
Identities = 83/150 (55%), Positives = 112/150 (74%)
Frame = +1
Query: 52 LILQLLMALQVLIFSPAGADYIPPAKTDGFVYLNRRSFNFDSILIEAFYDPLCPDSRDSW 231
+I +L+ L ++ + A +PP + DGFVY F+ D+ILIEA++DP+CPDSRDSW
Sbjct: 1 MIRAVLLFLVFVVETRVQAQLVPPVRQDGFVYPPGHRFDPDTILIEAYFDPVCPDSRDSW 60
Query: 232 PPLKQALHDYGSGVSLVVHLLPLPYHDNAYVASRALHVVNALNSSATFPLLELLFKDQEK 411
PPLKQALH YGS V+L++HLLPLPYHDNAYV SRALH+VN ++++ATF LLE FK Q
Sbjct: 61 PPLKQALHHYGSRVALLLHLLPLPYHDNAYVTSRALHIVNTVHANATFSLLEGFFKHQSL 120
Query: 412 FYGAQTRNLSRASIQEEFVKSATEVIGSSF 501
FY AQT+ LSR ++ E+ V+ T +G+S+
Sbjct: 121 FYNAQTQLLSRPAVVEKIVELGTVSLGNSY 150
>ref|NP_683315.1| unknown protein; protein id: At1g20225.1, supported by cDNA:
gi_17065543 [Arabidopsis thaliana]
gi|17065544|gb|AAL32926.1| Unknown protein [Arabidopsis
thaliana] gi|24899723|gb|AAN65076.1| Unknown protein
[Arabidopsis thaliana]
Length = 233
Score = 160 bits (406), Expect = 6e-39
Identities = 76/153 (49%), Positives = 111/153 (71%)
Frame = +1
Query: 49 ILILQLLMALQVLIFSPAGADYIPPAKTDGFVYLNRRSFNFDSILIEAFYDPLCPDSRDS 228
++I L+ + + + A IPPA+ DGF+Y R + D+ILIEA+ DP+CPD RD+
Sbjct: 1 MMIRTALVFVVFFVGTVVQAQLIPPARRDGFLYPPGRKIDRDTILIEAYIDPVCPDCRDA 60
Query: 229 WPPLKQALHDYGSGVSLVVHLLPLPYHDNAYVASRALHVVNALNSSATFPLLELLFKDQE 408
W PLK A+ YGS V+LV+HL+PLP+HDNA+VASRALH+V+ LN++ATF LLE +FK Q
Sbjct: 61 WEPLKLAIDHYGSRVALVLHLIPLPFHDNAFVASRALHIVDTLNANATFNLLEGIFKHQT 120
Query: 409 KFYGAQTRNLSRASIQEEFVKSATEVIGSSFYT 507
FY +QT+ +SR ++ EE +K T +G+S+++
Sbjct: 121 LFYNSQTQLMSRPAVVEELIKLGTVTLGNSYHS 153
>pir||H86335 T20H2.2 protein - Arabidopsis thaliana
gi|8778978|gb|AAF79893.1|AC022472_2 Contains similarity
to pigpen protein from Mus musculus gb|AF224264 and
contains protein of unknown function DUF78 PF|01918
domain. ESTs gb|N38077, gb|BE037702, gb|AV442191,
gb|AV441368, gb|Z17998, gb|AV527266, gb|AV520794,
gb|AI997847, gb|AV543000 come from this gene.
[Arabidopsis thaliana]
Length = 538
Score = 160 bits (405), Expect = 8e-39
Identities = 74/134 (55%), Positives = 103/134 (76%)
Frame = +1
Query: 106 ADYIPPAKTDGFVYLNRRSFNFDSILIEAFYDPLCPDSRDSWPPLKQALHDYGSGVSLVV 285
A IPPA+ DGF+Y R + D+ILIEA+ DP+CPD RD+W PLK A+ YGS V+LV+
Sbjct: 19 AQLIPPARRDGFLYPPGRKIDRDTILIEAYIDPVCPDCRDAWEPLKLAIDHYGSRVALVL 78
Query: 286 HLLPLPYHDNAYVASRALHVVNALNSSATFPLLELLFKDQEKFYGAQTRNLSRASIQEEF 465
HL+PLP+HDNA+VASRALH+V+ LN++ATF LLE +FK Q FY +QT+ +SR ++ EE
Sbjct: 79 HLIPLPFHDNAFVASRALHIVDTLNANATFNLLEGIFKHQTLFYNSQTQLMSRPAVVEEL 138
Query: 466 VKSATEVIGSSFYT 507
+K T +G+S+++
Sbjct: 139 IKLGTVTLGNSYHS 152
>gb|EAA36442.1| predicted protein [Neurospora crassa]
Length = 219
Score = 37.0 bits (84), Expect = 0.13
Identities = 27/111 (24%), Positives = 45/111 (40%), Gaps = 8/111 (7%)
Frame = +1
Query: 184 IEAFYDPLCPDSRDSW--------PPLKQALHDYGSGVSLVVHLLPLPYHDNAYVASRAL 339
+E F D +CP S + P L+ D GS V + P+H ++ + A
Sbjct: 31 VEIFLDYVCPFSAKIYNTLYTTLLPSLRSEHADLGSKVQFIFRHQIQPWHPSSTLTHEAG 90
Query: 340 HVVNALNSSATFPLLELLFKDQEKFYGAQTRNLSRASIQEEFVKSATEVIG 492
V L + + LFKDQ+ ++ N +R + K A++ G
Sbjct: 91 LAVQRLAPTKFWDFSAALFKDQKAYFDVSLVNETRNETYKRLAKLASQSAG 141
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 447,188,332
Number of Sequences: 1393205
Number of extensions: 9595452
Number of successful extensions: 27770
Number of sequences better than 10.0: 31
Number of HSP's better than 10.0 without gapping: 26375
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 27688
length of database: 448,689,247
effective HSP length: 114
effective length of database: 289,863,877
effective search space used: 16232377112
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)