Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KCC001303A_C01 KCC001303A_c01
(554 letters)
Database: nr
1,537,769 sequences; 498,525,298 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAK59627.1| unknown protein [Arabidopsis thaliana] 141 7e-33
ref|NP_563902.1| chloroplast lumen pentapeptide protein, putativ... 141 7e-33
pir||F86257 Hypothetical protein [imported] - Arabidopsis thalia... 139 3e-32
ref|NP_484230.1| hypothetical protein [Nostoc sp. PCC 7120] gi|2... 82 6e-15
ref|ZP_00110510.1| COG1357: Uncharacterized low-complexity prote... 80 2e-14
>gb|AAK59627.1| unknown protein [Arabidopsis thaliana]
Length = 280
Score = 141 bits (355), Expect = 7e-33
Identities = 83/185 (44%), Positives = 108/185 (57%), Gaps = 4/185 (2%)
Frame = +1
Query: 10 SFHIVATPLHLSRLHTMALTMRRATVARPAVSSRTRTVTVQASASKH----MGAGVAAVA 177
S + +P H R L + + SS TR ++ S + A +AA
Sbjct: 20 SSSVSRSPYHFQRYLLRRLQLSSRSNLEIKDSSNTREGCCSSAESNKWKRILSAAMAAAV 79
Query: 178 LAATMSLAGPALADLNAYEAATGGEFGIGSAMQYGEADIQGRDFSNQDLRRSNFTSADCR 357
+A++ + PA+A+LN +EA T GEFGIGSA QYG AD+ SN++ RR+NFTSAD R
Sbjct: 80 IASSSGV--PAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRRANFTSADMR 137
Query: 358 NATFKGSNLQGAYFIKAVTYRTNFEDANLSDVLMDRATMVEANLKNAILQRTVFTRSDLK 537
+ F GS GAY KAV Y+ NF A+LSD LMDR + EANL NA+L R+V TRSDL
Sbjct: 138 ESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLG 197
Query: 538 DAVIE 552
A IE
Sbjct: 198 GAKIE 202
>ref|NP_563902.1| chloroplast lumen pentapeptide protein, putative [Arabidopsis
thaliana] gi|23297125|gb|AAN13098.1| unknown protein
[Arabidopsis thaliana]
Length = 280
Score = 141 bits (355), Expect = 7e-33
Identities = 83/185 (44%), Positives = 109/185 (58%), Gaps = 4/185 (2%)
Frame = +1
Query: 10 SFHIVATPLHLSRLHTMALTMRRATVARPAVSSRTRTVTVQASAS----KHMGAGVAAVA 177
S + +P H R L + + SS TR ++ S + + A +AA
Sbjct: 20 SSSVSRSPYHFQRYLLRRLQLSSRSNLEIKDSSNTREGCCSSAESNTWKRILSAAMAAAV 79
Query: 178 LAATMSLAGPALADLNAYEAATGGEFGIGSAMQYGEADIQGRDFSNQDLRRSNFTSADCR 357
+A++ + PA+A+LN +EA T GEFGIGSA QYG AD+ SN++ RR+NFTSAD R
Sbjct: 80 IASSSGV--PAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRRANFTSADMR 137
Query: 358 NATFKGSNLQGAYFIKAVTYRTNFEDANLSDVLMDRATMVEANLKNAILQRTVFTRSDLK 537
+ F GS GAY KAV Y+ NF A+LSD LMDR + EANL NA+L R+V TRSDL
Sbjct: 138 ESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLG 197
Query: 538 DAVIE 552
A IE
Sbjct: 198 GAKIE 202
>pir||F86257 Hypothetical protein [imported] - Arabidopsis thaliana
gi|10086510|gb|AAG12570.1|AC022522_3 Hypothetical
protein [Arabidopsis thaliana]
Length = 293
Score = 139 bits (350), Expect = 3e-32
Identities = 81/183 (44%), Positives = 109/183 (59%), Gaps = 2/183 (1%)
Frame = +1
Query: 10 SFHIVATPLHLSR--LHTMALTMRRATVARPAVSSRTRTVTVQASASKHMGAGVAAVALA 183
S + +P H R L + L+ R + + + T + + + A +AA +A
Sbjct: 10 SSSVSRSPYHFQRYLLRRLQLSSR----SNLEIKDSSNTSAESNTWKRILSAAMAAAVIA 65
Query: 184 ATMSLAGPALADLNAYEAATGGEFGIGSAMQYGEADIQGRDFSNQDLRRSNFTSADCRNA 363
++ + PA+A+LN +EA T GEFGIGSA QYG AD+ SN++ RR+NFTSAD R +
Sbjct: 66 SSSGV--PAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRRANFTSADMRES 123
Query: 364 TFKGSNLQGAYFIKAVTYRTNFEDANLSDVLMDRATMVEANLKNAILQRTVFTRSDLKDA 543
F GS GAY KAV Y+ NF A+LSD LMDR + EANL NA+L R+V TRSDL A
Sbjct: 124 DFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLGGA 183
Query: 544 VIE 552
IE
Sbjct: 184 KIE 186
>ref|NP_484230.1| hypothetical protein [Nostoc sp. PCC 7120] gi|25354788|pir||AB1830
hypothetical protein all0186 [imported] - Nostoc sp.
(strain PCC 7120) gi|17135164|dbj|BAB77710.1|
ORF_ID:all0186~hypothetical protein [Nostoc sp. PCC
7120]
Length = 168
Score = 81.6 bits (200), Expect = 6e-15
Identities = 41/95 (43%), Positives = 56/95 (58%)
Frame = +1
Query: 265 SAMQYGEADIQGRDFSNQDLRRSNFTSADCRNATFKGSNLQGAYFIKAVTYRTNFEDANL 444
+ + Y A+++ RDF+N DL NF +A+ R F+G+NL A K V + N +ANL
Sbjct: 34 NTINYNNANLENRDFANADLVGVNFVAAEMRGTNFQGANLTNAILTKGVLLKANLSEANL 93
Query: 445 SDVLMDRATMVEANLKNAILQRTVFTRSDLKDAVI 549
+ L+DRAT+ ANLKNAI TRS DA I
Sbjct: 94 TGALVDRATLDNANLKNAIFTEATLTRSRFYDADI 128
>ref|ZP_00110510.1| COG1357: Uncharacterized low-complexity proteins [Nostoc
punctiforme]
Length = 168
Score = 80.1 bits (196), Expect = 2e-14
Identities = 42/95 (44%), Positives = 53/95 (55%)
Frame = +1
Query: 265 SAMQYGEADIQGRDFSNQDLRRSNFTSADCRNATFKGSNLQGAYFIKAVTYRTNFEDANL 444
+ + Y +++ RDFSN DL F +A+ R F+G+NL A K V + N E ANL
Sbjct: 34 NTINYNNINLENRDFSNADLAGVTFVAAEMRGTNFQGANLTNAILTKGVLLKANLEGANL 93
Query: 445 SDVLMDRATMVEANLKNAILQRTVFTRSDLKDAVI 549
S L+DR TM ANLKNAI TRS DA I
Sbjct: 94 SGALVDRVTMDGANLKNAIFTEATLTRSRFFDAEI 128