Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KCC001501A_C01 KCC001501A_c01
(582 letters)
Database: nr
1,537,769 sequences; 498,525,298 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_862421.1| putative hydroxyproline-rich protein [Micrococc... 45 7e-04
ref|ZP_00094438.1| COG4638: Phenylpropionate dioxygenase and rel... 44 0.001
ref|ZP_00139981.1| COG2010: Cytochrome c, mono- and diheme varia... 44 0.002
dbj|BAC57318.1| P0700D12.12 [Oryza sativa (japonica cultivar-gro... 41 0.014
ref|NP_566678.1| expressed protein [Arabidopsis thaliana] gi|928... 40 0.031
>ref|NP_862421.1| putative hydroxyproline-rich protein [Micrococcus sp. 28]
gi|18025409|gb|AAK62517.1| putative hydroxyproline-rich
protein [Micrococcus sp. 28]
Length = 406
Score = 45.1 bits (105), Expect = 7e-04
Identities = 34/119 (28%), Positives = 49/119 (40%), Gaps = 5/119 (4%)
Frame = +3
Query: 228 PLALCRALRVTRLLRPTATVCMPMEMPLRTESARRPPRPSGRS*TL*MRAHSAPLARTAS 407
P+A+ R TR + PLR + R P P S T + A PL+R++
Sbjct: 20 PVAMNSTTRTTRGTMAMLCKSVKPHRPLRPGTVRFPNTPCWGSRTAALTADPPPLSRSSG 79
Query: 408 RSAC----PSHSAWTSPESCSSR-WMLRP*RCPTSSPE*TPAX*WCRRPRRPRARSARC 569
C P+H + P S+R W RP P + P + W R +RP +RC
Sbjct: 80 PDQCRPSSPAHQQRSGPHGPSARPWPARPDDAPCAVPAWSSKHPWWRPRQRPEHPCSRC 138
>ref|ZP_00094438.1| COG4638: Phenylpropionate dioxygenase and related
ring-hydroxylating dioxygenases, large terminal subunit
[Novosphingobium aromaticivorans]
Length = 715
Score = 44.3 bits (103), Expect = 0.001
Identities = 55/171 (32%), Positives = 67/171 (39%), Gaps = 18/171 (10%)
Frame = +1
Query: 97 PVHLRQLSSPSNRPS*K-----DAAHSAFWSRPSAAAGWPCSPRRAFHHR----SPSVER 249
P H R+ S+P R + +A S +RPS G PC HH SP+ R
Sbjct: 57 PCHARRGSAPGGRVCRRRCGVDGSARSPCAARPSRNGGVPCR-----HHGPLGLSPASAR 111
Query: 250 CV*RGSCGQRQRCACQWKCRFARKVPDARPDRQDAHRHCE*GHTLHRWPERPPGRPARHI 429
+ R CG R +W R R P R A R G RP R RH
Sbjct: 112 GL-RPRCGHRWHGGARWYRRADRGRPAGGFPRFPAGRGNRVGLR-----HRPCAR--RHR 163
Query: 430 QHGQVRKAAAPDGCCGRRDVQPQV---------RSELLQXDGAGGHAGRAR 555
G R+ P GRR V+P+V R + DG GGHAGR R
Sbjct: 164 AAG--RRCRYPADAAGRRAVRPRVDLRAAIPDARRARRRPDGPGGHAGRNR 212
>ref|ZP_00139981.1| COG2010: Cytochrome c, mono- and diheme variants [Pseudomonas
aeruginosa UCBPP-PA14]
Length = 639
Score = 43.5 bits (101), Expect = 0.002
Identities = 30/87 (34%), Positives = 35/87 (39%)
Frame = +1
Query: 316 RKVPDARPDRQDAHRHCE*GHTLHRWPERPPGRPARHIQHGQVRKAAAPDGCCGRRDVQP 495
R++P RPD Q R H L +R P P RH H Q R+ P+G G R
Sbjct: 51 RQLPRPRPDLQGRLRPATAAHDLRLEGQRDPHEPLRHRAHAQDRRGDEPEGHLGER---- 106
Query: 496 QVRSELLQXDGAGGHAGRARGRRGVAS 576
Q LQ G H RG G S
Sbjct: 107 QELRRPLQYTGLPDHPPARRGDHGQRS 133
>dbj|BAC57318.1| P0700D12.12 [Oryza sativa (japonica cultivar-group)]
Length = 320
Score = 40.8 bits (94), Expect = 0.014
Identities = 46/159 (28%), Positives = 64/159 (39%), Gaps = 18/159 (11%)
Frame = +2
Query: 158 TLPSGRVHQ-QQRAGPARRAVPF--TTARPLSSVACNAAPAANGN---GVHANGNAA--- 310
+LP+ HQ + R P A+P + PL + A A + GV NG+
Sbjct: 11 SLPALPSHQPRSRLAPRSLALPGGRSCCGPLRAAAAGGGGGAKDDAQAGVTPNGSPVIKS 70
Query: 311 ---SHGKCPTPAQTVRTLIDIVNEGTLCTV------GPNGLPVGLPVTFSMDKSGKLQLQ 463
+HG P PA VR L++ LCTV G P G V FS D G
Sbjct: 71 ATFAHG-LPPPALAVRNLMEQARFAHLCTVMSGMHHRRTGYPFGSLVDFSNDSMGHPIFS 129
Query: 464 MDAAAVEMSNLKSGVNSCXLMVQAATQAARAVGAVSLHG 580
+ A+ NL S C L+VQ + + V++ G
Sbjct: 130 LSPLAIHTRNLLSDPR-CTLVVQVPGWSGLSNARVTIFG 167
>ref|NP_566678.1| expressed protein [Arabidopsis thaliana] gi|9280221|dbj|BAB01711.1|
gene_id:MXL8.5~unknown protein [Arabidopsis thaliana]
gi|17065156|gb|AAL32732.1| Unknown protein [Arabidopsis
thaliana] gi|27311937|gb|AAO00934.1| Unknown protein
[Arabidopsis thaliana]
Length = 317
Score = 39.7 bits (91), Expect = 0.031
Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 2/74 (2%)
Frame = +2
Query: 326 PTPAQTVRTLIDIVNEGTLCTVGPNGLPVGLPVTFSMDKSGK--LQLQMDAAAVEMSNLK 499
P PA+ R+++++ + GTL T+ +G P+G+ V F++DK G L L + + S L
Sbjct: 59 PFPAEVSRSIMELSSVGTLSTLTHDGWPLGVGVRFAVDKDGTPVLCLNRSVSPDKRSALH 118
Query: 500 SGVNSCXLMVQAAT 541
+ C L T
Sbjct: 119 VQLEQCGLRTPQCT 132