
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC148971.3 - phase: 0
(226 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_172444.2| Ulp1 protease family protein [Arabidopsis thali... 105 1e-21
gb|AAN46793.1| At1g09730/F21M12_12 [Arabidopsis thaliana] gi|221... 103 5e-21
gb|AAU90189.1| unknown protein [Oryza sativa (japonica cultivar-... 95 1e-18
ref|NP_195088.2| Ulp1 protease family protein [Arabidopsis thali... 85 1e-15
dbj|BAD37693.1| Ulp1 protease-like [Oryza sativa (japonica culti... 67 4e-10
gb|AAM91198.1| similar to protein-tyrosine phosphatase 2 [Arabid... 67 5e-10
ref|NP_973802.1| Ulp1 protease family protein [Arabidopsis thali... 67 5e-10
gb|AAD39580.1| T10O24.20 [Arabidopsis thaliana] 65 1e-09
ref|NP_176228.1| Ulp1 protease family protein [Arabidopsis thali... 60 4e-08
gb|AAC24055.1| Contains similarity to protein-tyrosine phosphata... 60 4e-08
ref|NP_916015.1| OSJNBb0021A09.14 [Oryza sativa (japonica cultiv... 60 6e-08
dbj|BAD87026.1| Ulp1 protease-like [Oryza sativa (japonica culti... 60 6e-08
emb|CAB88329.1| putative protein [Arabidopsis thaliana] gi|15228... 51 2e-05
gb|AAX23862.1| hypothetical protein At3g48480 [Arabidopsis thali... 51 2e-05
emb|CAF89300.1| unnamed protein product [Tetraodon nigroviridis] 51 2e-05
dbj|BAC98029.1| mKIAA0797 protein [Mus musculus] 50 4e-05
ref|NP_666115.2| SUMO/sentrin specific protease 6 [Mus musculus] 50 5e-05
ref|XP_539004.1| PREDICTED: similar to SENP6 protein [Canis fami... 50 5e-05
emb|CAH72480.1| SUSP1 [Homo sapiens] gi|55662693|emb|CAH74168.1|... 50 5e-05
ref|XP_518592.1| PREDICTED: SUMO1/sentrin specific protease 6 [P... 50 5e-05
>ref|NP_172444.2| Ulp1 protease family protein [Arabidopsis thaliana]
Length = 984
Score = 105 bits (261), Expect = 1e-21
Identities = 67/195 (34%), Positives = 99/195 (50%), Gaps = 36/195 (18%)
Query: 2 ADDFSSKFLQLRFISLEVSSLQMNLTNSVELFIDFCFSLPPFFFFQLPQQDNFYDCGLFL 61
+DD SS+F+ LRF+SLE LPQQ+N +DCGLFL
Sbjct: 604 SDDISSRFMNLRFVSLE-----------------------------LPQQENSFDCGLFL 634
Query: 62 LYFVERFLEEAPIKFNPFKITKFSKFLNSNWFPSDEASLRRSHIQNLIYDIFENGSLKAP 121
L+++E FL EAP+ F+PFKI S FL NWFP EASL+R+ IQ LI+++ EN S +
Sbjct: 635 LHYLELFLAEAPLNFSPFKIYNASNFLYLNWFPPAEASLKRTLIQKLIFELLENRSREV- 693
Query: 122 PIDCRGKGPLSELPGVIEHKVEADSSGASCYPGI-WHGNLSNGSTETDIQFRPV--SPVR 178
+ E P + + + C P I +G+++ + I+ + S +R
Sbjct: 694 ---SNEQNQSCESPVAVNDDMGIEVLSERCSPLIDCNGDMTQTQDDQGIEMTLLERSSMR 750
Query: 179 AASCSRDPGIVFKDL 193
+ D G+V +DL
Sbjct: 751 HIQAANDSGMVLRDL 765
>gb|AAN46793.1| At1g09730/F21M12_12 [Arabidopsis thaliana]
gi|22135830|gb|AAM91101.1| At1g09730/F21M12_12
[Arabidopsis thaliana]
Length = 378
Score = 103 bits (256), Expect = 5e-21
Identities = 59/153 (38%), Positives = 87/153 (56%), Gaps = 7/153 (4%)
Query: 44 FFFQLPQQDNFYDCGLFLLYFVERFLEEAPIKFNPFKITKFSKFLNSNWFPSDEASLRRS 103
F FQLPQQ+N +DCGLFLL+++E FL EAP+ F+PFKI S FL NWFP EASL+R+
Sbjct: 11 FLFQLPQQENSFDCGLFLLHYLELFLAEAPLNFSPFKIYNASNFLYLNWFPPAEASLKRT 70
Query: 104 HIQNLIYDIFENGSLKAPPIDCRGKGPLSELPGVIEHKVEADSSGASCYPGI-WHGNLSN 162
IQ LI+++ EN S + + E P + + + C P I +G+++
Sbjct: 71 LIQKLIFELLENRSREV----SNEQNQSCESPVAVNDDMGIEVLSERCSPLIDCNGDMTQ 126
Query: 163 GSTETDIQFRPV--SPVRAASCSRDPGIVFKDL 193
+ I+ + S +R + D G+V +DL
Sbjct: 127 TQDDQGIEMTLLERSSMRHIQAANDSGMVLRDL 159
>gb|AAU90189.1| unknown protein [Oryza sativa (japonica cultivar-group)]
Length = 991
Score = 95.1 bits (235), Expect = 1e-18
Identities = 53/124 (42%), Positives = 67/124 (53%), Gaps = 29/124 (23%)
Query: 2 ADDFSSKFLQLRFISLEVSSLQMNLTNSVELFIDFCFSLPPFFFFQLPQQDNFYDCGLFL 61
A D S KFL LRFISLE LPQQDN +DCGLFL
Sbjct: 455 ASDCSDKFLNLRFISLE-----------------------------LPQQDNSFDCGLFL 485
Query: 62 LYFVERFLEEAPIKFNPFKITKFSKFLNSNWFPSDEASLRRSHIQNLIYDIFENGSLKAP 121
L++VE FL + P FNP KI F+ +L+ +WFP EASL+RS I+ LI+ + + S P
Sbjct: 486 LHYVELFLMDTPRSFNPLKIDSFANYLSDDWFPPAEASLKRSLIRKLIHKLLKEPSQDFP 545
Query: 122 PIDC 125
+ C
Sbjct: 546 KLVC 549
>ref|NP_195088.2| Ulp1 protease family protein [Arabidopsis thaliana]
Length = 783
Score = 85.1 bits (209), Expect = 1e-15
Identities = 47/132 (35%), Positives = 77/132 (57%), Gaps = 5/132 (3%)
Query: 47 QLPQQDNFYDCGLFLLYFVERFLEEAPIKFNPFKITKFSKFLNSNWFPSDEASLRRSHIQ 106
+LPQQ+N +DCGLFLL++++ F+ +AP KFNP I++ + FL NWFP+ EASL+R +I
Sbjct: 484 ELPQQENSFDCGLFLLHYLDLFVAQAPAKFNPSLISRSANFLTRNWFPAKEASLKRRNIL 543
Query: 107 NLIYDIFENGSLKAPPIDCRGKGPLSELPGVIEHKVEADSSGASCYPGIWHGNL-SNGST 165
L+Y++ + P + + + P + + + E+++ C W + ST
Sbjct: 544 ELLYNLHKGHDPSILPANSKSEPPHCGVSNRNDQETESENVIECCN---WIKPFDGSSST 600
Query: 166 ETDI-QFRPVSP 176
TDI Q + SP
Sbjct: 601 VTDISQTKTCSP 612
>dbj|BAD37693.1| Ulp1 protease-like [Oryza sativa (japonica cultivar-group)]
Length = 522
Score = 67.0 bits (162), Expect = 4e-10
Identities = 39/102 (38%), Positives = 55/102 (53%), Gaps = 12/102 (11%)
Query: 48 LPQQDNFYDCGLFLLYFVERFLEEAPIKFNPFKITKFSKFLNSNWFPSDEASLRRSHIQN 107
+PQQDN YDCG+F+LY++ RF+EEAP + N S WF +EAS R +Q
Sbjct: 414 VPQQDNEYDCGVFVLYYMRRFIEEAPERLNN---KDSSNMFGEGWFQREEASALRKEMQA 470
Query: 108 LIYDIFE----NGSLKAP--PIDCRGKGP---LSELPGVIEH 140
L+ +FE N ++ P P+ + P LS P V +H
Sbjct: 471 LLLRLFEEAKDNNHMRDPTTPVSATAEHPVEVLSTEPAVPDH 512
>gb|AAM91198.1| similar to protein-tyrosine phosphatase 2 [Arabidopsis thaliana]
gi|20260164|gb|AAM12980.1| similar to protein-tyrosine
phosphatase 2 [Arabidopsis thaliana]
gi|22329476|ref|NP_172527.2| Ulp1 protease family
protein [Arabidopsis thaliana]
Length = 571
Score = 66.6 bits (161), Expect = 5e-10
Identities = 31/71 (43%), Positives = 45/71 (62%), Gaps = 4/71 (5%)
Query: 47 QLPQQDNFYDCGLFLLYFVERFLEEAPIKFNPFKITKFSKFLNSNWFPSDEASLRRSHIQ 106
Q+PQQ N +DCGLFLL+F+ RF+EEAP + + K ++ WF +EAS R I
Sbjct: 502 QVPQQKNDFDCGLFLLFFIRRFIEEAPQRLT----LQDLKMIHKKWFKPEEASALRIKIW 557
Query: 107 NLIYDIFENGS 117
N++ D+F G+
Sbjct: 558 NILVDLFRKGN 568
>ref|NP_973802.1| Ulp1 protease family protein [Arabidopsis thaliana]
Length = 570
Score = 66.6 bits (161), Expect = 5e-10
Identities = 31/71 (43%), Positives = 45/71 (62%), Gaps = 4/71 (5%)
Query: 47 QLPQQDNFYDCGLFLLYFVERFLEEAPIKFNPFKITKFSKFLNSNWFPSDEASLRRSHIQ 106
Q+PQQ N +DCGLFLL+F+ RF+EEAP + + K ++ WF +EAS R I
Sbjct: 501 QVPQQKNDFDCGLFLLFFIRRFIEEAPQRLT----LQDLKMIHKKWFKPEEASALRIKIW 556
Query: 107 NLIYDIFENGS 117
N++ D+F G+
Sbjct: 557 NILVDLFRKGN 567
>gb|AAD39580.1| T10O24.20 [Arabidopsis thaliana]
Length = 582
Score = 65.5 bits (158), Expect = 1e-09
Identities = 30/71 (42%), Positives = 45/71 (63%), Gaps = 4/71 (5%)
Query: 47 QLPQQDNFYDCGLFLLYFVERFLEEAPIKFNPFKITKFSKFLNSNWFPSDEASLRRSHIQ 106
++PQQ N +DCGLFLL+F+ RF+EEAP + + K ++ WF +EAS R I
Sbjct: 513 EVPQQKNDFDCGLFLLFFIRRFIEEAPQRLT----LQDLKMIHKKWFKPEEASALRIKIW 568
Query: 107 NLIYDIFENGS 117
N++ D+F G+
Sbjct: 569 NILVDLFRKGN 579
>ref|NP_176228.1| Ulp1 protease family protein [Arabidopsis thaliana]
Length = 604
Score = 60.5 bits (145), Expect = 4e-08
Identities = 29/67 (43%), Positives = 42/67 (62%), Gaps = 4/67 (5%)
Query: 47 QLPQQDNFYDCGLFLLYFVERFLEEAPIKFNPFKITKFSKFLNSNWFPSDEASLRRSHIQ 106
Q+PQQ N +DCG F+L+F++RF+EEAP + + F K WF DEAS R I+
Sbjct: 535 QVPQQKNDFDCGPFVLFFIKRFIEEAPQRLKRKDLGMFDK----KWFRPDEASALRIKIR 590
Query: 107 NLIYDIF 113
N + ++F
Sbjct: 591 NTLIELF 597
>gb|AAC24055.1| Contains similarity to protein-tyrosine phosphatase 2 gb|L15420
from Dictyostelium discoideum. EST gb|N38718 comes from
this g [Arabidopsis thaliana] gi|7486914|pir||T02274
hypothetical protein T13D8.11 - Arabidopsis thaliana
Length = 547
Score = 60.5 bits (145), Expect = 4e-08
Identities = 29/67 (43%), Positives = 42/67 (62%), Gaps = 4/67 (5%)
Query: 47 QLPQQDNFYDCGLFLLYFVERFLEEAPIKFNPFKITKFSKFLNSNWFPSDEASLRRSHIQ 106
Q+PQQ N +DCG F+L+F++RF+EEAP + + F K WF DEAS R I+
Sbjct: 478 QVPQQKNDFDCGPFVLFFIKRFIEEAPQRLKRKDLGMFDK----KWFRPDEASALRIKIR 533
Query: 107 NLIYDIF 113
N + ++F
Sbjct: 534 NTLIELF 540
>ref|NP_916015.1| OSJNBb0021A09.14 [Oryza sativa (japonica cultivar-group)]
Length = 506
Score = 59.7 bits (143), Expect = 6e-08
Identities = 27/72 (37%), Positives = 43/72 (59%), Gaps = 4/72 (5%)
Query: 47 QLPQQDNFYDCGLFLLYFVERFLEEAPIKFNPFKITKFSKFLNSNWFPSDEASLRRSHIQ 106
Q+P Q N YDCG+F+L+++ERF++EAP + + F + WF E S R I+
Sbjct: 408 QVPSQRNKYDCGIFMLHYIERFIQEAPERLTRENLCMFGR----KWFDPKETSGLRDRIR 463
Query: 107 NLIYDIFENGSL 118
L++D FE+ +
Sbjct: 464 ALMFDAFESARM 475
>dbj|BAD87026.1| Ulp1 protease-like [Oryza sativa (japonica cultivar-group)]
Length = 528
Score = 59.7 bits (143), Expect = 6e-08
Identities = 27/72 (37%), Positives = 43/72 (59%), Gaps = 4/72 (5%)
Query: 47 QLPQQDNFYDCGLFLLYFVERFLEEAPIKFNPFKITKFSKFLNSNWFPSDEASLRRSHIQ 106
Q+P Q N YDCG+F+L+++ERF++EAP + + F + WF E S R I+
Sbjct: 430 QVPSQRNKYDCGIFMLHYIERFIQEAPERLTRENLCMFGR----KWFDPKETSGLRDRIR 485
Query: 107 NLIYDIFENGSL 118
L++D FE+ +
Sbjct: 486 ALMFDAFESARM 497
>emb|CAB88329.1| putative protein [Arabidopsis thaliana]
gi|15228382|ref|NP_190417.1| hypothetical protein
[Arabidopsis thaliana]
Length = 169
Score = 51.2 bits (121), Expect = 2e-05
Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 3/52 (5%)
Query: 42 PFFFFQLPQQDNFYDCGLFLLYFVERFLEEAPIKFNPFKITKFSKFLNSNWF 93
PF+ +PQQ N +CG F+LY++ RF+E+AP FN + FL +WF
Sbjct: 102 PFYVPMVPQQTNDVECGSFVLYYIHRFIEDAPENFN---VEDMPYFLKEDWF 150
>gb|AAX23862.1| hypothetical protein At3g48480 [Arabidopsis thaliana]
Length = 167
Score = 51.2 bits (121), Expect = 2e-05
Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 3/52 (5%)
Query: 42 PFFFFQLPQQDNFYDCGLFLLYFVERFLEEAPIKFNPFKITKFSKFLNSNWF 93
PF+ +PQQ N +CG F+LY++ RF+E+AP FN + FL +WF
Sbjct: 100 PFYVPMVPQQTNDVECGSFVLYYIHRFIEDAPENFN---VEDMPYFLKEDWF 148
>emb|CAF89300.1| unnamed protein product [Tetraodon nigroviridis]
Length = 160
Score = 51.2 bits (121), Expect = 2e-05
Identities = 24/70 (34%), Positives = 40/70 (56%), Gaps = 9/70 (12%)
Query: 47 QLPQQDNFYDCGLFLLYFVERFLEEAPIKFN-PFKITKFSKFLNSNWFPSDEASLRRSHI 105
Q+PQQDN DCGL+LL + + FL+ + F P ++ NWFP + +R I
Sbjct: 98 QVPQQDNSSDCGLYLLQYAQSFLQNPVVHFELPLRL--------GNWFPRQQVRQKREEI 149
Query: 106 QNLIYDIFEN 115
++LI ++ ++
Sbjct: 150 RSLILELHQS 159
>dbj|BAC98029.1| mKIAA0797 protein [Mus musculus]
Length = 1174
Score = 50.4 bits (119), Expect = 4e-05
Identities = 25/72 (34%), Positives = 39/72 (53%), Gaps = 9/72 (12%)
Query: 47 QLPQQDNFYDCGLFLLYFVERFLEEAPIKFN-PFKITKFSKFLNSNWFPSDEASLRRSHI 105
++PQQ+NF DCG+++L +VE F E + F P + NWFP +R I
Sbjct: 1081 KVPQQNNFSDCGVYVLQYVESFFENPVLNFELPMNLV--------NWFPPPRMKTKREEI 1132
Query: 106 QNLIYDIFENGS 117
+N+I + E+ S
Sbjct: 1133 RNIILKLQESQS 1144
>ref|NP_666115.2| SUMO/sentrin specific protease 6 [Mus musculus]
Length = 1132
Score = 50.1 bits (118), Expect = 5e-05
Identities = 25/72 (34%), Positives = 39/72 (53%), Gaps = 9/72 (12%)
Query: 47 QLPQQDNFYDCGLFLLYFVERFLEEAPIKFN-PFKITKFSKFLNSNWFPSDEASLRRSHI 105
++PQQ+NF DCG+++L +VE F E + F P + NWFP +R I
Sbjct: 1039 KVPQQNNFSDCGVYVLQYVESFFENPVLNFELPMNL--------MNWFPPPRMKTKREEI 1090
Query: 106 QNLIYDIFENGS 117
+N+I + E+ S
Sbjct: 1091 RNIILKLQESQS 1102
>ref|XP_539004.1| PREDICTED: similar to SENP6 protein [Canis familiaris]
Length = 1204
Score = 50.1 bits (118), Expect = 5e-05
Identities = 25/72 (34%), Positives = 40/72 (54%), Gaps = 9/72 (12%)
Query: 47 QLPQQDNFYDCGLFLLYFVERFLEEAPIKFN-PFKITKFSKFLNSNWFPSDEASLRRSHI 105
++PQQ+NF DCG+++L +VE F E + F P + +NWFP +R I
Sbjct: 1112 KVPQQNNFSDCGVYVLQYVESFFENPILNFELPMNL--------ANWFPPPRMRTKREEI 1163
Query: 106 QNLIYDIFENGS 117
+N+I + E+ S
Sbjct: 1164 RNIILKLQEDQS 1175
>emb|CAH72480.1| SUSP1 [Homo sapiens] gi|55662693|emb|CAH74168.1| SUSP1 [Homo sapiens]
gi|56205419|emb|CAI23575.1| SUSP1 [Homo sapiens]
gi|56204016|emb|CAI19523.1| SUSP1 [Homo sapiens]
Length = 1107
Score = 50.1 bits (118), Expect = 5e-05
Identities = 25/72 (34%), Positives = 40/72 (54%), Gaps = 9/72 (12%)
Query: 47 QLPQQDNFYDCGLFLLYFVERFLEEAPIKFN-PFKITKFSKFLNSNWFPSDEASLRRSHI 105
++PQQ+NF DCG+++L +VE F E + F P + +NWFP +R I
Sbjct: 1015 KVPQQNNFSDCGVYVLQYVESFFENPILSFELPMNL--------ANWFPPPRMRTKREEI 1066
Query: 106 QNLIYDIFENGS 117
+N+I + E+ S
Sbjct: 1067 RNIILKLQEDQS 1078
>ref|XP_518592.1| PREDICTED: SUMO1/sentrin specific protease 6 [Pan troglodytes]
Length = 1316
Score = 50.1 bits (118), Expect = 5e-05
Identities = 25/72 (34%), Positives = 40/72 (54%), Gaps = 9/72 (12%)
Query: 47 QLPQQDNFYDCGLFLLYFVERFLEEAPIKFN-PFKITKFSKFLNSNWFPSDEASLRRSHI 105
++PQQ+NF DCG+++L +VE F E + F P + +NWFP +R I
Sbjct: 1224 KVPQQNNFSDCGVYVLQYVESFFENPILSFELPMNL--------ANWFPPPRMRTKREEI 1275
Query: 106 QNLIYDIFENGS 117
+N+I + E+ S
Sbjct: 1276 RNIILKLQEDQS 1287
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.324 0.141 0.439
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 399,688,730
Number of Sequences: 2540612
Number of extensions: 17208305
Number of successful extensions: 37514
Number of sequences better than 10.0: 87
Number of HSP's better than 10.0 without gapping: 81
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 37399
Number of HSP's gapped (non-prelim): 92
length of query: 226
length of database: 863,360,394
effective HSP length: 123
effective length of query: 103
effective length of database: 550,865,118
effective search space: 56739107154
effective search space used: 56739107154
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.5 bits)
S2: 73 (32.7 bits)
Medicago: description of AC148971.3