
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC141115.10 + phase: 0 /pseudo
(173 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAG14981.1| C2 domain-containing protein [Cicer arietinum] 120 2e-26
gb|AAR24766.1| At2g22125 [Arabidopsis thaliana] gi|38454166|gb|A... 115 5e-25
ref|NP_179803.3| C2 domain-containing protein [Arabidopsis thali... 115 5e-25
dbj|BAD38184.1| C2 domain-containing protein-like [Oryza sativa ... 100 2e-20
pir||C84609 hypothetical protein At2g22130 [imported] - Arabidop... 92 6e-18
dbj|BAC42703.1| unknown protein [Arabidopsis thaliana] 51 2e-05
ref|NP_177870.1| C2 domain-containing protein / armadillo/beta-c... 51 2e-05
ref|NP_175078.1| C2 domain-containing protein / armadillo/beta-c... 44 0.002
gb|AAM15466.1| unknown protein [Arabidopsis thaliana] gi|2019814... 41 0.016
gb|EAL87727.1| hypothetical protein Afu1g00130 [Aspergillus fumi... 32 7.3
gb|EAA77132.1| hypothetical protein FG09575.1 [Gibberella zeae P... 32 9.5
ref|XP_416549.1| PREDICTED: similar to DNA polymerase theta isof... 32 9.5
gb|AAC64636.1| sensory transduction histidine kinase [Mastigocla... 32 9.5
emb|CAE67628.1| Hypothetical protein CBG13182 [Caenorhabditis br... 32 9.5
>emb|CAG14981.1| C2 domain-containing protein [Cicer arietinum]
Length = 248
Score = 120 bits (301), Expect = 2e-26
Identities = 82/180 (45%), Positives = 100/180 (55%), Gaps = 45/180 (25%)
Query: 13 MDALFLLIQGWSACPVEVSRDQSNAAAYAIPLLQNLIQFGPVLFFEKAEFI-------LV 65
+D+LFLL Q WSACP EVSR QS AAA AIP LQ LIQ GP F EKAEF+ LV
Sbjct: 62 LDSLFLLRQAWSACPAEVSRAQSIAAADAIPFLQYLIQSGPPRFQEKAEFLLQCLPGTLV 121
Query: 66 MIVKRGNNMRQCVGNQG---KITL-------------EANQEWDERFTYMVL*ECSSRTE 109
+I+KRGNNM+Q VGN KITL N EWDE F++ E + +
Sbjct: 122 VIIKRGNNMKQSVGNPSVYCKITLGNNPPRLTKVVSTGPNPEWDESFSWSF--ESPPKGQ 179
Query: 110 ASYL-LQKQA*SGK-------------------ADEHTLLPTSKSGQPRNLEVELKWSNK 149
++ + ++ GK A E+TLLP SKSG PRNLE+E +WSNK
Sbjct: 180 KLHISCKNKSKVGKSKFGKVTIQIDRVVMLGAVAGEYTLLPASKSGPPRNLEIEFQWSNK 239
>gb|AAR24766.1| At2g22125 [Arabidopsis thaliana] gi|38454166|gb|AAR20777.1|
At2g22125 [Arabidopsis thaliana]
Length = 309
Score = 115 bits (288), Expect = 5e-25
Identities = 82/180 (45%), Positives = 100/180 (55%), Gaps = 46/180 (25%)
Query: 13 MDALFLLIQGWSACPVEVSRDQSNAAAYAIPLLQNLIQFGPVLFFEKAEFI-------LV 65
+DALFLL Q WSACP EVSR QS AAA AIPLLQ LIQ GP F EKAEF+ LV
Sbjct: 133 LDALFLLRQAWSACPAEVSRAQSVAAADAIPLLQYLIQSGPPRFQEKAEFLLQCLPGTLV 192
Query: 66 MIVKRGNNMRQCVGNQG---KITL-------------EANQEWDERFTYMVL*ECSSRTE 109
+ +KRGNNM+Q VGN KITL N EWDE F++ E + +
Sbjct: 193 VTIKRGNNMKQSVGNPSVFCKITLGNNPPRQTKVISTGPNPEWDESFSWSF--ESPPKGQ 250
Query: 110 ASYL-LQKQA*SGK-------------------ADEHTLLPTSKSGQPRNLEVELKWSNK 149
++ + ++ GK A E++LLP SKSG PRNLE+E +WSNK
Sbjct: 251 KLHISCKNKSKMGKSSFGKVTIQIDRVVMLGAVAGEYSLLPESKSG-PRNLEIEFQWSNK 309
>ref|NP_179803.3| C2 domain-containing protein [Arabidopsis thaliana]
Length = 309
Score = 115 bits (288), Expect = 5e-25
Identities = 82/180 (45%), Positives = 100/180 (55%), Gaps = 46/180 (25%)
Query: 13 MDALFLLIQGWSACPVEVSRDQSNAAAYAIPLLQNLIQFGPVLFFEKAEFI-------LV 65
+DALFLL Q WSACP EVSR QS AAA AIPLLQ LIQ GP F EKAEF+ LV
Sbjct: 133 LDALFLLRQAWSACPAEVSRAQSVAAADAIPLLQYLIQSGPPRFQEKAEFLLQCLPGTLV 192
Query: 66 MIVKRGNNMRQCVGNQG---KITL-------------EANQEWDERFTYMVL*ECSSRTE 109
+ +KRGNNM+Q VGN KITL N EWDE F++ E + +
Sbjct: 193 VTIKRGNNMKQSVGNPSVFCKITLGNNPPRQTKVISTGPNPEWDESFSWSF--ESPPKGQ 250
Query: 110 ASYL-LQKQA*SGK-------------------ADEHTLLPTSKSGQPRNLEVELKWSNK 149
++ + ++ GK A E++LLP SKSG PRNLE+E +WSNK
Sbjct: 251 KLHISCKNKSKMGKSSFGKVTIQIDRVVMLGAVAGEYSLLPESKSG-PRNLEIEFQWSNK 309
>dbj|BAD38184.1| C2 domain-containing protein-like [Oryza sativa (japonica
cultivar-group)]
Length = 983
Score = 100 bits (248), Expect = 2e-20
Identities = 70/178 (39%), Positives = 92/178 (51%), Gaps = 41/178 (23%)
Query: 13 MDALFLLIQGWSACPVEVSRDQSNAAAYAIPLLQNLIQFGPVLFFEKAEFI-------LV 65
+D+L+LL Q W AC E+ + QS AA+ AIPLLQ LIQ GP F EKAE + L
Sbjct: 806 LDSLYLLRQAWGACAAEIFKAQSVAASEAIPLLQYLIQSGPPRFQEKAELLLQCLPGTLT 865
Query: 66 MIVKRGNNMRQCVGNQG---KITL-------------EANQEWDERFTY---------MV 100
+ +KRGNN+RQ VGN K+TL A EWDE F + +
Sbjct: 866 VTIKRGNNLRQSVGNPSAFCKLTLGNNPPRLTKIVSTGATPEWDEAFAWAFDSPPKGQKL 925
Query: 101 L*ECSSRT--------EASYLLQKQA*SGK-ADEHTLLPTSKSGQPRNLEVELKWSNK 149
C + + + + + + G A E+TLLP SKSG RNLE+E +WSNK
Sbjct: 926 HISCKNNSKFGKKSFGKVTIQIDRVVMLGSVAGEYTLLPESKSGPNRNLEIEFQWSNK 983
>pir||C84609 hypothetical protein At2g22130 [imported] - Arabidopsis thaliana
Length = 2048
Score = 92.0 bits (227), Expect = 6e-18
Identities = 55/87 (63%), Positives = 60/87 (68%), Gaps = 10/87 (11%)
Query: 13 MDALFLLIQGWSACPVEVSRDQSNAAAYAIPLLQNLIQFGPVLFFEKAEFI-------LV 65
+DALFLL Q WSACP EVSR QS AAA AIPLLQ LIQ GP F EKAEF+ LV
Sbjct: 1922 LDALFLLRQAWSACPAEVSRAQSVAAADAIPLLQYLIQSGPPRFQEKAEFLLQCLPGTLV 1981
Query: 66 MIVKRGNNMRQCVGNQG---KITLEAN 89
+ +KRGNNM+Q VGN KITL N
Sbjct: 1982 VTIKRGNNMKQSVGNPSVFCKITLGNN 2008
>dbj|BAC42703.1| unknown protein [Arabidopsis thaliana]
Length = 434
Score = 50.8 bits (120), Expect = 2e-05
Identities = 46/179 (25%), Positives = 77/179 (42%), Gaps = 42/179 (23%)
Query: 13 MDALFLLIQGWSACPVEVSRDQSNAAAYAIPLLQNLIQFGPVLFFEKAEFI-------LV 65
+D L+LL W+ ++V++ Q+ AA AIP+LQ L++ P F +KA+ + L
Sbjct: 250 LDILYLLRHSWTNMSIDVAKSQAMIAAEAIPVLQMLMKTCPPRFHDKADSLLHCLPGCLT 309
Query: 66 MIVKRGNNMRQ---------------CVGNQGKITLEA-NQEWDERFTY---------MV 100
+ V R NN++Q C Q K+ + EW E FT+ +
Sbjct: 310 VNVMRANNLKQSMATTNAFCQLTIGNCPPRQTKVVSNSTTPEWKEGFTWAFDVPPKGQKL 369
Query: 101 L*ECSSRT--------EASYLLQKQA*SGKADEHTLL--PTSKSGQPRNLEVELKWSNK 149
C S++ + + K G+ L SK R+L++E+ WSN+
Sbjct: 370 HIICKSKSTFGKTTLGRVTIQIDKVVTEGEYSGSLSLNHENSKDASSRSLDIEIAWSNR 428
>ref|NP_177870.1| C2 domain-containing protein / armadillo/beta-catenin repeat family
protein [Arabidopsis thaliana] gi|12323397|gb|AAG51678.1|
unknown protein; 15069-22101 [Arabidopsis thaliana]
gi|25406506|pir||H96803 unknown protein T5M16.5
[imported] - Arabidopsis thaliana
Length = 2110
Score = 50.8 bits (120), Expect = 2e-05
Identities = 46/179 (25%), Positives = 77/179 (42%), Gaps = 42/179 (23%)
Query: 13 MDALFLLIQGWSACPVEVSRDQSNAAAYAIPLLQNLIQFGPVLFFEKAEFI-------LV 65
+D L+LL W+ ++V++ Q+ AA AIP+LQ L++ P F +KA+ + L
Sbjct: 1926 LDILYLLRHSWTNMSIDVAKSQAMIAAEAIPVLQMLMKTCPPRFHDKADSLLHCLPGCLT 1985
Query: 66 MIVKRGNNMRQ---------------CVGNQGKITLEA-NQEWDERFTY---------MV 100
+ V R NN++Q C Q K+ + EW E FT+ +
Sbjct: 1986 VNVMRANNLKQSMATTNAFCQLTIGNCPPRQTKVVSNSTTPEWKEGFTWAFDVPPKGQKL 2045
Query: 101 L*ECSSRT--------EASYLLQKQA*SGKADEHTLL--PTSKSGQPRNLEVELKWSNK 149
C S++ + + K G+ L SK R+L++E+ WSN+
Sbjct: 2046 HIICKSKSTFGKTTLGRVTIQIDKVVTEGEYSGSLSLNHENSKDASSRSLDIEIAWSNR 2104
>ref|NP_175078.1| C2 domain-containing protein / armadillo/beta-catenin repeat family
protein [Arabidopsis thaliana] gi|12320824|gb|AAG50555.1|
hypothetical protein [Arabidopsis thaliana]
gi|25405158|pir||E96505 hypothetical protein T7O23.25
[imported] - Arabidopsis thaliana
Length = 2114
Score = 43.9 bits (102), Expect = 0.002
Identities = 31/112 (27%), Positives = 53/112 (46%), Gaps = 24/112 (21%)
Query: 11 SWMDALFLLIQGWSACPVEVSRDQSNAAAYAIPLLQNLIQF-----GPVLFFEKAEFI-- 63
S MD ++ L Q W+ P E +R Q+ AA AIP+LQ +++ P F E+ +
Sbjct: 1930 SAMDTIYTLRQSWTTMPTETARSQAVLAADAIPVLQLMMKSKLKSPAPSSFHERGNSLLN 1989
Query: 64 -----LVMIVKRGNNMRQ-----------CVGNQGKITLEANQE-WDERFTY 98
L + +KRG+N+++ C + K+ ++ W E FT+
Sbjct: 1990 CLPGSLTVAIKRGDNLKRSNAFCRLIIDNCPTKKTKVVKRSSSPVWKESFTW 2041
>gb|AAM15466.1| unknown protein [Arabidopsis thaliana] gi|20198149|gb|AAM15432.1|
unknown protein [Arabidopsis thaliana]
Length = 109
Score = 40.8 bits (94), Expect = 0.016
Identities = 19/27 (70%), Positives = 23/27 (84%), Gaps = 1/27 (3%)
Query: 123 ADEHTLLPTSKSGQPRNLEVELKWSNK 149
A E++LLP SKSG PRNLE+E +WSNK
Sbjct: 84 AGEYSLLPESKSG-PRNLEIEFQWSNK 109
>gb|EAL87727.1| hypothetical protein Afu1g00130 [Aspergillus fumigatus Af293]
Length = 249
Score = 32.0 bits (71), Expect = 7.3
Identities = 14/45 (31%), Positives = 22/45 (48%)
Query: 11 SWMDALFLLIQGWSACPVEVSRDQSNAAAYAIPLLQNLIQFGPVL 55
SW DA+ L G ++CPV + + Y + + + GPVL
Sbjct: 143 SWFDAVLLASAGTTSCPVRLGNGSAIIQVYGLTKGMSALGIGPVL 187
>gb|EAA77132.1| hypothetical protein FG09575.1 [Gibberella zeae PH-1]
gi|46136119|ref|XP_389751.1| hypothetical protein
FG09575.1 [Gibberella zeae PH-1]
Length = 3213
Score = 31.6 bits (70), Expect = 9.5
Identities = 17/63 (26%), Positives = 34/63 (52%)
Query: 64 LVMIVKRGNNMRQCVGNQGKITLEANQEWDERFTYMVL*ECSSRTEASYLLQKQA*SGKA 123
L ++ R N+ +G+ T+E ++ + FTY++L + S + S+L++ A A
Sbjct: 1518 LASLIHRTGNIASELGHVFSQTMEKHENRYQYFTYLLLEQGESINDPSFLVRPSAVLRSA 1577
Query: 124 DEH 126
D+H
Sbjct: 1578 DDH 1580
>ref|XP_416549.1| PREDICTED: similar to DNA polymerase theta isoform 1; polymerase
(DNA-directed), theta [Gallus gallus]
Length = 3066
Score = 31.6 bits (70), Expect = 9.5
Identities = 23/75 (30%), Positives = 37/75 (48%), Gaps = 6/75 (8%)
Query: 6 HVKKLSWMDALFLLIQGWSACPVEVSRDQSN---AAAYAIP--LLQNLIQFGPVLFFE-K 59
H SW+ L +QGW V V DQ++ A++ +P +L+ G V FE +
Sbjct: 439 HRVSWSWLADGLLQMQGWQCQQVNVPEDQADKLLLASWGLPKAVLEKYHSLGVVQMFEWQ 498
Query: 60 AEFILVMIVKRGNNM 74
AE +++ V G N+
Sbjct: 499 AECLMLGQVLEGKNL 513
>gb|AAC64636.1| sensory transduction histidine kinase [Mastigocladus laminosus]
Length = 308
Score = 31.6 bits (70), Expect = 9.5
Identities = 22/92 (23%), Positives = 43/92 (45%), Gaps = 10/92 (10%)
Query: 2 QIQWHVKKLSWMDALFLLIQGWSACPVEVSRDQSNAAAYAIPLLQNLIQFGPVLFFEKAE 61
+++WH + LS + + L + SR ++ A +P ++ LI L +
Sbjct: 135 RVEWHPESLSLQECVDLAL----------SRIRTRVATDPLPQIKTLISPNLPLVKADGD 184
Query: 62 FILVMIVKRGNNMRQCVGNQGKITLEANQEWD 93
+++ +I K +N + QG+IT+ AN D
Sbjct: 185 WLVEVIAKLVDNACKFTPPQGQITITANPNSD 216
>emb|CAE67628.1| Hypothetical protein CBG13182 [Caenorhabditis briggsae]
Length = 700
Score = 31.6 bits (70), Expect = 9.5
Identities = 14/43 (32%), Positives = 23/43 (52%)
Query: 43 PLLQNLIQFGPVLFFEKAEFILVMIVKRGNNMRQCVGNQGKIT 85
P +I FG FF FI++ I GN+ +Q + ++G+ T
Sbjct: 20 PSCNKIISFGVYSFFIVVNFIVLCIPNTGNHYQQMIDDEGRTT 62
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.328 0.139 0.431
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 274,637,062
Number of Sequences: 2540612
Number of extensions: 9799176
Number of successful extensions: 20817
Number of sequences better than 10.0: 14
Number of HSP's better than 10.0 without gapping: 11
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 20788
Number of HSP's gapped (non-prelim): 24
length of query: 173
length of database: 863,360,394
effective HSP length: 119
effective length of query: 54
effective length of database: 561,027,566
effective search space: 30295488564
effective search space used: 30295488564
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 70 (31.6 bits)
Medicago: description of AC141115.10