
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0299.12
(418 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At2g47700 unknown protein 226 1e-59
At4g13490 putative protein 84 1e-16
At2g15260 hypothetical protein 80 2e-15
At1g51930 hypothetical protein 46 4e-05
At2g28920 hypothetical protein 45 6e-05
At5g41430 unknown protein 43 3e-04
At2g38970 putative retroelement pol polyprotein 42 5e-04
At5g03450 unknown protein 42 6e-04
At4g09120 putative protein 42 6e-04
At1g08050 unknown protein 42 8e-04
At4g18110 hypothetical protein 41 0.001
At5g40250 RING finger -like protein 41 0.001
At5g01880 unknown protein 40 0.003
At2g17450 pseudogene 40 0.003
At3g14320 putative zinc finger protein 39 0.005
At4g09130 putative protein 39 0.007
At5g51450 unknown protein 38 0.009
At4g25230 unknown protein 38 0.009
At4g09110 putative protein 38 0.009
At3g62690 RING-H2 zinc finger protein ATL5 38 0.009
>At2g47700 unknown protein
Length = 358
Score = 226 bits (577), Expect = 1e-59
Identities = 150/377 (39%), Positives = 195/377 (50%), Gaps = 71/377 (18%)
Query: 28 VSCSICLEVVADNGDRSWAKLHCGHQFHLDCIGSAFNIKGAMQCPNCRKIEKGQWLYANG 87
VSCSICLE V D+G RS AKL CGHQFHLDCIGSAFN+KGAMQCPNCR +EKGQWLYANG
Sbjct: 36 VSCSICLESVLDDGTRSKAKLQCGHQFHLDCIGSAFNMKGAMQCPNCRNVEKGQWLYANG 95
Query: 88 G-RSYPEFNMDDWTYDEDLYDLSYSELQSFGVHWCPFGNLARLPSPFDH----------D 136
R +PEF+M+DW +EDLY LSY E+Q + VHWCPFG L++ + F+ +
Sbjct: 96 STRPFPEFSMEDWIPEEDLYGLSYPEMQ-YRVHWCPFGELSQAAASFEELEPATTTYHTE 154
Query: 137 MLGQHAIFAEHTAVSSGSHHCPYIAYVGP---IHPSSSNSGGTVSEASNFNHWNGSSVPN 193
G HA H+ Y+AYVGP P +S++ T + + WN S N
Sbjct: 155 FHGHHAAAVNHS----------YLAYVGPGPAATPRTSDNNST-----DDHPWNSHS--N 197
Query: 194 DMPASFTFPAVDLHYHGWEHHSPPFSTASSRLVAADQPSVSPGNQRPVRGGSDVPRSGSF 253
D F V YH HHSP FS ++ +V + S R +
Sbjct: 198 D---HFHQLPVAPQYH---HHSPSFSLPAAHVVDGEV-------------DSSAARGLPY 238
Query: 254 MHPFLVGHSSAARAGSSVASSLIPPYPGSNARARDRVQALQAYYQPQQPPNSTTMRTPIA 313
HPFL H S R+ ++ S Y GS+ + R++ A + + Q N T+ +P+
Sbjct: 239 AHPFLFSHRSNQRSSPAINS-----YQGSSTQMREQHHAYN-HQRQQHHANGPTLASPLI 292
Query: 314 SGTRRPSIHGGSAPLAPVATSLDQSGGFFLIPPSSSGRNFQEENHLPSRYHAWERDHLPS 373
S TRR G P P DQ+ GFF+ P G + + E + HAWERD P
Sbjct: 293 SMTRR-----GLPPPPPPPPMPDQNVGFFIYP----GGHHEPET---DQIHAWERDWFPH 340
Query: 374 LSL--NHVDRDSGWRAY 388
+ NH S W +
Sbjct: 341 FPVPSNHRTIPSLWHRH 357
>At4g13490 putative protein
Length = 226
Score = 84.3 bits (207), Expect = 1e-16
Identities = 35/62 (56%), Positives = 48/62 (76%), Gaps = 1/62 (1%)
Query: 25 FGSVSCSICLEVVADNGD-RSWAKLHCGHQFHLDCIGSAFNIKGAMQCPNCRKIEKGQWL 83
F C++CLE +A++ D R+ KL C H+FHLDC+GS+FNIK M+CP CR+IEKG+WL
Sbjct: 17 FDDDDCAVCLEPLANDADERTVVKLRCSHKFHLDCVGSSFNIKNKMECPCCRQIEKGKWL 76
Query: 84 YA 85
+A
Sbjct: 77 FA 78
>At2g15260 hypothetical protein
Length = 362
Score = 80.5 bits (197), Expect = 2e-15
Identities = 46/110 (41%), Positives = 60/110 (53%), Gaps = 9/110 (8%)
Query: 30 CSICLEVVADNGD--RSWAKLHCGHQFHLDCIGSAFNIKGAMQCPNCRKIEKGQWLYANG 87
CSIC + + D R+ L C H+FHLDCIGSA+N KG M+CPNCR IE G W +++G
Sbjct: 123 CSICRGALVNENDVQRTLVTLKCVHKFHLDCIGSAYNAKGFMECPNCRNIEPGHWQFSDG 182
Query: 88 GRSYPE---FNMDDWTYDEDLYDLSYSELQSFGVHWCPFGNLARLPSPFD 134
P N ++ D D S ++S CPFG L + PFD
Sbjct: 183 THFNPNGMIANNEEEEEDNDPGSFSQMIVKS---EVCPFGCLGQ-RYPFD 228
>At1g51930 hypothetical protein
Length = 132
Score = 45.8 bits (107), Expect = 4e-05
Identities = 31/71 (43%), Positives = 38/71 (52%), Gaps = 13/71 (18%)
Query: 7 EDEGNQEVDGGDGGGGKGFGSVSCSICLEVVADNGDRSWAKL-HCGHQFHLDCIGSAFNI 65
E+EG +E +GG GK F C ICLE D D +L +CGH FHL CI S
Sbjct: 65 EEEGGREEEGG----GKRF----CPICLEEYED--DHQIRRLRNCGHVFHLLCIDSWLTQ 114
Query: 66 KGAMQCPNCRK 76
K CP+CR+
Sbjct: 115 K--QNCPSCRR 123
>At2g28920 hypothetical protein
Length = 145
Score = 45.4 bits (106), Expect = 6e-05
Identities = 27/60 (45%), Positives = 31/60 (51%), Gaps = 3/60 (5%)
Query: 16 GGDGGGGKGFGSVSCSICLEVVADNGDRSWAKLHCGHQFHLDCIGSAFNIKGAMQCPNCR 75
GGDGG G G + C ICLE N D + C H FH+DCI S K + CP CR
Sbjct: 79 GGDGGDGDGVKADVCVICLEDFKVN-DVVRVLVRCKHVFHVDCIDSWCFYK--LTCPICR 135
>At5g41430 unknown protein
Length = 161
Score = 43.1 bits (100), Expect = 3e-04
Identities = 23/53 (43%), Positives = 28/53 (52%), Gaps = 3/53 (5%)
Query: 23 KGFGSVSCSICLEVVADNGDRSWAKLHCGHQFHLDCIGSAFNIKGAMQCPNCR 75
+GF + CSICLE + D + K C H FH CI S +K CPNCR
Sbjct: 110 EGFDEIGCSICLEELEDGHEIIRIK-KCRHVFHRSCIDSW--LKQNRSCPNCR 159
>At2g38970 putative retroelement pol polyprotein
Length = 692
Score = 42.4 bits (98), Expect = 5e-04
Identities = 21/50 (42%), Positives = 26/50 (52%), Gaps = 3/50 (6%)
Query: 27 SVSCSICLEVVADNGDRSWAKLHCGHQFHLDCIGSAFNIK-GAMQCPNCR 75
S +CSICL + + G + C H FH CI S N+K G CP CR
Sbjct: 69 SKTCSICLNKMKEGGGHALFTAECSHSFHFHCIAS--NVKHGNQVCPVCR 116
>At5g03450 unknown protein
Length = 630
Score = 42.0 bits (97), Expect = 6e-04
Identities = 27/89 (30%), Positives = 35/89 (38%), Gaps = 14/89 (15%)
Query: 3 LGNNEDEGNQEVDGGDGGGGKGFGS-------------VSCSICLEVVADNGDRSWAKLH 49
+ NE EG + + G GS +SCSIC+EV G L
Sbjct: 80 VSQNESEGIENQNRSQGVCSSSSGSQEDVEWKQGDTEGLSCSICMEVWTSGGQHQVCCLP 139
Query: 50 CGHQFHLDCIGSAF-NIKGAMQCPNCRKI 77
CGH + CI F + +CP C KI
Sbjct: 140 CGHLYGYSCINKWFQQRRSGGKCPLCNKI 168
>At4g09120 putative protein
Length = 345
Score = 42.0 bits (97), Expect = 6e-04
Identities = 27/74 (36%), Positives = 30/74 (40%), Gaps = 8/74 (10%)
Query: 24 GFGSVSCSICLEVVADNGDRSWAKLHCGHQFHLDCIGSAFNIKGAMQCPNCRKIEKGQWL 83
G G V C+ICL D W C H FH +CI + CP CR L
Sbjct: 117 GKGGVECAICLSEFEDQETLRWMP-PCSHTFHANCID--VWLSSWSTCPVCRAN-----L 168
Query: 84 YANGGRSYPEFNMD 97
G SYP NMD
Sbjct: 169 SLKPGESYPYLNMD 182
>At1g08050 unknown protein
Length = 641
Score = 41.6 bits (96), Expect = 8e-04
Identities = 23/72 (31%), Positives = 29/72 (39%), Gaps = 3/72 (4%)
Query: 5 NNEDEGNQEVDGGDGGGGKGFGSVSCSICLEVVADNGDRSWAKLHCGHQFHLDCIGSAFN 64
N D + G F C+ICL + ++ C H FH DCI S N
Sbjct: 44 NRPDSSSSRPGSTPSSSGLRFSKGRCAICLYEIRKEDGKAIFTAECSHSFHFDCITS--N 101
Query: 65 IK-GAMQCPNCR 75
+K G CP CR
Sbjct: 102 VKHGNRICPLCR 113
>At4g18110 hypothetical protein
Length = 213
Score = 41.2 bits (95), Expect = 0.001
Identities = 22/47 (46%), Positives = 25/47 (52%), Gaps = 3/47 (6%)
Query: 30 CSICLEVVADNG-DRSWAKLHCGHQFHLDCIGSAFNIKGAMQCPNCR 75
C ICLE +A +G +R KL C H FH DCI K CP CR
Sbjct: 155 CIICLEELASSGSERRIMKLLCSHSFHKDCILPWLRCK--RSCPTCR 199
>At5g40250 RING finger -like protein
Length = 376
Score = 40.8 bits (94), Expect = 0.001
Identities = 27/89 (30%), Positives = 38/89 (42%), Gaps = 7/89 (7%)
Query: 16 GGDGGGGKGFGSVSCSICLEVVADNGDRSWAKLHCGHQFHLDCIGSAFNIKGAMQCPNCR 75
GG GG G C++CL ++ D+ C H FHL+CI + ++ CP CR
Sbjct: 129 GGGGGNGAAQEPFDCAVCLCEFSEK-DKLRLLPMCSHAFHLNCIDTW--LQSNSTCPLCR 185
Query: 76 KIEKGQWLYANGGRSYPEFNMDDWTYDED 104
G P F+ DD DE+
Sbjct: 186 ----GTLFSPGFSMENPMFDFDDIREDEE 210
>At5g01880 unknown protein
Length = 159
Score = 39.7 bits (91), Expect = 0.003
Identities = 21/57 (36%), Positives = 27/57 (46%), Gaps = 3/57 (5%)
Query: 19 GGGGKGFGSVSCSICLEVVADNGDRSWAKLHCGHQFHLDCIGSAFNIKGAMQCPNCR 75
G G + C+ICL AD G+R C H FH+ CI + + CPNCR
Sbjct: 94 GSGEVKIAATECAICLGEFAD-GERVRVLPPCNHSFHMSCIDTW--LVSHSSCPNCR 147
>At2g17450 pseudogene
Length = 185
Score = 39.7 bits (91), Expect = 0.003
Identities = 21/59 (35%), Positives = 29/59 (48%), Gaps = 3/59 (5%)
Query: 19 GGGGKGFGSVSCSICLEVVADNGDRSWAKLHCGHQFHLDCIGSAFNIKGAMQCPNCRKI 77
G + S C+ICL AD + L CGH FH++CI + CP+CR+I
Sbjct: 91 GAAAEEGDSTECAICLTDFADGEEIRVLPL-CGHSFHVECIDKW--LVSRSSCPSCRRI 146
>At3g14320 putative zinc finger protein
Length = 204
Score = 38.9 bits (89), Expect = 0.005
Identities = 19/49 (38%), Positives = 28/49 (56%), Gaps = 3/49 (6%)
Query: 28 VSCSICLEVVADNGDRSWAKLHCGHQFHLDCIGSAFNIKGAMQCPNCRK 76
+ C +CL +AD GD++ C H FH++CI S ++ CP CRK
Sbjct: 86 LECVVCLSELAD-GDKARVLPSCDHWFHVECIDSW--LQSNSTCPICRK 131
>At4g09130 putative protein
Length = 357
Score = 38.5 bits (88), Expect = 0.007
Identities = 25/74 (33%), Positives = 31/74 (41%), Gaps = 8/74 (10%)
Query: 24 GFGSVSCSICLEVVADNGDRSWAKLHCGHQFHLDCIGSAFNIKGAMQCPNCRKIEKGQWL 83
G G V C+ICL D W C H FH +CI + + CP CR L
Sbjct: 114 GNGGVECAICLCEFEDEEPLRWMP-PCSHTFHANCIDEWLSSRST--CPVCRAN-----L 165
Query: 84 YANGGRSYPEFNMD 97
G S+P +MD
Sbjct: 166 SLKSGDSFPHPSMD 179
>At5g51450 unknown protein
Length = 577
Score = 38.1 bits (87), Expect = 0.009
Identities = 19/49 (38%), Positives = 25/49 (50%), Gaps = 7/49 (14%)
Query: 30 CSICLEVVADNGDRSWAKLHCGHQFHLDCIGSAFN--IKGAMQCPNCRK 76
C+IC E +A +LHC H FHL C+ S + + CP CRK
Sbjct: 337 CAICREPMAKA-----KRLHCNHLFHLGCLRSWLDQGLNEVYSCPTCRK 380
>At4g25230 unknown protein
Length = 578
Score = 38.1 bits (87), Expect = 0.009
Identities = 19/49 (38%), Positives = 25/49 (50%), Gaps = 7/49 (14%)
Query: 30 CSICLEVVADNGDRSWAKLHCGHQFHLDCIGSAFN--IKGAMQCPNCRK 76
C+IC E +A +LHC H FHL C+ S + + CP CRK
Sbjct: 337 CAICREPMAKA-----KRLHCNHLFHLGCLRSWLDQGLNEVYSCPTCRK 380
>At4g09110 putative protein
Length = 302
Score = 38.1 bits (87), Expect = 0.009
Identities = 24/69 (34%), Positives = 28/69 (39%), Gaps = 8/69 (11%)
Query: 24 GFGSVSCSICLEVVADNGDRSWAKLHCGHQFHLDCIGSAFNIKGAMQCPNCRKIEKGQWL 83
G G V C+ICL D W C H FH +CI + + CP CR L
Sbjct: 117 GKGGVECAICLSEFVDKETLRWMP-PCSHTFHANCIDVWLSSQST--CPACRAN-----L 168
Query: 84 YANGGRSYP 92
G SYP
Sbjct: 169 SLKPGESYP 177
>At3g62690 RING-H2 zinc finger protein ATL5
Length = 257
Score = 38.1 bits (87), Expect = 0.009
Identities = 21/47 (44%), Positives = 27/47 (56%), Gaps = 5/47 (10%)
Query: 30 CSICL-EVVADNGDRSWAKLHCGHQFHLDCIGSAFNIKGAMQCPNCR 75
CS+CL E D+ R K CGH FH+DCI + F + + CP CR
Sbjct: 113 CSVCLSEFEEDDEGRVLPK--CGHVFHVDCIDTWFRSRSS--CPLCR 155
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.316 0.133 0.426
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,326,387
Number of Sequences: 26719
Number of extensions: 558303
Number of successful extensions: 2893
Number of sequences better than 10.0: 213
Number of HSP's better than 10.0 without gapping: 32
Number of HSP's successfully gapped in prelim test: 182
Number of HSP's that attempted gapping in prelim test: 2747
Number of HSP's gapped (non-prelim): 288
length of query: 418
length of database: 11,318,596
effective HSP length: 102
effective length of query: 316
effective length of database: 8,593,258
effective search space: 2715469528
effective search space used: 2715469528
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 61 (28.1 bits)
Lotus: description of TM0299.12