
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC137838.2 - phase: 0
(137 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At5g19330 putative protein 152 4e-38
At5g13060 unknown protein 79 6e-16
At3g20170 unknown protein 32 0.079
At3g20200 protein kinase like protein 32 0.10
At1g23180 unknown protein 32 0.13
At2g23140 hypothetical protein 31 0.18
At2g05810 hypothetical protein 31 0.18
At2g14300 pseudogene; similar to MURA transposase of maize Muta... 30 0.30
At1g28390 putative protein 30 0.39
At4g34940 putative protein 28 1.5
At5g52000 importin alpha subunit 28 1.9
At5g24420 6-phosphogluconolactonase-like protein 28 1.9
At2g44900 hypothetical protein 27 2.5
At1g36170 acetyl-CoA carboxylase, putative, 5' partial 27 2.5
At1g36160 hypothetical protein, 3' partial 27 2.5
At1g71020 unknown protein 27 3.3
At1g61220 unknown protein 27 3.3
At5g27610 putative protein 27 4.3
At3g60350 Arm repeat containing protein - like 27 4.3
At3g43020 putative protein 27 4.3
>At5g19330 putative protein
Length = 636
Score = 152 bits (385), Expect = 4e-38
Identities = 85/135 (62%), Positives = 102/135 (74%), Gaps = 5/135 (3%)
Query: 4 QRRQGHCLSERKGQKRKLDE--ELPEDRQISSAPPTADERAALLVEVANQVTVLESTFTW 61
+RR+G ERKGQKRKL+E EDR+IS+ + D ALL EVA QV+VL S F+W
Sbjct: 6 ERREGRSFPERKGQKRKLEEGAAAVEDREISAV--STDGGQALLSEVAAQVSVLNSAFSW 63
Query: 62 NEADRAAAKRATHALADLAKN-EEVVNVIVEGGAIPALIKHLQAPPVTDCVQKPLPFEHE 120
E+DRAAAKRAT LA+LAKN E++VNVIV+GGA+PAL+ HLQAPP D P+EHE
Sbjct: 64 QESDRAAAKRATQVLAELAKNAEDLVNVIVDGGAVPALMTHLQAPPYNDGDLAEKPYEHE 123
Query: 121 VEKGSAFALGLLAVK 135
VEKGSAFALGLLA+K
Sbjct: 124 VEKGSAFALGLLAIK 138
Score = 26.9 bits (58), Expect = 3.3
Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 12/45 (26%)
Query: 89 IVEGGAIPALIKHLQAPPVTDCVQKPLPFEHEVEKGSAFALGLLA 133
IV+ GA+ LI+ LQ+P V ++++ SAFALG LA
Sbjct: 321 IVQRGAVRPLIEMLQSPDV------------QLKEMSAFALGRLA 353
Score = 25.4 bits (54), Expect = 9.6
Identities = 16/37 (43%), Positives = 21/37 (56%), Gaps = 1/37 (2%)
Query: 64 ADRAAAKRATHALADLA-KNEEVVNVIVEGGAIPALI 99
+D + A AL LA KN++ N IVE A+P LI
Sbjct: 211 SDSKVQRAAAGALRTLAFKNDDNKNQIVECNALPTLI 247
>At5g13060 unknown protein
Length = 709
Score = 79.3 bits (194), Expect = 6e-16
Identities = 50/129 (38%), Positives = 70/129 (53%), Gaps = 1/129 (0%)
Query: 5 RRQGHCLSERKGQKRKLDEELPEDRQISSAPPTADERAALLVEVANQVTVLESTFTWNEA 64
+RQ + KRKL ++ DE L+V + V VL S+F+ +
Sbjct: 6 KRQRTTRLAARNLKRKLSHNTDGAPIVTQLIDIDDEPIDLVVAIRRHVEVLNSSFSDPDF 65
Query: 65 DRAAAKRATHALADLAKNEEVVNVIVEGGAIPALIKHLQAPPVTDCVQKPLPFEHEVEKG 124
D A K A +ADLAK +E V +IVE GAIPAL+++L++P V C P EH++EK
Sbjct: 66 DHEAVKEAAADIADLAKIDENVEIIVENGAIPALVRYLESPLVV-CGNVPKSCEHKLEKD 124
Query: 125 SAFALGLLA 133
A ALGL+A
Sbjct: 125 CALALGLIA 133
>At3g20170 unknown protein
Length = 475
Score = 32.3 bits (72), Expect = 0.079
Identities = 22/70 (31%), Positives = 33/70 (46%), Gaps = 3/70 (4%)
Query: 30 QISSAPPTADERAALLVEVANQVTVLESTFTWNEADRAAAKRATHALADLAKNEEVVNVI 89
+I SA T E +LV ++E+ N A R +RA HA+ + ++
Sbjct: 174 EILSALTTIRESRRVLVHSGGLKFLVEAAKVGNLASR---ERACHAIGLIGVTRRARRIL 230
Query: 90 VEGGAIPALI 99
VE G IPAL+
Sbjct: 231 VEAGVIPALV 240
Score = 27.3 bits (59), Expect = 2.5
Identities = 23/61 (37%), Positives = 28/61 (45%), Gaps = 1/61 (1%)
Query: 44 LLVEVANQVTVLESTFTWNEA-DRAAAKRATHALADLAKNEEVVNVIVEGGAIPALIKHL 102
L V N V + E A D A A+ L DLA V+VI GAIP LI+ L
Sbjct: 302 LAVAEGNAVLIAEQLVRILRAGDNEAKLAASDVLWDLAGYRHSVSVIRGSGAIPLLIELL 361
Query: 103 Q 103
+
Sbjct: 362 R 362
>At3g20200 protein kinase like protein
Length = 780
Score = 32.0 bits (71), Expect = 0.10
Identities = 27/116 (23%), Positives = 48/116 (41%), Gaps = 7/116 (6%)
Query: 14 RKGQ-KRKLDEELPEDRQISSAPPTADERAALLVEVANQVTVLESTFTWNEADRAAAKRA 72
++GQ K + + E + A A E A + E+ Q +E+ +++++ +
Sbjct: 383 QEGQLSEKSTKSIVEKERAHKAAKDASETAGKIAELETQRRAIEAAGSFSDSSLRYRRYV 442
Query: 73 THALADLAKNEEVVNVIVEGGAIPALIKHLQAPPV------TDCVQKPLPFEHEVE 122
+ + + + N I EGG P +L PV D VQ F+ EVE
Sbjct: 443 IGEIEEATNSFDKANKIGEGGYGPVYKGYLDHTPVAIKALKADAVQGRSQFQREVE 498
>At1g23180 unknown protein
Length = 452
Score = 31.6 bits (70), Expect = 0.13
Identities = 24/78 (30%), Positives = 39/78 (49%), Gaps = 9/78 (11%)
Query: 28 DRQISSAPPTADERAALLV---EVANQVTVLESTFTWNEADRAAAKRATHALADLAKNEE 84
+++I S P ++ LL VA V +L AD AA RA ++AD + NE+
Sbjct: 22 EKEIKSEGPGKSQQLTLLPCVDGVARLVLIL------GLADELAATRAAESIADASINED 75
Query: 85 VVNVIVEGGAIPALIKHL 102
+ +E GA+ L++ L
Sbjct: 76 MRVSFMEAGAVKPLVQLL 93
>At2g23140 hypothetical protein
Length = 924
Score = 31.2 bits (69), Expect = 0.18
Identities = 28/99 (28%), Positives = 42/99 (42%), Gaps = 6/99 (6%)
Query: 11 LSERKGQKRKLDEELPEDRQISSAPPTADERAALLVEVANQVTVLES------TFTWNEA 64
LSE + Q +KL EEL + TA+ R + N++ + S
Sbjct: 615 LSEVETQVKKLVEELKSSSLDTQRQATAELRLLAKHNMDNRIVIGNSGAIVLLVELLYST 674
Query: 65 DRAAAKRATHALADLAKNEEVVNVIVEGGAIPALIKHLQ 103
D A + A AL +L+ N+ I + GAI LI L+
Sbjct: 675 DSATQENAVTALLNLSINDNNKKAIADAGAIEPLIHVLE 713
>At2g05810 hypothetical protein
Length = 580
Score = 31.2 bits (69), Expect = 0.18
Identities = 24/70 (34%), Positives = 35/70 (49%), Gaps = 10/70 (14%)
Query: 37 TADERAALLVEVANQVTVLESTFTWNEADRAAAKRATH----ALADLAKNEEVVNVIVEG 92
TAD A + VTVL EA R+ +K+ A++++A EE+ + E
Sbjct: 272 TADPATAWAISAYGGVTVLI------EACRSGSKQVQEHIAGAISNIAAVEEIRTTLAEE 325
Query: 93 GAIPALIKHL 102
GAIP LI+ L
Sbjct: 326 GAIPVLIQLL 335
>At2g14300 pseudogene; similar to MURA transposase of maize Mutator
transposon
Length = 1230
Score = 30.4 bits (67), Expect = 0.30
Identities = 23/90 (25%), Positives = 40/90 (43%)
Query: 43 ALLVEVANQVTVLESTFTWNEADRAAAKRATHALADLAKNEEVVNVIVEGGAIPALIKHL 102
++L V N T + F N+ D AAK + + V I + + LI ++
Sbjct: 623 SVLYRVNNDETQFIAWFELNKRDPEAAKLLYEQIPNFYTINHVPPKIDDAYHLRILINNI 682
Query: 103 QAPPVTDCVQKPLPFEHEVEKGSAFALGLL 132
+AP D ++ H+ + + +ALGLL
Sbjct: 683 RAPKGFDDIKTVEGVVHKTYRDACYALGLL 712
>At1g28390 putative protein
Length = 470
Score = 30.0 bits (66), Expect = 0.39
Identities = 19/58 (32%), Positives = 31/58 (52%)
Query: 37 TADERAALLVEVANQVTVLESTFTWNEADRAAAKRATHALADLAKNEEVVNVIVEGGA 94
TA +R +L V TV + + WN+ R + +R+ + LA + EE+ IV GG+
Sbjct: 316 TAKKRPDMLEVVECLKTVRQLSPAWNKLRRRSEERSENVLAVEEEKEEIHVRIVRGGS 373
>At4g34940 putative protein
Length = 664
Score = 28.1 bits (61), Expect = 1.5
Identities = 22/74 (29%), Positives = 38/74 (50%), Gaps = 7/74 (9%)
Query: 27 EDRQISSAPPTADERAALLVEVANQVTVLESTFTWNEADRAAAKRATHALADLAKNEEVV 86
+D + P A+E +L + QV +L FT + DR+ A + +LA N+
Sbjct: 129 DDEYLGLPPIAANE--PILCLIWEQVAIL---FTGSLDDRSDAAASLVSLA--RDNDRYG 181
Query: 87 NVIVEGGAIPALIK 100
+I+E G +P+L+K
Sbjct: 182 RLIIEEGGVPSLLK 195
>At5g52000 importin alpha subunit
Length = 441
Score = 27.7 bits (60), Expect = 1.9
Identities = 14/42 (33%), Positives = 24/42 (56%), Gaps = 1/42 (2%)
Query: 65 DRAAAKRATHALADLAK-NEEVVNVIVEGGAIPALIKHLQAP 105
D K A AL L++ +E+ + ++E G +P L++ LQ P
Sbjct: 183 DEDVLKNACMALCHLSEGSEDGIQSVIEAGFVPKLVQILQLP 224
>At5g24420 6-phosphogluconolactonase-like protein
Length = 252
Score = 27.7 bits (60), Expect = 1.9
Identities = 15/55 (27%), Positives = 25/55 (45%)
Query: 60 TWNEADRAAAKRATHALADLAKNEEVVNVIVEGGAIPALIKHLQAPPVTDCVQKP 114
T NE A A+ + A K + + V++ GG + + L PP D ++ P
Sbjct: 11 TKNELSEAMAEYTANLSAKFIKEKGLFTVVLSGGDLIDWLCKLVQPPYIDSIEWP 65
>At2g44900 hypothetical protein
Length = 930
Score = 27.3 bits (59), Expect = 2.5
Identities = 13/61 (21%), Positives = 28/61 (45%)
Query: 39 DERAALLVEVANQVTVLESTFTWNEADRAAAKRATHALADLAKNEEVVNVIVEGGAIPAL 98
+E + + +++ F W +RA ALA+LA +++ + + G + AL
Sbjct: 510 EEHKNAIAQAGGVKALVDLIFRWPNGCDGVLERAAGALANLAADDKCSMEVAKAGGVHAL 569
Query: 99 I 99
+
Sbjct: 570 V 570
>At1g36170 acetyl-CoA carboxylase, putative, 5' partial
Length = 1865
Score = 27.3 bits (59), Expect = 2.5
Identities = 18/73 (24%), Positives = 34/73 (45%), Gaps = 5/73 (6%)
Query: 21 LDEELPEDRQISSAPPTADERAALLVEVANQVTVLESTFTWNEADRAAAKRATHALADLA 80
L E D + + AP + + +V + NQ+++L+ + D A+ + LA +
Sbjct: 766 LRETKHNDYETAGAPLSGNMMHIAIVGINNQMSLLQ-----DSGDEDQAQERVNKLAKIL 820
Query: 81 KNEEVVNVIVEGG 93
K EEV + + G
Sbjct: 821 KEEEVSSSLCSAG 833
>At1g36160 hypothetical protein, 3' partial
Length = 2252
Score = 27.3 bits (59), Expect = 2.5
Identities = 18/73 (24%), Positives = 34/73 (45%), Gaps = 5/73 (6%)
Query: 21 LDEELPEDRQISSAPPTADERAALLVEVANQVTVLESTFTWNEADRAAAKRATHALADLA 80
L E D + + AP + + +V + NQ+++L+ + D A+ + LA +
Sbjct: 1163 LRETKHNDYETAGAPLSGNMMHIAIVGINNQMSLLQ-----DSGDEDQAQERVNKLAKIL 1217
Query: 81 KNEEVVNVIVEGG 93
K EEV + + G
Sbjct: 1218 KEEEVSSSLCSAG 1230
>At1g71020 unknown protein
Length = 628
Score = 26.9 bits (58), Expect = 3.3
Identities = 23/72 (31%), Positives = 31/72 (42%), Gaps = 10/72 (13%)
Query: 49 ANQVTVLESTFTWNEADRAAAKRATHALADLAKNEEVVNVIVEGGAIPALIKHLQAPPVT 108
A V L T + ++R A + T L+ LA N+ I+ AIP LI
Sbjct: 506 AGIVKPLVKMLTDSSSERMADEALT-ILSVLASNQVAKTAILRANAIPPLI--------- 555
Query: 109 DCVQKPLPFEHE 120
DC+QK P E
Sbjct: 556 DCLQKDQPRNRE 567
>At1g61220 unknown protein
Length = 873
Score = 26.9 bits (58), Expect = 3.3
Identities = 25/105 (23%), Positives = 40/105 (37%), Gaps = 8/105 (7%)
Query: 11 LSERKGQKRKLDEELPEDRQISSAPPTADERAALLVEVANQVTVLESTFT--------WN 62
L E ++ D E+ QIS D L+ + + L+S T W
Sbjct: 677 LDETTKTQQSSDILTQEEPQISGREDDGDIWEILMRTHSEVLNTLQSRLTKLQIVRHFWE 736
Query: 63 EADRAAAKRATHALADLAKNEEVVNVIVEGGAIPALIKHLQAPPV 107
+D A A L+D + +V+N++ + I L Q PV
Sbjct: 737 RSDIKGAIAALRKLSDHSVQADVINILTDKTEILTLDLFSQLAPV 781
>At5g27610 putative protein
Length = 969
Score = 26.6 bits (57), Expect = 4.3
Identities = 21/80 (26%), Positives = 33/80 (41%), Gaps = 8/80 (10%)
Query: 7 QGHCLSE--RKGQKRKLDEELPEDRQISSAPPTADERAALLVEVANQVTVLESTFTWNEA 64
+ H SE RK KRK + LP D + PP + + V ++ L+ T + +
Sbjct: 122 EDHDASEVTRKHLKRKRPQVLPSDFREEVVPPHS------VASVEGCLSFLKQTQAYEKR 175
Query: 65 DRAAAKRATHALADLAKNEE 84
RA KR L + +
Sbjct: 176 QRATGKRTPRFLVAITHERD 195
>At3g60350 Arm repeat containing protein - like
Length = 928
Score = 26.6 bits (57), Expect = 4.3
Identities = 13/61 (21%), Positives = 27/61 (43%)
Query: 39 DERAALLVEVANQVTVLESTFTWNEADRAAAKRATHALADLAKNEEVVNVIVEGGAIPAL 98
+E + + +++ F W +RA ALA+LA +++ + G + AL
Sbjct: 501 EEHKNAIAQAGGVNALVDLIFRWPHGCDGVLERAAGALANLAADDKCSMEVARAGGVHAL 560
Query: 99 I 99
+
Sbjct: 561 V 561
>At3g43020 putative protein
Length = 366
Score = 26.6 bits (57), Expect = 4.3
Identities = 21/107 (19%), Positives = 44/107 (40%), Gaps = 4/107 (3%)
Query: 9 HCLSERKGQKRKLDEELPEDRQISSAPP----TADERAALLVEVANQVTVLESTFTWNEA 64
H + +K K + + R + + P + DE++ LL+E+ +V L + E
Sbjct: 35 HIVDRKKKLGVKFEAPKSKQRYSTCSKPELKLSDDEKSRLLIELQTKVEELSNRVMKLEK 94
Query: 65 DRAAAKRATHALADLAKNEEVVNVIVEGGAIPALIKHLQAPPVTDCV 111
R A +H L++ ++ V V + + P+T+ +
Sbjct: 95 TRKATSSQSHTLSEHVFGQQEVGQQVPNDNLDQAKDSSDSTPLTNTI 141
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.315 0.131 0.366
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,901,231
Number of Sequences: 26719
Number of extensions: 102546
Number of successful extensions: 367
Number of sequences better than 10.0: 34
Number of HSP's better than 10.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 18
Number of HSP's that attempted gapping in prelim test: 340
Number of HSP's gapped (non-prelim): 42
length of query: 137
length of database: 11,318,596
effective HSP length: 89
effective length of query: 48
effective length of database: 8,940,605
effective search space: 429149040
effective search space used: 429149040
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 54 (25.4 bits)
Medicago: description of AC137838.2