
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147000.5 - phase: 0
(221 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At3g49890 unknown protein 191 3e-49
At2g36620 60S ribosomal protein L24 32 0.35
At2g17350 unknown protein 31 0.45
At3g05830 unknown protein 31 0.59
At2g36490 hypothetical protein 31 0.59
At5g53800 unknown protein 30 0.77
At5g52550 unknown protein 30 0.77
At5g59390 transcriptional regulator - like protein 30 1.3
At3g53020 60S ribosomal protein - like 30 1.3
At5g65180 unknown protein 29 1.7
At1g78790 unknown protein 29 1.7
At5g26770 unknown protein 29 2.2
At2g45890 hypothetical protein 29 2.2
At1g63105 putative protein 29 2.2
At1g36160 hypothetical protein, 3' partial 29 2.2
At4g12770 auxilin-like protein 28 2.9
At3g61260 putative DNA-binding protein 28 2.9
At3g26840 unknown protein 28 2.9
At1g30320 unknown protein 28 2.9
At3g57150 putative pseudouridine synthase (NAP57) 28 3.8
>At3g49890 unknown protein
Length = 220
Score = 191 bits (484), Expect = 3e-49
Identities = 101/222 (45%), Positives = 153/222 (68%), Gaps = 11/222 (4%)
Query: 1 MAEEELNNGYCSSSGDEDGDAAWKAAIDSVAGTSSYVTSFMNGFSATNNNDTKKKHNDNQ 60
MA+ EL+ G SS ED D W+AAI+S+A T+ Y G SAT T+ + +
Sbjct: 1 MAKRELSGGDSSS---EDEDPKWRAAINSIATTTVY------GASATKPAATQSHNYGDF 51
Query: 61 NPKSPKIKHYQLKALKLLDDILENTIEIVKEPIPVLDEDPNVDDCGIRLFRHSKPGIVFD 120
K K+ H Q+K LL++++E T++ V++P+ + ++ P +DCG+RLF+ GIVFD
Sbjct: 52 RLKPKKLTHGQIKVKNLLNEMVEKTLDFVEDPVNIPEDKPE-NDCGVRLFKRCATGIVFD 110
Query: 121 HADEPQPPMKRPKLVPGEDIDEKSKKFRRRIRSIAVDGNDLIAAANDAYKKSLARLEAKD 180
H DE + P K+P L P + ++ SK+F++R++SIAVDG+D++ AA +A KK+ ARL+AK+
Sbjct: 111 HVDEIRGPKKKPNLRPDKGVEGSSKEFKKRVKSIAVDGSDILTAAVEAAKKASARLDAKE 170
Query: 181 AAAKAKAKREEERIEKLKKIRGERWLPSMAKEMQAKIK-IKH 221
AAK KAK+EEERI +LKK+RGE+WLPS+ + M+ ++K IKH
Sbjct: 171 VAAKDKAKKEEERIAELKKVRGEKWLPSIERAMKKEMKRIKH 212
>At2g36620 60S ribosomal protein L24
Length = 164
Score = 31.6 bits (70), Expect = 0.35
Identities = 20/81 (24%), Positives = 41/81 (49%)
Query: 138 EDIDEKSKKFRRRIRSIAVDGNDLIAAANDAYKKSLARLEAKDAAAKAKAKREEERIEKL 197
+D +++ K RRR + + A KK + E +DAA +A + +ERI+K
Sbjct: 63 KDAAQEAVKRRRRATKKPYSRSIVGATLEVIQKKRAEKPEVRDAAREAALREIKERIKKT 122
Query: 198 KKIRGERWLPSMAKEMQAKIK 218
K + + + +K+ ++++K
Sbjct: 123 KDEKKAKKVEYASKQQKSQVK 143
>At2g17350 unknown protein
Length = 117
Score = 31.2 bits (69), Expect = 0.45
Identities = 19/62 (30%), Positives = 32/62 (50%), Gaps = 1/62 (1%)
Query: 139 DIDEKSKKFRRRIRSIAVDGNDLIAAANDAYKKSLARLEAKDAAAKAKAKREEERIEKLK 198
DI ++ K R ++S + +A A+K L+R E + A +A+ K E E +EK K
Sbjct: 23 DIQDRIMK-EREMQSYIEEREREVAEREAAWKAELSRRETEIARQEARLKMERENLEKEK 81
Query: 199 KI 200
+
Sbjct: 82 SV 83
>At3g05830 unknown protein
Length = 336
Score = 30.8 bits (68), Expect = 0.59
Identities = 22/70 (31%), Positives = 35/70 (49%), Gaps = 9/70 (12%)
Query: 139 DIDEKSKKFRRRIRSIAVD----GNDLIAAANDAYKKSLARLEAKDAAAKAKAKREEERI 194
D+DEK + FRR + S+A + L++ K+++ R E A+ + K E I
Sbjct: 15 DLDEKKESFRRNVVSLATELKQVRGRLVSQEQSFLKETITRKE-----AEKRGKNMEMEI 69
Query: 195 EKLKKIRGER 204
KL+K ER
Sbjct: 70 CKLQKRLEER 79
>At2g36490 hypothetical protein
Length = 1207
Score = 30.8 bits (68), Expect = 0.59
Identities = 39/167 (23%), Positives = 69/167 (40%), Gaps = 27/167 (16%)
Query: 4 EELNNGYCSSSGDEDGDAAWKAAIDSVAGTSSYVTSFMNGFSATNNNDTKKKHNDNQNPK 63
E L +GYCS + K +D+ VT + + + TK+K+
Sbjct: 378 EHLTSGYCSKPQQNN-----KILVDT------RVTVSKKKPTKSEKSQTKQKN------L 420
Query: 64 SPKIKHYQLKALKLLDDIL---ENTIEIVKEPIPVLDEDPNVDDCGIRLFRHSKPGIVFD 120
P + + L D L N+IE + E + +LD + + + + + ++F
Sbjct: 421 LPNLCRFPPSFTGLSPDELWKRRNSIETISELLRLLDINREHSETALVPYTMNSQIVLFG 480
Query: 121 H---ADEPQPPMKRPKLVPGEDIDEKS----KKFRRRIRSIAVDGND 160
A P P+K+P+ P D+D+++ K I S VDG+D
Sbjct: 481 GGAGAIVPVTPVKKPRPRPKVDLDDETDRVWKLLLENINSEGVDGSD 527
>At5g53800 unknown protein
Length = 339
Score = 30.4 bits (67), Expect = 0.77
Identities = 19/64 (29%), Positives = 31/64 (47%), Gaps = 1/64 (1%)
Query: 141 DEKSKKFRRRIRSIAVDGNDLIAAANDAYKKSLARLEAKDAAAKAKAKREEERIEKLKKI 200
D KS + RRR R + +D + + Y S E++D + K KR+E E+ ++
Sbjct: 90 DRKSSRSRRRRRDYSSSSSDSESESESEYSDS-EESESEDERRRRKRKRKEREEEEKERK 148
Query: 201 RGER 204
R R
Sbjct: 149 RRRR 152
>At5g52550 unknown protein
Length = 360
Score = 30.4 bits (67), Expect = 0.77
Identities = 23/79 (29%), Positives = 38/79 (47%), Gaps = 5/79 (6%)
Query: 131 RPKLVPG--EDIDEKSKKFRRRIRSIAVDGNDLIAAANDAYKKSLARLEA---KDAAAKA 185
R K V G + I + KK RR ++IA K +LEA +D+A A
Sbjct: 19 RKKKVKGVVDPIKQAEKKNRRLEKAIATSAAIRAELEKKKQMKKEGQLEAADEEDSADAA 78
Query: 186 KAKREEERIEKLKKIRGER 204
K K+E + +E++K+ ++
Sbjct: 79 KKKQERDELERIKQAENKK 97
>At5g59390 transcriptional regulator - like protein
Length = 561
Score = 29.6 bits (65), Expect = 1.3
Identities = 19/65 (29%), Positives = 34/65 (52%), Gaps = 4/65 (6%)
Query: 144 SKKFRRRIRSIAVDGNDLIAAANDAYKKSLARLEAKDAAAKAKAKREEERI----EKLKK 199
+K ++ + + + +L D ++KSLA LEAK +A+ E+R E+++K
Sbjct: 215 NKNYQEGFQKMQMKMEELYQQVLDGHEKSLAELEAKREKLDERARLIEQRAIINEEEMEK 274
Query: 200 IRGER 204
R ER
Sbjct: 275 SRLER 279
>At3g53020 60S ribosomal protein - like
Length = 163
Score = 29.6 bits (65), Expect = 1.3
Identities = 20/79 (25%), Positives = 38/79 (47%)
Query: 138 EDIDEKSKKFRRRIRSIAVDGNDLIAAANDAYKKSLARLEAKDAAAKAKAKREEERIEKL 197
+D +++ K RRR + + A KK + E +DAA +A + +ERI+K
Sbjct: 63 KDAAQEAVKRRRRATKKPYSRSIVGATLEVIQKKRAEKPEVRDAAREAALREIKERIKKT 122
Query: 198 KKIRGERWLPSMAKEMQAK 216
K + + + +K+ + K
Sbjct: 123 KDEKKAKKVEFASKQQKVK 141
>At5g65180 unknown protein
Length = 439
Score = 29.3 bits (64), Expect = 1.7
Identities = 21/63 (33%), Positives = 31/63 (48%), Gaps = 8/63 (12%)
Query: 143 KSKKFRRRIRSIAVDGNDLIAAANDAYKKSLAR-LEAKDAAAKAKAKREEERIEKLKKIR 201
K K RRIR + D D + A D K+SLA+ LE ++ + + +EKLK +
Sbjct: 191 KCKSAVRRIRKMEKDVEDACSTAKDPRKESLAKELEEEENILR-------QSVEKLKSVE 243
Query: 202 GER 204
R
Sbjct: 244 ESR 246
>At1g78790 unknown protein
Length = 104
Score = 29.3 bits (64), Expect = 1.7
Identities = 22/80 (27%), Positives = 33/80 (40%), Gaps = 14/80 (17%)
Query: 144 SKKFRRRIRSIAVDGNDLIAAANDAYKKSLARLEA--------------KDAAAKAKAKR 189
+++FR R RS A+D + A K LA A +A +A
Sbjct: 21 ARRFRERERSDAIDATEAEVALGTTKKNRLASANANALKLSCELLKSFVSEAVQRAAIIA 80
Query: 190 EEERIEKLKKIRGERWLPSM 209
E E +EK++ ER LP +
Sbjct: 81 EAEGMEKIEATHLERILPQL 100
>At5g26770 unknown protein
Length = 335
Score = 28.9 bits (63), Expect = 2.2
Identities = 22/66 (33%), Positives = 33/66 (49%), Gaps = 9/66 (13%)
Query: 138 EDIDEKSKKFRRRIRSIAVD----GNDLIAAANDAYKKSLARLEAKDAAAKAKAKREEER 193
+D+D K + FRR + S+A + L++ K+S R E A+ KAK E
Sbjct: 14 KDLDGKKESFRRNVVSMAAELKQVRGRLVSQEQFFVKESFCRKE-----AEKKAKNMEME 68
Query: 194 IEKLKK 199
I KL+K
Sbjct: 69 ICKLQK 74
>At2g45890 hypothetical protein
Length = 492
Score = 28.9 bits (63), Expect = 2.2
Identities = 33/150 (22%), Positives = 53/150 (35%), Gaps = 16/150 (10%)
Query: 23 WKAAIDSVAGTSSYVTSFMNGFSATNNNDTKKKHNDNQNPKSPKIKHYQLKALKLLDDIL 82
WK ++ Y+ + TN T+ K + P ++ +++LD
Sbjct: 181 WKREMNCFMSICDYIVEVIPRSLGTNVEITETKLRSDILMSLPALRKLDNMLMEILDSFT 240
Query: 83 ENTIEIVKEPIPVLDEDPNVDDCGIRLFRHSKPGIVFDHADE----PQPPMKRPKLVPGE 138
EN V+ ++ D G FR +V DE P P VP E
Sbjct: 241 ENEFWYVERGSSSMNSGGGGRDSG--TFRK----VVVQRKDEKWWLPVP------CVPAE 288
Query: 139 DIDEKSKKFRRRIRSIAVDGNDLIAAANDA 168
+ E+ +K R R A + A ND+
Sbjct: 289 GLSEEERKHLRHKRDCASQIHKAALAINDS 318
>At1g63105 putative protein
Length = 122
Score = 28.9 bits (63), Expect = 2.2
Identities = 19/54 (35%), Positives = 27/54 (49%), Gaps = 3/54 (5%)
Query: 137 GEDIDEKSKKFRRRIRSIAVDGNDLIAAANDAYKKSLARLEAKDAAAKAKAKRE 190
G+D+ EKSK +R VD + A ++A K+ EA+ K KAK E
Sbjct: 61 GDDVFEKSKTALDIMRKAVVDAKERKKARDEAIKE---EEEARKEEVKKKAKNE 111
>At1g36160 hypothetical protein, 3' partial
Length = 2252
Score = 28.9 bits (63), Expect = 2.2
Identities = 13/40 (32%), Positives = 22/40 (54%), Gaps = 2/40 (5%)
Query: 22 AWKAAIDSVAGTSS--YVTSFMNGFSATNNNDTKKKHNDN 59
AW+ + +V G + +V+S F TNN ++K H D+
Sbjct: 1452 AWRVVVANVTGRTCTVHVSSAYKKFGCTNNTESKSTHLDD 1491
>At4g12770 auxilin-like protein
Length = 909
Score = 28.5 bits (62), Expect = 2.9
Identities = 45/213 (21%), Positives = 77/213 (36%), Gaps = 27/213 (12%)
Query: 8 NGYCSSSGDEDGD---------AAWKAAIDSVAGTSSYVTSFMNGFSATNNNDTKKKHND 58
NGY S ED D AA K A+D + S + + H +
Sbjct: 434 NGYPDPSSGEDSDVFSTAAASAAAMKDAMDKAEAKFRHAKERREKESLKASRSREGDHTE 493
Query: 59 NQNPKSPKIKHYQLKALKLLDDILENTIEIVKEPIPVLDE----DPNVDDCGIRLFRHSK 114
N + + +++ K ++L + E E+ K +E ++ RL
Sbjct: 494 NYDSRERELRE---KQVRLDRERAEREAEMEKTQAREREEREREQKRIERERERLLARQA 550
Query: 115 PGIVFDHADE---PQPPMKRPKLVPGEDIDEKSKKFRRRIRSIAVDGNDLIAAANDAYKK 171
A E + K + G+ D + + R ++ + + AA
Sbjct: 551 VERATREARERAATEAHAKVQRAAVGKVTDARERAERAAVQRAHAEARERAAAG------ 604
Query: 172 SLARLEAKDAAAKAKAKREEERIEKLKKIRGER 204
AR +A+ AAA+A+ + E EK K+R ER
Sbjct: 605 --AREKAEKAAAEARERANAEVREKEAKVRAER 635
>At3g61260 putative DNA-binding protein
Length = 212
Score = 28.5 bits (62), Expect = 2.9
Identities = 30/123 (24%), Positives = 51/123 (41%), Gaps = 10/123 (8%)
Query: 108 RLFRHSKPGIVFDHADEPQPPMKRPKLVPGEDIDEKSKKFRRRIRSIAVDGNDLIAAAND 167
++F SK V + E P K D+ +R+ + + A +
Sbjct: 61 QIFDDSKALTVVEKPVEEPAPAKPASASLDRDVKLADLSKEKRLSFVRAWEESEKSKAEN 120
Query: 168 AYKKSLARLEA----KDAAAKAKAKREEERIEKLKKIRGERW------LPSMAKEMQAKI 217
+K +A + A K AA +A+ K+ EE++EK K ER + A+E +A I
Sbjct: 121 KAEKKIADVHAWENSKKAAVEAQLKKIEEQLEKKKAEYAERMKNKVAAIHKEAEERRAMI 180
Query: 218 KIK 220
+ K
Sbjct: 181 EAK 183
>At3g26840 unknown protein
Length = 701
Score = 28.5 bits (62), Expect = 2.9
Identities = 35/116 (30%), Positives = 46/116 (39%), Gaps = 22/116 (18%)
Query: 26 AIDSVAGTSSYVTSFMNGFSATNNNDTKKKHNDN--QNPKSPKIKHYQLKALKLLDDILE 83
AI SV TSS T NND + +NP S K++ + K L D LE
Sbjct: 34 AIKSVTSTSSPPTPSSGVQRRRKNNDENRATVAKVVENPYS-KVEAARPDLQKRLSDFLE 92
Query: 84 NTIEIVKE---------PI---------PVLDEDPNVDDCGIRLFRHSKP-GIVFD 120
E V + P+ P+L P +D G+ L RH K G +FD
Sbjct: 93 EAREFVGDGGGPPRWFSPLECGAQATNSPLLLYLPGIDGTGLGLIRHHKKLGEIFD 148
>At1g30320 unknown protein
Length = 509
Score = 28.5 bits (62), Expect = 2.9
Identities = 18/52 (34%), Positives = 29/52 (55%), Gaps = 1/52 (1%)
Query: 166 NDAYKKSLARLEAKDAAAKAKAKREEERIE-KLKKIRGERWLPSMAKEMQAK 216
N YK+ R++A ++ KAK + E RIE K+++++ E M K AK
Sbjct: 411 NARYKREEIRIQAWESQEKAKLEAEMRRIEAKVEQMKAEAEAKIMKKIALAK 462
>At3g57150 putative pseudouridine synthase (NAP57)
Length = 565
Score = 28.1 bits (61), Expect = 3.8
Identities = 24/101 (23%), Positives = 45/101 (43%), Gaps = 16/101 (15%)
Query: 120 DHADEPQP---PMKRPKLVPGEDIDEKSKKFRRRIRSIAVDGNDLIAAANDAYKKSLARL 176
D +D P P + K V GE+ +EK K +++ + + K+ A
Sbjct: 446 DSSDSPAPVTTKKSKTKEVEGEEAEEKVKSSKKKKKK-----------DKEEEKEEEAGS 494
Query: 177 EAKDAAAKAKAKREEERIEKLKKIRGERWLPSMAKEMQAKI 217
E K+ K K ++EE IE++ + E+ +K+ +A +
Sbjct: 495 EKKE--KKKKKDKKEEVIEEVASPKSEKKKKKKSKDTEAAV 533
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.311 0.131 0.368
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,436,740
Number of Sequences: 26719
Number of extensions: 248174
Number of successful extensions: 970
Number of sequences better than 10.0: 41
Number of HSP's better than 10.0 without gapping: 12
Number of HSP's successfully gapped in prelim test: 30
Number of HSP's that attempted gapping in prelim test: 941
Number of HSP's gapped (non-prelim): 55
length of query: 221
length of database: 11,318,596
effective HSP length: 95
effective length of query: 126
effective length of database: 8,780,291
effective search space: 1106316666
effective search space used: 1106316666
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 58 (26.9 bits)
Medicago: description of AC147000.5