
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144929.16 + phase: 0
(143 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAV63926.1| hypothetical protein At5g14090 [Arabidopsis thali... 63 1e-09
dbj|BAB08288.1| unnamed protein product [Arabidopsis thaliana] g... 63 1e-09
ref|XP_317043.2| ENSANGP00000019222 [Anopheles gambiae str. PEST... 35 0.50
ref|XP_394021.1| PREDICTED: similar to GA18623-PA [Apis mellifera] 34 0.65
emb|CAH98116.1| hypothetical protein PB105643.00.0 [Plasmodium b... 34 0.85
gb|EAA69381.1| hypothetical protein FG00042.1 [Gibberella zeae P... 33 1.9
ref|XP_217570.3| PREDICTED: similar to ATRX protein [Rattus norv... 32 2.5
emb|CAG60956.1| unnamed protein product [Candida glabrata CBS138... 32 2.5
gb|AAM75032.1| LD11251p [Drosophila melanogaster] gi|7295490|gb|... 32 2.5
gb|AAF05591.1| alanyl-tRNA synthetase [Drosophila melanogaster] 32 2.5
ref|XP_647723.1| hypothetical protein DDB0216508 [Dictyostelium ... 32 2.5
ref|XP_647711.1| hypothetical protein DDB0216496 [Dictyostelium ... 32 2.5
sp|P70486|ATRX_RAT Transcriptional regulator ATRX (X-linked nucl... 32 2.5
pir||A24785 hypothetical protein 335 - slime mold (Dictyostelium... 32 3.2
ref|XP_647780.1| hypothetical protein DDB0216467 [Dictyostelium ... 32 3.2
ref|NP_083109.2| IQ motif containing E [Mus musculus] gi|3758920... 32 3.2
gb|AAS54247.1| AGL244Cp [Ashbya gossypii ATCC 10895] gi|45200853... 32 3.2
ref|XP_559187.1| ENSANGP00000029120 [Anopheles gambiae str. PEST... 32 3.2
gb|AAA33193.1| ORF1 32 3.2
dbj|BAB24605.1| unnamed protein product [Mus musculus] 32 3.2
>gb|AAV63926.1| hypothetical protein At5g14090 [Arabidopsis thaliana]
gi|52354463|gb|AAU44552.1| hypothetical protein
AT5G14090 [Arabidopsis thaliana]
Length = 358
Score = 63.2 bits (152), Expect = 1e-09
Identities = 50/159 (31%), Positives = 79/159 (49%), Gaps = 22/159 (13%)
Query: 3 DAAVTETQLKLISYELEKFVEAD-EECFYESSGRISFRSNVTLSRKKTDGLETEDLGNKV 61
DA VTE LKLIS EL+KF+EA+ +E ++ SGR S + + + + +G++ E+ +
Sbjct: 123 DADVTENDLKLISNELDKFLEAEAKEGHHQPSGRNSDTNTIASTIEAIEGVDDEEDNQPM 182
Query: 62 ICPLQEYLLGSSFEIRE-KTEVRIERAS------VRETQVKQ--------------GRRS 100
PLQEY GS E+ E K + +RAS + E Q KQ +S
Sbjct: 183 KFPLQEYFFGSLIELPESKIAGKKDRASLGELFQITEVQDKQSENIYGKKKKQPNSAHKS 242
Query: 101 ALHIIKKMSNMVLSSSKSCNTYGNTADHATSTNEKLCKV 139
A H++KK+ + SS+ + D +K+ +V
Sbjct: 243 AKHLVKKVLKKIHPSSRGSVSGKPEVDSTKKKFQKMVQV 281
>dbj|BAB08288.1| unnamed protein product [Arabidopsis thaliana]
gi|15241307|ref|NP_196913.1| hypothetical protein
[Arabidopsis thaliana]
Length = 361
Score = 63.2 bits (152), Expect = 1e-09
Identities = 53/159 (33%), Positives = 80/159 (49%), Gaps = 25/159 (15%)
Query: 3 DAAVTETQLKLISYELEKFVEAD-EECFYESSGRISFRSNVTLSRKKTDGLETEDLGNKV 61
DA VTE LKLIS EL+KF+EA+ +E ++ SGR S + + + + +G++ E+ +
Sbjct: 126 DADVTENDLKLISNELDKFLEAEAKEGHHQPSGRNSDTNTIASTIEAIEGVDDEEDNQPM 185
Query: 62 ICPLQEYLLGSSFEIRE-KTEVRIERAS------VRETQVKQ--------------GRRS 100
PLQEY GS E+ E K + +RAS + E Q KQ +S
Sbjct: 186 KFPLQEYFFGSLIELPESKIAGKKDRASLGELFQITEVQDKQSENIYGKKKKQPNSAHKS 245
Query: 101 ALHIIKKMSNMVLSSSKSCNTYGNTADHATSTNEKLCKV 139
A H++KK+ + SS+ + D ST +K KV
Sbjct: 246 AKHLVKKVLKKIHPSSRGSVSGKPEVD---STKKKFQKV 281
>ref|XP_317043.2| ENSANGP00000019222 [Anopheles gambiae str. PEST]
gi|55237282|gb|EAA12759.2| ENSANGP00000019222 [Anopheles
gambiae str. PEST]
Length = 243
Score = 34.7 bits (78), Expect = 0.50
Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 2/77 (2%)
Query: 52 LETEDLGNKVICPLQEYLLGSSFEIREKTEVRIERASVRETQVKQGRRSALHIIKKMSNM 111
LET+ + C + + + SF + V+ E + +T + +G R LH I ++
Sbjct: 71 LETD--AGQYTCSVPQLGVSKSFNVVANVVVKFESTEIGKTNIVEGERLTLHCIAYGTDP 128
Query: 112 VLSSSKSCNTYGNTADH 128
++ + NTY + DH
Sbjct: 129 KITWTVGNNTYNASTDH 145
>ref|XP_394021.1| PREDICTED: similar to GA18623-PA [Apis mellifera]
Length = 527
Score = 34.3 bits (77), Expect = 0.65
Identities = 31/102 (30%), Positives = 48/102 (46%), Gaps = 8/102 (7%)
Query: 8 ETQLKLISYELEKFVEADEECFYESSGRISFRSNVTLSRKKTDGLETEDL--GNKVICPL 65
ET +LI +L++ +EE + +S +TL R K ETE L NK+ +
Sbjct: 336 ETTGRLIEEQLKELAHVEEESEIVRQQEKTLKSEITLLRSKLANCETELLQCKNKIRLVM 395
Query: 66 QEYLLGSSFEIREKTEVRI-ERASVRE-----TQVKQGRRSA 101
+E + RE E +I ER + E +V+Q +RSA
Sbjct: 396 EELAIEQHNISRETDERQIMERKIITEVEHLQNEVEQAKRSA 437
>emb|CAH98116.1| hypothetical protein PB105643.00.0 [Plasmodium berghei]
Length = 473
Score = 33.9 bits (76), Expect = 0.85
Identities = 25/97 (25%), Positives = 45/97 (45%), Gaps = 6/97 (6%)
Query: 27 ECFYESSGRISFRSNVTLSRKKTDGLETEDLGNKVICPLQEYLL---GSSFEIREKTEVR 83
+ FYE IS N+ + + + GNK+I L+ L G+SF I +K +++
Sbjct: 180 QAFYEYENYISNNENIYICELRIYNMTCNSCGNKIINFLKNKNLIIDGNSFAIDDKIKLK 239
Query: 84 I---ERASVRETQVKQGRRSALHIIKKMSNMVLSSSK 117
I E S + + G ++ + +K N ++S K
Sbjct: 240 INIPENMSNYKNDISDGIKNKSNNVKFYVNKIMSEIK 276
>gb|EAA69381.1| hypothetical protein FG00042.1 [Gibberella zeae PH-1]
gi|46102676|ref|XP_380218.1| hypothetical protein
FG00042.1 [Gibberella zeae PH-1]
Length = 7791
Score = 32.7 bits (73), Expect = 1.9
Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 2/57 (3%)
Query: 81 EVRIERASVRETQVKQGRRSALHIIKKMSNMVLSSSKSCNTYGN--TADHATSTNEK 135
E+ + + ETQ K+ R H+IK+++N V S S SC N A+ S N+K
Sbjct: 4511 EIHFDAKLLEETQAKRLIRQLAHVIKQLANSVPSLSLSCIDMMNPDDAEEIKSWNKK 4567
>ref|XP_217570.3| PREDICTED: similar to ATRX protein [Rattus norvegicus]
Length = 2492
Score = 32.3 bits (72), Expect = 2.5
Identities = 27/123 (21%), Positives = 51/123 (40%), Gaps = 10/123 (8%)
Query: 14 ISYELEKFVEADEECFYESSGRISFRSNVTLSRKKTDGLETEDLGNKVICPLQEYLLGSS 73
+S +++ E C +SS R+ V+L KK L + G + C +
Sbjct: 1041 LSDSVDRLPVKGESC--DSSEDKKTRNRVSLREKKQFSLPAKSSGKRPECSSSDTERSVK 1098
Query: 74 FEIREKTEVRIERASVRETQVKQGRRSALHIIKKMSNMVLSSSKSCNTYGNTADHATSTN 133
E + T+ R++R +RE + +RS + V S S S + G++ D
Sbjct: 1099 GECCDSTDKRVKRIDLRERRSSNSKRS--------TKEVKSGSSSSDAEGSSEDAKKQKK 1150
Query: 134 EKL 136
+++
Sbjct: 1151 QRM 1153
>emb|CAG60956.1| unnamed protein product [Candida glabrata CBS138]
gi|50291145|ref|XP_448005.1| unnamed protein product
[Candida glabrata]
Length = 512
Score = 32.3 bits (72), Expect = 2.5
Identities = 24/105 (22%), Positives = 50/105 (46%), Gaps = 4/105 (3%)
Query: 36 ISFRSNVTLSRKKTDGLETEDLGNKVICPLQEYLLGSSFEIREKTEVRIERASVRETQVK 95
IS N+ +S + + + +K + +++ ++E+ + +E+ RET
Sbjct: 344 ISSSLNIKISGYRLQPIGSTSKVSKFLADKEDW---ETYELYFSSNFSLEKIFKRETDFD 400
Query: 96 QGRRSALHIIKKMSNMVLSSSKSCNTYGNTADHATSTNEKLCKVG 140
+G S L I+K++S + S+ S +T N D T +N+ + G
Sbjct: 401 KGMESLLDIVKQIS-LSFSTVVSSDTDINIQDERTRSNDSMLNGG 444
>gb|AAM75032.1| LD11251p [Drosophila melanogaster] gi|7295490|gb|AAF50804.1|
CG4633-PA [Drosophila melanogaster]
gi|24658214|ref|NP_523932.2| CG4633-PA [Drosophila
melanogaster] gi|25012817|gb|AAN71499.1| RE74040p
[Drosophila melanogaster]
Length = 1012
Score = 32.3 bits (72), Expect = 2.5
Identities = 39/136 (28%), Positives = 64/136 (46%), Gaps = 23/136 (16%)
Query: 5 AVTETQLKLISYELEKFVEADEECFYESSGRISFRSNVTLSRKKTDGLETEDLGNKVIC- 63
A + K ++Y++ V +D+ C E G + R +KTD EDL N+VIC
Sbjct: 664 AAIRSLFKKVTYQVSSSVSSDQ-CKLEL-GLLGKRI------QKTDVQLIEDLINRVICS 715
Query: 64 --PLQEYLLGSSFEIREKTEVRIERASVRETQVKQGRRSALHIIKKMSNMVLSSSKSCNT 121
P++ LL S+ E+ E+ ++ + E +QG R + + + LSS + C
Sbjct: 716 AAPVEVQLL-SAAEVLEQNDITMVPG---EVYPEQGLRL---VNVESPELQLSSKELC-- 766
Query: 122 YGNTADHATSTNEKLC 137
HAT+T+E C
Sbjct: 767 ---CGTHATNTSELSC 779
>gb|AAF05591.1| alanyl-tRNA synthetase [Drosophila melanogaster]
Length = 1012
Score = 32.3 bits (72), Expect = 2.5
Identities = 39/136 (28%), Positives = 64/136 (46%), Gaps = 23/136 (16%)
Query: 5 AVTETQLKLISYELEKFVEADEECFYESSGRISFRSNVTLSRKKTDGLETEDLGNKVIC- 63
A + K ++Y++ V +D+ C E G + R +KTD EDL N+VIC
Sbjct: 664 AAIRSLFKKVTYQVSSSVSSDQ-CKLEL-GLLGKRI------QKTDVQLIEDLINRVICS 715
Query: 64 --PLQEYLLGSSFEIREKTEVRIERASVRETQVKQGRRSALHIIKKMSNMVLSSSKSCNT 121
P++ LL S+ E+ E+ ++ + E +QG R + + + LSS + C
Sbjct: 716 AAPVEVQLL-SAAEVLEQNDITMVPG---EVYPEQGLRL---VNVESPELQLSSKELC-- 766
Query: 122 YGNTADHATSTNEKLC 137
HAT+T+E C
Sbjct: 767 ---CGTHATNTSELSC 779
>ref|XP_647723.1| hypothetical protein DDB0216508 [Dictyostelium discoideum]
gi|60475868|gb|EAL73799.1| hypothetical protein
DDB0216508 [Dictyostelium discoideum]
Length = 335
Score = 32.3 bits (72), Expect = 2.5
Identities = 22/97 (22%), Positives = 46/97 (46%)
Query: 36 ISFRSNVTLSRKKTDGLETEDLGNKVICPLQEYLLGSSFEIREKTEVRIERASVRETQVK 95
++ + +LSR + + + E G++V+ P++ F+ E VR S+R+
Sbjct: 201 LAVNAQASLSRVRRNNIAKEIYGSEVLLPIKIKDTPKMFDETETERVRKLAKSIRKNNEA 260
Query: 96 QGRRSALHIIKKMSNMVLSSSKSCNTYGNTADHATST 132
+ L+ K + L +S NT GN+++ +S+
Sbjct: 261 KQSLLKLNYHSKTNAKKLVNSSGNNTTGNSSNSKSSS 297
>ref|XP_647711.1| hypothetical protein DDB0216496 [Dictyostelium discoideum]
gi|60475856|gb|EAL73787.1| hypothetical protein
DDB0216496 [Dictyostelium discoideum]
Length = 673
Score = 32.3 bits (72), Expect = 2.5
Identities = 22/97 (22%), Positives = 46/97 (46%)
Query: 36 ISFRSNVTLSRKKTDGLETEDLGNKVICPLQEYLLGSSFEIREKTEVRIERASVRETQVK 95
++F + +LSR + + + E G++V+ P++ F+ E VR S+R+
Sbjct: 187 LAFNAQASLSRVRRNNIAKEIYGSEVLLPIKIKDTPKMFDETETERVRKLAKSIRKNNEA 246
Query: 96 QGRRSALHIIKKMSNMVLSSSKSCNTYGNTADHATST 132
+ L+ K + +S NT GN+++ +S+
Sbjct: 247 KQSLLKLNYHSKSNVKKSVNSSGNNTTGNSSNSKSSS 283
>sp|P70486|ATRX_RAT Transcriptional regulator ATRX (X-linked nuclear protein) (pABP-2)
gi|1514945|dbj|BAA10936.1| helicase II [Rattus
norvegicus]
Length = 527
Score = 32.3 bits (72), Expect = 2.5
Identities = 27/123 (21%), Positives = 51/123 (40%), Gaps = 10/123 (8%)
Query: 14 ISYELEKFVEADEECFYESSGRISFRSNVTLSRKKTDGLETEDLGNKVICPLQEYLLGSS 73
+S +++ E C +SS R+ V+L KK L + G + C +
Sbjct: 161 LSDSVDRLPVKGESC--DSSEDKKTRNRVSLREKKQFSLPAKSSGKRPECSSSDTERSVK 218
Query: 74 FEIREKTEVRIERASVRETQVKQGRRSALHIIKKMSNMVLSSSKSCNTYGNTADHATSTN 133
E + T+ R++R +RE + +RS + V S S S + G++ D
Sbjct: 219 GECCDSTDKRVKRIDLRERRSSNSKRS--------TKEVKSGSSSSDAEGSSEDAKKQKK 270
Query: 134 EKL 136
+++
Sbjct: 271 QRM 273
>pir||A24785 hypothetical protein 335 - slime mold (Dictyostelium discoideum)
transposon DIRS-1
Length = 335
Score = 32.0 bits (71), Expect = 3.2
Identities = 22/97 (22%), Positives = 46/97 (46%)
Query: 36 ISFRSNVTLSRKKTDGLETEDLGNKVICPLQEYLLGSSFEIREKTEVRIERASVRETQVK 95
++ + +LSR + + + E G++V+ P++ F+ E VR S+R+
Sbjct: 201 LAVNTQASLSRVRRNNIAKEIYGSEVLLPIKIKDTPKMFDETETERVRKLAKSIRKNNEA 260
Query: 96 QGRRSALHIIKKMSNMVLSSSKSCNTYGNTADHATST 132
+ L+ K + L +S NT GN+++ +S+
Sbjct: 261 KQSLLKLNYHSKSNVKKLVNSSGNNTTGNSSNSKSSS 297
>ref|XP_647780.1| hypothetical protein DDB0216467 [Dictyostelium discoideum]
gi|60475929|gb|EAL73856.1| hypothetical protein
DDB0216467 [Dictyostelium discoideum]
gi|903712|gb|AAA70200.1| unknown protein
Length = 335
Score = 32.0 bits (71), Expect = 3.2
Identities = 22/97 (22%), Positives = 46/97 (46%)
Query: 36 ISFRSNVTLSRKKTDGLETEDLGNKVICPLQEYLLGSSFEIREKTEVRIERASVRETQVK 95
++ + +LSR + + + E G++V+ P++ F+ E VR S+R+
Sbjct: 201 LAVNAQASLSRVRRNNIAKEIYGSEVLLPIKIKDTPKMFDETETERVRKLAKSIRKNNEA 260
Query: 96 QGRRSALHIIKKMSNMVLSSSKSCNTYGNTADHATST 132
+ L+ K + L +S NT GN+++ +S+
Sbjct: 261 KQSLLKLNYHSKSNVKKLVNSSGNNTTGNSSNSKSSS 297
>ref|NP_083109.2| IQ motif containing E [Mus musculus] gi|37589209|gb|AAH59223.1| IQ
motif containing E [Mus musculus]
Length = 778
Score = 32.0 bits (71), Expect = 3.2
Identities = 24/98 (24%), Positives = 47/98 (47%), Gaps = 6/98 (6%)
Query: 17 ELEKFVEADEECFYESSGRISFR------SNVTLSRKKTDGLETEDLGNKVICPLQEYLL 70
ELEK V + E +S ++ SN+++ ++ EDL C QE+L
Sbjct: 345 ELEKKVSSSESPKQSTSELVNPNPLVRSPSNISVQKQPKGDQSPEDLPKVAPCEEQEHLQ 404
Query: 71 GSSFEIREKTEVRIERASVRETQVKQGRRSALHIIKKM 108
G+ +RE+ E+ ++ ++KQ +S + + K++
Sbjct: 405 GTVKSLREELGALQEQLLEKDLEMKQLLQSKIDLEKEL 442
>gb|AAS54247.1| AGL244Cp [Ashbya gossypii ATCC 10895] gi|45200853|ref|NP_986423.1|
AGL244Cp [Eremothecium gossypii]
Length = 842
Score = 32.0 bits (71), Expect = 3.2
Identities = 22/84 (26%), Positives = 38/84 (45%), Gaps = 14/84 (16%)
Query: 10 QLKLISYELEKFVEADEECFYESSGRISFRSNVTLSRKKTDGLETEDLGNKVICPLQEYL 69
++ ++Y+L V+A + F ESSG + V +S + D LQ
Sbjct: 87 EMSSLAYQLNTLVDATTKIFVESSGFVLLEPFVKMSVEFAD--------------LQVLS 132
Query: 70 LGSSFEIREKTEVRIERASVRETQ 93
+ S ++IR+ E E+ + RE Q
Sbjct: 133 IVSDYDIRQTGETTFEQFNARERQ 156
>ref|XP_559187.1| ENSANGP00000029120 [Anopheles gambiae str. PEST]
gi|55242892|gb|EAL41073.1| ENSANGP00000029120 [Anopheles
gambiae str. PEST]
Length = 1092
Score = 32.0 bits (71), Expect = 3.2
Identities = 15/55 (27%), Positives = 27/55 (48%)
Query: 77 REKTEVRIERASVRETQVKQGRRSALHIIKKMSNMVLSSSKSCNTYGNTADHATS 131
R + +++ ++R Q RR+ H +K LS+ +S +T+ DHA S
Sbjct: 49 RNRRLALVKKRAIRLVQTFAARRAIPHYFQKRVKHTLSAKQSASTHNGATDHAPS 103
>gb|AAA33193.1| ORF1
Length = 335
Score = 32.0 bits (71), Expect = 3.2
Identities = 22/97 (22%), Positives = 46/97 (46%)
Query: 36 ISFRSNVTLSRKKTDGLETEDLGNKVICPLQEYLLGSSFEIREKTEVRIERASVRETQVK 95
++ + +LSR + + + E G++V+ P++ F+ E VR S+R+
Sbjct: 201 LAVNTQASLSRVRRNNIAKEIYGSEVLLPIKIKDTPKMFDETETERVRKLAKSIRKNNEA 260
Query: 96 QGRRSALHIIKKMSNMVLSSSKSCNTYGNTADHATST 132
+ L+ K + L +S NT GN+++ +S+
Sbjct: 261 KQSLLKLNYHSKSNVKKLVNSSGNNTTGNSSNSKSSS 297
>dbj|BAB24605.1| unnamed protein product [Mus musculus]
Length = 761
Score = 32.0 bits (71), Expect = 3.2
Identities = 24/98 (24%), Positives = 47/98 (47%), Gaps = 6/98 (6%)
Query: 17 ELEKFVEADEECFYESSGRISFR------SNVTLSRKKTDGLETEDLGNKVICPLQEYLL 70
ELEK V + E +S ++ SN+++ ++ EDL C QE+L
Sbjct: 300 ELEKKVSSSESPKQSTSELVNPNPLVRSPSNISVQKQPKGDQSPEDLPKVAPCEEQEHLQ 359
Query: 71 GSSFEIREKTEVRIERASVRETQVKQGRRSALHIIKKM 108
G+ +RE+ E+ ++ ++KQ +S + + K++
Sbjct: 360 GTVKSLREELGALQEQLLEKDLEMKQLLQSKIDLEKEL 397
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.312 0.128 0.345
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 204,942,086
Number of Sequences: 2540612
Number of extensions: 6925667
Number of successful extensions: 13633
Number of sequences better than 10.0: 52
Number of HSP's better than 10.0 without gapping: 18
Number of HSP's successfully gapped in prelim test: 34
Number of HSP's that attempted gapping in prelim test: 13610
Number of HSP's gapped (non-prelim): 52
length of query: 143
length of database: 863,360,394
effective HSP length: 119
effective length of query: 24
effective length of database: 561,027,566
effective search space: 13464661584
effective search space used: 13464661584
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 67 (30.4 bits)
Medicago: description of AC144929.16