
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC148758.4 - phase: 0 /pseudo
(618 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
MDH_CAMJE (Q9PHY2) Probable malate dehydrogenase (EC 1.1.1.37) 35 0.58
CO7_HUMAN (P10643) Complement component C7 precursor 33 1.7
S3A2_HUMAN (Q15428) Splicing factor 3A subunit 2 (Spliceosome as... 32 3.8
ILVC_BUCUM (Q9AQA0) Ketol-acid reductoisomerase (EC 1.1.1.86) (A... 32 3.8
YAT2_SCHPO (Q10149) Hypothetical protein C1D4.02c in chromosome I 32 4.9
SMN_BOVIN (O18870) Survival motor neuron protein 32 4.9
SIR3_YEAST (P06701) Regulatory protein SIR3 (Silent information ... 32 4.9
SDC3_HUMAN (O75056) Syndecan-3 (SYND3) 32 4.9
SMN_RAT (O35876) Survival motor neuron protein 32 6.4
S3A2_MOUSE (Q62203) Splicing factor 3A subunit 2 (Spliceosome as... 32 6.4
MLL2_HUMAN (O14686) Myeloid/lymphoid or mixed-lineage leukemia p... 32 6.4
CA54_HUMAN (P29400) Collagen alpha 5(IV) chain precursor 31 8.4
>MDH_CAMJE (Q9PHY2) Probable malate dehydrogenase (EC 1.1.1.37)
Length = 300
Score = 35.0 bits (79), Expect = 0.58
Identities = 16/63 (25%), Positives = 34/63 (53%)
Query: 3 SKKKVLSVEDVIKSVGLGYDLTNDLRLKFCKYDSKLIAIDHDNLRTVELPGRVSIPNVPK 62
S KK++++ V+ + Y+L L +K + D++LI +D++ V+ V N+ +
Sbjct: 135 SSKKIIAMAGVLDNARFKYELAKKLNVKMSRVDTRLIGFHNDDMVLVKSYASVKNKNISE 194
Query: 63 SIN 65
+N
Sbjct: 195 FLN 197
>CO7_HUMAN (P10643) Complement component C7 precursor
Length = 843
Score = 33.5 bits (75), Expect = 1.7
Identities = 32/124 (25%), Positives = 49/124 (38%), Gaps = 26/124 (20%)
Query: 152 VPSSWDPAALARFIEKYGTHAVVGVKIGG--TDIIYAKQQYSSPLQPSDVQKK------- 202
+PS +D +A R I++YGTH + +GG + Y + + V++K
Sbjct: 283 LPSLYDYSAYRRLIDQYGTHYLQSGSLGGEYRVLFYVDSEKLKQNDFNSVEEKKCKSSGW 342
Query: 203 ----------LKDMADELFRGQAGQNNANDGTFNSKEKFMRDNGLGFLDIQAQSYRETEK 252
K++ + L QNN G E F+R G GF I SY E +
Sbjct: 343 HFVVKFSSHGCKELENALKAASGTQNNVLRG-----EPFIRGGGAGF--ISGLSYLELDN 395
Query: 253 NTTN 256
N
Sbjct: 396 PAGN 399
>S3A2_HUMAN (Q15428) Splicing factor 3A subunit 2 (Spliceosome
associated protein 62) (SAP 62) (SF3a66)
Length = 464
Score = 32.3 bits (72), Expect = 3.8
Identities = 23/80 (28%), Positives = 32/80 (39%), Gaps = 6/80 (7%)
Query: 488 KVLFLRLHFCKVADATRVRAPEWDGSPGLTQKSGMISTFISTRFSGPQKLPPPQPSDVNV 547
K FL+ HF P G PG+ + + + R P+ LPPP P + +
Sbjct: 204 KQFFLQFHFKMEKPPAPPSLPA--GPPGVKRPPPPLMNGLPPRPPLPESLPPPPPGGLPL 261
Query: 548 ----NSALYPGGPPVPAQAP 563
+ P GPP P Q P
Sbjct: 262 PPMPPTGPAPSGPPGPPQLP 281
>ILVC_BUCUM (Q9AQA0) Ketol-acid reductoisomerase (EC 1.1.1.86)
(Acetohydroxy-acid isomeroreductase)
(Alpha-keto-beta-hydroxylacil reductoisomerase)
(Fragment)
Length = 346
Score = 32.3 bits (72), Expect = 3.8
Identities = 24/99 (24%), Positives = 46/99 (46%), Gaps = 3/99 (3%)
Query: 149 KRAVPSSWDPAALARFIEKYGTHAVVGVKIGGTDIIYAKQQYSSPLQPSDVQKKLKDMAD 208
++ + + DPA A+ I+ +K GG ++ + S ++ + +K+K +
Sbjct: 171 EKLITNKHDPAYAAKLIQNGWETITESLKHGGITLMMDRLSNPSKIRAYKISEKIKKILT 230
Query: 209 ELFRGQAGQNNANDGTFNSK-EKFMRDNGLGFLDIQAQS 246
LF Q NN G F+S+ K ++N LD + Q+
Sbjct: 231 SLF--QQHMNNIISGEFSSEMMKDWKNNDKKLLDWRKQT 267
>YAT2_SCHPO (Q10149) Hypothetical protein C1D4.02c in chromosome I
Length = 345
Score = 32.0 bits (71), Expect = 4.9
Identities = 42/175 (24%), Positives = 66/175 (37%), Gaps = 31/175 (17%)
Query: 427 KTFQLKDETNRNVS-DASSERKYYEKVQWKSF----------------SHICTAPVESYD 469
+ F LK + R V+ +S+ K +QW S S + A + Y+
Sbjct: 81 EVFSLKGQITRKVNIKINSDEKIGMVLQWASIAPAVDAIWHILNVIDDSPVARASLVPYE 140
Query: 470 DNAVVTGAHFEVGETGLKKVLF------LRLH-FCKVADATR----VRAPEWDGSPGLTQ 518
D V T GE L ++ LRL+ + D+TR V W G+ +
Sbjct: 141 DYIVGTPEGMMTGEKALSDLIESHLNRPLRLYIYNHYRDSTRQVTIVPNRHWGGNGAIGC 200
Query: 519 KSGMISTFISTRFSGPQKLPPPQPSDVNVNSALYPGGPPVPAQAPKLLKFVDTTE 573
G + R P PPPQP D+ ++ + G +Q + F+ T E
Sbjct: 201 GVGH---GVLHRLPAPLSGPPPQPGDIVFSNPMLGGPDHKVSQPSETENFLPTPE 252
>SMN_BOVIN (O18870) Survival motor neuron protein
Length = 287
Score = 32.0 bits (71), Expect = 4.9
Identities = 20/65 (30%), Positives = 28/65 (42%), Gaps = 3/65 (4%)
Query: 504 RVRAPEWDG---SPGLTQKSGMISTFISTRFSGPQKLPPPQPSDVNVNSALYPGGPPVPA 560
R RA W+ P +SG+ FSGP PPP P ++ +P GPP+
Sbjct: 179 RSRAAPWNSFLPPPPHMPRSGLGPGKSGLNFSGPPPPPPPPPHFLSRWLPPFPAGPPMIP 238
Query: 561 QAPKL 565
P +
Sbjct: 239 PPPPI 243
>SIR3_YEAST (P06701) Regulatory protein SIR3 (Silent information
regulator 3)
Length = 978
Score = 32.0 bits (71), Expect = 4.9
Identities = 16/88 (18%), Positives = 41/88 (46%)
Query: 107 FQFSGVWQRDAANTKSLAFDGVSITLYNIALDKTHVVLSDHVKRAVPSSWDPAALARFIE 166
F ++ + + N +LA + V I + + D S +K+ + +W+PA ++
Sbjct: 865 FLYTLAQETEGTNRHTLALETVLIKMVKMLRDNPGYKASKEIKKVICGAWEPAITIEKLK 924
Query: 167 KYGTHAVVGVKIGGTDIIYAKQQYSSPL 194
++ +VV +G ++ ++ S+ +
Sbjct: 925 QFSWISVVNDLVGEKLVVVVLEEPSASI 952
>SDC3_HUMAN (O75056) Syndecan-3 (SYND3)
Length = 384
Score = 32.0 bits (71), Expect = 4.9
Identities = 29/95 (30%), Positives = 39/95 (40%), Gaps = 7/95 (7%)
Query: 484 TGLKKVLFLRLHFCKVADATRVRAPEWDGSPGLTQKSGMISTFISTRFSGPQKLPPP--- 540
TG++++L L L A AT AP + + +ST S P+ LP P
Sbjct: 148 TGVRRLLPLPLTTVATARATTPEAPSPPTTAAVLDTEAPTPRLVSTATSRPRALPRPATT 207
Query: 541 QPSDVNVNSALYPG----GPPVPAQAPKLLKFVDT 571
Q D+ S L G GP AQ P F+ T
Sbjct: 208 QEPDIPERSTLPLGTTAPGPTEVAQTPTPETFLTT 242
>SMN_RAT (O35876) Survival motor neuron protein
Length = 289
Score = 31.6 bits (70), Expect = 6.4
Identities = 14/36 (38%), Positives = 18/36 (49%)
Query: 530 RFSGPQKLPPPQPSDVNVNSALYPGGPPVPAQAPKL 565
RFSGP PPP P + +P GPP+ P +
Sbjct: 209 RFSGPPPPPPPPPPFLPCWMPPFPSGPPIIPPPPPI 244
>S3A2_MOUSE (Q62203) Splicing factor 3A subunit 2 (Spliceosome
associated protein 62) (SAP 62) (SF3a66)
Length = 475
Score = 31.6 bits (70), Expect = 6.4
Identities = 23/80 (28%), Positives = 31/80 (38%), Gaps = 6/80 (7%)
Query: 488 KVLFLRLHFCKVADATRVRAPEWDGSPGLTQKSGMISTFISTRFSGPQKLPPPQPSDVNV 547
K FL+ HF P G PG+ + + + R P LPPP P + +
Sbjct: 194 KFFFLQFHFKMEKPPAPPSLPA--GPPGVKRPPPPLMNGLPPRPPLPDALPPPPPGGLPL 251
Query: 548 ----NSALYPGGPPVPAQAP 563
+ P GPP P Q P
Sbjct: 252 PPMPPTGPAPSGPPGPPQMP 271
>MLL2_HUMAN (O14686) Myeloid/lymphoid or mixed-lineage leukemia
protein 2 (ALL1-related protein)
Length = 5262
Score = 31.6 bits (70), Expect = 6.4
Identities = 25/70 (35%), Positives = 27/70 (37%), Gaps = 9/70 (12%)
Query: 502 ATRVRAPEWDGSPGLTQKSGMISTFISTRFSGPQKLPPPQPSDV--------NVNSALYP 553
A R P G Q G ISTR GP + P P P+ NV L P
Sbjct: 2557 AMSARFPSTPGPELGRQALGSPLAGISTRLPGPGE-PVPGPAGPAQFIELRHNVQKGLGP 2615
Query: 554 GGPPVPAQAP 563
GG P P Q P
Sbjct: 2616 GGTPFPGQGP 2625
>CA54_HUMAN (P29400) Collagen alpha 5(IV) chain precursor
Length = 1685
Score = 31.2 bits (69), Expect = 8.4
Identities = 21/55 (38%), Positives = 24/55 (43%), Gaps = 5/55 (9%)
Query: 508 PEWDGSPGLTQKSGMISTFISTRFSGPQKLP-PPQPSDVNVNSALYPGGPPVPAQ 561
P G PGL G+ T F GPQ + PP PS V PG P +P Q
Sbjct: 1011 PGLPGQPGLIGPPGLKGTIGDMGFPGPQGVEGPPGPSGVPGQ----PGSPGLPGQ 1061
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.322 0.138 0.411
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 72,904,751
Number of Sequences: 164201
Number of extensions: 3208571
Number of successful extensions: 7536
Number of sequences better than 10.0: 12
Number of HSP's better than 10.0 without gapping: 3
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 7523
Number of HSP's gapped (non-prelim): 18
length of query: 618
length of database: 59,974,054
effective HSP length: 116
effective length of query: 502
effective length of database: 40,926,738
effective search space: 20545222476
effective search space used: 20545222476
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 69 (31.2 bits)
Medicago: description of AC148758.4