FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0470, 301 aa
1>>>pF1KE0470 301 - 301 aa - 301 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.8899+/-0.000861; mu= 18.2347+/- 0.052
mean_var=64.4107+/-12.736, 0's: 0 Z-trim(106.2): 22 B-trim: 0 in 0/50
Lambda= 0.159807
statistics sampled from 8819 (8832) to 8819 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.641), E-opt: 0.2 (0.271), width: 16
Scan time: 2.350
The best scores are: opt bits E(32554)
CCDS35207.1 GPM6B gene_id:2824|Hs108|chrX ( 305) 2061 483.7 7.3e-137
CCDS35206.1 GPM6B gene_id:2824|Hs108|chrX ( 328) 1921 451.5 4e-127
CCDS48084.1 GPM6B gene_id:2824|Hs108|chrX ( 246) 1675 394.7 3.8e-110
CCDS14158.1 GPM6B gene_id:2824|Hs108|chrX ( 265) 1675 394.7 4e-110
CCDS14514.1 PLP1 gene_id:5354|Hs108|chrX ( 242) 979 234.2 7.6e-62
CCDS3824.1 GPM6A gene_id:2823|Hs108|chr4 ( 278) 855 205.6 3.4e-53
CCDS54822.1 GPM6A gene_id:2823|Hs108|chr4 ( 267) 854 205.4 3.9e-53
CCDS58936.1 GPM6A gene_id:2823|Hs108|chr4 ( 271) 854 205.4 3.9e-53
CCDS14513.1 PLP1 gene_id:5354|Hs108|chrX ( 277) 496 122.9 2.8e-28
>>CCDS35207.1 GPM6B gene_id:2824|Hs108|chrX (305 aa)
initn: 2061 init1: 2061 opt: 2061 Z-score: 2570.6 bits: 483.7 E(32554): 7.3e-137
Smith-Waterman score: 2061; 100.0% identity (100.0% similar) in 301 aa overlap (1-301:5-305)
10 20 30 40 50
pF1KE0 METAAEENTEQSQERKVNSRAEMEIGRYHWMYPGSKNHQYHPVPTLGDRASPLSSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 MKPAMETAAEENTEQSQERKVNSRAEMEIGRYHWMYPGSKNHQYHPVPTLGDRASPLSSP
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 GCFECCIKCLGGVPYASLVATILCFSGVALFCGCGHVALAGTVAILEQHFSTNASDHALL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 GCFECCIKCLGGVPYASLVATILCFSGVALFCGCGHVALAGTVAILEQHFSTNASDHALL
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE0 SEVIQLMQYVIYGIASFFFLYGIILLAEGFYTTSAVKELHGEFKTTACGRCISGMFVFLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 SEVIQLMQYVIYGIASFFFLYGIILLAEGFYTTSAVKELHGEFKTTACGRCISGMFVFLT
130 140 150 160 170 180
180 190 200 210 220 230
pF1KE0 YVLGVAWLGVFGFSAVPVFMFYNIWSTCEVIKSPQTNGTTGVEQICVDIRQYGIIPWNAF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 YVLGVAWLGVFGFSAVPVFMFYNIWSTCEVIKSPQTNGTTGVEQICVDIRQYGIIPWNAF
190 200 210 220 230 240
240 250 260 270 280 290
pF1KE0 PGKICGSALENICNTNEFYMSYHLFIVACAGAGATVIALLIYMMATTYNYAVLKFKSRED
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 PGKICGSALENICNTNEFYMSYHLFIVACAGAGATVIALLIYMMATTYNYAVLKFKSRED
250 260 270 280 290 300
300
pF1KE0 CCTKF
:::::
CCDS35 CCTKF
>>CCDS35206.1 GPM6B gene_id:2824|Hs108|chrX (328 aa)
initn: 1919 init1: 1919 opt: 1921 Z-score: 2395.7 bits: 451.5 E(32554): 4e-127
Smith-Waterman score: 1921; 95.6% identity (97.6% similar) in 294 aa overlap (1-294:5-298)
10 20 30 40 50
pF1KE0 METAAEENTEQSQERKVNSRAEMEIGRYHWMYPGSKNHQYHPVPTLGDRASPLSSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 MKPAMETAAEENTEQSQERKVNSRAEMEIGRYHWMYPGSKNHQYHPVPTLGDRASPLSSP
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 GCFECCIKCLGGVPYASLVATILCFSGVALFCGCGHVALAGTVAILEQHFSTNASDHALL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 GCFECCIKCLGGVPYASLVATILCFSGVALFCGCGHVALAGTVAILEQHFSTNASDHALL
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE0 SEVIQLMQYVIYGIASFFFLYGIILLAEGFYTTSAVKELHGEFKTTACGRCISGMFVFLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 SEVIQLMQYVIYGIASFFFLYGIILLAEGFYTTSAVKELHGEFKTTACGRCISGMFVFLT
130 140 150 160 170 180
180 190 200 210 220 230
pF1KE0 YVLGVAWLGVFGFSAVPVFMFYNIWSTCEVIKSPQTNGTTGVEQICVDIRQYGIIPWNAF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS35 YVLGVAWLGVFGFSAVPVFMFYNIWSTCEVIKSPQTNGTTGVEQICVDIRQYGIIPWNAF
190 200 210 220 230 240
240 250 260 270 280 290
pF1KE0 PGKICGSALENICNTNEFYMSYHLFIVACAGAGATVIALLIYMMATTYNYAVLKFKSRED
:::::::::::::::::::::::::::::::::::::::. ..: . :.: :: :.
CCDS35 PGKICGSALENICNTNEFYMSYHLFIVACAGAGATVIALIHFLMILSSNWAYLKDASKMQ
250 260 270 280 290 300
300
pF1KE0 CCTKF
CCDS35 AYQDIKAKEEQELQDIQSRSKEQLNSYT
310 320
>>CCDS48084.1 GPM6B gene_id:2824|Hs108|chrX (246 aa)
initn: 1675 init1: 1675 opt: 1675 Z-score: 2091.0 bits: 394.7 E(32554): 3.8e-110
Smith-Waterman score: 1675; 100.0% identity (100.0% similar) in 245 aa overlap (57-301:2-246)
30 40 50 60 70 80
pF1KE0 RYHWMYPGSKNHQYHPVPTLGDRASPLSSPGCFECCIKCLGGVPYASLVATILCFSGVAL
::::::::::::::::::::::::::::::
CCDS48 MGCFECCIKCLGGVPYASLVATILCFSGVAL
10 20 30
90 100 110 120 130 140
pF1KE0 FCGCGHVALAGTVAILEQHFSTNASDHALLSEVIQLMQYVIYGIASFFFLYGIILLAEGF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 FCGCGHVALAGTVAILEQHFSTNASDHALLSEVIQLMQYVIYGIASFFFLYGIILLAEGF
40 50 60 70 80 90
150 160 170 180 190 200
pF1KE0 YTTSAVKELHGEFKTTACGRCISGMFVFLTYVLGVAWLGVFGFSAVPVFMFYNIWSTCEV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 YTTSAVKELHGEFKTTACGRCISGMFVFLTYVLGVAWLGVFGFSAVPVFMFYNIWSTCEV
100 110 120 130 140 150
210 220 230 240 250 260
pF1KE0 IKSPQTNGTTGVEQICVDIRQYGIIPWNAFPGKICGSALENICNTNEFYMSYHLFIVACA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS48 IKSPQTNGTTGVEQICVDIRQYGIIPWNAFPGKICGSALENICNTNEFYMSYHLFIVACA
160 170 180 190 200 210
270 280 290 300
pF1KE0 GAGATVIALLIYMMATTYNYAVLKFKSREDCCTKF
:::::::::::::::::::::::::::::::::::
CCDS48 GAGATVIALLIYMMATTYNYAVLKFKSREDCCTKF
220 230 240
>>CCDS14158.1 GPM6B gene_id:2824|Hs108|chrX (265 aa)
initn: 1757 init1: 1675 opt: 1675 Z-score: 2090.5 bits: 394.7 E(32554): 4e-110
Smith-Waterman score: 1681; 86.7% identity (86.7% similar) in 301 aa overlap (1-301:5-265)
10 20 30 40 50
pF1KE0 METAAEENTEQSQERKVNSRAEMEIGRYHWMYPGSKNHQYHPVPTLGDRASPLSSP
::::::::::::::::
CCDS14 MKPAMETAAEENTEQSQERK----------------------------------------
10 20
60 70 80 90 100 110
pF1KE0 GCFECCIKCLGGVPYASLVATILCFSGVALFCGCGHVALAGTVAILEQHFSTNASDHALL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 GCFECCIKCLGGVPYASLVATILCFSGVALFCGCGHVALAGTVAILEQHFSTNASDHALL
30 40 50 60 70 80
120 130 140 150 160 170
pF1KE0 SEVIQLMQYVIYGIASFFFLYGIILLAEGFYTTSAVKELHGEFKTTACGRCISGMFVFLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 SEVIQLMQYVIYGIASFFFLYGIILLAEGFYTTSAVKELHGEFKTTACGRCISGMFVFLT
90 100 110 120 130 140
180 190 200 210 220 230
pF1KE0 YVLGVAWLGVFGFSAVPVFMFYNIWSTCEVIKSPQTNGTTGVEQICVDIRQYGIIPWNAF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 YVLGVAWLGVFGFSAVPVFMFYNIWSTCEVIKSPQTNGTTGVEQICVDIRQYGIIPWNAF
150 160 170 180 190 200
240 250 260 270 280 290
pF1KE0 PGKICGSALENICNTNEFYMSYHLFIVACAGAGATVIALLIYMMATTYNYAVLKFKSRED
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 PGKICGSALENICNTNEFYMSYHLFIVACAGAGATVIALLIYMMATTYNYAVLKFKSRED
210 220 230 240 250 260
300
pF1KE0 CCTKF
:::::
CCDS14 CCTKF
>>CCDS14514.1 PLP1 gene_id:5354|Hs108|chrX (242 aa)
initn: 946 init1: 651 opt: 979 Z-score: 1223.8 bits: 234.2 E(32554): 7.6e-62
Smith-Waterman score: 979; 57.1% identity (83.2% similar) in 238 aa overlap (57-294:2-238)
30 40 50 60 70 80
pF1KE0 RYHWMYPGSKNHQYHPVPTLGDRASPLSSPGCFECCIKCLGGVPYASLVATILCFSGVAL
: .::: .:: :.:.:::::: ::: ::::
CCDS14 MGLLECCARCLVGAPFASLVATGLCFFGVAL
10 20 30
90 100 110 120 130 140
pF1KE0 FCGCGHVALAGTVAILEQHFSTNASDHALLSEVIQLMQYVIYGIASFFFLYGIILLAEGF
:::::: ::.:: ..: .:: : .:. : .::. .:::::: :::::::: .::::::
CCDS14 FCGCGHEALTGTEKLIETYFSKNYQDYEYLINVIHAFQYVIYGTASFFFLYGALLLAEGF
40 50 60 70 80 90
150 160 170 180 190 200
pF1KE0 YTTSAVKELHGEFKTTACGRCISGMFVFLTYVLGVAWLGVFGFSAVPVFMFYNIWSTCEV
:::.::... :..::: ::. .:. :: .::.: :.:: ::. :::::....: :.::.
CCDS14 YTTGAVRQIFGDYKTTICGKGLSATFVGITYALTVVWLLVFACSAVPVYIYFNTWTTCQS
100 110 120 130 140 150
210 220 230 240 250 260
pF1KE0 IKSPQTNGTTGVEQICVDIRQYGIIPWNAFPGKICGSALENICNTNEFYMSYHLFIVACA
: : .. .... ..:.: :.::..::::::::.::: : .::.: :: :..::::.: .
CCDS14 IAFP-SKTSASIGSLCADARMYGVLPWNAFPGKVCGSNLLSICKTAEFQMTFHLFIAAFV
160 170 180 190 200 210
270 280 290 300
pF1KE0 GAGATVIALLIYMMATTYNYAVLKFKSREDCCTKF
::.::...:: .:.:.:::.::::. .:
CCDS14 GAAATLVSLLTFMIAATYNFAVLKLMGRGTKF
220 230 240
>>CCDS3824.1 GPM6A gene_id:2823|Hs108|chr4 (278 aa)
initn: 800 init1: 389 opt: 855 Z-score: 1068.5 bits: 205.6 E(32554): 3.4e-53
Smith-Waterman score: 855; 51.2% identity (80.7% similar) in 244 aa overlap (54-290:10-243)
30 40 50 60 70 80
pF1KE0 EIGRYHWMYPGSKNHQYHPVPTLGDRASPLSSPGCFECCIKCLGGVPYASLVATILCFSG
.. ::::::::::::.:::::.:::: ..:
CCDS38 MEENMEEGQTQKGCFECCIKCLGGIPYASLIATILLYAG
10 20 30
90 100 110 120 130 140
pF1KE0 VALFCGCGHVALAGTVAILEQHF--STNASDHALLSEVIQLMQYVIYGIASFFFLYGIIL
::::::::: ::.::: ::. .: . .:.: . .:....:::::::. ::.:::.:
CCDS38 VALFCGCGHEALSGTVNILQTYFEMARTAGDTLDVFTMIDIFKYVIYGIAAAFFVYGILL
40 50 60 70 80 90
150 160 170 180 190 200
pF1KE0 LAEGFYTTSAVKELHGEFKTTACGRCISGMFVFLTYVLGVAWLGVFGFSAVPVFMFYNIW
..:::.::.:.:.:.:.:: :.::::.:. :..:::.. .::::: .:...::.:..:.:
CCDS38 MVEGFFTTGAIKDLYGDFKITTCGRCVSAWFIMLTYLFMLAWLGVTAFTSLPVYMYFNLW
100 110 120 130 140 150
210 220 230 240 250
pF1KE0 STCEVIKSPQTNGTTGVE--QICVDIRQYGIIPWNAFPGKICGSALEN---ICNTNEFYM
. :. .:: :: ..:.:.::.::. . ::: .. :: .:...:. :
CCDS38 TICR--------NTTLVEGANLCLDLRQFGIVTIGE-EKKIC-TVSENFLRMCESTELNM
160 170 180 190 200
260 270 280 290 300
pF1KE0 SYHLFIVACAGAGATVIALLIYMMATTYNYAVLKFKSREDCCTKF
..:::::: :::::.:::.. :.:. . :.: .:
CCDS38 TFHLFIVALAGAGAAVIAMVHYLMVLSANWAYVKDACRMQKYEDIKSKEEQELHDIHSTR
210 220 230 240 250 260
CCDS38 SKERLNAYT
270
>>CCDS54822.1 GPM6A gene_id:2823|Hs108|chr4 (267 aa)
initn: 800 init1: 389 opt: 854 Z-score: 1067.5 bits: 205.4 E(32554): 3.9e-53
Smith-Waterman score: 854; 51.9% identity (80.9% similar) in 241 aa overlap (57-290:2-232)
30 40 50 60 70 80
pF1KE0 RYHWMYPGSKNHQYHPVPTLGDRASPLSSPGCFECCIKCLGGVPYASLVATILCFSGVAL
::::::::::::.:::::.:::: ..::::
CCDS54 MGCFECCIKCLGGIPYASLIATILLYAGVAL
10 20 30
90 100 110 120 130 140
pF1KE0 FCGCGHVALAGTVAILEQHF--STNASDHALLSEVIQLMQYVIYGIASFFFLYGIILLAE
:::::: ::.::: ::. .: . .:.: . .:....:::::::. ::.:::.:..:
CCDS54 FCGCGHEALSGTVNILQTYFEMARTAGDTLDVFTMIDIFKYVIYGIAAAFFVYGILLMVE
40 50 60 70 80 90
150 160 170 180 190 200
pF1KE0 GFYTTSAVKELHGEFKTTACGRCISGMFVFLTYVLGVAWLGVFGFSAVPVFMFYNIWSTC
::.::.:.:.:.:.:: :.::::.:. :..:::.. .::::: .:...::.:..:.:. :
CCDS54 GFFTTGAIKDLYGDFKITTCGRCVSAWFIMLTYLFMLAWLGVTAFTSLPVYMYFNLWTIC
100 110 120 130 140 150
210 220 230 240 250
pF1KE0 EVIKSPQTNGTTGVE--QICVDIRQYGIIPWNAFPGKICGSALEN---ICNTNEFYMSYH
. .:: :: ..:.:.::.::. . ::: .. :: .:...:. :..:
CCDS54 R--------NTTLVEGANLCLDLRQFGIVTIGE-EKKIC-TVSENFLRMCESTELNMTFH
160 170 180 190 200
260 270 280 290 300
pF1KE0 LFIVACAGAGATVIALLIYMMATTYNYAVLKFKSREDCCTKF
::::: :::::.:::.. :.:. . :.: .:
CCDS54 LFIVALAGAGAAVIAMVHYLMVLSANWAYVKDACRMQKYEDIKSKEEQELHDIHSTRSKE
210 220 230 240 250 260
CCDS54 RLNAYT
>>CCDS58936.1 GPM6A gene_id:2823|Hs108|chr4 (271 aa)
initn: 800 init1: 389 opt: 854 Z-score: 1067.4 bits: 205.4 E(32554): 3.9e-53
Smith-Waterman score: 854; 51.9% identity (80.9% similar) in 241 aa overlap (57-290:6-236)
30 40 50 60 70 80
pF1KE0 RYHWMYPGSKNHQYHPVPTLGDRASPLSSPGCFECCIKCLGGVPYASLVATILCFSGVAL
::::::::::::.:::::.:::: ..::::
CCDS58 MTDLEGCFECCIKCLGGIPYASLIATILLYAGVAL
10 20 30
90 100 110 120 130 140
pF1KE0 FCGCGHVALAGTVAILEQHF--STNASDHALLSEVIQLMQYVIYGIASFFFLYGIILLAE
:::::: ::.::: ::. .: . .:.: . .:....:::::::. ::.:::.:..:
CCDS58 FCGCGHEALSGTVNILQTYFEMARTAGDTLDVFTMIDIFKYVIYGIAAAFFVYGILLMVE
40 50 60 70 80 90
150 160 170 180 190 200
pF1KE0 GFYTTSAVKELHGEFKTTACGRCISGMFVFLTYVLGVAWLGVFGFSAVPVFMFYNIWSTC
::.::.:.:.:.:.:: :.::::.:. :..:::.. .::::: .:...::.:..:.:. :
CCDS58 GFFTTGAIKDLYGDFKITTCGRCVSAWFIMLTYLFMLAWLGVTAFTSLPVYMYFNLWTIC
100 110 120 130 140 150
210 220 230 240 250
pF1KE0 EVIKSPQTNGTTGVE--QICVDIRQYGIIPWNAFPGKICGSALEN---ICNTNEFYMSYH
. .:: :: ..:.:.::.::. . ::: .. :: .:...:. :..:
CCDS58 R--------NTTLVEGANLCLDLRQFGIVTIGE-EKKIC-TVSENFLRMCESTELNMTFH
160 170 180 190 200
260 270 280 290 300
pF1KE0 LFIVACAGAGATVIALLIYMMATTYNYAVLKFKSREDCCTKF
::::: :::::.:::.. :.:. . :.: .:
CCDS58 LFIVALAGAGAAVIAMVHYLMVLSANWAYVKDACRMQKYEDIKSKEEQELHDIHSTRSKE
210 220 230 240 250 260
CCDS58 RLNAYT
270
>>CCDS14513.1 PLP1 gene_id:5354|Hs108|chrX (277 aa)
initn: 926 init1: 489 opt: 496 Z-score: 621.2 bits: 122.9 E(32554): 2.8e-28
Smith-Waterman score: 899; 49.8% identity (72.5% similar) in 273 aa overlap (57-294:2-273)
30 40 50 60 70 80
pF1KE0 RYHWMYPGSKNHQYHPVPTLGDRASPLSSPGCFECCIKCLGGVPYASLVATILCFSGVAL
: .::: .:: :.:.:::::: ::: ::::
CCDS14 MGLLECCARCLVGAPFASLVATGLCFFGVAL
10 20 30
90 100 110 120 130 140
pF1KE0 FCGCGHVALAGTVAILEQHFSTNASDHALLSEVIQLMQYVIYGIASFFFLYGIILLAEGF
:::::: ::.:: ..: .:: : .:. : .::. .:::::: :::::::: .::::::
CCDS14 FCGCGHEALTGTEKLIETYFSKNYQDYEYLINVIHAFQYVIYGTASFFFLYGALLLAEGF
40 50 60 70 80 90
150 160 170
pF1KE0 YTTSAVKELHGEFKTTACGRCISGM-----------------------------------
:::.::... :..::: ::. .:.
CCDS14 YTTGAVRQIFGDYKTTICGKGLSATVTGGQKGRGSRGQHQAHSLERVCHCLGKWLGHPDK
100 110 120 130 140 150
180 190 200 210 220 230
pF1KE0 FVFLTYVLGVAWLGVFGFSAVPVFMFYNIWSTCEVIKSPQTNGTTGVEQICVDIRQYGII
:: .::.: :.:: ::. :::::....: :.::. : : .. .... ..:.: :.::..
CCDS14 FVGITYALTVVWLLVFACSAVPVYIYFNTWTTCQSIAFP-SKTSASIGSLCADARMYGVL
160 170 180 190 200 210
240 250 260 270 280 290
pF1KE0 PWNAFPGKICGSALENICNTNEFYMSYHLFIVACAGAGATVIALLIYMMATTYNYAVLKF
::::::::.::: : .::.: :: :..::::.: .::.::...:: .:.:.:::.::::.
CCDS14 PWNAFPGKVCGSNLLSICKTAEFQMTFHLFIAAFVGAAATLVSLLTFMIAATYNFAVLKL
220 230 240 250 260 270
300
pF1KE0 KSREDCCTKF
.:
CCDS14 MGRGTKF
301 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 06:17:55 2016 done: Thu Nov 3 06:17:55 2016
Total Scan time: 2.350 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]