FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE3068, 224 aa
1>>>pF1KE3068 224 - 224 aa - 224 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.4348+/-0.000772; mu= 12.4755+/- 0.046
mean_var=57.4412+/-11.353, 0's: 0 Z-trim(107.4): 17 B-trim: 159 in 2/47
Lambda= 0.169224
statistics sampled from 9562 (9576) to 9562 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.689), E-opt: 0.2 (0.294), width: 16
Scan time: 1.750
The best scores are: opt bits E(32554)
CCDS2893.1 FAM3D gene_id:131177|Hs108|chr3 ( 224) 1523 379.8 7.6e-106
CCDS35453.1 FAM3A gene_id:60343|Hs108|chrX ( 230) 868 219.9 1.1e-57
CCDS55542.1 FAM3A gene_id:60343|Hs108|chrX ( 192) 821 208.4 2.6e-54
CCDS76060.1 FAM3A gene_id:60343|Hs108|chrX ( 244) 821 208.4 3.2e-54
CCDS5782.1 FAM3C gene_id:10447|Hs108|chr7 ( 227) 794 201.8 2.9e-52
CCDS55543.1 FAM3A gene_id:60343|Hs108|chrX ( 213) 467 122.0 2.9e-28
CCDS13671.1 FAM3B gene_id:54097|Hs108|chr21 ( 235) 464 121.3 5.3e-28
CCDS42930.1 FAM3B gene_id:54097|Hs108|chr21 ( 187) 456 119.3 1.7e-27
>>CCDS2893.1 FAM3D gene_id:131177|Hs108|chr3 (224 aa)
initn: 1523 init1: 1523 opt: 1523 Z-score: 2013.8 bits: 379.8 E(32554): 7.6e-106
Smith-Waterman score: 1523; 100.0% identity (100.0% similar) in 224 aa overlap (1-224:1-224)
10 20 30 40 50 60
pF1KE3 MRVSGVLRLLALIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASPTKEIQVKKYKCGLIKP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 MRVSGVLRLLALIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASPTKEIQVKKYKCGLIKP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE3 CPANYFAFKICSGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQKAFD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 CPANYFAFKICSGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQKAFD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE3 MYSGDVMHLVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFRDSWV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 MYSGDVMHLVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFRDSWV
130 140 150 160 170 180
190 200 210 220
pF1KE3 FIGAKDLRGKSPFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF
::::::::::::::::::::::::::::::::::::::::::::
CCDS28 FIGAKDLRGKSPFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF
190 200 210 220
>>CCDS35453.1 FAM3A gene_id:60343|Hs108|chrX (230 aa)
initn: 887 init1: 814 opt: 868 Z-score: 1149.4 bits: 219.9 E(32554): 1.1e-57
Smith-Waterman score: 868; 54.5% identity (81.2% similar) in 224 aa overlap (1-220:1-224)
10 20 30 40 50
pF1KE3 MRVSGVLRLLALIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASPTKEI----QVKKYKCG
::..: ::...:. .. .::. . .. . . . : .:: . . ...:::::
CCDS35 MRLAGPLRIVVLVVSVGVTWIVVSILLGGPGSGFPRIQQLFTSPESSVTAAPRARKYKCG
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE3 LIKPCPANYFAFKICSGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQ
: .::: ...::.. ::::::.:: .:.::.:.:: ::.:::::::::::::..: ..
CCDS35 LPQPCPEEHLAFRVVSGAANVIGPKICLEDKMLMSSVKDNVGRGLNIALVNGVSGELIEA
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE3 KAFDMYSGDVMHLVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFR
.::::..::: :.::.. . :.::.:::::::.::::.:.:::::.::: ::.:.::
CCDS35 RAFDMWAGDVNDLLKFIRPLHEGTLVFVASYDDPATKMNEETRKLFSELGSRNAKELAFR
130 140 150 160 170 180
180 190 200 210 220
pF1KE3 DSWVFIGAKDLRGKSPFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF
:::::.::: ...:::::: .::: .:::::::: ::::::.:
CCDS35 DSWVFVGAKGVQNKSPFEQHVKNSKHSNKYEGWPEALEMEGCIPRRSTAS
190 200 210 220 230
>>CCDS55542.1 FAM3A gene_id:60343|Hs108|chrX (192 aa)
initn: 831 init1: 814 opt: 821 Z-score: 1088.6 bits: 208.4 E(32554): 2.6e-54
Smith-Waterman score: 821; 63.1% identity (86.6% similar) in 179 aa overlap (42-220:8-186)
20 30 40 50 60 70
pF1KE3 LIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASPTKEIQVKKYKCGLIKPCPANYFAFKIC
.: : ...:::::: .::: ...::..
CCDS55 MRLAGPESSVTAAPRARKYKCGLPQPCPEEHLAFRVV
10 20 30
80 90 100 110 120 130
pF1KE3 SGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQKAFDMYSGDVMHLVK
::::::.:: .:.::.:.:: ::.:::::::::::::..: .. .::::..::: :.:
CCDS55 SGAANVIGPKICLEDKMLMSSVKDNVGRGLNIALVNGVSGELIEARAFDMWAGDVNDLLK
40 50 60 70 80 90
140 150 160 170 180 190
pF1KE3 FLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFRDSWVFIGAKDLRGKS
:.. . :.::.:::::::.::::.:.:::::.::: ::.:.:::::::.::: ...::
CCDS55 FIRPLHEGTLVFVASYDDPATKMNEETRKLFSELGSRNAKELAFRDSWVFVGAKGVQNKS
100 110 120 130 140 150
200 210 220
pF1KE3 PFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF
:::: .::: .:::::::: ::::::.:
CCDS55 PFEQHVKNSKHSNKYEGWPEALEMEGCIPRRSTAS
160 170 180 190
>>CCDS76060.1 FAM3A gene_id:60343|Hs108|chrX (244 aa)
initn: 836 init1: 814 opt: 821 Z-score: 1086.9 bits: 208.4 E(32554): 3.2e-54
Smith-Waterman score: 821; 63.1% identity (86.6% similar) in 179 aa overlap (42-220:60-238)
20 30 40 50 60 70
pF1KE3 LIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASPTKEIQVKKYKCGLIKPCPANYFAFKIC
.: : ...:::::: .::: ...::..
CCDS76 AISVHCNLYLLGSSDSSAPASRVAETTGPESSVTAAPRARKYKCGLPQPCPEEHLAFRVV
30 40 50 60 70 80
80 90 100 110 120 130
pF1KE3 SGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQKAFDMYSGDVMHLVK
::::::.:: .:.::.:.:: ::.:::::::::::::..: .. .::::..::: :.:
CCDS76 SGAANVIGPKICLEDKMLMSSVKDNVGRGLNIALVNGVSGELIEARAFDMWAGDVNDLLK
90 100 110 120 130 140
140 150 160 170 180 190
pF1KE3 FLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFRDSWVFIGAKDLRGKS
:.. . :.::.:::::::.::::.:.:::::.::: ::.:.:::::::.::: ...::
CCDS76 FIRPLHEGTLVFVASYDDPATKMNEETRKLFSELGSRNAKELAFRDSWVFVGAKGVQNKS
150 160 170 180 190 200
200 210 220
pF1KE3 PFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF
:::: .::: .:::::::: ::::::.:
CCDS76 PFEQHVKNSKHSNKYEGWPEALEMEGCIPRRSTAS
210 220 230 240
>>CCDS5782.1 FAM3C gene_id:10447|Hs108|chr7 (227 aa)
initn: 825 init1: 768 opt: 794 Z-score: 1051.8 bits: 201.8 E(32554): 2.9e-52
Smith-Waterman score: 794; 51.3% identity (77.4% similar) in 226 aa overlap (1-222:1-225)
10 20 30 40 50
pF1KE3 MRVSGVLRLLALIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASP----TKEIQVKKYKCG
:::.:. .:.. . ... :.. : . . ..: . : .: : .. . .::::
CCDS57 MRVAGAAKLVVAVAVFLLTFYVISQVFEIKMDA-SLGNLFARSALDTAARSTKPPRYKCG
10 20 30 40 50
60 70 80 90 100 110
pF1KE3 LIKPCPANYFAFKICSGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQ
. : :: ..::::. ::::::::: .:.:: ..:: ::::::::.:.::.:: :: ::
CCDS57 ISKACPEKHFAFKMASGAANVVGPKICLEDNVLMSGVKNNVGRGINVALANGKTGEVLDT
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE3 KAFDMYSGDVMHLVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFR
: :::..::: ...::: : :..::...::: .::.:::.:.:..::::. .::::
CCDS57 KYFDMWGGDVAPFIEFLKAIQDGTIVLMGTYDDGATKLNDEARRLIADLGSTSITNLGFR
120 130 140 150 160 170
180 190 200 210 220
pF1KE3 DSWVFIGAKDLRGKSPFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF
:.::: :.: .. :::::: .::. ::::::::::..:::::.: :
CCDS57 DNWVFCGGKGIKTKSPFEQHIKNNKDTNKYEGWPEVVEMEGCIPQKQD
180 190 200 210 220
>>CCDS55543.1 FAM3A gene_id:60343|Hs108|chrX (213 aa)
initn: 802 init1: 461 opt: 467 Z-score: 620.8 bits: 122.0 E(32554): 2.9e-28
Smith-Waterman score: 766; 51.3% identity (75.0% similar) in 224 aa overlap (1-220:1-207)
10 20 30 40 50
pF1KE3 MRVSGVLRLLALIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASPTKEI----QVKKYKCG
::..: ::...:. .. .::. . .. . . . : .:: . . ...:::::
CCDS55 MRLAGPLRIVVLVVSVGVTWIVVSILLGGPGSGFPRIQQLFTSPESSVTAAPRARKYKCG
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE3 LIKPCPANYFAFKICSGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQ
: .::: ...::.. ::::::.:: .:.::.:.:: ::.::::::::::::
CCDS55 LPQPCPEEHLAFRVVSGAANVIGPKICLEDKMLMSSVKDNVGRGLNIALVN---------
70 80 90 100 110
120 130 140 150 160 170
pF1KE3 KAFDMYSGDVMHLVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFR
:: :.::.. . :.::.:::::::.::::.:.:::::.::: ::.:.::
CCDS55 --------DVNDLLKFIRPLHEGTLVFVASYDDPATKMNEETRKLFSELGSRNAKELAFR
120 130 140 150 160
180 190 200 210 220
pF1KE3 DSWVFIGAKDLRGKSPFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF
:::::.::: ...:::::: .::: .:::::::: ::::::.:
CCDS55 DSWVFVGAKGVQNKSPFEQHVKNSKHSNKYEGWPEALEMEGCIPRRSTAS
170 180 190 200 210
>>CCDS13671.1 FAM3B gene_id:54097|Hs108|chr21 (235 aa)
initn: 430 init1: 232 opt: 464 Z-score: 616.1 bits: 121.3 E(32554): 5.3e-28
Smith-Waterman score: 464; 32.9% identity (64.9% similar) in 228 aa overlap (5-220:7-231)
10 20 30 40 50
pF1KE3 MRVSGVLRLLALIFAIVTTW--------MFIRSYMSFSMKTIRLPRWLAASPTKEIQV
:.:... ..:: . .: .. . .: . .:: .. :. . :
CCDS13 MRPLAGGLLKVVFVVFASLCAWYSGYLLAELIPDAPLSSAAYSIRS---IGERPVLKAPV
10 20 30 40 50
60 70 80 90 100
pF1KE3 -KKYKCGLIKPCPANYFAFKICSGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGT
:. :: :::.. .:... ::.. .:::: ..:. .::.::.:::.:: .
CCDS13 PKRQKCDHWTPCPSDTYAYRLLSGGGRSKYAKICFEDNLLMGEQLGNVARGINIAIVNYV
60 70 80 90 100 110
110 120 130 140 150 160
pF1KE3 TGAVLGQKAFDMYSGDVMH-LVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSS
:: : . . :::: :: ..::.. .:.....::: .:..:..... . :::.
CCDS13 TGNVTATRCFDMYEGDNSGPMTKFIQSAAPKSLLFMVTYDDGSTRLNNDAKNAIEALGSK
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE3 YAKQLGFRDSWVFIGAKDLRGKSPFEQFLKNSPDT--NKYEGWPELLEMEGCMPPKPF
... ::.:::::.:: :. : ... : :. :.: ::: ...:::.:
CCDS13 EIRNMKFRSSWVFIAAKGLELPSEIQREKINHSDAKNNRYSGWPAEIQIEGCIPKERS
180 190 200 210 220 230
>>CCDS42930.1 FAM3B gene_id:54097|Hs108|chr21 (187 aa)
initn: 430 init1: 232 opt: 456 Z-score: 607.2 bits: 119.3 E(32554): 1.7e-27
Smith-Waterman score: 456; 37.1% identity (68.3% similar) in 186 aa overlap (38-220:2-183)
10 20 30 40 50 60
pF1KE3 RLLALIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASPTKEIQVKKYKCGLIKPCPANYFA
: ::..:. :. :: :::.. .:
CCDS42 MRPLAGAPVP----KRQKCDHWTPCPSDTYA
10 20
70 80 90 100 110 120
pF1KE3 FKICSGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQKAFDMYSGDVM
... ::.. .:::: ..:. .::.::.:::.:: .:: : . . :::: ::
CCDS42 YRLLSGGGRSKYAKICFEDNLLMGEQLGNVARGINIAIVNYVTGNVTATRCFDMYEGDNS
30 40 50 60 70 80
130 140 150 160 170 180
pF1KE3 H-LVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFRDSWVFIGAKD
..::.. .:.....::: .:..:..... . :::. ... ::.:::::.::
CCDS42 GPMTKFIQSAAPKSLLFMVTYDDGSTRLNNDAKNAIEALGSKEIRNMKFRSSWVFIAAKG
90 100 110 120 130 140
190 200 210 220
pF1KE3 LRGKSPFEQFLKNSPDT--NKYEGWPELLEMEGCMPPKPF
:. : ... : :. :.: ::: ...:::.:
CCDS42 LELPSEIQREKINHSDAKNNRYSGWPAEIQIEGCIPKERS
150 160 170 180
224 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 02:21:10 2016 done: Sun Nov 6 02:21:11 2016
Total Scan time: 1.750 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]