FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3068, 224 aa 1>>>pF1KE3068 224 - 224 aa - 224 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4348+/-0.000772; mu= 12.4755+/- 0.046 mean_var=57.4412+/-11.353, 0's: 0 Z-trim(107.4): 17 B-trim: 159 in 2/47 Lambda= 0.169224 statistics sampled from 9562 (9576) to 9562 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.689), E-opt: 0.2 (0.294), width: 16 Scan time: 1.750 The best scores are: opt bits E(32554) CCDS2893.1 FAM3D gene_id:131177|Hs108|chr3 ( 224) 1523 379.8 7.6e-106 CCDS35453.1 FAM3A gene_id:60343|Hs108|chrX ( 230) 868 219.9 1.1e-57 CCDS55542.1 FAM3A gene_id:60343|Hs108|chrX ( 192) 821 208.4 2.6e-54 CCDS76060.1 FAM3A gene_id:60343|Hs108|chrX ( 244) 821 208.4 3.2e-54 CCDS5782.1 FAM3C gene_id:10447|Hs108|chr7 ( 227) 794 201.8 2.9e-52 CCDS55543.1 FAM3A gene_id:60343|Hs108|chrX ( 213) 467 122.0 2.9e-28 CCDS13671.1 FAM3B gene_id:54097|Hs108|chr21 ( 235) 464 121.3 5.3e-28 CCDS42930.1 FAM3B gene_id:54097|Hs108|chr21 ( 187) 456 119.3 1.7e-27 >>CCDS2893.1 FAM3D gene_id:131177|Hs108|chr3 (224 aa) initn: 1523 init1: 1523 opt: 1523 Z-score: 2013.8 bits: 379.8 E(32554): 7.6e-106 Smith-Waterman score: 1523; 100.0% identity (100.0% similar) in 224 aa overlap (1-224:1-224) 10 20 30 40 50 60 pF1KE3 MRVSGVLRLLALIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASPTKEIQVKKYKCGLIKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 MRVSGVLRLLALIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASPTKEIQVKKYKCGLIKP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 CPANYFAFKICSGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQKAFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 CPANYFAFKICSGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQKAFD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 MYSGDVMHLVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFRDSWV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 MYSGDVMHLVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFRDSWV 130 140 150 160 170 180 190 200 210 220 pF1KE3 FIGAKDLRGKSPFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF :::::::::::::::::::::::::::::::::::::::::::: CCDS28 FIGAKDLRGKSPFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF 190 200 210 220 >>CCDS35453.1 FAM3A gene_id:60343|Hs108|chrX (230 aa) initn: 887 init1: 814 opt: 868 Z-score: 1149.4 bits: 219.9 E(32554): 1.1e-57 Smith-Waterman score: 868; 54.5% identity (81.2% similar) in 224 aa overlap (1-220:1-224) 10 20 30 40 50 pF1KE3 MRVSGVLRLLALIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASPTKEI----QVKKYKCG ::..: ::...:. .. .::. . .. . . . : .:: . . ...::::: CCDS35 MRLAGPLRIVVLVVSVGVTWIVVSILLGGPGSGFPRIQQLFTSPESSVTAAPRARKYKCG 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 LIKPCPANYFAFKICSGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQ : .::: ...::.. ::::::.:: .:.::.:.:: ::.:::::::::::::..: .. CCDS35 LPQPCPEEHLAFRVVSGAANVIGPKICLEDKMLMSSVKDNVGRGLNIALVNGVSGELIEA 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE3 KAFDMYSGDVMHLVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFR .::::..::: :.::.. . :.::.:::::::.::::.:.:::::.::: ::.:.:: CCDS35 RAFDMWAGDVNDLLKFIRPLHEGTLVFVASYDDPATKMNEETRKLFSELGSRNAKELAFR 130 140 150 160 170 180 180 190 200 210 220 pF1KE3 DSWVFIGAKDLRGKSPFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF :::::.::: ...:::::: .::: .:::::::: ::::::.: CCDS35 DSWVFVGAKGVQNKSPFEQHVKNSKHSNKYEGWPEALEMEGCIPRRSTAS 190 200 210 220 230 >>CCDS55542.1 FAM3A gene_id:60343|Hs108|chrX (192 aa) initn: 831 init1: 814 opt: 821 Z-score: 1088.6 bits: 208.4 E(32554): 2.6e-54 Smith-Waterman score: 821; 63.1% identity (86.6% similar) in 179 aa overlap (42-220:8-186) 20 30 40 50 60 70 pF1KE3 LIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASPTKEIQVKKYKCGLIKPCPANYFAFKIC .: : ...:::::: .::: ...::.. CCDS55 MRLAGPESSVTAAPRARKYKCGLPQPCPEEHLAFRVV 10 20 30 80 90 100 110 120 130 pF1KE3 SGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQKAFDMYSGDVMHLVK ::::::.:: .:.::.:.:: ::.:::::::::::::..: .. .::::..::: :.: CCDS55 SGAANVIGPKICLEDKMLMSSVKDNVGRGLNIALVNGVSGELIEARAFDMWAGDVNDLLK 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE3 FLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFRDSWVFIGAKDLRGKS :.. . :.::.:::::::.::::.:.:::::.::: ::.:.:::::::.::: ...:: CCDS55 FIRPLHEGTLVFVASYDDPATKMNEETRKLFSELGSRNAKELAFRDSWVFVGAKGVQNKS 100 110 120 130 140 150 200 210 220 pF1KE3 PFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF :::: .::: .:::::::: ::::::.: CCDS55 PFEQHVKNSKHSNKYEGWPEALEMEGCIPRRSTAS 160 170 180 190 >>CCDS76060.1 FAM3A gene_id:60343|Hs108|chrX (244 aa) initn: 836 init1: 814 opt: 821 Z-score: 1086.9 bits: 208.4 E(32554): 3.2e-54 Smith-Waterman score: 821; 63.1% identity (86.6% similar) in 179 aa overlap (42-220:60-238) 20 30 40 50 60 70 pF1KE3 LIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASPTKEIQVKKYKCGLIKPCPANYFAFKIC .: : ...:::::: .::: ...::.. CCDS76 AISVHCNLYLLGSSDSSAPASRVAETTGPESSVTAAPRARKYKCGLPQPCPEEHLAFRVV 30 40 50 60 70 80 80 90 100 110 120 130 pF1KE3 SGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQKAFDMYSGDVMHLVK ::::::.:: .:.::.:.:: ::.:::::::::::::..: .. .::::..::: :.: CCDS76 SGAANVIGPKICLEDKMLMSSVKDNVGRGLNIALVNGVSGELIEARAFDMWAGDVNDLLK 90 100 110 120 130 140 140 150 160 170 180 190 pF1KE3 FLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFRDSWVFIGAKDLRGKS :.. . :.::.:::::::.::::.:.:::::.::: ::.:.:::::::.::: ...:: CCDS76 FIRPLHEGTLVFVASYDDPATKMNEETRKLFSELGSRNAKELAFRDSWVFVGAKGVQNKS 150 160 170 180 190 200 200 210 220 pF1KE3 PFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF :::: .::: .:::::::: ::::::.: CCDS76 PFEQHVKNSKHSNKYEGWPEALEMEGCIPRRSTAS 210 220 230 240 >>CCDS5782.1 FAM3C gene_id:10447|Hs108|chr7 (227 aa) initn: 825 init1: 768 opt: 794 Z-score: 1051.8 bits: 201.8 E(32554): 2.9e-52 Smith-Waterman score: 794; 51.3% identity (77.4% similar) in 226 aa overlap (1-222:1-225) 10 20 30 40 50 pF1KE3 MRVSGVLRLLALIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASP----TKEIQVKKYKCG :::.:. .:.. . ... :.. : . . ..: . : .: : .. . .:::: CCDS57 MRVAGAAKLVVAVAVFLLTFYVISQVFEIKMDA-SLGNLFARSALDTAARSTKPPRYKCG 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 LIKPCPANYFAFKICSGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQ . : :: ..::::. ::::::::: .:.:: ..:: ::::::::.:.::.:: :: :: CCDS57 ISKACPEKHFAFKMASGAANVVGPKICLEDNVLMSGVKNNVGRGINVALANGKTGEVLDT 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE3 KAFDMYSGDVMHLVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFR : :::..::: ...::: : :..::...::: .::.:::.:.:..::::. .:::: CCDS57 KYFDMWGGDVAPFIEFLKAIQDGTIVLMGTYDDGATKLNDEARRLIADLGSTSITNLGFR 120 130 140 150 160 170 180 190 200 210 220 pF1KE3 DSWVFIGAKDLRGKSPFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF :.::: :.: .. :::::: .::. ::::::::::..:::::.: : CCDS57 DNWVFCGGKGIKTKSPFEQHIKNNKDTNKYEGWPEVVEMEGCIPQKQD 180 190 200 210 220 >>CCDS55543.1 FAM3A gene_id:60343|Hs108|chrX (213 aa) initn: 802 init1: 461 opt: 467 Z-score: 620.8 bits: 122.0 E(32554): 2.9e-28 Smith-Waterman score: 766; 51.3% identity (75.0% similar) in 224 aa overlap (1-220:1-207) 10 20 30 40 50 pF1KE3 MRVSGVLRLLALIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASPTKEI----QVKKYKCG ::..: ::...:. .. .::. . .. . . . : .:: . . ...::::: CCDS55 MRLAGPLRIVVLVVSVGVTWIVVSILLGGPGSGFPRIQQLFTSPESSVTAAPRARKYKCG 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 LIKPCPANYFAFKICSGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQ : .::: ...::.. ::::::.:: .:.::.:.:: ::.:::::::::::: CCDS55 LPQPCPEEHLAFRVVSGAANVIGPKICLEDKMLMSSVKDNVGRGLNIALVN--------- 70 80 90 100 110 120 130 140 150 160 170 pF1KE3 KAFDMYSGDVMHLVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFR :: :.::.. . :.::.:::::::.::::.:.:::::.::: ::.:.:: CCDS55 --------DVNDLLKFIRPLHEGTLVFVASYDDPATKMNEETRKLFSELGSRNAKELAFR 120 130 140 150 160 180 190 200 210 220 pF1KE3 DSWVFIGAKDLRGKSPFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF :::::.::: ...:::::: .::: .:::::::: ::::::.: CCDS55 DSWVFVGAKGVQNKSPFEQHVKNSKHSNKYEGWPEALEMEGCIPRRSTAS 170 180 190 200 210 >>CCDS13671.1 FAM3B gene_id:54097|Hs108|chr21 (235 aa) initn: 430 init1: 232 opt: 464 Z-score: 616.1 bits: 121.3 E(32554): 5.3e-28 Smith-Waterman score: 464; 32.9% identity (64.9% similar) in 228 aa overlap (5-220:7-231) 10 20 30 40 50 pF1KE3 MRVSGVLRLLALIFAIVTTW--------MFIRSYMSFSMKTIRLPRWLAASPTKEIQV :.:... ..:: . .: .. . .: . .:: .. :. . : CCDS13 MRPLAGGLLKVVFVVFASLCAWYSGYLLAELIPDAPLSSAAYSIRS---IGERPVLKAPV 10 20 30 40 50 60 70 80 90 100 pF1KE3 -KKYKCGLIKPCPANYFAFKICSGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGT :. :: :::.. .:... ::.. .:::: ..:. .::.::.:::.:: . CCDS13 PKRQKCDHWTPCPSDTYAYRLLSGGGRSKYAKICFEDNLLMGEQLGNVARGINIAIVNYV 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE3 TGAVLGQKAFDMYSGDVMH-LVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSS :: : . . :::: :: ..::.. .:.....::: .:..:..... . :::. CCDS13 TGNVTATRCFDMYEGDNSGPMTKFIQSAAPKSLLFMVTYDDGSTRLNNDAKNAIEALGSK 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE3 YAKQLGFRDSWVFIGAKDLRGKSPFEQFLKNSPDT--NKYEGWPELLEMEGCMPPKPF ... ::.:::::.:: :. : ... : :. :.: ::: ...:::.: CCDS13 EIRNMKFRSSWVFIAAKGLELPSEIQREKINHSDAKNNRYSGWPAEIQIEGCIPKERS 180 190 200 210 220 230 >>CCDS42930.1 FAM3B gene_id:54097|Hs108|chr21 (187 aa) initn: 430 init1: 232 opt: 456 Z-score: 607.2 bits: 119.3 E(32554): 1.7e-27 Smith-Waterman score: 456; 37.1% identity (68.3% similar) in 186 aa overlap (38-220:2-183) 10 20 30 40 50 60 pF1KE3 RLLALIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASPTKEIQVKKYKCGLIKPCPANYFA : ::..:. :. :: :::.. .: CCDS42 MRPLAGAPVP----KRQKCDHWTPCPSDTYA 10 20 70 80 90 100 110 120 pF1KE3 FKICSGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQKAFDMYSGDVM ... ::.. .:::: ..:. .::.::.:::.:: .:: : . . :::: :: CCDS42 YRLLSGGGRSKYAKICFEDNLLMGEQLGNVARGINIAIVNYVTGNVTATRCFDMYEGDNS 30 40 50 60 70 80 130 140 150 160 170 180 pF1KE3 H-LVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFRDSWVFIGAKD ..::.. .:.....::: .:..:..... . :::. ... ::.:::::.:: CCDS42 GPMTKFIQSAAPKSLLFMVTYDDGSTRLNNDAKNAIEALGSKEIRNMKFRSSWVFIAAKG 90 100 110 120 130 140 190 200 210 220 pF1KE3 LRGKSPFEQFLKNSPDT--NKYEGWPELLEMEGCMPPKPF :. : ... : :. :.: ::: ...:::.: CCDS42 LELPSEIQREKINHSDAKNNRYSGWPAEIQIEGCIPKERS 150 160 170 180 224 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 02:21:10 2016 done: Sun Nov 6 02:21:11 2016 Total Scan time: 1.750 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]