FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3075, 235 aa 1>>>pF1KE3075 235 - 235 aa - 235 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7163+/-0.000807; mu= 11.2443+/- 0.048 mean_var=58.6098+/-11.496, 0's: 0 Z-trim(106.3): 15 B-trim: 32 in 1/48 Lambda= 0.167529 statistics sampled from 8920 (8932) to 8920 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.66), E-opt: 0.2 (0.274), width: 16 Scan time: 2.190 The best scores are: opt bits E(32554) CCDS13671.1 FAM3B gene_id:54097|Hs108|chr21 ( 235) 1580 390.1 6.8e-109 CCDS42930.1 FAM3B gene_id:54097|Hs108|chr21 ( 187) 1244 308.8 1.5e-84 CCDS76060.1 FAM3A gene_id:60343|Hs108|chrX ( 244) 509 131.2 5.9e-31 CCDS35453.1 FAM3A gene_id:60343|Hs108|chrX ( 230) 507 130.7 7.8e-31 CCDS5782.1 FAM3C gene_id:10447|Hs108|chr7 ( 227) 501 129.3 2.1e-30 CCDS55542.1 FAM3A gene_id:60343|Hs108|chrX ( 192) 497 128.3 3.5e-30 CCDS2893.1 FAM3D gene_id:131177|Hs108|chr3 ( 224) 464 120.3 1e-27 >>CCDS13671.1 FAM3B gene_id:54097|Hs108|chr21 (235 aa) initn: 1580 init1: 1580 opt: 1580 Z-score: 2068.5 bits: 390.1 E(32554): 6.8e-109 Smith-Waterman score: 1580; 100.0% identity (100.0% similar) in 235 aa overlap (1-235:1-235) 10 20 30 40 50 60 pF1KE3 MRPLAGGLLKVVFVVFASLCAWYSGYLLAELIPDAPLSSAAYSIRSIGERPVLKAPVPKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MRPLAGGLLKVVFVVFASLCAWYSGYLLAELIPDAPLSSAAYSIRSIGERPVLKAPVPKR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 QKCDHWTPCPSDTYAYRLLSGGGRSKYAKICFEDNLLMGEQLGNVARGINIAIVNYVTGN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 QKCDHWTPCPSDTYAYRLLSGGGRSKYAKICFEDNLLMGEQLGNVARGINIAIVNYVTGN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 VTATRCFDMYEGDNSGPMTKFIQSAAPKSLLFMVTYDDGSTRLNNDAKNAIEALGSKEIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 VTATRCFDMYEGDNSGPMTKFIQSAAPKSLLFMVTYDDGSTRLNNDAKNAIEALGSKEIR 130 140 150 160 170 180 190 200 210 220 230 pF1KE3 NMKFRSSWVFIAAKGLELPSEIQREKINHSDAKNNRYSGWPAEIQIEGCIPKERS ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 NMKFRSSWVFIAAKGLELPSEIQREKINHSDAKNNRYSGWPAEIQIEGCIPKERS 190 200 210 220 230 >>CCDS42930.1 FAM3B gene_id:54097|Hs108|chr21 (187 aa) initn: 1244 init1: 1244 opt: 1244 Z-score: 1631.3 bits: 308.8 E(32554): 1.5e-84 Smith-Waterman score: 1244; 98.4% identity (98.9% similar) in 186 aa overlap (50-235:2-187) 20 30 40 50 60 70 pF1KE3 CAWYSGYLLAELIPDAPLSSAAYSIRSIGERPVLKAPVPKRQKCDHWTPCPSDTYAYRLL ::. ::::::::::::::::::::::::: CCDS42 MRPLAGAPVPKRQKCDHWTPCPSDTYAYRLL 10 20 30 80 90 100 110 120 130 pF1KE3 SGGGRSKYAKICFEDNLLMGEQLGNVARGINIAIVNYVTGNVTATRCFDMYEGDNSGPMT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 SGGGRSKYAKICFEDNLLMGEQLGNVARGINIAIVNYVTGNVTATRCFDMYEGDNSGPMT 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE3 KFIQSAAPKSLLFMVTYDDGSTRLNNDAKNAIEALGSKEIRNMKFRSSWVFIAAKGLELP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 KFIQSAAPKSLLFMVTYDDGSTRLNNDAKNAIEALGSKEIRNMKFRSSWVFIAAKGLELP 100 110 120 130 140 150 200 210 220 230 pF1KE3 SEIQREKINHSDAKNNRYSGWPAEIQIEGCIPKERS :::::::::::::::::::::::::::::::::::: CCDS42 SEIQREKINHSDAKNNRYSGWPAEIQIEGCIPKERS 160 170 180 >>CCDS76060.1 FAM3A gene_id:60343|Hs108|chrX (244 aa) initn: 492 init1: 248 opt: 509 Z-score: 669.2 bits: 131.2 E(32554): 5.9e-31 Smith-Waterman score: 509; 37.1% identity (68.6% similar) in 210 aa overlap (26-235:38-242) 10 20 30 40 50 pF1KE3 MRPLAGGLLKVVFVVFASLCAWYSGYLLAELIPDAPLSSAAYSIRSIGERPVLKA :::. .:: : .: . . : : : CCDS76 PGWAWQWLSSHPATLHQLECSGAISVHCNLYLLGSSDSSAPASRVAET--TGPESSVTAA 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 PVPKRQKCDHWTPCPSDTYAYRLLSGGGRSKYAKICFEDNLLMGEQLGNVARGINIAIVN : .. :: ::: . :.:..::.. :::.::..::. ::.::.:::.:: CCDS76 PRARKYKCGLPQPCPEEHLAFRVVSGAANVIGPKICLEDKMLMSSVKDNVGRGLNIALVN 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE3 YVTGNVTATRCFDMYEGDNSGPMTKFIQSAAPKSLLFMVTYDDGSTRLNNDAKNAIEALG :.:.. .: :::. :: . . :::. .:.:...::: .:..:..... . :: CCDS76 GVSGELIEARAFDMWAGDVND-LLKFIRPLHEGTLVFVASYDDPATKMNEETRKLFSELG 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE3 SKEIRNMKFRSSWVFIAAKGLELPSEIQREKINHSDAKNNRYSGWPAEIQIEGCIPKERS :.. ... ::.::::..:::.. : .... : . ..:.: ::: ...:::::.. . CCDS76 SRNAKELAFRDSWVFVGAKGVQNKSPFEQHVKN--SKHSNKYEGWPEALEMEGCIPRRST 190 200 210 220 230 240 CCDS76 AS >>CCDS35453.1 FAM3A gene_id:60343|Hs108|chrX (230 aa) initn: 497 init1: 248 opt: 507 Z-score: 667.0 bits: 130.7 E(32554): 7.8e-31 Smith-Waterman score: 507; 35.4% identity (68.4% similar) in 237 aa overlap (1-235:1-228) 10 20 30 40 50 pF1KE3 MRPLAGGLLKVVFVVFASLCAWYSGYLLAELIPDAPLSS--AAYSIRSIGERPVLKAPVP :: ::: : ::.:: ... .: ... .. .: :. .. . : : :: CCDS35 MR-LAGPLRIVVLVVSVGV-TW----IVVSILLGGPGSGFPRIQQLFTSPESSVTAAPRA 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 KRQKCDHWTPCPSDTYAYRLLSGGGRSKYAKICFEDNLLMGEQLGNVARGINIAIVNYVT .. :: ::: . :.:..::.. :::.::..::. ::.::.:::.:: :. CCDS35 RKYKCGLPQPCPEEHLAFRVVSGAANVIGPKICLEDKMLMSSVKDNVGRGLNIALVNGVS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE3 GNVTATRCFDMYEGDNSGPMTKFIQSAAPKSLLFMVTYDDGSTRLNNDAKNAIEALGSKE :.. .: :::. :: . . :::. .:.:...::: .:..:..... . :::.. CCDS35 GELIEARAFDMWAGD-VNDLLKFIRPLHEGTLVFVASYDDPATKMNEETRKLFSELGSRN 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE3 IRNMKFRSSWVFIAAKGLELPSEIQREKINHSDAKNNRYSGWPAEIQIEGCIPKERS ... ::.::::..:::.. : .. .....: ..:.: ::: ...:::::.. . CCDS35 AKELAFRDSWVFVGAKGVQNKSPFE-QHVKNSK-HSNKYEGWPEALEMEGCIPRRSTAS 180 190 200 210 220 230 >>CCDS5782.1 FAM3C gene_id:10447|Hs108|chr7 (227 aa) initn: 437 init1: 221 opt: 501 Z-score: 659.3 bits: 129.3 E(32554): 2.1e-30 Smith-Waterman score: 501; 36.8% identity (65.8% similar) in 228 aa overlap (7-234:5-226) 10 20 30 40 50 60 pF1KE3 MRPLAGGLLKVVFVVFASLCAWYSGYLLAELIPDAPLSSAAYSIRSIGERPVLKAPVPKR : :.: .: . : ..: . :. :: :.. :: . . .. : : CCDS57 MRVAGAAKLVVAVAVFLLTFYVISQVFEIKMDASLGN--LFARSALDTAA-RSTKPPR 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 QKCDHWTPCPSDTYAYRLLSGGGRSKYAKICFEDNLLMGEQLGNVARGINIAIVNYVTGN :: :: .:... ::.. :::.:::.::. .::.::::.:..: ::. CCDS57 YKCGISKACPEKHFAFKMASGAANVVGPKICLEDNVLMSGVKNNVGRGINVALANGKTGE 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE3 VTATRCFDMYEGDNSGPMTKFIQSAAPKSLLFMVTYDDGSTRLNNDAKNAIEALGSKEIR : :. :::. :: .:. .:... ....: :::::.:.::..:. : ::: : CCDS57 VLDTKYFDMWGGD-VAPFIEFLKAIQDGTIVLMGTYDDGATKLNDEARRLIADLGSTSIT 120 130 140 150 160 170 190 200 210 220 230 pF1KE3 NMKFRSSWVFIAAKGLELPSEIQREKINHSDAKNNRYSGWPAEIQIEGCIPKERS :. ::..::: ..::.. : .... :..:. :.: ::: ...:::::... CCDS57 NLGFRDNWVFCGGKGIKTKSPFEQHIKNNKDT--NKYEGWPEVVEMEGCIPQKQD 180 190 200 210 220 >>CCDS55542.1 FAM3A gene_id:60343|Hs108|chrX (192 aa) initn: 475 init1: 248 opt: 497 Z-score: 655.3 bits: 128.3 E(32554): 3.5e-30 Smith-Waterman score: 497; 37.8% identity (71.0% similar) in 193 aa overlap (44-235:1-190) 20 30 40 50 60 70 pF1KE3 VVFASLCAWYSGYLLAELIPDAPLSSAAYSIRSIG-ERPVLKAPVPKRQKCDHWTPCPSD .: : : : :: .. :: ::: . CCDS55 MRLAGPESSVTAAPRARKYKCGLPQPCPEE 10 20 30 80 90 100 110 120 130 pF1KE3 TYAYRLLSGGGRSKYAKICFEDNLLMGEQLGNVARGINIAIVNYVTGNVTATRCFDMYEG :.:..::.. :::.::..::. ::.::.:::.:: :.:.. .: :::. : CCDS55 HLAFRVVSGAANVIGPKICLEDKMLMSSVKDNVGRGLNIALVNGVSGELIEARAFDMWAG 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE3 DNSGPMTKFIQSAAPKSLLFMVTYDDGSTRLNNDAKNAIEALGSKEIRNMKFRSSWVFIA : . . :::. .:.:...::: .:..:..... . :::.. ... ::.::::.. CCDS55 DVND-LLKFIRPLHEGTLVFVASYDDPATKMNEETRKLFSELGSRNAKELAFRDSWVFVG 100 110 120 130 140 200 210 220 230 pF1KE3 AKGLELPSEIQREKINHSDAKNNRYSGWPAEIQIEGCIPKERS :::.. : .. .....: ..:.: ::: ...:::::.. . CCDS55 AKGVQNKSPFE-QHVKNSK-HSNKYEGWPEALEMEGCIPRRSTAS 150 160 170 180 190 >>CCDS2893.1 FAM3D gene_id:131177|Hs108|chr3 (224 aa) initn: 430 init1: 232 opt: 464 Z-score: 611.1 bits: 120.3 E(32554): 1e-27 Smith-Waterman score: 464; 32.9% identity (64.9% similar) in 228 aa overlap (7-231:5-220) 10 20 30 40 50 pF1KE3 MRPLAGGLLKVVFVVFASLCAWYSGYLLAELIPDAPLSSAAYSIRS---IGERPVLKAPV :.:... ..:: . .: .. . .: . .:: .. :. . : CCDS28 MRVSGVLRLLALIFAIVTTW--------MFIRSYMSFSMKTIRLPRWLAASPTKEIQV 10 20 30 40 50 60 70 80 90 100 110 pF1KE3 PKRQKCDHWTPCPSDTYAYRLLSGGGRSKYAKICFEDNLLMGEQLGNVARGINIAIVNYV :. :: :::.. .:... ::.. .:::: ..:. .::.::.:::.:: . CCDS28 -KKYKCGLIKPCPANYFAFKICSGAANVVGPTMCFEDRMIMSPVKNNVGRGLNIALVNGT 60 70 80 90 100 120 130 140 150 160 170 pF1KE3 TGNVTATRCFDMYEGDNSGPMTKFIQSAAPKSLLFMVTYDDGSTRLNNDAKNAIEALGSK :: : . . :::: :: ..::.. .:.....::: .:..:..... . :::. CCDS28 TGAVLGQKAFDMYSGDVMH-LVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSS 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE3 EIRNMKFRSSWVFIAAKGLELPSEIQREKINHSDAKNNRYSGWPAEIQIEGCIPKERS ... ::.:::::.:: :. : ... : :. :.: ::: ...:::.: CCDS28 YAKQLGFRDSWVFIGAKDLRGKSPFEQFLKNSPDT--NKYEGWPELLEMEGCMPPKPF 170 180 190 200 210 220 235 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 04:26:33 2016 done: Sun Nov 6 04:26:33 2016 Total Scan time: 2.190 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]