FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6307, 314 aa 1>>>pF1KE6307 314 - 314 aa - 314 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2730+/-0.00105; mu= 16.1509+/- 0.063 mean_var=73.5427+/-14.381, 0's: 0 Z-trim(104.1): 47 B-trim: 38 in 1/50 Lambda= 0.149556 statistics sampled from 7693 (7729) to 7693 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.603), E-opt: 0.2 (0.237), width: 16 Scan time: 1.590 The best scores are: opt bits E(32554) CCDS1833.1 EPCAM gene_id:4072|Hs108|chr2 ( 314) 2046 450.9 5.8e-127 CCDS609.1 TACSTD2 gene_id:4070|Hs108|chr1 ( 323) 1012 227.8 8.6e-60 >>CCDS1833.1 EPCAM gene_id:4072|Hs108|chr2 (314 aa) initn: 2046 init1: 2046 opt: 2046 Z-score: 2392.8 bits: 450.9 E(32554): 5.8e-127 Smith-Waterman score: 2046; 99.7% identity (99.7% similar) in 314 aa overlap (1-314:1-314) 10 20 30 40 50 60 pF1KE6 MAPPQVLAFGLLLAAATATFAAAQEECVCENYKLAVNCFVNNNRQCQCTSVGAQNTVICS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 MAPPQVLAFGLLLAAATATFAAAQEECVCENYKLAVNCFVNNNRQCQCTSVGAQNTVICS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 KLAAKCLVMKAEMNGSKLGRRAKPEGALQNNDGLYDPDCDESGLFKAKQCNGTSTCWCVN :::::::::::::::::::::::::::::::::::::::::::::::::::::: ::::: CCDS18 KLAAKCLVMKAEMNGSKLGRRAKPEGALQNNDGLYDPDCDESGLFKAKQCNGTSMCWCVN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 TAGVRRTDKDTEITCSERVRTYWIIIELKHKAREKPYDSKSLRTALQKEITTRYQLDPKF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 TAGVRRTDKDTEITCSERVRTYWIIIELKHKAREKPYDSKSLRTALQKEITTRYQLDPKF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 ITSILYENNVITIDLVQNSSQKTQNDVDIADVAYYFEKDVKGESLFHSKKMDLTVNGEQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 ITSILYENNVITIDLVQNSSQKTQNDVDIADVAYYFEKDVKGESLFHSKKMDLTVNGEQL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 DLDPGQTLIYYVDEKAPEFSMQGLKAGVIAVIVVVVIAVVAGIVVLVISRKKRMAKYEKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 DLDPGQTLIYYVDEKAPEFSMQGLKAGVIAVIVVVVIAVVAGIVVLVISRKKRMAKYEKA 250 260 270 280 290 300 310 pF1KE6 EIKEMGEMHRELNA :::::::::::::: CCDS18 EIKEMGEMHRELNA 310 >>CCDS609.1 TACSTD2 gene_id:4070|Hs108|chr1 (323 aa) initn: 922 init1: 415 opt: 1012 Z-score: 1186.9 bits: 227.8 E(32554): 8.6e-60 Smith-Waterman score: 1012; 49.7% identity (78.8% similar) in 316 aa overlap (1-311:7-320) 10 20 30 40 50 pF1KE6 MAPPQV-LAFGLLLAAATATFAAAQEECVCENYKLAVNCFVNNNRQCQCTSVGA .::: . : . ::. ::.. .:::..:.: . :..: . . .::: ..:. CCDS60 MARGPGLAPPPLRLPLLLLVLAAVTGHTAAQDNCTCPTNKMTVCSPDGPGGRCQCRALGS 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 QNTVICSKLAAKCLVMKAEMNGSKLGRR-AKP-EGALQNNDGLYDPDCDESGLFKAKQCN .: :: :..:::..::.:.. : .: ..: : :: .:::::::::: : :::.::: CCDS60 GMAVDCSTLTSKCLLLKARMSAPKNARTLVRPSEHALVDNDGLYDPDCDPEGRFKARQCN 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE6 GTSTCWCVNTAGVRRTDK-DTEITCSERVRTYWIIIELKHKAREKPYDSKSLRTALQKEI ::.:::::..::::::: : . :.: :::. :.:.:.:. .. ..: . :.. . CCDS60 QTSVCWCVNSVGVRRTDKGDLSLRCDELVRTHHILIDLRHRPTAGAFNHSDLDAELRRLF 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE6 TTRYQLDPKFITSILYENNVITIDLVQNSSQKTQNDVDIADVAYYFEKDVKGESLFHSKK ::.: :::.... ::. .: :.: ::.:::. .::::.:.:::::.:.::::::... CCDS60 RERYRLHPKFVAAVHYEQPTIQIELRQNTSQKAAGDVDIGDAAYYFERDIKGESLFQGRG 190 200 210 220 230 240 240 250 260 270 280 pF1KE6 -MDLTVNGEQLDLDPGQTLIYYVDEKAPEFSMQGLKAGVIAVIVVVVIAVVAGIVVLVIS .:: : :: :... .:::::.:: :.:::. : ::.::::::::.:.:::..::::. CCDS60 GLDLRVRGEPLQVE--RTLIYYLDEIPPKFSMKRLTAGLIAVIVVVVVALVAGMAVLVIT 250 260 270 280 290 290 300 310 pF1KE6 RKKRMAKYEKAEIKEMGEMHRELNA ... .::.:.::::.::...: CCDS60 NRRKSGKYKKVEIKELGELRKEPSL 300 310 320 314 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 19:35:15 2016 done: Mon Nov 7 19:35:15 2016 Total Scan time: 1.590 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]