FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8259, 299 aa 1>>>pF1KB8259 299 - 299 aa - 299 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.1558+/-0.00087; mu= 3.5482+/- 0.052 mean_var=144.1449+/-31.606, 0's: 0 Z-trim(111.6): 18 B-trim: 657 in 1/50 Lambda= 0.106825 statistics sampled from 12476 (12493) to 12476 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.73), E-opt: 0.2 (0.384), width: 16 Scan time: 2.540 The best scores are: opt bits E(32554) CCDS13553.1 TCEA2 gene_id:6919|Hs108|chr20 ( 299) 1981 316.3 1.8e-86 CCDS13554.1 TCEA2 gene_id:6919|Hs108|chr20 ( 272) 1817 291.0 6.8e-79 CCDS47858.1 TCEA1 gene_id:6917|Hs108|chr8 ( 301) 1304 211.9 4.7e-55 CCDS47857.1 TCEA1 gene_id:6917|Hs108|chr8 ( 280) 1129 184.9 5.8e-47 CCDS44086.1 TCEA3 gene_id:6920|Hs108|chr1 ( 348) 870 145.1 7.3e-35 >>CCDS13553.1 TCEA2 gene_id:6919|Hs108|chr20 (299 aa) initn: 1981 init1: 1981 opt: 1981 Z-score: 1665.8 bits: 316.3 E(32554): 1.8e-86 Smith-Waterman score: 1981; 100.0% identity (100.0% similar) in 299 aa overlap (1-299:1-299) 10 20 30 40 50 60 pF1KB8 MMGKEEEIARIARRLDKMVTKKSAEGAMDLLRELKAMPITLHLLQSTRVGMSVNALRKQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MMGKEEEIARIARRLDKMVTKKSAEGAMDLLRELKAMPITLHLLQSTRVGMSVNALRKQS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 SDEEVIALAKSLIKSWKKLLDASDAKARERGRGMPLPTSSRDASEAPDPSRKRPELPRAP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 SDEEVIALAKSLIKSWKKLLDASDAKARERGRGMPLPTSSRDASEAPDPSRKRPELPRAP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 STPRITTFPPVPVTCDAVRNKCREMLTAALQTDHDHVAIGADCERLSAQIEECIFRDVGN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 STPRITTFPPVPVTCDAVRNKCREMLTAALQTDHDHVAIGADCERLSAQIEECIFRDVGN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 TDMKYKNRVRSRISNLKDAKNPDLRRNVLCGAITPQQIAVMTSEEMASDELKEIRKAMTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 TDMKYKNRVRSRISNLKDAKNPDLRRNVLCGAITPQQIAVMTSEEMASDELKEIRKAMTK 190 200 210 220 230 240 250 260 270 280 290 pF1KB8 EAIREHQMARTGGTQTDLFTCGKCRKKNCTYTQVQTRSSDEPMTTFVVCNECGNRWKFC ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 EAIREHQMARTGGTQTDLFTCGKCRKKNCTYTQVQTRSSDEPMTTFVVCNECGNRWKFC 250 260 270 280 290 >>CCDS13554.1 TCEA2 gene_id:6919|Hs108|chr20 (272 aa) initn: 1817 init1: 1817 opt: 1817 Z-score: 1529.8 bits: 291.0 E(32554): 6.8e-79 Smith-Waterman score: 1817; 100.0% identity (100.0% similar) in 272 aa overlap (28-299:1-272) 10 20 30 40 50 60 pF1KB8 MMGKEEEIARIARRLDKMVTKKSAEGAMDLLRELKAMPITLHLLQSTRVGMSVNALRKQS ::::::::::::::::::::::::::::::::: CCDS13 MDLLRELKAMPITLHLLQSTRVGMSVNALRKQS 10 20 30 70 80 90 100 110 120 pF1KB8 SDEEVIALAKSLIKSWKKLLDASDAKARERGRGMPLPTSSRDASEAPDPSRKRPELPRAP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 SDEEVIALAKSLIKSWKKLLDASDAKARERGRGMPLPTSSRDASEAPDPSRKRPELPRAP 40 50 60 70 80 90 130 140 150 160 170 180 pF1KB8 STPRITTFPPVPVTCDAVRNKCREMLTAALQTDHDHVAIGADCERLSAQIEECIFRDVGN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 STPRITTFPPVPVTCDAVRNKCREMLTAALQTDHDHVAIGADCERLSAQIEECIFRDVGN 100 110 120 130 140 150 190 200 210 220 230 240 pF1KB8 TDMKYKNRVRSRISNLKDAKNPDLRRNVLCGAITPQQIAVMTSEEMASDELKEIRKAMTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 TDMKYKNRVRSRISNLKDAKNPDLRRNVLCGAITPQQIAVMTSEEMASDELKEIRKAMTK 160 170 180 190 200 210 250 260 270 280 290 pF1KB8 EAIREHQMARTGGTQTDLFTCGKCRKKNCTYTQVQTRSSDEPMTTFVVCNECGNRWKFC ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 EAIREHQMARTGGTQTDLFTCGKCRKKNCTYTQVQTRSSDEPMTTFVVCNECGNRWKFC 220 230 240 250 260 270 >>CCDS47858.1 TCEA1 gene_id:6917|Hs108|chr8 (301 aa) initn: 1266 init1: 940 opt: 1304 Z-score: 1101.9 bits: 211.9 E(32554): 4.7e-55 Smith-Waterman score: 1304; 66.8% identity (84.4% similar) in 301 aa overlap (5-299:2-301) 10 20 30 40 50 60 pF1KB8 MMGKEEEIARIARRLDKMVTKKSAEGAMDLLRELKAMPITLHLLQSTRVGMSVNALRKQS :.:..:.:...:::: ::.: ::.:::.::: .:.::.::::::.::::::.:::: CCDS47 MEDEVVRFAKKMDKMVQKKNAAGALDLLKELKNIPMTLELLQSTRIGMSVNAIRKQS 10 20 30 40 50 70 80 90 100 110 pF1KB8 SDEEVIALAKSLIKSWKKLLDA-SDAKARERGRGMPLPTS-----SRDASEAPDPSRKRP .:::: .::::::::::::::. : : .. . : :: .:. : . .: CCDS47 TDEEVTSLAKSLIKSWKKLLDGPSTEKDLDEKKKEPAITSQNSPEAREESTSSGNVSNRK 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 ELPRAPSTPRITTFPPVPVTCDAVRNKCREMLTAALQTDHDHVAIGADCERLSAQIEECI . : .: ...:: .: : :.:: ::::::.:::.: :..::::: :.:..:::: : CCDS47 DETNARDT-YVSSFPRAPSTSDSVRLKCREMLAAALRTGDDYIAIGADEEELGSQIEEAI 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB8 FRDVGNTDMKYKNRVRSRISNLKDAKNPDLRRNVLCGAITPQQIAVMTSEEMASDELKEI .... :::::::::::::::::::::::.::.::::: : :. .: ::.::::::::::. CCDS47 YQEIRNTDMKYKNRVRSRISNLKDAKNPNLRKNVLCGNIPPDLFARMTAEEMASDELKEM 180 190 200 210 220 230 240 250 260 270 280 290 pF1KB8 RKAMTKEAIREHQMARTGGTQTDLFTCGKCRKKNCTYTQVQTRSSDEPMTTFVVCNECGN :: .:::::::::::.::::::::::::::.:::::::::::::.::::::::::::::: CCDS47 RKNLTKEAIREHQMAKTGGTQTDLFTCGKCKKKNCTYTQVQTRSADEPMTTFVVCNECGN 240 250 260 270 280 290 pF1KB8 RWKFC ::::: CCDS47 RWKFC 300 >>CCDS47857.1 TCEA1 gene_id:6917|Hs108|chr8 (280 aa) initn: 1159 init1: 940 opt: 1129 Z-score: 956.6 bits: 184.9 E(32554): 5.8e-47 Smith-Waterman score: 1159; 62.1% identity (78.1% similar) in 301 aa overlap (5-299:2-280) 10 20 30 40 50 60 pF1KB8 MMGKEEEIARIARRLDKMVTKKSAEGAMDLLRELKAMPITLHLLQSTRVGMSVNALRKQS :.:..:.:...:::: ::.: :::.::::::.:::: CCDS47 MEDEVVRFAKKMDKMVQKKNA---------------------STRIGMSVNAIRKQS 10 20 30 70 80 90 100 110 pF1KB8 SDEEVIALAKSLIKSWKKLLDA-SDAKARERGRGMPLPTS-----SRDASEAPDPSRKRP .:::: .::::::::::::::. : : .. . : :: .:. : . .: CCDS47 TDEEVTSLAKSLIKSWKKLLDGPSTEKDLDEKKKEPAITSQNSPEAREESTSSGNVSNRK 40 50 60 70 80 90 120 130 140 150 160 170 pF1KB8 ELPRAPSTPRITTFPPVPVTCDAVRNKCREMLTAALQTDHDHVAIGADCERLSAQIEECI . : .: ...:: .: : :.:: ::::::.:::.: :..::::: :.:..:::: : CCDS47 DETNARDT-YVSSFPRAPSTSDSVRLKCREMLAAALRTGDDYIAIGADEEELGSQIEEAI 100 110 120 130 140 150 180 190 200 210 220 230 pF1KB8 FRDVGNTDMKYKNRVRSRISNLKDAKNPDLRRNVLCGAITPQQIAVMTSEEMASDELKEI .... :::::::::::::::::::::::.::.::::: : :. .: ::.::::::::::. CCDS47 YQEIRNTDMKYKNRVRSRISNLKDAKNPNLRKNVLCGNIPPDLFARMTAEEMASDELKEM 160 170 180 190 200 210 240 250 260 270 280 290 pF1KB8 RKAMTKEAIREHQMARTGGTQTDLFTCGKCRKKNCTYTQVQTRSSDEPMTTFVVCNECGN :: .:::::::::::.::::::::::::::.:::::::::::::.::::::::::::::: CCDS47 RKNLTKEAIREHQMAKTGGTQTDLFTCGKCKKKNCTYTQVQTRSADEPMTTFVVCNECGN 220 230 240 250 260 270 pF1KB8 RWKFC ::::: CCDS47 RWKFC 280 >>CCDS44086.1 TCEA3 gene_id:6920|Hs108|chr1 (348 aa) initn: 1188 init1: 823 opt: 870 Z-score: 739.4 bits: 145.1 E(32554): 7.3e-35 Smith-Waterman score: 1048; 50.8% identity (73.4% similar) in 331 aa overlap (19-299:18-348) 10 20 30 40 50 60 pF1KB8 MMGKEEEIARIARRLDKMVTKKSAEGAMDLLRELKAMPITLHLLQSTRVGMSVNALRKQS :..:..:::.:::..:.. ....:::.::.:..::..::. CCDS44 MGQEEELLRIAKKLEKMVARKNTEGALDLLKKLHSCQMSIQLLQTTRIGVAVNGVRKHC 10 20 30 40 50 70 80 90 pF1KB8 SDEEVIALAKSLIKSWKKLLDASD------------AKARERG---------RGMPLPTS ::.::..::: :::.::.:::. :: .:.: :. : . CCDS44 SDKEVVSLAKVLIKNWKRLLDSPGPPKGEKGEEREKAKKKEKGLECSDWKPEAGLSPPRK 60 70 80 90 100 110 100 110 120 130 pF1KB8 SRD--------------ASEAP--------DPSRKRPELPRAPSTPRITTFP-------P .:. :: .: . :... : :..::.: :: : CCDS44 KREDPKTRRDSVDSKSSASSSPKRPSVERSNSSKSKAESPKTPSSPLTPTFASSMCLLAP 120 130 140 150 160 170 140 150 160 170 180 190 pF1KB8 VPVTCDAVRNKCREMLTAALQTDHDHVAIGADCERLSAQIEECIFRDVGNTDMKYKNRVR .: :.::.:: :::.:::..: :. :..:......::. :.... .:::::.:::: CCDS44 CYLTGDSVRDKCVEMLSAALKADDDYKDYGVNCDKMASEIEDHIYQELKSTDMKYRNRVR 180 190 200 210 220 230 200 210 220 230 240 250 pF1KB8 SRISNLKDAKNPDLRRNVLCGAITPQQIAVMTSEEMASDELKEIRKAMTKEAIREHQMAR :::::::: .:: :::::: :::. :: ::.::::::::.:.:.:::.:::::::::. CCDS44 SRISNLKDPRNPGLRRNVLSGAISAGLIAKMTAEEMASDELRELRNAMTQEAIREHQMAK 240 250 260 270 280 290 260 270 280 290 pF1KB8 TGGTQTDLFTCGKCRKKNCTYTQVQTRSSDEPMTTFVVCNECGNRWKFC :::: :::: :.::.::::::.::::::.::::::::.::::::::::: CCDS44 TGGTTTDLFQCSKCKKKNCTYNQVQTRSADEPMTTFVLCNECGNRWKFC 300 310 320 330 340 299 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 14:30:11 2016 done: Mon Nov 7 14:30:11 2016 Total Scan time: 2.540 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]