FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1948, 503 aa 1>>>pF1KE1948 503 - 503 aa - 503 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.7588+/-0.0011; mu= 10.7875+/- 0.066 mean_var=106.3722+/-21.585, 0's: 0 Z-trim(105.4): 23 B-trim: 77 in 1/48 Lambda= 0.124354 statistics sampled from 8373 (8379) to 8373 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.62), E-opt: 0.2 (0.257), width: 16 Scan time: 3.320 The best scores are: opt bits E(32554) CCDS1805.2 THUMPD2 gene_id:80745|Hs108|chr2 ( 503) 3287 601.0 1e-171 CCDS2573.1 THUMPD3 gene_id:25917|Hs108|chr3 ( 507) 425 87.5 3.8e-17 >>CCDS1805.2 THUMPD2 gene_id:80745|Hs108|chr2 (503 aa) initn: 3287 init1: 3287 opt: 3287 Z-score: 3196.4 bits: 601.0 E(32554): 1e-171 Smith-Waterman score: 3287; 100.0% identity (100.0% similar) in 503 aa overlap (1-503:1-503) 10 20 30 40 50 60 pF1KE1 MSEARGEPGSGPEAGARFFCTAGRGLEPFVMREVRARLAATQVEYISGKVFFTTCSDLNM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 MSEARGEPGSGPEAGARFFCTAGRGLEPFVMREVRARLAATQVEYISGKVFFTTCSDLNM 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 LKKLKSAERLFLLIKKQFPLIISSVSKGKIFNEMQRLINEDPGSWLNAISIWKNLLELDA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 LKKLKSAERLFLLIKKQFPLIISSVSKGKIFNEMQRLINEDPGSWLNAISIWKNLLELDA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 KKEKLSQRDDNQLKRKVGENEIIAKKLKIEQMQKIEENRDCQLEKQIKEETLEQRDFTTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 KKEKLSQRDDNQLKRKVGENEIIAKKLKIEQMQKIEENRDCQLEKQIKEETLEQRDFTTK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 SEKFQEEEFQNDIEKAIDTHNQNDLTFRVSCRCSGTIGKAFTAQEVGKVIGIAIMKHFGW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 SEKFQEEEFQNDIEKAIDTHNQNDLTFRVSCRCSGTIGKAFTAQEVGKVIGIAIMKHFGW 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 KADLRNPQLEIFIHLNDIYSVVGIPVFRVSLASRAYIKTAGLRSTIAWAMASLADIKAGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 KADLRNPQLEIFIHLNDIYSVVGIPVFRVSLASRAYIKTAGLRSTIAWAMASLADIKAGA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 FVLDPMCGLGTILLEAAKEWPDVYYVGADVSDSQLLGTWDNLKAAGLEDKIELLKISVIE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 FVLDPMCGLGTILLEAAKEWPDVYYVGADVSDSQLLGTWDNLKAAGLEDKIELLKISVIE 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 LPLPSESVDIIISDIPFGKKFKLGKDIKSILQEMERVLHVGGTIVLLLSEDHHRRLTDCK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 LPLPSESVDIIISDIPFGKKFKLGKDIKSILQEMERVLHVGGTIVLLLSEDHHRRLTDCK 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE1 ESNIPFNSKDSHTDEPGIKKCLNPEEKTGAFKTASTSFEASNHKFLDRMSPFGSLVPVEC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 ESNIPFNSKDSHTDEPGIKKCLNPEEKTGAFKTASTSFEASNHKFLDRMSPFGSLVPVEC 430 440 450 460 470 480 490 500 pF1KE1 YKVSLGKTDAFICKYKKSHSSGL ::::::::::::::::::::::: CCDS18 YKVSLGKTDAFICKYKKSHSSGL 490 500 >>CCDS2573.1 THUMPD3 gene_id:25917|Hs108|chr3 (507 aa) initn: 350 init1: 155 opt: 425 Z-score: 421.4 bits: 87.5 E(32554): 3.8e-17 Smith-Waterman score: 492; 27.5% identity (60.8% similar) in 426 aa overlap (9-411:31-447) 10 20 30 pF1KE1 MSEARGEPGSGPEAGARFFCTAGRGLEPFVMREVRARL :: : . . :. :.: . ::: .: CCDS25 MCDIEEATNQLLDVNLHENQKSVQVTESDLGSESELLVTIGATVPTGFEQTAADEVREKL 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE1 AAT-QVEYISGKVFFT-TCSDLNMLKKLKSAERLFLLIKKQFPLIISSVSKGKIFNEMQR ... .. ::..:. . .: ... :.:.. ::.... .: . .: ....... CCDS25 GSSCKISRDRGKIYFVISVESLAQVHCLRSVDNLFVVVQ-EFQDYQFKQTKEEVLKDFED 70 80 90 100 110 100 110 120 130 140 150 pF1KE1 LINEDPGSWLNAISIWKNLLELDAKKEKLSQRDDNQLKRKVGENEIIAKKLKIEQMQKIE : .. : : : ...:: . :: : .. ..:. :.:..... ..::.: . . CCDS25 LAGKLP--WSNPLKVWKINASFKKKKAKRKKINQNSSKEKINNGQ----EVKIDQRNVKK 120 130 140 150 160 170 160 170 180 190 200 210 pF1KE1 ENRDCQLEKQI---KEETLEQRDFTTK-SEKFQEEEFQNDIEKAIDTHNQNDLTFRVSCR : . :...: :. ..: .: .. . . ..: . .:. : : :::.: CCDS25 EFTSHALDSHILDYYENPAIKEDVSTLIGDDLASCKDETDESSKEETEPQV-LKFRVTCN 180 190 200 210 220 230 220 230 240 250 260 270 pF1KE1 CSGTIGKAFTAQEVGKVIGIAIMKHFGWKADLRNPQLEIFIHLNDIYSVVGIPVFRVSLA .: . ::..:... .: :.. .: ::::. : ..:......: .::: . . :: CCDS25 RAGE-KHCFTSNEAARDFGGAVQDYFKWKADMTNFDVEVLLNIHDNEVIVGIALTEESLH 240 250 260 270 280 290 280 290 300 310 320 330 pF1KE1 SR--AYIKTAGLRSTIAWAMASLADIKAGAFVLDPMCGLGTILLEAAKEWPDVYYVGADV : ... . ::::.:..: : : ...::::: :.: .:.: :: : .....: CCDS25 RRNITHFGPTTLRSTLAYGMLRLCDPLPYDIIVDPMCGTGAIPIEGATEWSDCFHIAGDN 300 310 320 330 340 350 340 350 360 370 pF1KE1 SDSQLLGTWDNL-----KAAGLEDK------IELLKISVIELPLPSESVDIIISDIPFGK . . . .:. :. : : :. .. .. .::: . :::::..:.:::: CCDS25 NPLAVNRAANNIASLLTKSQIKEGKPSWGLPIDAVQWDICNLPLRTGSVDIIVTDLPFGK 360 370 380 390 400 410 380 390 400 410 420 430 pF1KE1 KFKLGKDIKSI----LQEMERVLHVGGTIVLLLSEDHHRRLTDCKESNIPFNSKDSHTDE .. : .. :.:: :: ..::..: CCDS25 RMGSKKRNWNLYPACLREMSRVCTPTTGRAVLLTQDTKCFTKALSGMRHVWRKVDTVWVN 420 430 440 450 460 470 440 450 460 470 480 490 pF1KE1 PGIKKCLNPEEKTGAFKTASTSFEASNHKFLDRMSPFGSLVPVECYKVSLGKTDAFICKY CCDS25 VGGLRAAVYVLIRTPQAFVHPSEQDGERGTLWQCKE 480 490 500 503 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 18:30:50 2016 done: Sun Nov 6 18:30:51 2016 Total Scan time: 3.320 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]