FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1950, 507 aa 1>>>pF1KE1950 507 - 507 aa - 507 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.1155+/-0.00109; mu= 14.0445+/- 0.066 mean_var=75.3177+/-14.798, 0's: 0 Z-trim(103.6): 34 B-trim: 43 in 1/49 Lambda= 0.147783 statistics sampled from 7481 (7487) to 7481 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.591), E-opt: 0.2 (0.23), width: 16 Scan time: 3.150 The best scores are: opt bits E(32554) CCDS2573.1 THUMPD3 gene_id:25917|Hs108|chr3 ( 507) 3401 734.9 4.9e-212 CCDS1805.2 THUMPD2 gene_id:80745|Hs108|chr2 ( 503) 425 100.4 4.9e-21 >>CCDS2573.1 THUMPD3 gene_id:25917|Hs108|chr3 (507 aa) initn: 3401 init1: 3401 opt: 3401 Z-score: 3920.2 bits: 734.9 E(32554): 4.9e-212 Smith-Waterman score: 3401; 100.0% identity (100.0% similar) in 507 aa overlap (1-507:1-507) 10 20 30 40 50 60 pF1KE1 MCDIEEATNQLLDVNLHENQKSVQVTESDLGSESELLVTIGATVPTGFEQTAADEVREKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 MCDIEEATNQLLDVNLHENQKSVQVTESDLGSESELLVTIGATVPTGFEQTAADEVREKL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 GSSCKISRDRGKIYFVISVESLAQVHCLRSVDNLFVVVQEFQDYQFKQTKEEVLKDFEDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 GSSCKISRDRGKIYFVISVESLAQVHCLRSVDNLFVVVQEFQDYQFKQTKEEVLKDFEDL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 AGKLPWSNPLKVWKINASFKKKKAKRKKINQNSSKEKINNGQEVKIDQRNVKKEFTSHAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 AGKLPWSNPLKVWKINASFKKKKAKRKKINQNSSKEKINNGQEVKIDQRNVKKEFTSHAL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 DSHILDYYENPAIKEDVSTLIGDDLASCKDETDESSKEETEPQVLKFRVTCNRAGEKHCF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 DSHILDYYENPAIKEDVSTLIGDDLASCKDETDESSKEETEPQVLKFRVTCNRAGEKHCF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 TSNEAARDFGGAVQDYFKWKADMTNFDVEVLLNIHDNEVIVGIALTEESLHRRNITHFGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 TSNEAARDFGGAVQDYFKWKADMTNFDVEVLLNIHDNEVIVGIALTEESLHRRNITHFGP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 TTLRSTLAYGMLRLCDPLPYDIIVDPMCGTGAIPIEGATEWSDCFHIAGDNNPLAVNRAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 TTLRSTLAYGMLRLCDPLPYDIIVDPMCGTGAIPIEGATEWSDCFHIAGDNNPLAVNRAA 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 NNIASLLTKSQIKEGKPSWGLPIDAVQWDICNLPLRTGSVDIIVTDLPFGKRMGSKKRNW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 NNIASLLTKSQIKEGKPSWGLPIDAVQWDICNLPLRTGSVDIIVTDLPFGKRMGSKKRNW 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE1 NLYPACLREMSRVCTPTTGRAVLLTQDTKCFTKALSGMRHVWRKVDTVWVNVGGLRAAVY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS25 NLYPACLREMSRVCTPTTGRAVLLTQDTKCFTKALSGMRHVWRKVDTVWVNVGGLRAAVY 430 440 450 460 470 480 490 500 pF1KE1 VLIRTPQAFVHPSEQDGERGTLWQCKE ::::::::::::::::::::::::::: CCDS25 VLIRTPQAFVHPSEQDGERGTLWQCKE 490 500 >>CCDS1805.2 THUMPD2 gene_id:80745|Hs108|chr2 (503 aa) initn: 350 init1: 155 opt: 425 Z-score: 491.2 bits: 100.4 E(32554): 4.9e-21 Smith-Waterman score: 492; 27.5% identity (60.8% similar) in 426 aa overlap (31-447:9-411) 10 20 30 40 50 60 pF1KE1 MCDIEEATNQLLDVNLHENQKSVQVTESDLGSESELLVTIGATVPTGFEQTAADEVREKL :: : . . :. :.: . ::: .: CCDS18 MSEARGEPGSGPEAGARFFCTAGRGLEPFVMREVRARL 10 20 30 70 80 90 100 110 pF1KE1 GSSCKISRDRGKIYFVISVESLAQVHCLRSVDNLFVVVQ-EFQDYQFKQTKEEVLKDFED ... .. ::..:. . .: ... :.:.. ::.... .: . .: ....... CCDS18 AAT-QVEYISGKVFFT-TCSDLNMLKKLKSAERLFLLIKKQFPLIISSVSKGKIFNEMQR 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE1 LAGKLP--WSNPLKVWKINASFKKKKAKRKKINQNSSKEKINNGQ----EVKIDQRNVKK : .. : : : ...:: . :: : .. ..:. :.:..... ..::.: . . CCDS18 LINEDPGSWLNAISIWKNLLELDAKKEKLSQRDDNQLKRKVGENEIIAKKLKIEQMQKIE 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE1 EFTSHALDSHILDYYENPAIKEDVSTLIGDDLASCKDETDESSKEETEPQV-LKFRVTCN : . :...: :. ..: .: .. . . ..: . .:. : : :::.: CCDS18 ENRDCQLEKQI---KEETLEQRDFTTK-SEKFQEEEFQNDIEKAIDTHNQNDLTFRVSCR 160 170 180 190 200 210 240 250 260 270 280 290 pF1KE1 RAGE-KHCFTSNEAARDFGGAVQDYFKWKADMTNFDVEVLLNIHDNEVIVGIALTEESLH .: . ::..:... .: :.. .: ::::. : ..:......: .::: . . :: CCDS18 CSGTIGKAFTAQEVGKVIGIAIMKHFGWKADLRNPQLEIFIHLNDIYSVVGIPVFRVSLA 220 230 240 250 260 270 300 310 320 330 340 350 pF1KE1 RRNITHFGPTTLRSTLAYGMLRLCDPLPYDIIVDPMCGTGAIPIEGATEWSDCFHIAGDN : ... . ::::.:..: : : ...::::: :.: .:.: :: : .....: CCDS18 SR--AYIKTAGLRSTIAWAMASLADIKAGAFVLDPMCGLGTILLEAAKEWPDVYYVGADV 280 290 300 310 320 330 360 370 380 390 400 410 pF1KE1 NPLAVNRAANNIASLLTKSQIKEGKPSWGLPIDAVQWDICNLPLRTGSVDIIVTDLPFGK . . . .:. :. : : :. .. .. .::: . :::::..:.:::: CCDS18 SDSQLLGTWDNL-----KAAGLEDK------IELLKISVIELPLPSESVDIIISDIPFGK 340 350 360 370 420 430 440 450 460 470 pF1KE1 RMGSKKRNWNLYPACLREMSRVCTPTTGRAVLLTQDTKCFTKALSGMRHVWRKVDTVWVN .. : .. :.:: :: ..::..: CCDS18 KFKLGKDIKSI----LQEMERVLHVGGTIVLLLSEDHHRRLTDCKESNIPFNSKDSHTDE 380 390 400 410 420 430 480 490 500 pF1KE1 VGGLRAAVYVLIRTPQAFVHPSEQDGERGTLWQCKE CCDS18 PGIKKCLNPEEKTGAFKTASTSFEASNHKFLDRMSPFGSLVPVECYKVSLGKTDAFICKY 440 450 460 470 480 490 507 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 17:55:04 2016 done: Sun Nov 6 17:55:05 2016 Total Scan time: 3.150 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]