FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6749, 542 aa 1>>>pF1KE6749 542 - 542 aa - 542 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.2589+/-0.00104; mu= 0.0500+/- 0.062 mean_var=188.1386+/-38.115, 0's: 0 Z-trim(110.8): 21 B-trim: 163 in 1/51 Lambda= 0.093505 statistics sampled from 11889 (11904) to 11889 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.719), E-opt: 0.2 (0.366), width: 16 Scan time: 2.670 The best scores are: opt bits E(32554) CCDS3871.1 PAPD7 gene_id:11044|Hs108|chr5 ( 542) 3609 499.4 4.3e-141 CCDS54006.1 PAPD5 gene_id:64282|Hs108|chr16 ( 698) 1648 234.9 2.3e-61 >>CCDS3871.1 PAPD7 gene_id:11044|Hs108|chr5 (542 aa) initn: 3609 init1: 3609 opt: 3609 Z-score: 2646.5 bits: 499.4 E(32554): 4.3e-141 Smith-Waterman score: 3609; 100.0% identity (100.0% similar) in 542 aa overlap (1-542:1-542) 10 20 30 40 50 60 pF1KE6 MSPCPEEAAMRREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 MSPCPEEAAMRREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 LQLLEQALRKHNVAEPCSIKVLDKATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKNY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 LQLLEQALRKHNVAEPCSIKVLDKATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKNY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 MKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQLHPRIDARRADENLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 MKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQLHPRIDARRADENLG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 MLLVEFFELYGRNFNYLKTGIRIKEGGAYIAKEEIMKAMTSGYRPSMLCIEDPLLPGNDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 MLLVEFFELYGRNFNYLKTGIRIKEGGAYIAKEEIMKAMTSGYRPSMLCIEDPLLPGNDV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 GRSSYGAMQVKQVFDYAYIVLSHAVSPLARSYPNRDAESTLGRIIKVTQEVIDYRRWIKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 GRSSYGAMQVKQVFDYAYIVLSHAVSPLARSYPNRDAESTLGRIIKVTQEVIDYRRWIKE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 KWGSKAHPSPGMDSRIKIKERIATCNGEQTQNREPESPYGQRLTLSLSSPQLLSSGSSAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 KWGSKAHPSPGMDSRIKIKERIATCNGEQTQNREPESPYGQRLTLSLSSPQLLSSGSSAS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 SVSSLSGSDVDSDTPPCTTPSVYQFSLQAPAPLMAGLPTALPMPSGKPQPTTSRTLIMTT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 SVSSLSGSDVDSDTPPCTTPSVYQFSLQAPAPLMAGLPTALPMPSGKPQPTTSRTLIMTT 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE6 NNQTRFTIPPPTLGVAPVPCRQAGVEGTASLKAVHHMSSPAIPSASPNPLSSPHLYHKQH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 NNQTRFTIPPPTLGVAPVPCRQAGVEGTASLKAVHHMSSPAIPSASPNPLSSPHLYHKQH 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE6 NGMKLSMKGSHGHTQGGGYSSVGSGGVRPPVGNRGHHQYNRTGWRRKKHTHTRDSLPVSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 NGMKLSMKGSHGHTQGGGYSSVGSGGVRPPVGNRGHHQYNRTGWRRKKHTHTRDSLPVSL 490 500 510 520 530 540 pF1KE6 SR :: CCDS38 SR >>CCDS54006.1 PAPD5 gene_id:64282|Hs108|chr16 (698 aa) initn: 1642 init1: 1554 opt: 1648 Z-score: 1215.1 bits: 234.9 E(32554): 2.3e-61 Smith-Waterman score: 1648; 66.1% identity (84.5% similar) in 386 aa overlap (1-380:210-595) 10 20 30 pF1KE6 MSPCPEEAAMRREVVKRIETVVKDLWPTAD ::: ::: :: :::.:::.:.:.:::.:: CCDS54 VVYSGTPWKRRNYNQGVVGLHEEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSAD 180 190 200 210 220 230 40 50 60 70 80 90 pF1KE6 VQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALRKHNVAEPCSIKVLDKATVPII :::::::.::::::::::::::::::: :: ::.:::::.::. :.::::::::::: CCDS54 VQIFGSFKTGLYLPTSDIDLVVFGKWENLPLWTLEEALRKHKVADEDSVKVLDKATVPII 240 250 260 270 280 290 100 110 120 130 140 150 pF1KE6 KLTDQETEVKVDISFNMETGVRAAEFIKNYMKKYSLLPYLILVLKQFLLQRDLNEVFTGG ::::. ::::::::::...:::::..::.. ::: .::::.::::::::::::::::::: CCDS54 KLTDSFTEVKVDISFNVQNGVRAADLIKDFTKKYPVLPYLVLVLKQFLLQRDLNEVFTGG 300 310 320 330 340 350 160 170 180 190 200 210 pF1KE6 ISSYSLILMAISFLQLHPRIDARRADENLGMLLVEFFELYGRNFNYLKTGIRIKEGGAYI :.::::.:::.:::::::: :: . : :.::.::::::::.:::::::::::.::.:. CCDS54 IGSYSLFLMAVSFLQLHPREDACIPNTNYGVLLIEFFELYGRHFNYLKTGIRIKDGGSYV 360 370 380 390 400 410 220 230 240 250 260 270 pF1KE6 AKEEIMKAMTSGYRPSMLCIEDPLLPGNDVGRSSYGAMQVKQVFDYAYIVLSHAVSPLAR ::.:..: : .::::::: ::::: :::::::::::::::::.:::::.::::::::.:. CCDS54 AKDEVQKNMLDGYRPSMLYIEDPLQPGNDVGRSSYGAMQVKQAFDYAYVVLSHAVSPIAK 420 430 440 450 460 470 280 290 300 310 320 pF1KE6 SYPNRDAESTLGRIIKVTQEVIDYRRWIKEKWGSKAHPSPGMDSR----IKIKERIATCN ::: ..:: :::::.::.:: :: ::...:: : .: :. .. : ... :: CCDS54 YYPNNETESILGRIIRVTDEVATYRDWISKQWGLKNRPEPSCNGNGVTLIVDTQQLDKCN 480 490 500 510 520 530 330 340 350 360 370 380 pF1KE6 GEQTQNREPESPYGQRLTLSLSSPQLLSSGS--SASSVSSLSGSDVDSDTPPCTTPSVYQ .. ... : . .. . :::. . ::.. :.::... :.::::::. :: :: CCDS54 NNLSEENEALGKCRSKTSESLSKHSSNSSSGPVSSSSATQSSSSDVDSDATPCKTPKQLL 540 550 560 570 580 590 390 400 410 420 430 440 pF1KE6 FSLQAPAPLMAGLPTALPMPSGKPQPTTSRTLIMTTNNQTRFTIPPPTLGVAPVPCRQAG CCDS54 CRPSTGNRVGSQDVSLESSQAVGKMQSTQTTNTSNSTNKSQHGSARLFRSSSKGFQGTTQ 600 610 620 630 640 650 542 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 16:00:13 2016 done: Tue Nov 8 16:00:13 2016 Total Scan time: 2.670 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]