FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1568, 309 aa 1>>>pF1KE1568 309 - 309 aa - 309 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.8013+/-0.000882; mu= 13.5843+/- 0.053 mean_var=79.1676+/-15.638, 0's: 0 Z-trim(106.8): 55 B-trim: 0 in 0/50 Lambda= 0.144145 statistics sampled from 9155 (9210) to 9155 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.666), E-opt: 0.2 (0.283), width: 16 Scan time: 2.640 The best scores are: opt bits E(32554) CCDS13340.1 TOMM34 gene_id:10953|Hs108|chr20 ( 309) 2007 426.8 1e-119 CCDS34930.1 SPAG1 gene_id:6674|Hs108|chr8 ( 926) 895 195.8 1e-49 CCDS8753.1 RPAP3 gene_id:79657|Hs108|chr12 ( 665) 289 69.7 6.8e-12 CCDS53783.1 RPAP3 gene_id:79657|Hs108|chr12 ( 631) 284 68.7 1.3e-11 CCDS53782.1 RPAP3 gene_id:79657|Hs108|chr12 ( 506) 270 65.7 8.4e-11 CCDS4348.1 TTC1 gene_id:7265|Hs108|chr5 ( 292) 266 64.8 9.4e-11 >>CCDS13340.1 TOMM34 gene_id:10953|Hs108|chr20 (309 aa) initn: 2007 init1: 2007 opt: 2007 Z-score: 2262.9 bits: 426.8 E(32554): 1e-119 Smith-Waterman score: 2007; 100.0% identity (100.0% similar) in 309 aa overlap (1-309:1-309) 10 20 30 40 50 60 pF1KE1 MAPKFPDSVEELRAAGNESFRNGQYAEASALYGRALRVLQAQGSSDPEEESVLYSNRAAC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MAPKFPDSVEELRAAGNESFRNGQYAEASALYGRALRVLQAQGSSDPEEESVLYSNRAAC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 HLKDGNCRDCIKDCTSALALVPFSIKPLLRRASAYEALEKYPMAYVDYKTVLQIDDNVTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 HLKDGNCRDCIKDCTSALALVPFSIKPLLRRASAYEALEKYPMAYVDYKTVLQIDDNVTS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 AVEGINRMTRALMDSLGPEWRLKLPSIPLVPVSAQKRWNSLPSENHKEMAKSKSKETTAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 AVEGINRMTRALMDSLGPEWRLKLPSIPLVPVSAQKRWNSLPSENHKEMAKSKSKETTAT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 KNRVPSAGDVEKARVLKEEGNELVKKGNHKKAIEKYSESLLCSNLESATYSNRALCYLVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 KNRVPSAGDVEKARVLKEEGNELVKKGNHKKAIEKYSESLLCSNLESATYSNRALCYLVL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 KQYTEAVKDCTEALKLDGKNVKAFYRRAQAHKALKDYKSSFADISNLLQIEPRNGPAQKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 KQYTEAVKDCTEALKLDGKNVKAFYRRAQAHKALKDYKSSFADISNLLQIEPRNGPAQKL 250 260 270 280 290 300 pF1KE1 RQEVKQNLH ::::::::: CCDS13 RQEVKQNLH >>CCDS34930.1 SPAG1 gene_id:6674|Hs108|chr8 (926 aa) initn: 879 init1: 546 opt: 895 Z-score: 1005.9 bits: 195.8 E(32554): 1e-49 Smith-Waterman score: 895; 48.8% identity (73.2% similar) in 299 aa overlap (12-309:448-739) 10 20 30 40 pF1KE1 MAPKFPDSVEELRAAGNESFRNGQYAEASALYGRALRVLQA :.. ::: ::.::.:::.. :. :. .:. CCDS34 SPRRASAAAAAGGGATGHPGGGQGAENPAGLKSQGNELFRSGQFAEAAGKYSAAIALLEP 420 430 440 450 460 470 50 60 70 80 90 100 pF1KE1 QGSSDPEEESVLYSNRAACHLKDGNCRDCIKDCTSALALVPFSIKPLLRRASAYEALEKY :: .. :.::::::::.::.::: ::.::. :: : :::.::::::: :::.::.: CCDS34 AGSEIADDLSILYSNRAACYLKEGNCSGCIQDCNRALELHPFSMKPLLRRAMAYETLEQY 480 490 500 510 520 530 110 120 130 140 150 160 pF1KE1 PMAYVDYKTVLQIDDNVTSAVEGINRMTRALMDSLGPEWRLKLPSIPLVPVSAQ-KRWNS :::::::::::: .. : ...::..: ::. ::.:: :: :: ::.:. . :. CCDS34 GKAYVDYKTVLQIDCGLQLANDSVNRLSRILMELDGPNWREKLSPIPAVPASVPLQAWH- 540 550 560 570 580 590 170 180 190 200 210 220 pF1KE1 LPSENHKEMAKSKSKETTATKNRVPSAGDVEKARVLKEEGNELVKKGNHKKAIEKYSESL :. ::: .... .... .: . : . ..::::::. :. :.: :. :::: : CCDS34 -PA---KEMISKQAGDSSS--HRQQGITDEKTFKALKEEGNQCVNDKNYKDALSKYSECL 600 610 620 630 640 650 230 240 250 260 270 280 pF1KE1 LCSNLESATYSNRALCYLVLKQYTEAVKDCTEALKLDGKNVKAFYRRAQAHKALKDYKSS .: : : :.::::::: : :. :: .:: .::.: ::::::::: :::.::.:..: CCDS34 KINNKECAIYTNRALCYLKLCQFEEAKQDCDQALQLADGNVKAFYRRALAHKGLKNYQKS 660 670 680 690 700 710 290 300 pF1KE1 FADISNLLQIEPRNGPAQKLRQEVKQNLH . :..... ..: :. .:: . :. CCDS34 LIDLNKVILLDPSIIEAKMELEEVTRLLNLKDKTAPFNKEKERRKIEIQEVNEGKEEPGR 720 730 740 750 760 770 >>CCDS8753.1 RPAP3 gene_id:79657|Hs108|chr12 (665 aa) initn: 463 init1: 218 opt: 289 Z-score: 327.0 bits: 69.7 E(32554): 6.8e-12 Smith-Waterman score: 359; 29.3% identity (57.2% similar) in 297 aa overlap (12-308:136-397) 10 20 30 40 pF1KE1 MAPKFPDSVEELRAAGNESFRNGQYAEASALYGRALRVLQA :. ::. :..:.: :: : ... CCDS87 DKDDSTHESLSQESESEEDGIHVDSQKALVLKEKGNKYFKQGKYDEAIDCYTKGM----- 110 120 130 140 150 160 50 60 70 80 90 100 pF1KE1 QGSSDPEEESVLYSNRAACHLKDGNCRDCIKDCTSALALVPFSIKPLLRRASAYEALEKY ..:: . :: .:::. ... . .::. :.:: : ::..: ::.: CCDS87 --DADPYNP-VLPTNRASAYFRLKKFAVAESDCNLAVALNRSYTKAYSRRGAARFALQKL 170 180 190 200 210 110 120 130 140 150 160 pF1KE1 PMAYVDYKTVLQIDDNVTSAVEGINRMTRALMDSLGPEWRLKLPSIPLVPVSAQKRWNSL : ::. ::... : :.. . ....:: :.:. :: CCDS87 EEAKKDYERVLELEPNNFEATNELRKISQAL---------------------ASKE-NSY 220 230 240 250 170 180 190 200 210 220 pF1KE1 PSENHKEMAKSKSKETTATKNRVPSAGDVEKARVLKEEGNELVKKGNHKKAIEKYSESLL :.: : : : . .... . . ..: :..:: . :.:....::: :.... CCDS87 PKE-----ADIVIKSTEGERKQIEAQQNKQQAISEKDRGNGFFKEGKYERAIECYTRGIA 260 270 280 290 300 310 230 240 250 260 270 280 pF1KE1 CSNLESATYSNRALCYLVLKQYTEAVKDCTEALKLDGKNVKAFYRRAQAHKALKDYKSSF .. .. .:::. :: ...: :: ::::.:. :::. ::: ::. :. : . . CCDS87 ADGANALLPANRAMAYLKIQKYEEAEKDCTQAILLDGSYSKAFARRGTARTFLGKLNEAK 320 330 340 350 360 370 290 300 pF1KE1 ADISNLLQIEPRNGPAQKLRQEVKQNLH :. ..: .:: : : ...:..: CCDS87 QDFETVLLLEPGNKQAVTELSKIKKELIEKGHWDDVFLDSTQRQNVVKPIDNPPHPGSTK 380 390 400 410 420 430 >>CCDS53783.1 RPAP3 gene_id:79657|Hs108|chr12 (631 aa) initn: 463 init1: 218 opt: 284 Z-score: 321.7 bits: 68.7 E(32554): 1.3e-11 Smith-Waterman score: 354; 29.2% identity (56.9% similar) in 295 aa overlap (12-306:136-395) 10 20 30 40 pF1KE1 MAPKFPDSVEELRAAGNESFRNGQYAEASALYGRALRVLQA :. ::. :..:.: :: : ... CCDS53 DKDDSTHESLSQESESEEDGIHVDSQKALVLKEKGNKYFKQGKYDEAIDCYTKGM----- 110 120 130 140 150 160 50 60 70 80 90 100 pF1KE1 QGSSDPEEESVLYSNRAACHLKDGNCRDCIKDCTSALALVPFSIKPLLRRASAYEALEKY ..:: . :: .:::. ... . .::. :.:: : ::..: ::.: CCDS53 --DADPYNP-VLPTNRASAYFRLKKFAVAESDCNLAVALNRSYTKAYSRRGAARFALQKL 170 180 190 200 210 110 120 130 140 150 160 pF1KE1 PMAYVDYKTVLQIDDNVTSAVEGINRMTRALMDSLGPEWRLKLPSIPLVPVSAQKRWNSL : ::. ::... : :.. . ....:: :.:. :: CCDS53 EEAKKDYERVLELEPNNFEATNELRKISQAL---------------------ASKE-NSY 220 230 240 250 170 180 190 200 210 220 pF1KE1 PSENHKEMAKSKSKETTATKNRVPSAGDVEKARVLKEEGNELVKKGNHKKAIEKYSESLL :.: : : : . .... . . ..: :..:: . :.:....::: :.... CCDS53 PKE-----ADIVIKSTEGERKQIEAQQNKQQAISEKDRGNGFFKEGKYERAIECYTRGIA 260 270 280 290 300 310 230 240 250 260 270 280 pF1KE1 CSNLESATYSNRALCYLVLKQYTEAVKDCTEALKLDGKNVKAFYRRAQAHKALKDYKSSF .. .. .:::. :: ...: :: ::::.:. :::. ::: ::. :. : . . CCDS53 ADGANALLPANRAMAYLKIQKYEEAEKDCTQAILLDGSYSKAFARRGTARTFLGKLNEAK 320 330 340 350 360 370 290 300 pF1KE1 ADISNLLQIEPRNGPAQKLRQEVKQNLH :. ..: .:: : : ...:. CCDS53 QDFETVLLLEPGNKQAVTELSKIKKKPLKKVIIEETGNLIQTIDVPDSTTAAAPENNPIN 380 390 400 410 420 430 >>CCDS53782.1 RPAP3 gene_id:79657|Hs108|chr12 (506 aa) initn: 336 init1: 218 opt: 270 Z-score: 307.4 bits: 65.7 E(32554): 8.4e-11 Smith-Waterman score: 320; 29.5% identity (57.6% similar) in 264 aa overlap (45-308:3-238) 20 30 40 50 60 70 pF1KE1 AGNESFRNGQYAEASALYGRALRVLQAQGSSDPEEESVLYSNRAACHLKDGNCRDCIKDC .:: . :: .:::. ... . .:: CCDS53 MDADPYNP-VLPTNRASAYFRLKKFAVAESDC 10 20 30 80 90 100 110 120 130 pF1KE1 TSALALVPFSIKPLLRRASAYEALEKYPMAYVDYKTVLQIDDNVTSAVEGINRMTRALMD . :.:: : ::..: ::.: : ::. ::... : :.. . ....:: CCDS53 NLAVALNRSYTKAYSRRGAARFALQKLEEAKKDYERVLELEPNNFEATNELRKISQAL-- 40 50 60 70 80 140 150 160 170 180 190 pF1KE1 SLGPEWRLKLPSIPLVPVSAQKRWNSLPSENHKEMAKSKSKETTATKNRVPSAGDVEKAR :.:. :: :.: : : : . .... . . ..: CCDS53 -------------------ASKE-NSYPKE-----ADIVIKSTEGERKQIEAQQNKQQAI 90 100 110 120 200 210 220 230 240 250 pF1KE1 VLKEEGNELVKKGNHKKAIEKYSESLLCSNLESATYSNRALCYLVLKQYTEAVKDCTEAL :..:: . :.:....::: :.... .. .. .:::. :: ...: :: ::::.:. CCDS53 SEKDRGNGFFKEGKYERAIECYTRGIAADGANALLPANRAMAYLKIQKYEEAEKDCTQAI 130 140 150 160 170 180 260 270 280 290 300 pF1KE1 KLDGKNVKAFYRRAQAHKALKDYKSSFADISNLLQIEPRNGPAQKLRQEVKQNLH :::. ::: ::. :. : . . :. ..: .:: : : ...:..: CCDS53 LLDGSYSKAFARRGTARTFLGKLNEAKQDFETVLLLEPGNKQAVTELSKIKKELIEKGHW 190 200 210 220 230 240 CCDS53 DDVFLDSTQRQNVVKPIDNPPHPGSTKPLKKVIIEETGNLIQTIDVPDSTTAAAPENNPI 250 260 270 280 290 300 >>CCDS4348.1 TTC1 gene_id:7265|Hs108|chr5 (292 aa) initn: 175 init1: 175 opt: 266 Z-score: 306.5 bits: 64.8 E(32554): 9.4e-11 Smith-Waterman score: 266; 38.0% identity (66.9% similar) in 121 aa overlap (12-132:119-236) 10 20 30 40 pF1KE1 MAPKFPDSVEELRAAGNESFRNGQYAEASALYGRALRVLQA :. :::.:..:.: :: . :.:::.. CCDS43 SSELDEEYLIELEKNMSDEEKQKRREESTRLKEEGNEQFKKGDYIEAESSYSRALEMCP- 90 100 110 120 130 140 50 60 70 80 90 100 pF1KE1 QGSSDPEEESVLYSNRAACHLKDGNCRDCIKDCTSALALVPFSIKPLLRRASAYEALEKY : .:.:.:.::::: ..:. . . :.::..:. : : :. .:::: :: .: CCDS43 --SCFQKERSILFSNRAAARMKQDKKEMAINDCSKAIQLNPSYIRAILRRAELYEKTDKL 150 160 170 180 190 200 110 120 130 140 150 160 pF1KE1 PMAYVDYKTVLQIDDNVTSAVEGINRMTRALMDSLGPEWRLKLPSIPLVPVSAQKRWNSL : :::..:. : .. .: :. :. . . CCDS43 DEALEDYKSILEKDPSIHQAREACMRLPKQIEERNERLKEEMLGKLKDLGNLVLRPFGLS 210 220 230 240 250 260 170 180 190 200 210 220 pF1KE1 PSENHKEMAKSKSKETTATKNRVPSAGDVEKARVLKEEGNELVKKGNHKKAIEKYSESLL CCDS43 TENFQIKQDSSTGSYSINFVQNPNNNR 270 280 290 309 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 01:23:48 2016 done: Mon Nov 7 01:23:48 2016 Total Scan time: 2.640 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]