FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1568, 309 aa
1>>>pF1KE1568 309 - 309 aa - 309 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.8013+/-0.000882; mu= 13.5843+/- 0.053
mean_var=79.1676+/-15.638, 0's: 0 Z-trim(106.8): 55 B-trim: 0 in 0/50
Lambda= 0.144145
statistics sampled from 9155 (9210) to 9155 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.666), E-opt: 0.2 (0.283), width: 16
Scan time: 2.640
The best scores are: opt bits E(32554)
CCDS13340.1 TOMM34 gene_id:10953|Hs108|chr20 ( 309) 2007 426.8 1e-119
CCDS34930.1 SPAG1 gene_id:6674|Hs108|chr8 ( 926) 895 195.8 1e-49
CCDS8753.1 RPAP3 gene_id:79657|Hs108|chr12 ( 665) 289 69.7 6.8e-12
CCDS53783.1 RPAP3 gene_id:79657|Hs108|chr12 ( 631) 284 68.7 1.3e-11
CCDS53782.1 RPAP3 gene_id:79657|Hs108|chr12 ( 506) 270 65.7 8.4e-11
CCDS4348.1 TTC1 gene_id:7265|Hs108|chr5 ( 292) 266 64.8 9.4e-11
>>CCDS13340.1 TOMM34 gene_id:10953|Hs108|chr20 (309 aa)
initn: 2007 init1: 2007 opt: 2007 Z-score: 2262.9 bits: 426.8 E(32554): 1e-119
Smith-Waterman score: 2007; 100.0% identity (100.0% similar) in 309 aa overlap (1-309:1-309)
10 20 30 40 50 60
pF1KE1 MAPKFPDSVEELRAAGNESFRNGQYAEASALYGRALRVLQAQGSSDPEEESVLYSNRAAC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MAPKFPDSVEELRAAGNESFRNGQYAEASALYGRALRVLQAQGSSDPEEESVLYSNRAAC
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 HLKDGNCRDCIKDCTSALALVPFSIKPLLRRASAYEALEKYPMAYVDYKTVLQIDDNVTS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 HLKDGNCRDCIKDCTSALALVPFSIKPLLRRASAYEALEKYPMAYVDYKTVLQIDDNVTS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 AVEGINRMTRALMDSLGPEWRLKLPSIPLVPVSAQKRWNSLPSENHKEMAKSKSKETTAT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 AVEGINRMTRALMDSLGPEWRLKLPSIPLVPVSAQKRWNSLPSENHKEMAKSKSKETTAT
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 KNRVPSAGDVEKARVLKEEGNELVKKGNHKKAIEKYSESLLCSNLESATYSNRALCYLVL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 KNRVPSAGDVEKARVLKEEGNELVKKGNHKKAIEKYSESLLCSNLESATYSNRALCYLVL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE1 KQYTEAVKDCTEALKLDGKNVKAFYRRAQAHKALKDYKSSFADISNLLQIEPRNGPAQKL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 KQYTEAVKDCTEALKLDGKNVKAFYRRAQAHKALKDYKSSFADISNLLQIEPRNGPAQKL
250 260 270 280 290 300
pF1KE1 RQEVKQNLH
:::::::::
CCDS13 RQEVKQNLH
>>CCDS34930.1 SPAG1 gene_id:6674|Hs108|chr8 (926 aa)
initn: 879 init1: 546 opt: 895 Z-score: 1005.9 bits: 195.8 E(32554): 1e-49
Smith-Waterman score: 895; 48.8% identity (73.2% similar) in 299 aa overlap (12-309:448-739)
10 20 30 40
pF1KE1 MAPKFPDSVEELRAAGNESFRNGQYAEASALYGRALRVLQA
:.. ::: ::.::.:::.. :. :. .:.
CCDS34 SPRRASAAAAAGGGATGHPGGGQGAENPAGLKSQGNELFRSGQFAEAAGKYSAAIALLEP
420 430 440 450 460 470
50 60 70 80 90 100
pF1KE1 QGSSDPEEESVLYSNRAACHLKDGNCRDCIKDCTSALALVPFSIKPLLRRASAYEALEKY
:: .. :.::::::::.::.::: ::.::. :: : :::.::::::: :::.::.:
CCDS34 AGSEIADDLSILYSNRAACYLKEGNCSGCIQDCNRALELHPFSMKPLLRRAMAYETLEQY
480 490 500 510 520 530
110 120 130 140 150 160
pF1KE1 PMAYVDYKTVLQIDDNVTSAVEGINRMTRALMDSLGPEWRLKLPSIPLVPVSAQ-KRWNS
:::::::::::: .. : ...::..: ::. ::.:: :: :: ::.:. . :.
CCDS34 GKAYVDYKTVLQIDCGLQLANDSVNRLSRILMELDGPNWREKLSPIPAVPASVPLQAWH-
540 550 560 570 580 590
170 180 190 200 210 220
pF1KE1 LPSENHKEMAKSKSKETTATKNRVPSAGDVEKARVLKEEGNELVKKGNHKKAIEKYSESL
:. ::: .... .... .: . : . ..::::::. :. :.: :. :::: :
CCDS34 -PA---KEMISKQAGDSSS--HRQQGITDEKTFKALKEEGNQCVNDKNYKDALSKYSECL
600 610 620 630 640 650
230 240 250 260 270 280
pF1KE1 LCSNLESATYSNRALCYLVLKQYTEAVKDCTEALKLDGKNVKAFYRRAQAHKALKDYKSS
.: : : :.::::::: : :. :: .:: .::.: ::::::::: :::.::.:..:
CCDS34 KINNKECAIYTNRALCYLKLCQFEEAKQDCDQALQLADGNVKAFYRRALAHKGLKNYQKS
660 670 680 690 700 710
290 300
pF1KE1 FADISNLLQIEPRNGPAQKLRQEVKQNLH
. :..... ..: :. .:: . :.
CCDS34 LIDLNKVILLDPSIIEAKMELEEVTRLLNLKDKTAPFNKEKERRKIEIQEVNEGKEEPGR
720 730 740 750 760 770
>>CCDS8753.1 RPAP3 gene_id:79657|Hs108|chr12 (665 aa)
initn: 463 init1: 218 opt: 289 Z-score: 327.0 bits: 69.7 E(32554): 6.8e-12
Smith-Waterman score: 359; 29.3% identity (57.2% similar) in 297 aa overlap (12-308:136-397)
10 20 30 40
pF1KE1 MAPKFPDSVEELRAAGNESFRNGQYAEASALYGRALRVLQA
:. ::. :..:.: :: : ...
CCDS87 DKDDSTHESLSQESESEEDGIHVDSQKALVLKEKGNKYFKQGKYDEAIDCYTKGM-----
110 120 130 140 150 160
50 60 70 80 90 100
pF1KE1 QGSSDPEEESVLYSNRAACHLKDGNCRDCIKDCTSALALVPFSIKPLLRRASAYEALEKY
..:: . :: .:::. ... . .::. :.:: : ::..: ::.:
CCDS87 --DADPYNP-VLPTNRASAYFRLKKFAVAESDCNLAVALNRSYTKAYSRRGAARFALQKL
170 180 190 200 210
110 120 130 140 150 160
pF1KE1 PMAYVDYKTVLQIDDNVTSAVEGINRMTRALMDSLGPEWRLKLPSIPLVPVSAQKRWNSL
: ::. ::... : :.. . ....:: :.:. ::
CCDS87 EEAKKDYERVLELEPNNFEATNELRKISQAL---------------------ASKE-NSY
220 230 240 250
170 180 190 200 210 220
pF1KE1 PSENHKEMAKSKSKETTATKNRVPSAGDVEKARVLKEEGNELVKKGNHKKAIEKYSESLL
:.: : : : . .... . . ..: :..:: . :.:....::: :....
CCDS87 PKE-----ADIVIKSTEGERKQIEAQQNKQQAISEKDRGNGFFKEGKYERAIECYTRGIA
260 270 280 290 300 310
230 240 250 260 270 280
pF1KE1 CSNLESATYSNRALCYLVLKQYTEAVKDCTEALKLDGKNVKAFYRRAQAHKALKDYKSSF
.. .. .:::. :: ...: :: ::::.:. :::. ::: ::. :. : . .
CCDS87 ADGANALLPANRAMAYLKIQKYEEAEKDCTQAILLDGSYSKAFARRGTARTFLGKLNEAK
320 330 340 350 360 370
290 300
pF1KE1 ADISNLLQIEPRNGPAQKLRQEVKQNLH
:. ..: .:: : : ...:..:
CCDS87 QDFETVLLLEPGNKQAVTELSKIKKELIEKGHWDDVFLDSTQRQNVVKPIDNPPHPGSTK
380 390 400 410 420 430
>>CCDS53783.1 RPAP3 gene_id:79657|Hs108|chr12 (631 aa)
initn: 463 init1: 218 opt: 284 Z-score: 321.7 bits: 68.7 E(32554): 1.3e-11
Smith-Waterman score: 354; 29.2% identity (56.9% similar) in 295 aa overlap (12-306:136-395)
10 20 30 40
pF1KE1 MAPKFPDSVEELRAAGNESFRNGQYAEASALYGRALRVLQA
:. ::. :..:.: :: : ...
CCDS53 DKDDSTHESLSQESESEEDGIHVDSQKALVLKEKGNKYFKQGKYDEAIDCYTKGM-----
110 120 130 140 150 160
50 60 70 80 90 100
pF1KE1 QGSSDPEEESVLYSNRAACHLKDGNCRDCIKDCTSALALVPFSIKPLLRRASAYEALEKY
..:: . :: .:::. ... . .::. :.:: : ::..: ::.:
CCDS53 --DADPYNP-VLPTNRASAYFRLKKFAVAESDCNLAVALNRSYTKAYSRRGAARFALQKL
170 180 190 200 210
110 120 130 140 150 160
pF1KE1 PMAYVDYKTVLQIDDNVTSAVEGINRMTRALMDSLGPEWRLKLPSIPLVPVSAQKRWNSL
: ::. ::... : :.. . ....:: :.:. ::
CCDS53 EEAKKDYERVLELEPNNFEATNELRKISQAL---------------------ASKE-NSY
220 230 240 250
170 180 190 200 210 220
pF1KE1 PSENHKEMAKSKSKETTATKNRVPSAGDVEKARVLKEEGNELVKKGNHKKAIEKYSESLL
:.: : : : . .... . . ..: :..:: . :.:....::: :....
CCDS53 PKE-----ADIVIKSTEGERKQIEAQQNKQQAISEKDRGNGFFKEGKYERAIECYTRGIA
260 270 280 290 300 310
230 240 250 260 270 280
pF1KE1 CSNLESATYSNRALCYLVLKQYTEAVKDCTEALKLDGKNVKAFYRRAQAHKALKDYKSSF
.. .. .:::. :: ...: :: ::::.:. :::. ::: ::. :. : . .
CCDS53 ADGANALLPANRAMAYLKIQKYEEAEKDCTQAILLDGSYSKAFARRGTARTFLGKLNEAK
320 330 340 350 360 370
290 300
pF1KE1 ADISNLLQIEPRNGPAQKLRQEVKQNLH
:. ..: .:: : : ...:.
CCDS53 QDFETVLLLEPGNKQAVTELSKIKKKPLKKVIIEETGNLIQTIDVPDSTTAAAPENNPIN
380 390 400 410 420 430
>>CCDS53782.1 RPAP3 gene_id:79657|Hs108|chr12 (506 aa)
initn: 336 init1: 218 opt: 270 Z-score: 307.4 bits: 65.7 E(32554): 8.4e-11
Smith-Waterman score: 320; 29.5% identity (57.6% similar) in 264 aa overlap (45-308:3-238)
20 30 40 50 60 70
pF1KE1 AGNESFRNGQYAEASALYGRALRVLQAQGSSDPEEESVLYSNRAACHLKDGNCRDCIKDC
.:: . :: .:::. ... . .::
CCDS53 MDADPYNP-VLPTNRASAYFRLKKFAVAESDC
10 20 30
80 90 100 110 120 130
pF1KE1 TSALALVPFSIKPLLRRASAYEALEKYPMAYVDYKTVLQIDDNVTSAVEGINRMTRALMD
. :.:: : ::..: ::.: : ::. ::... : :.. . ....::
CCDS53 NLAVALNRSYTKAYSRRGAARFALQKLEEAKKDYERVLELEPNNFEATNELRKISQAL--
40 50 60 70 80
140 150 160 170 180 190
pF1KE1 SLGPEWRLKLPSIPLVPVSAQKRWNSLPSENHKEMAKSKSKETTATKNRVPSAGDVEKAR
:.:. :: :.: : : : . .... . . ..:
CCDS53 -------------------ASKE-NSYPKE-----ADIVIKSTEGERKQIEAQQNKQQAI
90 100 110 120
200 210 220 230 240 250
pF1KE1 VLKEEGNELVKKGNHKKAIEKYSESLLCSNLESATYSNRALCYLVLKQYTEAVKDCTEAL
:..:: . :.:....::: :.... .. .. .:::. :: ...: :: ::::.:.
CCDS53 SEKDRGNGFFKEGKYERAIECYTRGIAADGANALLPANRAMAYLKIQKYEEAEKDCTQAI
130 140 150 160 170 180
260 270 280 290 300
pF1KE1 KLDGKNVKAFYRRAQAHKALKDYKSSFADISNLLQIEPRNGPAQKLRQEVKQNLH
:::. ::: ::. :. : . . :. ..: .:: : : ...:..:
CCDS53 LLDGSYSKAFARRGTARTFLGKLNEAKQDFETVLLLEPGNKQAVTELSKIKKELIEKGHW
190 200 210 220 230 240
CCDS53 DDVFLDSTQRQNVVKPIDNPPHPGSTKPLKKVIIEETGNLIQTIDVPDSTTAAAPENNPI
250 260 270 280 290 300
>>CCDS4348.1 TTC1 gene_id:7265|Hs108|chr5 (292 aa)
initn: 175 init1: 175 opt: 266 Z-score: 306.5 bits: 64.8 E(32554): 9.4e-11
Smith-Waterman score: 266; 38.0% identity (66.9% similar) in 121 aa overlap (12-132:119-236)
10 20 30 40
pF1KE1 MAPKFPDSVEELRAAGNESFRNGQYAEASALYGRALRVLQA
:. :::.:..:.: :: . :.:::..
CCDS43 SSELDEEYLIELEKNMSDEEKQKRREESTRLKEEGNEQFKKGDYIEAESSYSRALEMCP-
90 100 110 120 130 140
50 60 70 80 90 100
pF1KE1 QGSSDPEEESVLYSNRAACHLKDGNCRDCIKDCTSALALVPFSIKPLLRRASAYEALEKY
: .:.:.:.::::: ..:. . . :.::..:. : : :. .:::: :: .:
CCDS43 --SCFQKERSILFSNRAAARMKQDKKEMAINDCSKAIQLNPSYIRAILRRAELYEKTDKL
150 160 170 180 190 200
110 120 130 140 150 160
pF1KE1 PMAYVDYKTVLQIDDNVTSAVEGINRMTRALMDSLGPEWRLKLPSIPLVPVSAQKRWNSL
: :::..:. : .. .: :. :. . .
CCDS43 DEALEDYKSILEKDPSIHQAREACMRLPKQIEERNERLKEEMLGKLKDLGNLVLRPFGLS
210 220 230 240 250 260
170 180 190 200 210 220
pF1KE1 PSENHKEMAKSKSKETTATKNRVPSAGDVEKARVLKEEGNELVKKGNHKKAIEKYSESLL
CCDS43 TENFQIKQDSSTGSYSINFVQNPNNNR
270 280 290
309 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 01:23:48 2016 done: Mon Nov 7 01:23:48 2016
Total Scan time: 2.640 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]