FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE6228, 260 aa
1>>>pF1KE6228 260 - 260 aa - 260 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.1146+/-0.000765; mu= 15.5233+/- 0.046
mean_var=61.3483+/-12.454, 0's: 0 Z-trim(107.7): 27 B-trim: 495 in 1/49
Lambda= 0.163747
statistics sampled from 9720 (9740) to 9720 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.685), E-opt: 0.2 (0.299), width: 16
Scan time: 1.750
The best scores are: opt bits E(32554)
CCDS32366.1 HAGH gene_id:3029|Hs108|chr16 ( 260) 1757 423.3 8.3e-119
CCDS10447.2 HAGH gene_id:3029|Hs108|chr16 ( 308) 1757 423.3 9.5e-119
CCDS32354.1 HAGHL gene_id:84264|Hs108|chr16 ( 282) 892 219.0 2.9e-57
CCDS2413.1 PNKD gene_id:25953|Hs108|chr2 ( 361) 647 161.2 9.4e-40
CCDS2411.1 PNKD gene_id:25953|Hs108|chr2 ( 385) 647 161.2 9.9e-40
CCDS66900.1 HAGH gene_id:3029|Hs108|chr16 ( 236) 631 157.3 9e-39
CCDS12622.1 ETHE1 gene_id:23474|Hs108|chr19 ( 254) 252 67.8 8.6e-12
>>CCDS32366.1 HAGH gene_id:3029|Hs108|chr16 (260 aa)
initn: 1757 init1: 1757 opt: 1757 Z-score: 2246.5 bits: 423.3 E(32554): 8.3e-119
Smith-Waterman score: 1757; 100.0% identity (100.0% similar) in 260 aa overlap (1-260:1-260)
10 20 30 40 50 60
pF1KE6 MKVEVLPALTDNYMYLVIDDETKEAAIVDPVQPQKVVDAARKHGVKLTTVLTTHHHWDHA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 MKVEVLPALTDNYMYLVIDDETKEAAIVDPVQPQKVVDAARKHGVKLTTVLTTHHHWDHA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 GGNEKLVKLESGLKVYGGDDRIGALTHKITHLSTLQVGSLNVKCLATPCHTSGHICYFVS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 GGNEKLVKLESGLKVYGGDDRIGALTHKITHLSTLQVGSLNVKCLATPCHTSGHICYFVS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 KPGGSEPPAVFTGDTLFVAGCGKFYEGTADEMCKALLEVLGRLPPDTRVYCGHEYTINNL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 KPGGSEPPAVFTGDTLFVAGCGKFYEGTADEMCKALLEVLGRLPPDTRVYCGHEYTINNL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE6 KFARHVEPGNAAIREKLAWAKEKYSIGEPTVPSTLAEEFTYNPFMRVREKTVQQHAGETD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 KFARHVEPGNAAIREKLAWAKEKYSIGEPTVPSTLAEEFTYNPFMRVREKTVQQHAGETD
190 200 210 220 230 240
250 260
pF1KE6 PVTTMRAVRREKDQFKMPRD
::::::::::::::::::::
CCDS32 PVTTMRAVRREKDQFKMPRD
250 260
>>CCDS10447.2 HAGH gene_id:3029|Hs108|chr16 (308 aa)
initn: 1757 init1: 1757 opt: 1757 Z-score: 2245.4 bits: 423.3 E(32554): 9.5e-119
Smith-Waterman score: 1757; 100.0% identity (100.0% similar) in 260 aa overlap (1-260:49-308)
10 20 30
pF1KE6 MKVEVLPALTDNYMYLVIDDETKEAAIVDP
::::::::::::::::::::::::::::::
CCDS10 ACARRGLGPALLGVFCHTDLRKNLTVDEGTMKVEVLPALTDNYMYLVIDDETKEAAIVDP
20 30 40 50 60 70
40 50 60 70 80 90
pF1KE6 VQPQKVVDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLESGLKVYGGDDRIGALTHKIT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 VQPQKVVDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLESGLKVYGGDDRIGALTHKIT
80 90 100 110 120 130
100 110 120 130 140 150
pF1KE6 HLSTLQVGSLNVKCLATPCHTSGHICYFVSKPGGSEPPAVFTGDTLFVAGCGKFYEGTAD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 HLSTLQVGSLNVKCLATPCHTSGHICYFVSKPGGSEPPAVFTGDTLFVAGCGKFYEGTAD
140 150 160 170 180 190
160 170 180 190 200 210
pF1KE6 EMCKALLEVLGRLPPDTRVYCGHEYTINNLKFARHVEPGNAAIREKLAWAKEKYSIGEPT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 EMCKALLEVLGRLPPDTRVYCGHEYTINNLKFARHVEPGNAAIREKLAWAKEKYSIGEPT
200 210 220 230 240 250
220 230 240 250 260
pF1KE6 VPSTLAEEFTYNPFMRVREKTVQQHAGETDPVTTMRAVRREKDQFKMPRD
::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 VPSTLAEEFTYNPFMRVREKTVQQHAGETDPVTTMRAVRREKDQFKMPRD
260 270 280 290 300
>>CCDS32354.1 HAGHL gene_id:84264|Hs108|chr16 (282 aa)
initn: 890 init1: 562 opt: 892 Z-score: 1141.6 bits: 219.0 E(32554): 2.9e-57
Smith-Waterman score: 892; 49.6% identity (79.8% similar) in 262 aa overlap (1-259:1-261)
10 20 30 40 50 60
pF1KE6 MKVEVLPALTDNYMYLVIDDETKEAAIVDPVQPQKVVDAARKHGVKLTTVLTTHHHWDHA
:::.:.:.: ::::::::.. :.::. :: . :..... . ..::.::.:::::::::::
CCDS32 MKVKVIPVLEDNYMYLVIEELTREAVAVDVAVPKRLLEIVGREGVSLTAVLTTHHHWDHA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 GGNEKLVKLESGLKVYGGDDRIGALTHKITHLSTLQVGSLNVKCLATPCHTSGHICYFVS
:: .:..:. :: : :.:.:: .::....: :. :...:.:: :: ::.::. ::.
CCDS32 RGNPELARLRPGLAVLGADERIFSLTRRLAHGEELRFGAIHVRCLLTPGHTAGHMSYFLW
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 KPGGSEPPAVFTGDTLFVAGCGKFYEGTADEMCKALLEVLGRLPPDTRVYCGHEYTINNL
. .:::.:.::.: :::::. ::.:..: ..: : :: :::.:.:.::::.:..::
CCDS32 EDDCPDPPALFSGDALSVAGCGSCLEGSAQQMYQSLAE-LGTLPPETKVFCGHEHTLSNL
130 140 150 160 170
190 200 210 220 230 240
pF1KE6 KFARHVEPGNAAIREKLAWAKEKYSIGEPTVPSTLAEEFTYNPFMRVREKTVQQHAGETD
.::..::: : .: ::.:::.. :::::::.:: ::::.:: :. :.. .:..
CCDS32 EFAQKVEPCNDHVRAKLSWAKKRDEDDVPTVPSTLGEERLYNPFLRVAEEPVRKFTGKAV
180 190 200 210 220 230
250 260
pF1KE6 PVTTMRAVRREKDQFKM---PRD
:. ...:. .:. .:.. ::
CCDS32 PADVLEALCKERARFEQAGEPRQPQARALLALQWGLLSAAPHD
240 250 260 270 280
>>CCDS2413.1 PNKD gene_id:25953|Hs108|chr2 (361 aa)
initn: 550 init1: 324 opt: 647 Z-score: 827.2 bits: 161.2 E(32554): 9.4e-40
Smith-Waterman score: 648; 40.6% identity (69.2% similar) in 266 aa overlap (1-256:95-359)
10 20 30
pF1KE6 MKVEVLPALTDNYMYLVIDDETKEAAIVDP
.:: .:.:.::: ::.:: ... :. :::
CCDS24 GYLFYRQQLRRARNRYPKGHSKTQPRLFNGVKVLPIPVLSDNYSYLIIDTQAQLAVAVDP
70 80 90 100 110 120
40 50 60 70 80
pF1KE6 VQPQKVVDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLESGLKVYGG-DDRIGALTHKI
.:. : . .:.:: :...: ::.::::.:::. : . . .:::. .: : ::: .
CCDS24 SDPRAVQASIEKEGVTLVAILCTHKHWDHSGGNRDLSRRHRDCRVYGSPQDGIPYLTHPL
130 140 150 160 170 180
90 100 110 120 130 140
pF1KE6 THLSTLQVGSLNVKCLATPCHTSGHICYFVSKPGGSEPPAVFTGDTLFVAGCGKFYEGTA
: ....:: :... :::: ::.::. :... . : .:.:: ::..:::. .::.:
CCDS24 CHQDVVSVGRLQIRALATPGHTQGHLVYLLDGEPYKGPSCLFSGDLLFLSGCGRTFEGNA
190 200 210 220 230 240
150 160 170 180 190 200
pF1KE6 DEMCKALLEVLGRLPPDTRVYCGHEYTINNLKFARHVEPGNAAIREKLAWAKEKYSIGEP
. : ..: ::: : :: .. ::::. .:: :: ::: : : ..:. :.... .
CCDS24 ETMLSSLDTVLG-LGDDTLLWPGHEYAEENLGFAGVVEPENLARERKMQWVQRQRLERKG
250 260 270 280 290 300
210 220 230 240 250 260
pF1KE6 TVPSTLAEEFTYNPFMRVREKTVQQH-------AGETDP--VTTMRAVRREKDQFKMPRD
: ::::.:: .::::.:.. ..:. .:. : . .. .:: ::. :
CCDS24 TCPSTLGEERSYNPFLRTHCLALQEALGPGPGPTGDDDYSRAQLLEELRRLKDMHKSK
310 320 330 340 350 360
>>CCDS2411.1 PNKD gene_id:25953|Hs108|chr2 (385 aa)
initn: 550 init1: 324 opt: 647 Z-score: 826.8 bits: 161.2 E(32554): 9.9e-40
Smith-Waterman score: 648; 40.6% identity (69.2% similar) in 266 aa overlap (1-256:119-383)
10 20 30
pF1KE6 MKVEVLPALTDNYMYLVIDDETKEAAIVDP
.:: .:.:.::: ::.:: ... :. :::
CCDS24 GYLFYRQQLRRARNRYPKGHSKTQPRLFNGVKVLPIPVLSDNYSYLIIDTQAQLAVAVDP
90 100 110 120 130 140
40 50 60 70 80
pF1KE6 VQPQKVVDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLESGLKVYGG-DDRIGALTHKI
.:. : . .:.:: :...: ::.::::.:::. : . . .:::. .: : ::: .
CCDS24 SDPRAVQASIEKEGVTLVAILCTHKHWDHSGGNRDLSRRHRDCRVYGSPQDGIPYLTHPL
150 160 170 180 190 200
90 100 110 120 130 140
pF1KE6 THLSTLQVGSLNVKCLATPCHTSGHICYFVSKPGGSEPPAVFTGDTLFVAGCGKFYEGTA
: ....:: :... :::: ::.::. :... . : .:.:: ::..:::. .::.:
CCDS24 CHQDVVSVGRLQIRALATPGHTQGHLVYLLDGEPYKGPSCLFSGDLLFLSGCGRTFEGNA
210 220 230 240 250 260
150 160 170 180 190 200
pF1KE6 DEMCKALLEVLGRLPPDTRVYCGHEYTINNLKFARHVEPGNAAIREKLAWAKEKYSIGEP
. : ..: ::: : :: .. ::::. .:: :: ::: : : ..:. :.... .
CCDS24 ETMLSSLDTVLG-LGDDTLLWPGHEYAEENLGFAGVVEPENLARERKMQWVQRQRLERKG
270 280 290 300 310 320
210 220 230 240 250 260
pF1KE6 TVPSTLAEEFTYNPFMRVREKTVQQH-------AGETDP--VTTMRAVRREKDQFKMPRD
: ::::.:: .::::.:.. ..:. .:. : . .. .:: ::. :
CCDS24 TCPSTLGEERSYNPFLRTHCLALQEALGPGPGPTGDDDYSRAQLLEELRRLKDMHKSK
330 340 350 360 370 380
>>CCDS66900.1 HAGH gene_id:3029|Hs108|chr16 (236 aa)
initn: 631 init1: 631 opt: 631 Z-score: 809.5 bits: 157.3 E(32554): 9e-39
Smith-Waterman score: 631; 100.0% identity (100.0% similar) in 97 aa overlap (1-97:49-145)
10 20 30
pF1KE6 MKVEVLPALTDNYMYLVIDDETKEAAIVDP
::::::::::::::::::::::::::::::
CCDS66 ACARRGLGPALLGVFCHTDLRKNLTVDEGTMKVEVLPALTDNYMYLVIDDETKEAAIVDP
20 30 40 50 60 70
40 50 60 70 80 90
pF1KE6 VQPQKVVDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLESGLKVYGGDDRIGALTHKIT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 VQPQKVVDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLESGLKVYGGDDRIGALTHKIT
80 90 100 110 120 130
100 110 120 130 140 150
pF1KE6 HLSTLQVGSLNVKCLATPCHTSGHICYFVSKPGGSEPPAVFTGDTLFVAGCGKFYEGTAD
:::::::
CCDS66 HLSTLQVTPCLWLAAGSSMKGLRMRCVKLCWRSWAGSPRTQESTVATSTPSTTSSLHATW
140 150 160 170 180 190
>>CCDS12622.1 ETHE1 gene_id:23474|Hs108|chr19 (254 aa)
initn: 212 init1: 73 opt: 252 Z-score: 325.2 bits: 67.8 E(32554): 8.6e-12
Smith-Waterman score: 252; 27.5% identity (58.0% similar) in 193 aa overlap (4-188:27-213)
10 20 30
pF1KE6 MKVEVLPALTDNYMYLVIDDETKEAAIVDPVQPQKVV
... .. .. ::. : :..::...:::
CCDS12 MAEAVLRVARRQLSQRGGSGAPILLRQMFEPVSCTFTYLLGDRESREAVLIDPVLETAPR
10 20 30 40 50 60
40 50 60 70 80 90
pF1KE6 DAA--RKHGVKLTTVLTTHHHWDHAGGNEKLVKLESGLK-VYGGDDRIGALTHKITHLST
:: .. :..: ...:: : :: :. : .: : . : . . : : : ..
CCDS12 DAQLIKELGLRLLYAVNTHCHADHITGSGLLRSLLPGCQSVISRLSGAQADLH-IEDGDS
70 80 90 100 110
100 110 120 130 140 150
pF1KE6 LQVGSLNVKCLATPCHTSGHICYFVSKPGGSEPPAVFTGDTLFVAGCGK--FYEGTADEM
.. : . .. :.: :: : . . .. . .::::.:.. :::. : .: : .
CCDS12 IRFGRFALETRASPGHTPGCVTFVLN-----DHSMAFTGDALLIRGCGRTDFQQGCAKTL
120 130 140 150 160 170
160 170 180 190 200
pF1KE6 CKALLEVLGRLPPDTRVYCGHEY---TINNLKFARHVEPGNAAIREKLAWAKEKYSIGEP
... : . :: : .: .:.: :..... : ..:
CCDS12 YHSVHEKIFTLPGDCLIYPAHDYHGFTVSTVEEERTLNPRLTLSCEEFVKIMGNLNLPKP
180 190 200 210 220 230
210 220 230 240 250 260
pF1KE6 TVPSTLAEEFTYNPFMRVREKTVQQHAGETDPVTTMRAVRREKDQFKMPRD
CCDS12 QQIDFAVPANMRCGVQTPTA
240 250
260 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 11:16:22 2016 done: Tue Nov 8 11:16:22 2016
Total Scan time: 1.750 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]