FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6228, 260 aa 1>>>pF1KE6228 260 - 260 aa - 260 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1146+/-0.000765; mu= 15.5233+/- 0.046 mean_var=61.3483+/-12.454, 0's: 0 Z-trim(107.7): 27 B-trim: 495 in 1/49 Lambda= 0.163747 statistics sampled from 9720 (9740) to 9720 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.685), E-opt: 0.2 (0.299), width: 16 Scan time: 1.750 The best scores are: opt bits E(32554) CCDS32366.1 HAGH gene_id:3029|Hs108|chr16 ( 260) 1757 423.3 8.3e-119 CCDS10447.2 HAGH gene_id:3029|Hs108|chr16 ( 308) 1757 423.3 9.5e-119 CCDS32354.1 HAGHL gene_id:84264|Hs108|chr16 ( 282) 892 219.0 2.9e-57 CCDS2413.1 PNKD gene_id:25953|Hs108|chr2 ( 361) 647 161.2 9.4e-40 CCDS2411.1 PNKD gene_id:25953|Hs108|chr2 ( 385) 647 161.2 9.9e-40 CCDS66900.1 HAGH gene_id:3029|Hs108|chr16 ( 236) 631 157.3 9e-39 CCDS12622.1 ETHE1 gene_id:23474|Hs108|chr19 ( 254) 252 67.8 8.6e-12 >>CCDS32366.1 HAGH gene_id:3029|Hs108|chr16 (260 aa) initn: 1757 init1: 1757 opt: 1757 Z-score: 2246.5 bits: 423.3 E(32554): 8.3e-119 Smith-Waterman score: 1757; 100.0% identity (100.0% similar) in 260 aa overlap (1-260:1-260) 10 20 30 40 50 60 pF1KE6 MKVEVLPALTDNYMYLVIDDETKEAAIVDPVQPQKVVDAARKHGVKLTTVLTTHHHWDHA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MKVEVLPALTDNYMYLVIDDETKEAAIVDPVQPQKVVDAARKHGVKLTTVLTTHHHWDHA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GGNEKLVKLESGLKVYGGDDRIGALTHKITHLSTLQVGSLNVKCLATPCHTSGHICYFVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 GGNEKLVKLESGLKVYGGDDRIGALTHKITHLSTLQVGSLNVKCLATPCHTSGHICYFVS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 KPGGSEPPAVFTGDTLFVAGCGKFYEGTADEMCKALLEVLGRLPPDTRVYCGHEYTINNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 KPGGSEPPAVFTGDTLFVAGCGKFYEGTADEMCKALLEVLGRLPPDTRVYCGHEYTINNL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 KFARHVEPGNAAIREKLAWAKEKYSIGEPTVPSTLAEEFTYNPFMRVREKTVQQHAGETD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 KFARHVEPGNAAIREKLAWAKEKYSIGEPTVPSTLAEEFTYNPFMRVREKTVQQHAGETD 190 200 210 220 230 240 250 260 pF1KE6 PVTTMRAVRREKDQFKMPRD :::::::::::::::::::: CCDS32 PVTTMRAVRREKDQFKMPRD 250 260 >>CCDS10447.2 HAGH gene_id:3029|Hs108|chr16 (308 aa) initn: 1757 init1: 1757 opt: 1757 Z-score: 2245.4 bits: 423.3 E(32554): 9.5e-119 Smith-Waterman score: 1757; 100.0% identity (100.0% similar) in 260 aa overlap (1-260:49-308) 10 20 30 pF1KE6 MKVEVLPALTDNYMYLVIDDETKEAAIVDP :::::::::::::::::::::::::::::: CCDS10 ACARRGLGPALLGVFCHTDLRKNLTVDEGTMKVEVLPALTDNYMYLVIDDETKEAAIVDP 20 30 40 50 60 70 40 50 60 70 80 90 pF1KE6 VQPQKVVDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLESGLKVYGGDDRIGALTHKIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 VQPQKVVDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLESGLKVYGGDDRIGALTHKIT 80 90 100 110 120 130 100 110 120 130 140 150 pF1KE6 HLSTLQVGSLNVKCLATPCHTSGHICYFVSKPGGSEPPAVFTGDTLFVAGCGKFYEGTAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 HLSTLQVGSLNVKCLATPCHTSGHICYFVSKPGGSEPPAVFTGDTLFVAGCGKFYEGTAD 140 150 160 170 180 190 160 170 180 190 200 210 pF1KE6 EMCKALLEVLGRLPPDTRVYCGHEYTINNLKFARHVEPGNAAIREKLAWAKEKYSIGEPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 EMCKALLEVLGRLPPDTRVYCGHEYTINNLKFARHVEPGNAAIREKLAWAKEKYSIGEPT 200 210 220 230 240 250 220 230 240 250 260 pF1KE6 VPSTLAEEFTYNPFMRVREKTVQQHAGETDPVTTMRAVRREKDQFKMPRD :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 VPSTLAEEFTYNPFMRVREKTVQQHAGETDPVTTMRAVRREKDQFKMPRD 260 270 280 290 300 >>CCDS32354.1 HAGHL gene_id:84264|Hs108|chr16 (282 aa) initn: 890 init1: 562 opt: 892 Z-score: 1141.6 bits: 219.0 E(32554): 2.9e-57 Smith-Waterman score: 892; 49.6% identity (79.8% similar) in 262 aa overlap (1-259:1-261) 10 20 30 40 50 60 pF1KE6 MKVEVLPALTDNYMYLVIDDETKEAAIVDPVQPQKVVDAARKHGVKLTTVLTTHHHWDHA :::.:.:.: ::::::::.. :.::. :: . :..... . ..::.::.::::::::::: CCDS32 MKVKVIPVLEDNYMYLVIEELTREAVAVDVAVPKRLLEIVGREGVSLTAVLTTHHHWDHA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GGNEKLVKLESGLKVYGGDDRIGALTHKITHLSTLQVGSLNVKCLATPCHTSGHICYFVS :: .:..:. :: : :.:.:: .::....: :. :...:.:: :: ::.::. ::. CCDS32 RGNPELARLRPGLAVLGADERIFSLTRRLAHGEELRFGAIHVRCLLTPGHTAGHMSYFLW 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 KPGGSEPPAVFTGDTLFVAGCGKFYEGTADEMCKALLEVLGRLPPDTRVYCGHEYTINNL . .:::.:.::.: :::::. ::.:..: ..: : :: :::.:.:.::::.:..:: CCDS32 EDDCPDPPALFSGDALSVAGCGSCLEGSAQQMYQSLAE-LGTLPPETKVFCGHEHTLSNL 130 140 150 160 170 190 200 210 220 230 240 pF1KE6 KFARHVEPGNAAIREKLAWAKEKYSIGEPTVPSTLAEEFTYNPFMRVREKTVQQHAGETD .::..::: : .: ::.:::.. :::::::.:: ::::.:: :. :.. .:.. CCDS32 EFAQKVEPCNDHVRAKLSWAKKRDEDDVPTVPSTLGEERLYNPFLRVAEEPVRKFTGKAV 180 190 200 210 220 230 250 260 pF1KE6 PVTTMRAVRREKDQFKM---PRD :. ...:. .:. .:.. :: CCDS32 PADVLEALCKERARFEQAGEPRQPQARALLALQWGLLSAAPHD 240 250 260 270 280 >>CCDS2413.1 PNKD gene_id:25953|Hs108|chr2 (361 aa) initn: 550 init1: 324 opt: 647 Z-score: 827.2 bits: 161.2 E(32554): 9.4e-40 Smith-Waterman score: 648; 40.6% identity (69.2% similar) in 266 aa overlap (1-256:95-359) 10 20 30 pF1KE6 MKVEVLPALTDNYMYLVIDDETKEAAIVDP .:: .:.:.::: ::.:: ... :. ::: CCDS24 GYLFYRQQLRRARNRYPKGHSKTQPRLFNGVKVLPIPVLSDNYSYLIIDTQAQLAVAVDP 70 80 90 100 110 120 40 50 60 70 80 pF1KE6 VQPQKVVDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLESGLKVYGG-DDRIGALTHKI .:. : . .:.:: :...: ::.::::.:::. : . . .:::. .: : ::: . CCDS24 SDPRAVQASIEKEGVTLVAILCTHKHWDHSGGNRDLSRRHRDCRVYGSPQDGIPYLTHPL 130 140 150 160 170 180 90 100 110 120 130 140 pF1KE6 THLSTLQVGSLNVKCLATPCHTSGHICYFVSKPGGSEPPAVFTGDTLFVAGCGKFYEGTA : ....:: :... :::: ::.::. :... . : .:.:: ::..:::. .::.: CCDS24 CHQDVVSVGRLQIRALATPGHTQGHLVYLLDGEPYKGPSCLFSGDLLFLSGCGRTFEGNA 190 200 210 220 230 240 150 160 170 180 190 200 pF1KE6 DEMCKALLEVLGRLPPDTRVYCGHEYTINNLKFARHVEPGNAAIREKLAWAKEKYSIGEP . : ..: ::: : :: .. ::::. .:: :: ::: : : ..:. :.... . CCDS24 ETMLSSLDTVLG-LGDDTLLWPGHEYAEENLGFAGVVEPENLARERKMQWVQRQRLERKG 250 260 270 280 290 300 210 220 230 240 250 260 pF1KE6 TVPSTLAEEFTYNPFMRVREKTVQQH-------AGETDP--VTTMRAVRREKDQFKMPRD : ::::.:: .::::.:.. ..:. .:. : . .. .:: ::. : CCDS24 TCPSTLGEERSYNPFLRTHCLALQEALGPGPGPTGDDDYSRAQLLEELRRLKDMHKSK 310 320 330 340 350 360 >>CCDS2411.1 PNKD gene_id:25953|Hs108|chr2 (385 aa) initn: 550 init1: 324 opt: 647 Z-score: 826.8 bits: 161.2 E(32554): 9.9e-40 Smith-Waterman score: 648; 40.6% identity (69.2% similar) in 266 aa overlap (1-256:119-383) 10 20 30 pF1KE6 MKVEVLPALTDNYMYLVIDDETKEAAIVDP .:: .:.:.::: ::.:: ... :. ::: CCDS24 GYLFYRQQLRRARNRYPKGHSKTQPRLFNGVKVLPIPVLSDNYSYLIIDTQAQLAVAVDP 90 100 110 120 130 140 40 50 60 70 80 pF1KE6 VQPQKVVDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLESGLKVYGG-DDRIGALTHKI .:. : . .:.:: :...: ::.::::.:::. : . . .:::. .: : ::: . CCDS24 SDPRAVQASIEKEGVTLVAILCTHKHWDHSGGNRDLSRRHRDCRVYGSPQDGIPYLTHPL 150 160 170 180 190 200 90 100 110 120 130 140 pF1KE6 THLSTLQVGSLNVKCLATPCHTSGHICYFVSKPGGSEPPAVFTGDTLFVAGCGKFYEGTA : ....:: :... :::: ::.::. :... . : .:.:: ::..:::. .::.: CCDS24 CHQDVVSVGRLQIRALATPGHTQGHLVYLLDGEPYKGPSCLFSGDLLFLSGCGRTFEGNA 210 220 230 240 250 260 150 160 170 180 190 200 pF1KE6 DEMCKALLEVLGRLPPDTRVYCGHEYTINNLKFARHVEPGNAAIREKLAWAKEKYSIGEP . : ..: ::: : :: .. ::::. .:: :: ::: : : ..:. :.... . CCDS24 ETMLSSLDTVLG-LGDDTLLWPGHEYAEENLGFAGVVEPENLARERKMQWVQRQRLERKG 270 280 290 300 310 320 210 220 230 240 250 260 pF1KE6 TVPSTLAEEFTYNPFMRVREKTVQQH-------AGETDP--VTTMRAVRREKDQFKMPRD : ::::.:: .::::.:.. ..:. .:. : . .. .:: ::. : CCDS24 TCPSTLGEERSYNPFLRTHCLALQEALGPGPGPTGDDDYSRAQLLEELRRLKDMHKSK 330 340 350 360 370 380 >>CCDS66900.1 HAGH gene_id:3029|Hs108|chr16 (236 aa) initn: 631 init1: 631 opt: 631 Z-score: 809.5 bits: 157.3 E(32554): 9e-39 Smith-Waterman score: 631; 100.0% identity (100.0% similar) in 97 aa overlap (1-97:49-145) 10 20 30 pF1KE6 MKVEVLPALTDNYMYLVIDDETKEAAIVDP :::::::::::::::::::::::::::::: CCDS66 ACARRGLGPALLGVFCHTDLRKNLTVDEGTMKVEVLPALTDNYMYLVIDDETKEAAIVDP 20 30 40 50 60 70 40 50 60 70 80 90 pF1KE6 VQPQKVVDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLESGLKVYGGDDRIGALTHKIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 VQPQKVVDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLESGLKVYGGDDRIGALTHKIT 80 90 100 110 120 130 100 110 120 130 140 150 pF1KE6 HLSTLQVGSLNVKCLATPCHTSGHICYFVSKPGGSEPPAVFTGDTLFVAGCGKFYEGTAD ::::::: CCDS66 HLSTLQVTPCLWLAAGSSMKGLRMRCVKLCWRSWAGSPRTQESTVATSTPSTTSSLHATW 140 150 160 170 180 190 >>CCDS12622.1 ETHE1 gene_id:23474|Hs108|chr19 (254 aa) initn: 212 init1: 73 opt: 252 Z-score: 325.2 bits: 67.8 E(32554): 8.6e-12 Smith-Waterman score: 252; 27.5% identity (58.0% similar) in 193 aa overlap (4-188:27-213) 10 20 30 pF1KE6 MKVEVLPALTDNYMYLVIDDETKEAAIVDPVQPQKVV ... .. .. ::. : :..::...::: CCDS12 MAEAVLRVARRQLSQRGGSGAPILLRQMFEPVSCTFTYLLGDRESREAVLIDPVLETAPR 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE6 DAA--RKHGVKLTTVLTTHHHWDHAGGNEKLVKLESGLK-VYGGDDRIGALTHKITHLST :: .. :..: ...:: : :: :. : .: : . : . . : : : .. CCDS12 DAQLIKELGLRLLYAVNTHCHADHITGSGLLRSLLPGCQSVISRLSGAQADLH-IEDGDS 70 80 90 100 110 100 110 120 130 140 150 pF1KE6 LQVGSLNVKCLATPCHTSGHICYFVSKPGGSEPPAVFTGDTLFVAGCGK--FYEGTADEM .. : . .. :.: :: : . . .. . .::::.:.. :::. : .: : . CCDS12 IRFGRFALETRASPGHTPGCVTFVLN-----DHSMAFTGDALLIRGCGRTDFQQGCAKTL 120 130 140 150 160 170 160 170 180 190 200 pF1KE6 CKALLEVLGRLPPDTRVYCGHEY---TINNLKFARHVEPGNAAIREKLAWAKEKYSIGEP ... : . :: : .: .:.: :..... : ..: CCDS12 YHSVHEKIFTLPGDCLIYPAHDYHGFTVSTVEEERTLNPRLTLSCEEFVKIMGNLNLPKP 180 190 200 210 220 230 210 220 230 240 250 260 pF1KE6 TVPSTLAEEFTYNPFMRVREKTVQQHAGETDPVTTMRAVRREKDQFKMPRD CCDS12 QQIDFAVPANMRCGVQTPTA 240 250 260 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:16:22 2016 done: Tue Nov 8 11:16:22 2016 Total Scan time: 1.750 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]