FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3586, 205 aa 1>>>pF1KE3586 205 - 205 aa - 205 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6019+/-0.000698; mu= 11.8848+/- 0.042 mean_var=64.2682+/-13.127, 0's: 0 Z-trim(109.2): 15 B-trim: 0 in 0/51 Lambda= 0.159984 statistics sampled from 10737 (10749) to 10737 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.733), E-opt: 0.2 (0.33), width: 16 Scan time: 1.760 The best scores are: opt bits E(32554) CCDS1091.1 EFNA1 gene_id:1942|Hs108|chr1 ( 205) 1420 335.9 1e-92 CCDS1092.1 EFNA1 gene_id:1942|Hs108|chr1 ( 183) 941 225.3 1.8e-59 CCDS4097.1 EFNA5 gene_id:1946|Hs108|chr5 ( 228) 503 124.3 5.9e-29 CCDS1090.1 EFNA3 gene_id:1944|Hs108|chr1 ( 238) 502 124.1 7.2e-29 CCDS12061.1 EFNA2 gene_id:1943|Hs108|chr19 ( 213) 457 113.6 8.7e-26 CCDS1089.1 EFNA4 gene_id:1945|Hs108|chr1 ( 201) 435 108.6 2.8e-24 CCDS41407.1 EFNA4 gene_id:1945|Hs108|chr1 ( 193) 434 108.3 3.2e-24 CCDS44237.1 EFNA4 gene_id:1945|Hs108|chr1 ( 207) 421 105.3 2.7e-23 CCDS14391.1 EFNB1 gene_id:1947|Hs108|chrX ( 346) 244 64.6 8.5e-11 >>CCDS1091.1 EFNA1 gene_id:1942|Hs108|chr1 (205 aa) initn: 1420 init1: 1420 opt: 1420 Z-score: 1777.9 bits: 335.9 E(32554): 1e-92 Smith-Waterman score: 1420; 99.0% identity (99.5% similar) in 205 aa overlap (1-205:1-205) 10 20 30 40 50 60 pF1KE3 MEFLWAPLLGLCCSLAAADRHTVFWNSSNPKFRNEDYTIHVQLNDYVDIICPHYEDHSVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MEFLWAPLLGLCCSLAAADRHTVFWNSSNPKFRNEDYTIHVQLNDYVDIICPHYEDHSVA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 DAAMEQYILYLVEHEEYQLCQPQSKDQVRWQCNRPSAKHGPEKLSEKFQRFTPFTLGKEF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 DAAMEQYILYLVEHEEYQLCQPQSKDQVRWQCNRPSAKHGPEKLSEKFQRFTPFTLGKEF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 KEGHSYYYISKPIHQHEDRCLRLKVTVSGKITHSPQAHVNPQEKRLAADDPEVRVLHSIA :::::::::::::::::::::::::::::::::::::: ::::::::::::::::::::. CCDS10 KEGHSYYYISKPIHQHEDRCLRLKVTVSGKITHSPQAHDNPQEKRLAADDPEVRVLHSIG 130 140 150 160 170 180 190 200 pF1KE3 HSAAPRLFPLAWTVLLLPLLLLQTP ::::::::::::::::::::::::: CCDS10 HSAAPRLFPLAWTVLLLPLLLLQTP 190 200 >>CCDS1092.1 EFNA1 gene_id:1942|Hs108|chr1 (183 aa) initn: 939 init1: 939 opt: 941 Z-score: 1181.2 bits: 225.3 E(32554): 1.8e-59 Smith-Waterman score: 1217; 88.3% identity (88.8% similar) in 205 aa overlap (1-205:1-183) 10 20 30 40 50 60 pF1KE3 MEFLWAPLLGLCCSLAAADRHTVFWNSSNPKFRNEDYTIHVQLNDYVDIICPHYEDHSVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MEFLWAPLLGLCCSLAAADRHTVFWNSSNPKFRNEDYTIHVQLNDYVDIICPHYEDHSVA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 DAAMEQYILYLVEHEEYQLCQPQSKDQVRWQCNRPSAKHGPEKLSEKFQRFTPFTLGKEF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 DAAMEQYILYLVEHEEYQLCQPQSKDQVRWQCNRPSAKHGPEKLSEKFQRFTPFTLGKEF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 KEGHSYYYISKPIHQHEDRCLRLKVTVSGKITHSPQAHVNPQEKRLAADDPEVRVLHSIA :::::::::: :::::: ::::::::::::::::::::. CCDS10 KEGHSYYYIS----------------------HSPQAHDNPQEKRLAADDPEVRVLHSIG 130 140 150 190 200 pF1KE3 HSAAPRLFPLAWTVLLLPLLLLQTP ::::::::::::::::::::::::: CCDS10 HSAAPRLFPLAWTVLLLPLLLLQTP 160 170 180 >>CCDS4097.1 EFNA5 gene_id:1946|Hs108|chr5 (228 aa) initn: 504 init1: 275 opt: 503 Z-score: 633.3 bits: 124.3 E(32554): 5.9e-29 Smith-Waterman score: 503; 48.1% identity (74.1% similar) in 158 aa overlap (14-170:25-179) 10 20 30 40 pF1KE3 MEFLWAPLLGLCCSLAAADRHTVFWNSSNPKFRNEDYTIHVQLNDYVDI : :.:::..:.::::::.:. :: : : .:::.:. CCDS40 MLHVEMLTLVFLVLWMCVFSQDPGSKAVADRYAVYWNSSNPRFQRGDYHIDVCINDYLDV 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE3 ICPHYEDHSVADAAMEQYILYLVEHEEYQLCQPQSKDQVRWQCNRPSAKHGPEKLSEKFQ .:::::: :: . :.:.::.:. . :. :. :: ::.:::: . .:: :.::::: CCDS40 FCPHYED-SVPEDKTERYVLYMVNFDGYSACDHTSKGFKRWECNRPHSPNGPLKFSEKFQ 70 80 90 100 110 110 120 130 140 150 160 pF1KE3 RFTPFTLGKEFKEGHSYYYISKPIHQHEDR-CLRLKVTVSGKITHSPQAHVNPQEKRLAA ::::.:: ::. :. :.:::. : .. : ::.::: : . :.: . .. ... . . CCDS40 LFTPFSLGFEFRPGREYFYISSAIPDNGRRSCLKLKVFV--RPTNSCMKTIGVHDRVFDV 120 130 140 150 160 170 170 180 190 200 pF1KE3 DDPEVRVLHSIAHSAAPRLFPLAWTVLLLPLLLLQTP .: CCDS40 NDKVENSLEPADDTVHESAEPSRGENAAQTPRIPSRLLAILLFLLAMLLTL 180 190 200 210 220 >>CCDS1090.1 EFNA3 gene_id:1944|Hs108|chr1 (238 aa) initn: 503 init1: 271 opt: 502 Z-score: 631.8 bits: 124.1 E(32554): 7.2e-29 Smith-Waterman score: 510; 39.4% identity (66.5% similar) in 221 aa overlap (7-204:19-238) 10 20 30 40 pF1KE3 MEFLWAPLLGLCCSLAAADRHTVFWNSSNPKFRNEDYTIHVQLNDYVD :::. . : ..::.:.::::: ..: : ::..:..:::.: CCDS10 MAAAPLLLLLLLVPVPLLPLLAQGPGGALGNRHAVYWNSSNQHLRREGYTVQVNVNDYLD 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE3 IICPHYEDHSVADAA-------MEQYILYLVEHEEYQLCQPQSKDQVRWQCNRPSAKHGP : ::::.. .:. .: :::.::.: .. :. :. :. ::.:::: : :.: CCDS10 IYCPHYNSSGVGPGAGPGPGGGAEQYVLYMVSRNGYRTCNA-SQGFKRWECNRPHAPHSP 70 80 90 100 110 110 120 130 140 150 pF1KE3 EKLSEKFQRFTPFTLGKEFKEGHSYYYISKPIHQHEDRCLRLKVTVS-GKITHS------ :.::::::.. :.:: ::. :: ::::: : :. . .:::.:: : .. .:: CCDS10 IKFSEKFQRYSAFSLGYEFHAGHEYYYISTPTHNLHWKCLRMKVFVCCASTSHSGEKPVP 120 130 140 150 160 170 160 170 180 190 200 pF1KE3 --PQAHVNPQEK-----RLAADDPEVRVLHSIAHSAAPRL--FPLAWTVLLLPLLLLQTP :: ..:. : . ...:.: :.. ...:. .::: . .. . .: . CCDS10 TLPQFTMGPNVKINVLEDFEGENPQVPKLEKSISGTSPKREHLPLAVGIAFFLMTFLAS 180 190 200 210 220 230 >>CCDS12061.1 EFNA2 gene_id:1943|Hs108|chr19 (213 aa) initn: 470 init1: 362 opt: 457 Z-score: 576.4 bits: 113.6 E(32554): 8.7e-26 Smith-Waterman score: 467; 42.7% identity (62.5% similar) in 192 aa overlap (16-195:32-211) 10 20 30 pF1KE3 MEFLWAPLLGLCCSLAAADRHTVFWNSSNPKFR---NED---YTI : .::..:.:: :::.:. ..: ::. CCDS12 APAQRPLLPLLLLLLPLPPPPFARAEDAARANSDRYAVYWNRSNPRFHAGAGDDGGGYTV 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE3 HVQLNDYVDIICPHYEDHSVADAAMEQYILYLVEHEEYQLCQPQSKDQVRWQCNRPSAKH .:..:::.:: :::: ::.:.::.:. : . :. ... ::.::::.: CCDS12 EVSINDYLDIYCPHYGAPLPPAERMEHYVLYMVNGEGHASCDHRQRGFKRWECNRPAAPG 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE3 GPEKLSEKFQRFTPFTLGKEFKEGHSYYYISKPIHQHEDR-CLRLKVTVSGKITHSPQAH :: :.::::: ::::.:: ::. :: ::::: . :: :::::: . CCDS12 GPLKFSEKFQLFTPFSLGFEFRPGHEYYYISATPPNAVDRPCLRLKV------------Y 130 140 150 160 160 170 180 190 200 pF1KE3 VNPQEKRL-AADDPEVRVLHSIAHSAAPRLF----PLAWTVLLLPLLLLQTP : : .. : : .: .: . .. ::: :. ::.: CCDS12 VRPTNETLYEAPEPIFTSNNSCSSPGGCRLFLSTIPVLWTLLGS 170 180 190 200 210 >>CCDS1089.1 EFNA4 gene_id:1945|Hs108|chr1 (201 aa) initn: 427 init1: 196 opt: 435 Z-score: 549.4 bits: 108.6 E(32554): 2.8e-24 Smith-Waterman score: 435; 43.6% identity (67.3% similar) in 156 aa overlap (4-158:11-162) 10 20 30 40 50 pF1KE3 MEFLWAPLLGLCCSLAAADRHTVFWNSSNPKFRNEDYTIHVQLNDYVDIICPH ::: .:: ... ::.:.::::::.. : .... ::::.::.::: CCDS10 MRLLPLLRTVLWAAFLGSPLRGGSSLRHVVYWNSSNPRLLRGDAVVELGLNDYLDIVCPH 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 YEDHSVADAAMEQYILYLVEHEEYQLCQPQS-KDQVRWQCNRPSAKHGPEKLSEKFQRFT :: . .. : . ::.:. :. :: .. . :: :. : : ..:::.:::: CCDS10 YEGPGPPEGP-ETFALYMVDWPGYESCQAEGPRAYKRWVCSLP---FGHVQFSEKIQRFT 70 80 90 100 110 120 130 140 150 160 170 pF1KE3 PFTLGKEFKEGHSYYYISKPIHQHEDRCLRLKVTVSGKITHSPQAHVNPQEKRLAADDPE ::.:: :: :..::::: : . .::::.:.: : .: .:: CCDS10 PFSLGFEFLPGETYYYISVPTPESSGQCLRLQVSVCCKERKSESAHPVGSPGESGTSGWR 120 130 140 150 160 170 180 190 200 pF1KE3 VRVLHSIAHSAAPRLFPLAWTVLLLPLLLLQTP CCDS10 GGDTPSPLCLLLLLLLLILRLLRIL 180 190 200 >>CCDS41407.1 EFNA4 gene_id:1945|Hs108|chr1 (193 aa) initn: 389 init1: 196 opt: 434 Z-score: 548.4 bits: 108.3 E(32554): 3.2e-24 Smith-Waterman score: 434; 42.0% identity (66.9% similar) in 169 aa overlap (4-171:11-171) 10 20 30 40 50 pF1KE3 MEFLWAPLLGLCCSLAAADRHTVFWNSSNPKFRNEDYTIHVQLNDYVDIICPH ::: .:: ... ::.:.::::::.. : .... ::::.::.::: CCDS41 MRLLPLLRTVLWAAFLGSPLRGGSSLRHVVYWNSSNPRLLRGDAVVELGLNDYLDIVCPH 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 YEDHSVADAAMEQYILYLVEHEEYQLCQPQS-KDQVRWQCNRPSAKHGPEKLSEKFQRFT :: . .. : . ::.:. :. :: .. . :: :. : : ..:::.:::: CCDS41 YEGPGPPEGP-ETFALYMVDWPGYESCQAEGPRAYKRWVCSLP---FGHVQFSEKIQRFT 70 80 90 100 110 120 130 140 150 160 170 pF1KE3 PFTLGKEFKEGHSYYYISKPIHQHEDRCLRLKVTVSGKITHSPQAHVNPQEKRLAADDPE ::.:: :: :..::::: : . .::::.:.: : . : .: :.: . ...:: CCDS41 PFSLGFEFLPGETYYYISVPTPESSGQCLRLQVSVCCKERNLP-SH--PKEPE-SSQDPL 120 130 140 150 160 170 180 190 200 pF1KE3 VRVLHSIAHSAAPRLFPLAWTVLLLPLLLLQTP CCDS41 EEEGSLLPALGVPIQTDKMEH 180 190 >>CCDS44237.1 EFNA4 gene_id:1945|Hs108|chr1 (207 aa) initn: 389 init1: 196 opt: 421 Z-score: 531.7 bits: 105.3 E(32554): 2.7e-23 Smith-Waterman score: 421; 43.9% identity (67.6% similar) in 148 aa overlap (4-150:11-154) 10 20 30 40 50 pF1KE3 MEFLWAPLLGLCCSLAAADRHTVFWNSSNPKFRNEDYTIHVQLNDYVDIICPH ::: .:: ... ::.:.::::::.. : .... ::::.::.::: CCDS44 MRLLPLLRTVLWAAFLGSPLRGGSSLRHVVYWNSSNPRLLRGDAVVELGLNDYLDIVCPH 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 YEDHSVADAAMEQYILYLVEHEEYQLCQPQS-KDQVRWQCNRPSAKHGPEKLSEKFQRFT :: . .. : . ::.:. :. :: .. . :: :. : : ..:::.:::: CCDS44 YEGPGPPEGP-ETFALYMVDWPGYESCQAEGPRAYKRWVCSLP---FGHVQFSEKIQRFT 70 80 90 100 110 120 130 140 150 160 170 pF1KE3 PFTLGKEFKEGHSYYYISKPIHQHEDRCLRLKVTVSGKITHSPQAHVNPQEKRLAADDPE ::.:: :: :..::::: : . .::::.:.: : CCDS44 PFSLGFEFLPGETYYYISVPTPESSGQCLRLQVSVCCKERRARVLPRSPGGGGIPAACTG 120 130 140 150 160 170 180 190 200 pF1KE3 VRVLHSIAHSAAPRLFPLAWTVLLLPLLLLQTP CCDS44 GANSDRQDGALMGEIRGSEVTLAGACPLITG 180 190 200 >>CCDS14391.1 EFNB1 gene_id:1947|Hs108|chrX (346 aa) initn: 221 init1: 91 opt: 244 Z-score: 307.3 bits: 64.6 E(32554): 8.5e-11 Smith-Waterman score: 244; 32.4% identity (56.6% similar) in 173 aa overlap (2-163:11-175) 10 20 30 40 pF1KE3 MEFLWAPLLGLCCSLA---AADRHTVFWNSSNPKF-RNEDYTIHVQLNDYV ..: : .. : :: : . . : :.: :::: .. .:. ...: . CCDS14 MARPGQRWLGKWLVAMVVWALCRLATPLAKNLEPVSWSSLNPKFLSGKGLVIYPKIGDKL 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE3 DIICPHYEDHSVADAAMEQYILYLVEHEEYQLCQPQSKDQVRWQCNRPSAKHGPEKLSEK :::::. : : .: : ::::. :. :. .: :::: . ... : CCDS14 DIICPRAE----AGRPYEYYKLYLVRPEQAAACSTVLDPNVLVTCNRPEQEI---RFTIK 70 80 90 100 110 110 120 130 140 150 160 pF1KE3 FQRFTPFTLGKEFKEGHSYYYISKP---IHQHEDR----CLRLKVTVSGKITHSPQAHVN ::.:.: .: :::. :.:: : .. :.: : . . :. ..:.: :. CCDS14 FQEFSPNYMGLEFKKHHDYYITSTSNGSLEGLENREGGVCRTRTMKIIMKVGQDPNA-VT 120 130 140 150 160 170 170 180 190 200 pF1KE3 PQEKRLAADDPEVRVLHSIAHSAAPRLFPLAWTVLLLPLLLLQTP :.. CCDS14 PEQLTTSRPSKEADNTVKMATQAPGSRGSLGDSDGKHETVNQEEKSGPGASGGSSGDPDG 180 190 200 210 220 230 205 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 01:57:03 2016 done: Sun Nov 6 01:57:03 2016 Total Scan time: 1.760 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]