FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5268, 201 aa 1>>>pF1KE5268 201 - 201 aa - 201 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.0390+/-0.000527; mu= 16.0944+/- 0.032 mean_var=72.0360+/-14.382, 0's: 0 Z-trim(115.0): 17 B-trim: 0 in 0/54 Lambda= 0.151112 statistics sampled from 15575 (15591) to 15575 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.816), E-opt: 0.2 (0.479), width: 16 Scan time: 2.250 The best scores are: opt bits E(32554) CCDS1089.1 EFNA4 gene_id:1945|Hs108|chr1 ( 201) 1417 316.8 5.7e-87 CCDS44237.1 EFNA4 gene_id:1945|Hs108|chr1 ( 207) 1131 254.4 3.4e-68 CCDS41407.1 EFNA4 gene_id:1945|Hs108|chr1 ( 193) 1128 253.8 5.1e-68 CCDS4097.1 EFNA5 gene_id:1946|Hs108|chr5 ( 228) 480 112.5 2e-25 CCDS1091.1 EFNA1 gene_id:1942|Hs108|chr1 ( 205) 435 102.7 1.6e-22 CCDS12061.1 EFNA2 gene_id:1943|Hs108|chr19 ( 213) 396 94.2 6.1e-20 CCDS1092.1 EFNA1 gene_id:1942|Hs108|chr1 ( 183) 377 90.0 9.5e-19 CCDS1090.1 EFNA3 gene_id:1944|Hs108|chr1 ( 238) 333 80.5 9e-16 >>CCDS1089.1 EFNA4 gene_id:1945|Hs108|chr1 (201 aa) initn: 1417 init1: 1417 opt: 1417 Z-score: 1674.8 bits: 316.8 E(32554): 5.7e-87 Smith-Waterman score: 1417; 100.0% identity (100.0% similar) in 201 aa overlap (1-201:1-201) 10 20 30 40 50 60 pF1KE5 MRLLPLLRTVLWAAFLGSPLRGGSSLRHVVYWNSSNPRLLRGDAVVELGLNDYLDIVCPH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MRLLPLLRTVLWAAFLGSPLRGGSSLRHVVYWNSSNPRLLRGDAVVELGLNDYLDIVCPH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 YEGPGPPEGPETFALYMVDWPGYESCQAEGPRAYKRWVCSLPFGHVQFSEKIQRFTPFSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 YEGPGPPEGPETFALYMVDWPGYESCQAEGPRAYKRWVCSLPFGHVQFSEKIQRFTPFSL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 GFEFLPGETYYYISVPTPESSGQCLRLQVSVCCKERKSESAHPVGSPGESGTSGWRGGDT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 GFEFLPGETYYYISVPTPESSGQCLRLQVSVCCKERKSESAHPVGSPGESGTSGWRGGDT 130 140 150 160 170 180 190 200 pF1KE5 PSPLCLLLLLLLLILRLLRIL ::::::::::::::::::::: CCDS10 PSPLCLLLLLLLLILRLLRIL 190 200 >>CCDS44237.1 EFNA4 gene_id:1945|Hs108|chr1 (207 aa) initn: 1143 init1: 1123 opt: 1131 Z-score: 1337.7 bits: 254.4 E(32554): 3.4e-68 Smith-Waterman score: 1131; 94.2% identity (97.1% similar) in 171 aa overlap (1-171:1-169) 10 20 30 40 50 60 pF1KE5 MRLLPLLRTVLWAAFLGSPLRGGSSLRHVVYWNSSNPRLLRGDAVVELGLNDYLDIVCPH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MRLLPLLRTVLWAAFLGSPLRGGSSLRHVVYWNSSNPRLLRGDAVVELGLNDYLDIVCPH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 YEGPGPPEGPETFALYMVDWPGYESCQAEGPRAYKRWVCSLPFGHVQFSEKIQRFTPFSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 YEGPGPPEGPETFALYMVDWPGYESCQAEGPRAYKRWVCSLPFGHVQFSEKIQRFTPFSL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 GFEFLPGETYYYISVPTPESSGQCLRLQVSVCCKERKSESAHPVGSPGESGTSGWRGGDT ::::::::::::::::::::::::::::::::::::... . : ::: .: CCDS44 GFEFLPGETYYYISVPTPESSGQCLRLQVSVCCKERRAR-VLPR-SPGGGGIPAACTGGA 130 140 150 160 170 190 200 pF1KE5 PSPLCLLLLLLLLILRLLRIL CCDS44 NSDRQDGALMGEIRGSEVTLAGACPLITG 180 190 200 >>CCDS41407.1 EFNA4 gene_id:1945|Hs108|chr1 (193 aa) initn: 1140 init1: 1120 opt: 1128 Z-score: 1334.6 bits: 253.8 E(32554): 5.1e-68 Smith-Waterman score: 1128; 97.5% identity (98.2% similar) in 163 aa overlap (1-163:1-162) 10 20 30 40 50 60 pF1KE5 MRLLPLLRTVLWAAFLGSPLRGGSSLRHVVYWNSSNPRLLRGDAVVELGLNDYLDIVCPH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MRLLPLLRTVLWAAFLGSPLRGGSSLRHVVYWNSSNPRLLRGDAVVELGLNDYLDIVCPH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 YEGPGPPEGPETFALYMVDWPGYESCQAEGPRAYKRWVCSLPFGHVQFSEKIQRFTPFSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 YEGPGPPEGPETFALYMVDWPGYESCQAEGPRAYKRWVCSLPFGHVQFSEKIQRFTPFSL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 GFEFLPGETYYYISVPTPESSGQCLRLQVSVCCKERKSESAHPVGSPGESGTSGWRGGDT ::::::::::::::::::::::::::::::::::::. : :: CCDS41 GFEFLPGETYYYISVPTPESSGQCLRLQVSVCCKERNLPS-HPKEPESSQDPLEEEGSLL 130 140 150 160 170 190 200 pF1KE5 PSPLCLLLLLLLLILRLLRIL CCDS41 PALGVPIQTDKMEH 180 190 >>CCDS4097.1 EFNA5 gene_id:1946|Hs108|chr5 (228 aa) initn: 516 init1: 244 opt: 480 Z-score: 570.1 bits: 112.5 E(32554): 2e-25 Smith-Waterman score: 483; 39.4% identity (63.3% similar) in 226 aa overlap (1-200:4-228) 10 20 30 40 50 pF1KE5 MRLLPLLRTVLWA-AFLGSPLRGGSSLRHVVYWNSSNPRLLRGDAVVELGLNDYLDI ...: :. ::: .: .: . . :..:::::::::. ::: ... .:::::. CCDS40 MLHVEMLTLVFLVLWMCVFSQDPGSKAVADRYAVYWNSSNPRFQRGDYHIDVCINDYLDV 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 VCPHYEGPGPPEGPETFALYMVDWPGYESCQAEGPRAYKRWVCSLPF---GHVQFSEKIQ ::::: : . : ..::::.. :: .:. . ...::: :. : : ..::::.: CCDS40 FCPHYEDSVPEDKTERYVLYMVNFDGYSACDHTS-KGFKRWECNRPHSPNGPLKFSEKFQ 70 80 90 100 110 120 130 140 150 pF1KE5 RFTPFSLGFEFLPGETYYYISVPTPESSGQ-CLRLQVSV-----CCK------------E :::::::::: ::. :.::: :... . ::.:.: : : : . CCDS40 LFTPFSLGFEFRPGREYFYISSAIPDNGRRSCLKLKVFVRPTNSCMKTIGVHDRVFDVND 120 130 140 150 160 170 160 170 180 190 200 pF1KE5 RKSESAHPVGSP-GESG--TSGWRGGDTPS-PLCLLLLLLLLILRLLRIL . .: .:. . ::. . : ...:: : :: .::.:. :: . CCDS40 KVENSLEPADDTVHESAEPSRGENAAQTPRIPSRLLAILLFLLAMLLTL 180 190 200 210 220 >>CCDS1091.1 EFNA1 gene_id:1942|Hs108|chr1 (205 aa) initn: 427 init1: 196 opt: 435 Z-score: 517.7 bits: 102.7 E(32554): 1.6e-22 Smith-Waterman score: 435; 43.6% identity (67.3% similar) in 156 aa overlap (11-162:4-158) 10 20 30 40 50 60 pF1KE5 MRLLPLLRTVLWAAFLGSPLRGGSSLRHVVYWNSSNPRLLRGDAVVELGLNDYLDIVCPH ::: .:: ... ::.:.::::::.. : .... ::::.::.::: CCDS10 MEFLWAPLLGLCCSLAAADRHTVFWNSSNPKFRNEDYTIHVQLNDYVDIICPH 10 20 30 40 50 70 80 90 100 110 pF1KE5 YEGPGPPEGP-ETFALYMVDWPGYESCQAEGPRAYKRWVCSLP---FGHVQFSEKIQRFT :: . .. : . ::.:. :. :: .. . :: :. : : ..:::.:::: CCDS10 YEDHSVADAAMEQYILYLVEHEEYQLCQPQS-KDQVRWQCNRPSAKHGPEKLSEKFQRFT 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 PFSLGFEFLPGETYYYISVPTPESSGQCLRLQVSVCCKERKSESAHPVGSPGESGTSGWR ::.:: :: :..::::: : . .::::.:.: : .: .:: CCDS10 PFTLGKEFKEGHSYYYISKPIHQHEDRCLRLKVTVSGKITHSPQAHDNPQEKRLAADDPE 120 130 140 150 160 170 180 190 200 pF1KE5 GGDTPSPLCLLLLLLLLILRLLRIL CCDS10 VRVLHSIGHSAAPRLFPLAWTVLLLPLLLLQTP 180 190 200 >>CCDS12061.1 EFNA2 gene_id:1943|Hs108|chr19 (213 aa) initn: 440 init1: 172 opt: 396 Z-score: 471.5 bits: 94.2 E(32554): 6.1e-20 Smith-Waterman score: 445; 44.4% identity (65.2% similar) in 187 aa overlap (24-198:33-210) 10 20 30 40 pF1KE5 MRLLPLLRTVLWAAFLGSPLRGGSSLRHVVYWNSSNPRLLRGDA------VVE .: :..:::: ::::. : . .:: CCDS12 PAQRPLLPLLLLLLPLPPPPFARAEDAARANSDRYAVYWNRSNPRFHAGAGDDGGGYTVE 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE5 LGLNDYLDIVCPHYEGPGPP-EGPETFALYMVDWPGYESCQAEGPRAYKRWVCSLPF--- ...:::::: :::: .: :: : : ..::::. :. ::. . :..::: :. : CCDS12 VSINDYLDIYCPHYGAPLPPAERMEHYVLYMVNGEGHASCDHR-QRGFKRWECNRPAAPG 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE5 GHVQFSEKIQRFTPFSLGFEFLPGETYYYISVPTPESSGQ-CLRLQVSVC-CKERKSESA : ..::::.: :::::::::: ::. :::::. :.. . ::::.: : .: :. CCDS12 GPLKFSEKFQLFTPFSLGFEFRPGHEYYYISATPPNAVDRPCLRLKVYVRPTNETLYEAP 130 140 150 160 170 180 170 180 190 200 pF1KE5 HPVGSPGESGTSGWRGGDTPSPLCLLLLLLLLILRLLRIL .:. . ..: .: :: : :.: . .: : CCDS12 EPIFTSNNSCSSP--GG------CRLFLSTIPVLWTLLGS 190 200 210 >>CCDS1092.1 EFNA1 gene_id:1942|Hs108|chr1 (183 aa) initn: 377 init1: 196 opt: 377 Z-score: 450.0 bits: 90.0 E(32554): 9.5e-19 Smith-Waterman score: 377; 43.0% identity (66.9% similar) in 142 aa overlap (11-147:4-144) 10 20 30 40 50 60 pF1KE5 MRLLPLLRTVLWAAFLGSPLRGGSSLRHVVYWNSSNPRLLRGDAVVELGLNDYLDIVCPH ::: .:: ... ::.:.::::::.. : .... ::::.::.::: CCDS10 MEFLWAPLLGLCCSLAAADRHTVFWNSSNPKFRNEDYTIHVQLNDYVDIICPH 10 20 30 40 50 70 80 90 100 110 pF1KE5 YEGPGPPEGP-ETFALYMVDWPGYESCQAEGPRAYKRWVCSLP---FGHVQFSEKIQRFT :: . .. : . ::.:. :. :: .. . :: :. : : ..:::.:::: CCDS10 YEDHSVADAAMEQYILYLVEHEEYQLCQPQS-KDQVRWQCNRPSAKHGPEKLSEKFQRFT 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 PFSLGFEFLPGETYYYIS-VPTPESSGQCLRLQVSVCCKERKSESAHPVGSPGESGTSGW ::.:: :: :..::::: : ... : :: CCDS10 PFTLGKEFKEGHSYYYISHSPQAHDNPQEKRLAADDPEVRVLHSIGHSAAPRLFPLAWTV 120 130 140 150 160 170 180 190 200 pF1KE5 RGGDTPSPLCLLLLLLLLILRLLRIL CCDS10 LLLPLLLLQTP 180 >>CCDS1090.1 EFNA3 gene_id:1944|Hs108|chr1 (238 aa) initn: 467 init1: 218 opt: 333 Z-score: 396.6 bits: 80.5 E(32554): 9e-16 Smith-Waterman score: 474; 45.1% identity (67.6% similar) in 173 aa overlap (3-164:8-178) 10 20 30 40 50 pF1KE5 MRLLPLLRTVLWAAFLGSPLRGGSSLRHVVYWNSSNPRLLRGDAVVELGLNDYLD :: :: : .:.. :. . ::.::::::: .: : .:....::::: CCDS10 MAAAPLLLLLLLVPVPLLPLLAQGPGGALGNRHAVYWNSSNQHLRREGYTVQVNVNDYLD 10 20 30 40 50 60 60 70 80 90 100 pF1KE5 IVCPHYE--------GPGPPEGPETFALYMVDWPGYESCQAEGPRAYKRWVCSLPFG-H- : ::::. :::: : : ..::::. ::..:.: ...::: :. : . : CCDS10 IYCPHYNSSGVGPGAGPGPGGGAEQYVLYMVSRNGYRTCNA--SQGFKRWECNRPHAPHS 70 80 90 100 110 110 120 130 140 150 160 pF1KE5 -VQFSEKIQRFTPFSLGFEFLPGETYYYISVPTPESSGQCLRLQVSVCCKERKSESAHPV ..::::.::.. ::::.:: :. :::::.:: . .:::..: ::: . . .:: CCDS10 PIKFSEKFQRYSAFSLGYEFHAGHEYYYISTPTHNLHWKCLRMKVFVCCASTSHSGEKPV 120 130 140 150 160 170 170 180 190 200 pF1KE5 GSPGESGTSGWRGGDTPSPLCLLLLLLLLILRLLRIL CCDS10 PTLPQFTMGPNVKINVLEDFEGENPQVPKLEKSISGTSPKREHLPLAVGIAFFLMTFLAS 180 190 200 210 220 230 201 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 23:17:30 2016 done: Mon Nov 7 23:17:31 2016 Total Scan time: 2.250 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]