FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4122, 505 aa 1>>>pF1KE4122 505 - 505 aa - 505 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.1048+/-0.000932; mu= 14.3541+/- 0.056 mean_var=81.7034+/-16.329, 0's: 0 Z-trim(106.6): 43 B-trim: 0 in 0/50 Lambda= 0.141891 statistics sampled from 9012 (9055) to 9012 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.665), E-opt: 0.2 (0.278), width: 16 Scan time: 2.640 The best scores are: opt bits E(32554) CCDS4832.1 TBC1D22B gene_id:55633|Hs108|chr6 ( 505) 3406 707.1 1.1e-203 CCDS63512.1 TBC1D22A gene_id:25771|Hs108|chr22 ( 470) 1805 379.4 4.8e-105 CCDS74877.1 TBC1D22A gene_id:25771|Hs108|chr22 ( 487) 1805 379.4 4.9e-105 CCDS14078.1 TBC1D22A gene_id:25771|Hs108|chr22 ( 517) 1805 379.4 5.2e-105 CCDS63511.1 TBC1D22A gene_id:25771|Hs108|chr22 ( 439) 1655 348.7 7.8e-96 >>CCDS4832.1 TBC1D22B gene_id:55633|Hs108|chr6 (505 aa) initn: 3406 init1: 3406 opt: 3406 Z-score: 3770.2 bits: 707.1 E(32554): 1.1e-203 Smith-Waterman score: 3406; 100.0% identity (100.0% similar) in 505 aa overlap (1-505:1-505) 10 20 30 40 50 60 pF1KE4 MAAENSKQFWKRSAKLPGSIQPVYGAQHPPLDPRLTKNFIKERSKVNTVPLKNKKASSFH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 MAAENSKQFWKRSAKLPGSIQPVYGAQHPPLDPRLTKNFIKERSKVNTVPLKNKKASSFH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 EFARNTSDAWDIGDDEEEDFSSPSFQTLNSKVALATAAQVLENHSKLRVKPERSQSTTSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 EFARNTSDAWDIGDDEEEDFSSPSFQTLNSKVALATAAQVLENHSKLRVKPERSQSTTSD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 VPANYKVIKSSSDAQLSRNSSDTCLRNPLHKQQSLPLRPIIPLVARISDQNASGAPPMTV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 VPANYKVIKSSSDAQLSRNSSDTCLRNPLHKQQSLPLRPIIPLVARISDQNASGAPPMTV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 REKTRLEKFRQLLSSQNTDLDELRKCSWPGVPREVRPITWRLLSGYLPANTERRKLTLQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 REKTRLEKFRQLLSSQNTDLDELRKCSWPGVPREVRPITWRLLSGYLPANTERRKLTLQR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 KREEYFGFIEQYYDSRNEEHHQDTYRQIHIDIPRTNPLIPLFQQPLVQEIFERILFIWAI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 KREEYFGFIEQYYDSRNEEHHQDTYRQIHIDIPRTNPLIPLFQQPLVQEIFERILFIWAI 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 RHPASGYVQGINDLVTPFFVVFLSEYVEEDVENFDVTNLSQDMLRSIEADSFWCMSKLLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 RHPASGYVQGINDLVTPFFVVFLSEYVEEDVENFDVTNLSQDMLRSIEADSFWCMSKLLD 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 GIQDNYTFAQPGIQKKVKALEELVSRIDEQVHNHFRRYEVEYLQFAFRWMNNLLMRELPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 GIQDNYTFAQPGIQKKVKALEELVSRIDEQVHNHFRRYEVEYLQFAFRWMNNLLMRELPL 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 RCTIRLWDTYQSEPEGFSHFHLYVCAAFLIKWRKEILDEEDFQGLLMLLQNLPTIHWGNE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 RCTIRLWDTYQSEPEGFSHFHLYVCAAFLIKWRKEILDEEDFQGLLMLLQNLPTIHWGNE 430 440 450 460 470 480 490 500 pF1KE4 EIGLLLAEAYRLKYMFADAPNHYRR ::::::::::::::::::::::::: CCDS48 EIGLLLAEAYRLKYMFADAPNHYRR 490 500 >>CCDS63512.1 TBC1D22A gene_id:25771|Hs108|chr22 (470 aa) initn: 1912 init1: 1454 opt: 1805 Z-score: 1999.4 bits: 379.4 E(32554): 4.8e-105 Smith-Waterman score: 1964; 62.0% identity (82.2% similar) in 471 aa overlap (48-505:3-470) 20 30 40 50 60 70 pF1KE4 GSIQPVYGAQHPPLDPRLTKNFIKERSKVNTVPLKNKKASSFHEFARNTSDAWDIGDDEE :.:.: :..:.:.:: ::::::: :.:.. CCDS63 MPTTPVKAKRVSTFQEFESNTSDAWDAGEDDD 10 20 30 80 90 100 110 120 pF1KE4 EDFSSPSFQTLNSKVALATAAQVLENHSK------------LRVKPERSQSTTSDVPANY : .. . ..:::.:.. :: .::.:::. :. ::. : .. CCDS63 ELLAMAA-ESLNSEVVMETANRVLRNHSQRQGRPTLQEGPGLQQKPRPEAEPPSPPSGDL 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE4 KVIKSSSDAQLSRNSSDTCLRNPLHKQQSLPLRPIIPLVARISDQNASGAPPMTVREKTR ...:: :... : . .. ::...:::: . : . :: .. .. .. :: .: CCDS63 RLVKSVSESHTSCPAESASDAAPLQRSQSLPHSATVTL-GGTSDPSTLSSSALSEREASR 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE4 LEKFRQLLSSQNTDLDELRKCSWPGVPREVRPITWRLLSGYLPANTERRKLTLQRKREEY :.::.:::.. ::::.:::. :: :.:. :::.::.:::::::::..:: :::::..:: CCDS63 LDKFKQLLAGPNTDLEELRRLSWSGIPKPVRPMTWKLLSGYLPANVDRRPATLQRKQKEY 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE4 FGFIEQYYDSRNEEHHQDTYRQIHIDIPRTNPLIPLFQQPLVQEIFERILFIWAIRHPAS :.:::.::::::.: :::::::::::::: .: :. :: : ::::::::::::::::: CCDS63 FAFIEHYYDSRNDEVHQDTYRQIHIDIPRMSPEA-LILQPKVTEIFERILFIWAIRHPAS 220 230 240 250 260 310 320 330 340 350 360 pF1KE4 GYVQGINDLVTPFFVVFLSEYVE-EDVENFDVTNLSQDMLRSIEADSFWCMSKLLDGIQD :::::::::::::::::. ::.: :.:.. ::... ..: .::::..:::::::::::: CCDS63 GYVQGINDLVTPFFVVFICEYIEAEEVDTVDVSGVPAEVLCNIEADTYWCMSKLLDGIQD 270 280 290 300 310 320 370 380 390 400 410 420 pF1KE4 NYTFAQPGIQKKVKALEELVSRIDEQVHNHFRRYEVEYLQFAFRWMNNLLMRELPLRCTI :::::::::: ::: ::::::::::::: :. ..::.::::::::::::::::.:::::: CCDS63 NYTFAQPGIQMKVKMLEELVSRIDEQVHRHLDQHEVRYLQFAFRWMNNLLMREVPLRCTI 330 340 350 360 370 380 430 440 450 460 470 480 pF1KE4 RLWDTYQSEPEGFSHFHLYVCAAFLIKWRKEILDEEDFQGLLMLLQNLPTIHWGNEEIGL ::::::::::.::::::::::::::..::::::.:.::: ::..:::::: :: .:.:.: CCDS63 RLWDTYQSEPDGFSHFHLYVCAAFLVRWRKEILEEKDFQELLLFLQNLPTAHWDDEDISL 390 400 410 420 430 440 490 500 pF1KE4 LLAEAYRLKYMFADAPNHYRR :::::::::. ::::::::.. CCDS63 LLAEAYRLKFAFADAPNHYKK 450 460 470 >>CCDS74877.1 TBC1D22A gene_id:25771|Hs108|chr22 (487 aa) initn: 1912 init1: 1454 opt: 1805 Z-score: 1999.2 bits: 379.4 E(32554): 4.9e-105 Smith-Waterman score: 1975; 60.9% identity (81.9% similar) in 481 aa overlap (38-505:10-487) 10 20 30 40 50 60 pF1KE4 QFWKRSAKLPGSIQPVYGAQHPPLDPRLTKNFIKERSKVNTVPLKNKKASSFHEFARNTS .... .:. :.:.: :..:.:.:: ::: CCDS74 MWEPQPDVGSLLRSTAKMPTTPVKAKRVSTFQEFESNTS 10 20 30 70 80 90 100 110 pF1KE4 DAWDIGDDEEEDFSSPSFQTLNSKVALATAAQVLENHSK------------LRVKPERSQ :::: :.:..: .. . ..:::.:.. :: .::.:::. :. ::. CCDS74 DAWDAGEDDDELLAMAA-ESLNSEVVMETANRVLRNHSQRQGRPTLQEGPGLQQKPRPEA 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE4 STTSDVPANYKVIKSSSDAQLSRNSSDTCLRNPLHKQQSLPLRPIIPLVARISDQNASGA : .. ...:: :... : . .. ::...:::: . : . :: .. .. CCDS74 EPPSPPSGDLRLVKSVSESHTSCPAESASDAAPLQRSQSLPHSATVTL-GGTSDPSTLSS 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE4 PPMTVREKTRLEKFRQLLSSQNTDLDELRKCSWPGVPREVRPITWRLLSGYLPANTERRK .. :: .::.::.:::.. ::::.:::. :: :.:. :::.::.:::::::::..:: CCDS74 SALSEREASRLDKFKQLLAGPNTDLEELRRLSWSGIPKPVRPMTWKLLSGYLPANVDRRP 160 170 180 190 200 210 240 250 260 270 280 290 pF1KE4 LTLQRKREEYFGFIEQYYDSRNEEHHQDTYRQIHIDIPRTNPLIPLFQQPLVQEIFERIL :::::..:::.:::.::::::.: :::::::::::::: .: :. :: : ::::::: CCDS74 ATLQRKQKEYFAFIEHYYDSRNDEVHQDTYRQIHIDIPRMSPEA-LILQPKVTEIFERIL 220 230 240 250 260 270 300 310 320 330 340 350 pF1KE4 FIWAIRHPASGYVQGINDLVTPFFVVFLSEYVE-EDVENFDVTNLSQDMLRSIEADSFWC :::::::::::::::::::::::::::. ::.: :.:.. ::... ..: .::::..:: CCDS74 FIWAIRHPASGYVQGINDLVTPFFVVFICEYIEAEEVDTVDVSGVPAEVLCNIEADTYWC 280 290 300 310 320 330 360 370 380 390 400 410 pF1KE4 MSKLLDGIQDNYTFAQPGIQKKVKALEELVSRIDEQVHNHFRRYEVEYLQFAFRWMNNLL :::::::::::::::::::: ::: ::::::::::::: :. ..::.::::::::::::: CCDS74 MSKLLDGIQDNYTFAQPGIQMKVKMLEELVSRIDEQVHRHLDQHEVRYLQFAFRWMNNLL 340 350 360 370 380 390 420 430 440 450 460 470 pF1KE4 MRELPLRCTIRLWDTYQSEPEGFSHFHLYVCAAFLIKWRKEILDEEDFQGLLMLLQNLPT :::.::::::::::::::::.::::::::::::::..::::::.:.::: ::..:::::: CCDS74 MREVPLRCTIRLWDTYQSEPDGFSHFHLYVCAAFLVRWRKEILEEKDFQELLLFLQNLPT 400 410 420 430 440 450 480 490 500 pF1KE4 IHWGNEEIGLLLAEAYRLKYMFADAPNHYRR :: .:.:.::::::::::. ::::::::.. CCDS74 AHWDDEDISLLLAEAYRLKFAFADAPNHYKK 460 470 480 >>CCDS14078.1 TBC1D22A gene_id:25771|Hs108|chr22 (517 aa) initn: 2100 init1: 1454 opt: 1805 Z-score: 1998.8 bits: 379.4 E(32554): 5.2e-105 Smith-Waterman score: 2145; 61.5% identity (82.1% similar) in 520 aa overlap (1-505:1-517) 10 20 30 40 50 pF1KE4 MAAENS-KQFWKRS-AKLPGSIQPVYGAQHPPLDPRLTKNFIKERSKVNTVPLKNKKASS ::.... ::::::: .::::::: ::::::::.:: : .... .:. :.:.: :..:. CCDS14 MASDGARKQFWKRSNSKLPGSIQHVYGAQHPPFDPLLHGTLLRSTAKMPTTPVKAKRVST 10 20 30 40 50 60 60 70 80 90 100 pF1KE4 FHEFARNTSDAWDIGDDEEEDFSSPSFQTLNSKVALATAAQVLENHSK------------ :.:: ::::::: :.:..: .. . ..:::.:.. :: .::.:::. CCDS14 FQEFESNTSDAWDAGEDDDELLAMAA-ESLNSEVVMETANRVLRNHSQRQGRPTLQEGPG 70 80 90 100 110 110 120 130 140 150 160 pF1KE4 LRVKPERSQSTTSDVPANYKVIKSSSDAQLSRNSSDTCLRNPLHKQQSLPLRPIIPLVAR :. ::. : .. ...:: :... : . .. ::...:::: . : . CCDS14 LQQKPRPEAEPPSPPSGDLRLVKSVSESHTSCPAESASDAAPLQRSQSLPHSATVTL-GG 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE4 ISDQNASGAPPMTVREKTRLEKFRQLLSSQNTDLDELRKCSWPGVPREVRPITWRLLSGY :: .. .. .. :: .::.::.:::.. ::::.:::. :: :.:. :::.::.::::: CCDS14 TSDPSTLSSSALSEREASRLDKFKQLLAGPNTDLEELRRLSWSGIPKPVRPMTWKLLSGY 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE4 LPANTERRKLTLQRKREEYFGFIEQYYDSRNEEHHQDTYRQIHIDIPRTNPLIPLFQQPL ::::..:: :::::..:::.:::.::::::.: :::::::::::::: .: :. :: CCDS14 LPANVDRRPATLQRKQKEYFAFIEHYYDSRNDEVHQDTYRQIHIDIPRMSPEA-LILQPK 240 250 260 270 280 290 290 300 310 320 330 340 pF1KE4 VQEIFERILFIWAIRHPASGYVQGINDLVTPFFVVFLSEYVE-EDVENFDVTNLSQDMLR : ::::::::::::::::::::::::::::::::::. ::.: :.:.. ::... ..: CCDS14 VTEIFERILFIWAIRHPASGYVQGINDLVTPFFVVFICEYIEAEEVDTVDVSGVPAEVLC 300 310 320 330 340 350 350 360 370 380 390 400 pF1KE4 SIEADSFWCMSKLLDGIQDNYTFAQPGIQKKVKALEELVSRIDEQVHNHFRRYEVEYLQF .::::..:::::::::::::::::::::: ::: ::::::::::::: :. ..::.:::: CCDS14 NIEADTYWCMSKLLDGIQDNYTFAQPGIQMKVKMLEELVSRIDEQVHRHLDQHEVRYLQF 360 370 380 390 400 410 410 420 430 440 450 460 pF1KE4 AFRWMNNLLMRELPLRCTIRLWDTYQSEPEGFSHFHLYVCAAFLIKWRKEILDEEDFQGL ::::::::::::.::::::::::::::::.::::::::::::::..::::::.:.::: : CCDS14 AFRWMNNLLMREVPLRCTIRLWDTYQSEPDGFSHFHLYVCAAFLVRWRKEILEEKDFQEL 420 430 440 450 460 470 470 480 490 500 pF1KE4 LMLLQNLPTIHWGNEEIGLLLAEAYRLKYMFADAPNHYRR :..:::::: :: .:.:.::::::::::. ::::::::.. CCDS14 LLFLQNLPTAHWDDEDISLLLAEAYRLKFAFADAPNHYKK 480 490 500 510 >>CCDS63511.1 TBC1D22A gene_id:25771|Hs108|chr22 (439 aa) initn: 1792 init1: 1354 opt: 1655 Z-score: 1834.0 bits: 348.7 E(32554): 7.8e-96 Smith-Waterman score: 1799; 57.1% identity (73.8% similar) in 508 aa overlap (1-505:1-439) 10 20 30 40 50 pF1KE4 MAAENS-KQFWKRS-AKLPGSIQPVYGAQHPPLDPRLTKNFIKERSKVNTVPLKNKKASS ::.... ::::::: .:::::. .. .:. :.:.: :..:. CCDS63 MASDGARKQFWKRSNSKLPGSL-------------------LRSTAKMPTTPVKAKRVST 10 20 30 40 60 70 80 90 100 110 pF1KE4 FHEFARNTSDAWDIGDDEEEDFSSPSFQTLNSKVALATAAQVLENHSKLRVKPERSQSTT :.:: ::::::: :.:..: .. . ..:::.:.. :: .::.:::. . .: CCDS63 FQEFESNTSDAWDAGEDDDELLAMAA-ESLNSEVVMETANRVLRNHSQRQGRP------- 50 60 70 80 90 120 130 140 150 160 170 pF1KE4 SDVPANYKVIKSSSDAQLSRNSSDTCLRNPLHKQQSLPLRPIIPLVARISDQNASGAPPM : ..: .:. : :: . :: CCDS63 ------------------------TLQEGPGLQQK--P-RP-------------EAEPPS 100 110 180 190 200 210 220 230 pF1KE4 TVREKTRLEKFRQLLSSQNTDLDELRKCSWPGVPREVRPITWRLLSGYLPANTERRKLTL :: : . : . .:::. :: :.:. :::.::.:::::::::..:: :: CCDS63 PPSGDLRLVKSVSE-SHTSCPAEELRRLSWSGIPKPVRPMTWKLLSGYLPANVDRRPATL 120 130 140 150 160 170 240 250 260 270 280 290 pF1KE4 QRKREEYFGFIEQYYDSRNEEHHQDTYRQIHIDIPRTNPLIPLFQQPLVQEIFERILFIW :::..:::.:::.::::::.: :::::::::::::: .: :. :: : :::::::::: CCDS63 QRKQKEYFAFIEHYYDSRNDEVHQDTYRQIHIDIPRMSPEA-LILQPKVTEIFERILFIW 180 190 200 210 220 230 300 310 320 330 340 350 pF1KE4 AIRHPASGYVQGINDLVTPFFVVFLSEYVE-EDVENFDVTNLSQDMLRSIEADSFWCMSK ::::::::::::::::::::::::. ::.: :.:.. ::... ..: .::::..::::: CCDS63 AIRHPASGYVQGINDLVTPFFVVFICEYIEAEEVDTVDVSGVPAEVLCNIEADTYWCMSK 240 250 260 270 280 290 360 370 380 390 400 410 pF1KE4 LLDGIQDNYTFAQPGIQKKVKALEELVSRIDEQVHNHFRRYEVEYLQFAFRWMNNLLMRE ::::::::::::::::: ::: ::::::::::::: :. ..::.:::::::::::::::: CCDS63 LLDGIQDNYTFAQPGIQMKVKMLEELVSRIDEQVHRHLDQHEVRYLQFAFRWMNNLLMRE 300 310 320 330 340 350 420 430 440 450 460 470 pF1KE4 LPLRCTIRLWDTYQSEPEGFSHFHLYVCAAFLIKWRKEILDEEDFQGLLMLLQNLPTIHW .::::::::::::::::.::::::::::::::..::::::.:.::: ::..:::::: :: CCDS63 VPLRCTIRLWDTYQSEPDGFSHFHLYVCAAFLVRWRKEILEEKDFQELLLFLQNLPTAHW 360 370 380 390 400 410 480 490 500 pF1KE4 GNEEIGLLLAEAYRLKYMFADAPNHYRR .:.:.::::::::::. ::::::::.. CCDS63 DDEDISLLLAEAYRLKFAFADAPNHYKK 420 430 505 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 04:10:29 2016 done: Sun Nov 6 04:10:30 2016 Total Scan time: 2.640 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]