FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1546, 228 aa 1>>>pF1KE1546 228 - 228 aa - 228 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.7818+/-0.000928; mu= 17.2463+/- 0.055 mean_var=69.9754+/-14.294, 0's: 0 Z-trim(105.9): 45 B-trim: 0 in 0/49 Lambda= 0.153321 statistics sampled from 8631 (8675) to 8631 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.63), E-opt: 0.2 (0.266), width: 16 Scan time: 1.730 The best scores are: opt bits E(32554) CCDS8540.1 CD9 gene_id:928|Hs108|chr12 ( 228) 1537 348.8 1.7e-96 CCDS81654.1 CD9 gene_id:928|Hs108|chr12 ( 159) 1078 247.2 4.7e-66 CCDS7734.1 CD81 gene_id:975|Hs108|chr11 ( 236) 722 168.6 3.2e-42 CCDS881.1 TSPAN2 gene_id:10100|Hs108|chr1 ( 221) 717 167.4 6.5e-42 CCDS73240.1 CD81 gene_id:975|Hs108|chr11 ( 165) 464 111.4 3.7e-25 CCDS8999.1 TSPAN8 gene_id:7103|Hs108|chr12 ( 237) 421 102.0 3.5e-22 CCDS76193.1 TSPAN2 gene_id:10100|Hs108|chr1 ( 196) 380 92.8 1.6e-19 CCDS7910.1 TSPAN18 gene_id:90139|Hs108|chr11 ( 248) 367 90.1 1.4e-18 CCDS829.1 CD53 gene_id:963|Hs108|chr1 ( 219) 364 89.3 2.1e-18 CCDS7909.1 CD82 gene_id:3732|Hs108|chr11 ( 267) 351 86.5 1.8e-17 CCDS7719.1 CD151 gene_id:977|Hs108|chr11 ( 253) 349 86.1 2.3e-17 CCDS7721.1 TSPAN4 gene_id:7106|Hs108|chr11 ( 238) 312 77.9 6.4e-15 CCDS7369.1 TSPAN14 gene_id:81619|Hs108|chr10 ( 270) 306 76.6 1.8e-14 CCDS8520.1 TSPAN9 gene_id:10867|Hs108|chr12 ( 239) 292 73.5 1.4e-13 CCDS3646.1 TSPAN5 gene_id:10098|Hs108|chr4 ( 268) 288 72.6 2.7e-13 CCDS14248.1 TSPAN7 gene_id:7102|Hs108|chrX ( 249) 282 71.3 6.5e-13 CCDS8890.1 CD63 gene_id:967|Hs108|chr12 ( 238) 280 70.8 8.6e-13 CCDS14470.1 TSPAN6 gene_id:7105|Hs108|chrX ( 245) 271 68.8 3.5e-12 CCDS31469.1 CD82 gene_id:3732|Hs108|chr11 ( 242) 270 68.6 4e-12 CCDS47346.2 TSPAN17 gene_id:26262|Hs108|chr5 ( 263) 269 68.4 5e-12 CCDS31765.1 TSPAN11 gene_id:441631|Hs108|chr12 ( 253) 268 68.2 5.7e-12 CCDS54952.1 TSPAN17 gene_id:26262|Hs108|chr5 ( 329) 269 68.5 5.9e-12 CCDS34298.1 TSPAN17 gene_id:26262|Hs108|chr5 ( 332) 269 68.5 5.9e-12 CCDS12760.1 CD37 gene_id:951|Hs108|chr19 ( 281) 266 67.8 8.3e-12 CCDS5777.1 TSPAN12 gene_id:23554|Hs108|chr7 ( 305) 260 66.5 2.2e-11 CCDS530.1 TSPAN1 gene_id:10103|Hs108|chr1 ( 241) 257 65.7 2.9e-11 >>CCDS8540.1 CD9 gene_id:928|Hs108|chr12 (228 aa) initn: 1537 init1: 1537 opt: 1537 Z-score: 1846.0 bits: 348.8 E(32554): 1.7e-96 Smith-Waterman score: 1537; 100.0% identity (100.0% similar) in 228 aa overlap (1-228:1-228) 10 20 30 40 50 60 pF1KE1 MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNSSFYTGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNSSFYTGV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 YILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEVIKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 YILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEVIKE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 VQEFYKDTYNKLKTKDEPQRETLKAIHYALNCCGLAGGVEQFISDICPKKDVLETFTVKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 VQEFYKDTYNKLKTKDEPQRETLKAIHYALNCCGLAGGVEQFISDICPKKDVLETFTVKS 130 140 150 160 170 180 190 200 210 220 pF1KE1 CPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCCAIRRNREMV :::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 CPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCCAIRRNREMV 190 200 210 220 >>CCDS81654.1 CD9 gene_id:928|Hs108|chr12 (159 aa) initn: 1078 init1: 1078 opt: 1078 Z-score: 1299.4 bits: 247.2 E(32554): 4.7e-66 Smith-Waterman score: 1078; 100.0% identity (100.0% similar) in 159 aa overlap (70-228:1-159) 40 50 60 70 80 90 pF1KE1 QTKSIFEQETNNNNSSFYTGVYILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVI :::::::::::::::::::::::::::::: CCDS81 MMLVGFLGCCGAVQESQCMLGLFFGFLLVI 10 20 30 100 110 120 130 140 150 pF1KE1 FAIEIAAAIWGYSHKDEVIKEVQEFYKDTYNKLKTKDEPQRETLKAIHYALNCCGLAGGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 FAIEIAAAIWGYSHKDEVIKEVQEFYKDTYNKLKTKDEPQRETLKAIHYALNCCGLAGGV 40 50 60 70 80 90 160 170 180 190 200 210 pF1KE1 EQFISDICPKKDVLETFTVKSCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 EQFISDICPKKDVLETFTVKSCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCC 100 110 120 130 140 150 220 pF1KE1 AIRRNREMV ::::::::: CCDS81 AIRRNREMV >>CCDS7734.1 CD81 gene_id:975|Hs108|chr11 (236 aa) initn: 714 init1: 335 opt: 722 Z-score: 871.6 bits: 168.6 E(32554): 3.2e-42 Smith-Waterman score: 722; 44.6% identity (78.4% similar) in 231 aa overlap (1-222:1-231) 10 20 30 40 50 pF1KE1 MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNS--SFYT : :.: ::::::::: :::.::::: ..:...:::: : :: ... : ... . .::. CCDS77 MGVEGCTKCIKYLLFVFNFVFWLAGGVILGVALWLRHDPQTTNLLYLELGDKPAPNTFYV 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 GVYILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEVI :.::::..::.::.:::::: ::.:::::.:: :: :...:: :.::.:::. .::.. CCDS77 GIYILIAVGAVMMFVGFLGCYGAIQESQCLLGTFFTCLVILFACEVAAGIWGFVNKDQIA 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE1 KEVQEFYKDTYNKLKTKDEPQ--RETLKAIHYALNCCG----LAGGVEQFISDICPK-KD :.:..:: .. .. . :. . . ..:..: .:.::: : . . ...::. .. CCDS77 KDVKQFYDQALQQAVVDDDANNAKAVVKTFHETLDCCGSSTLTALTTSVLKNNLCPSGSN 130 140 150 160 170 180 180 190 200 210 220 pF1KE1 VLETFTVKSCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCCAIRRNREMV .. .. ..: . : ..:..:...:: ..: .::.::: ::.::.:::.:: CCDS77 IISNLFKEDCHQKIDDLFSGKLYLIGIAAIVVAVIMIFEMILSMVLCCGIRNSSVY 190 200 210 220 230 >>CCDS881.1 TSPAN2 gene_id:10100|Hs108|chr1 (221 aa) initn: 684 init1: 338 opt: 717 Z-score: 866.0 bits: 167.4 E(32554): 6.5e-42 Smith-Waterman score: 717; 48.2% identity (76.5% similar) in 226 aa overlap (4-228:5-221) 10 20 30 40 50 pF1KE1 MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNSSFYTG .:: .::::::.:::..::::: ::.:.:::.:: . : . .. . . ::.: CCDS88 MGRFRGGLRCIKYLLLGFNLLFWLAGSAVIAFGLWFRFGGAIKELSSEDKSPEY--FYVG 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 VYILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEVIK .:.:.::::::: :::.:::::..::::.:: :: :::::: :........ : .:. CCDS88 LYVLVGAGALMMAVGFFGCCGAMRESQCVLGSFFTCLLVIFAAEVTTGVFAFIGKGVAIR 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 EVQEFYKDTYNKLKTKDEPQRE-TLKAIHYALNCCGLAGGVEQFISDICPKKDVLETFTV .:: .:...:: ::. . . :: ..: ...::: .. :: .. ::: : . CCDS88 HVQTMYEEAYNDY-LKDRGKGNGTLITFHSTFQCCGKESS-EQ-VQPTCPK----ELLGH 120 130 140 150 160 170 180 190 200 210 220 pF1KE1 KSCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCCAIRRNREMV :.: : :. ... :...:: :::::: . ::::::::.:::::: .:... CCDS88 KNCIDEIETIISVKLQLIGIVGIGIAGLTIFGMIFSMVLCCAIRNSRDVI 180 190 200 210 220 >>CCDS73240.1 CD81 gene_id:975|Hs108|chr11 (165 aa) initn: 454 init1: 267 opt: 464 Z-score: 565.2 bits: 111.4 E(32554): 3.7e-25 Smith-Waterman score: 464; 40.6% identity (76.9% similar) in 160 aa overlap (70-222:1-160) 40 50 60 70 80 90 pF1KE1 QTKSIFEQETNNNNSSFYTGVYILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVI ::.:::::: ::.:::::.:: :: :... CCDS73 MMFVGFLGCYGAIQESQCLLGTFFTCLVIL 10 20 30 100 110 120 130 140 150 pF1KE1 FAIEIAAAIWGYSHKDEVIKEVQEFYKDTYNKLKTKDEPQ--RETLKAIHYALNCCG--- :: :.::.:::. .::.. :.:..:: .. .. . :. . . ..:..: .:.::: CCDS73 FACEVAAGIWGFVNKDQIAKDVKQFYDQALQQAVVDDDANNAKAVVKTFHETLDCCGSST 40 50 60 70 80 90 160 170 180 190 200 210 pF1KE1 -LAGGVEQFISDICPK-KDVLETFTVKSCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMI : . . ...::. .... .. ..: . : ..:..:...:: ..: .::.::: :: CCDS73 LTALTTSVLKNNLCPSGSNIISNLFKEDCHQKIDDLFSGKLYLIGIAAIVVAVIMIFEMI 100 110 120 130 140 150 220 pF1KE1 FSMILCCAIRRNREMV .::.:::.:: CCDS73 LSMVLCCGIRNSSVY 160 >>CCDS8999.1 TSPAN8 gene_id:7103|Hs108|chr12 (237 aa) initn: 272 init1: 239 opt: 421 Z-score: 511.7 bits: 102.0 E(32554): 3.5e-22 Smith-Waterman score: 505; 35.9% identity (69.7% similar) in 234 aa overlap (5-221:3-234) 10 20 30 40 50 60 pF1KE1 MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNSSFYTGV : . :::: .: :::.::: :: .::...:.: ......:: .: . .:: :..: CCDS89 MAGVSACIKYSMFTFNFLFWLCGILILALAIWVRVSNDSQAIFGSE--DVGSSSYVAV 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 YILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEVIKE :::..::..:..::::::::..::.::: ::: ::.:. ...:..: : :.. . CCDS89 DILIAVGAIIMILGFLGCCGAIKESRCMLLLFFIGLLLILLLQVATGILGAVFKSKSDRI 60 70 80 90 100 110 130 140 150 160 pF1KE1 VQEFYKDTYNKLKTKDEPQRETLKAI---HYALNCCGLAGGVE------QFISDICP--- :.: .. . :.. : ... .:: . ..::::..:. : ..: CCDS89 VNETLYENTKLLSATGESEKQFQEAIIVFQEEFKCCGLVNGAADWGNNFQHYPELCACLD 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE1 KKDVLETFTVK-----SCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCCAIRR :. .... : .: . ::. . ... :. ....:.::. :.:..:::.: : : CCDS89 KQRPCQSYNGKQVYKETCISFIKDFLAKNLIIVIGISFGLAVIEILGLVFSMVLYCQIGN 180 190 200 210 220 230 pF1KE1 NREMV CCDS89 K >>CCDS76193.1 TSPAN2 gene_id:10100|Hs108|chr1 (196 aa) initn: 613 init1: 205 opt: 380 Z-score: 463.8 bits: 92.8 E(32554): 1.6e-19 Smith-Waterman score: 599; 43.8% identity (68.6% similar) in 226 aa overlap (4-228:5-196) 10 20 30 40 50 pF1KE1 MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNSSFYTG .:: .::::::.:::..::::: ::.:.:::.:: . : . .. . . ::.: CCDS76 MGRFRGGLRCIKYLLLGFNLLFWLAGSAVIAFGLWFRFGGAIKELSSEDKSPEY--FYVG 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 VYILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEVIK .:.:.::::::: :::.:::::..::::.:: .:. CCDS76 LYVLVGAGALMMAVGFFGCCGAMRESQCVLG-------------------------SAIR 60 70 80 90 120 130 140 150 160 170 pF1KE1 EVQEFYKDTYNKLKTKDEPQRE-TLKAIHYALNCCGLAGGVEQFISDICPKKDVLETFTV .:: .:...:: ::. . . :: ..: ...::: .. :: .. ::: : . CCDS76 HVQTMYEEAYNDY-LKDRGKGNGTLITFHSTFQCCGKESS-EQ-VQPTCPK----ELLGH 100 110 120 130 140 180 190 200 210 220 pF1KE1 KSCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCCAIRRNREMV :.: : :. ... :...:: :::::: . ::::::::.:::::: .:... CCDS76 KNCIDEIETIISVKLQLIGIVGIGIAGLTIFGMIFSMVLCCAIRNSRDVI 150 160 170 180 190 >>CCDS7910.1 TSPAN18 gene_id:90139|Hs108|chr11 (248 aa) initn: 408 init1: 256 opt: 367 Z-score: 446.9 bits: 90.1 E(32554): 1.4e-18 Smith-Waterman score: 396; 31.8% identity (62.0% similar) in 242 aa overlap (9-217:8-243) 10 20 30 40 50 60 pF1KE1 MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNSSFYTGV :.:::.: :::...:.: .::::.:. : . : .: : . ::. CCDS79 MEGDCLSCMKYLMFVFNFFIFLGGACLLAIGIWVMVDP---TGF-REIVAANPLLLTGA 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 YILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEVIKE :::.. :.:..:.:::::::::.:..:.: .:: :.:.:: :..::: .. .... .: CCDS79 YILLAMGGLLFLLGFLGCCGAVRENKCLLLFFFLFILIIFLAELSAAILAFIFRENLTRE 60 70 80 90 100 110 130 140 150 160 pF1KE1 VQEFYKDTYNKLKTKDEPQ--RETLKAIHYALNCCGLAGGVE-QFIS------------- : :. .. . ... . : ... ...:::. : . .: : CCDS79 F--FTKELTKHYQGNNDTDVFSATWNSVMITFGCCGVNGPEDFKFASVFRLLTLDSEEVP 120 130 140 150 160 170 170 180 190 200 pF1KE1 DICPKKD------VL----ETFTVKS-------CPDAIKEVFDNKFHIIGAVGIGIAVVM . : ... :: : . .: : .: ..:.. .. ::..::. .. CCDS79 EACCRREPQSRDGVLLSREECLLGRSLFLNKQGCYTVILNTFETYVYLAGALAIGVLAIE 180 190 200 210 220 230 210 220 pF1KE1 IFGMIFSMILCCAIRRNREMV .:.:::.: : CCDS79 LFAMIFAMCLFRGIQ 240 >>CCDS829.1 CD53 gene_id:963|Hs108|chr1 (219 aa) initn: 179 init1: 150 opt: 364 Z-score: 444.0 bits: 89.3 E(32554): 2.1e-18 Smith-Waterman score: 364; 27.9% identity (62.6% similar) in 222 aa overlap (8-228:7-217) 10 20 30 40 50 pF1KE1 MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNSSFYTG- : .::.:: ::..::. : .:..:..: . .. .: .: :. : CCDS82 MGMSSLKLLKYVLFFFNLLFWICGCCILGFGIYLLIHNNFGVLF-----HNLPSLTLGN 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 VYILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEVIK :....:. ..:.:.:::: :...:..:.: :: .::.:. :.. :: . ..... . CCDS82 VFVIVGS--IIMVVAFLGCMGSIKENKCLLMSFFILLLIILLAEVTLAILLFVYEQKLNE 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 EVQEFYKDTYNKLKTKDEPQRETLKAIHYALNCCGLAGGVEQFISDICPKKDVLETFTVK : . :. .. .. :. . . .:. :.:::. .:. .. : : . :. CCDS82 YVAKGLTDSIHRYHS-DNSTKAAWDSIQSFLQCCGI-NGTSDWTSG--PPASCPSDRKVE 120 130 140 150 160 180 190 200 210 220 pF1KE1 SCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCCAIRRNREMV .: . : ..: :: . : . :. ..:: :.. : : : .. . . CCDS82 GCYAKARLWFHSNFLYIGIITICVCVIEVLGMSFALTLNCQIDKTSQTIGL 170 180 190 200 210 >>CCDS7909.1 CD82 gene_id:3732|Hs108|chr11 (267 aa) initn: 427 init1: 222 opt: 351 Z-score: 427.4 bits: 86.5 E(32554): 1.8e-17 Smith-Waterman score: 366; 28.7% identity (61.2% similar) in 258 aa overlap (6-218:2-253) 10 20 30 40 50 pF1KE1 MPVKGGTKCIK---YLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNSSFY :. ::: :.:: ::.::.. : ..:..:.:. : :: : . ....::. CCDS79 MGSACIKVTKYFLFLFNLIFFILGAVILGFGVWILAD---KSSFISVLQTSSSSLR 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 TGVYILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEV :.:..::.::. ::.::::: :::.: .:.:::.:.:::.:. ...:. : . .. CCDS79 MGAYVFIGVGAVTMLMGFLGCIGAVNEVRCLLGLYFAFLLLILIAQVTAGALFYFNMGKL 60 70 80 90 100 110 120 130 140 150 160 pF1KE1 IKE----VQEFYKDTYNKLKTKDEPQRETLKAIHYALNCCGLAG------GVEQF----I .: : :. .: ::. .... ... .. ..::: .. ..: . . CCDS79 KQEMGGIVTELIRD-YNS--SREDSLQDAWDYVQAQVKCCGWVSFYNWTDNAELMNRPEV 120 130 140 150 160 170 170 180 190 pF1KE1 SDICP---KKDVLETFTVKS-------------------------CPDAIKEVFDNKFHI . : : . ....:.. : . .. ..... : CCDS79 TYPCSCEVKGEEDNSLSVRKGFCEAPGNRTQSGNHPEDWPVYQEGCMEKVQAWLQENLGI 180 190 200 210 220 230 200 210 220 pF1KE1 IGAVGIGIAVVMIFGMIFSMILCCAIRRNREMV : .::.:.:.. ..::..:. :: CCDS79 ILGVGVGVAIIELLGMVLSICLCRHVHSEDYSKVPKY 240 250 260 228 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 01:45:10 2016 done: Mon Nov 7 01:45:10 2016 Total Scan time: 1.730 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]