FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5228, 236 aa 1>>>pF1KE5228 236 - 236 aa - 236 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9570+/-0.000882; mu= 15.7706+/- 0.053 mean_var=62.6592+/-12.826, 0's: 0 Z-trim(105.9): 41 B-trim: 997 in 2/47 Lambda= 0.162025 statistics sampled from 8631 (8672) to 8631 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.629), E-opt: 0.2 (0.266), width: 16 Scan time: 1.410 The best scores are: opt bits E(32554) CCDS7734.1 CD81 gene_id:975|Hs108|chr11 ( 236) 1581 378.1 2.8e-105 CCDS73240.1 CD81 gene_id:975|Hs108|chr11 ( 165) 1096 264.6 2.9e-71 CCDS8540.1 CD9 gene_id:928|Hs108|chr12 ( 228) 722 177.2 7.7e-45 CCDS881.1 TSPAN2 gene_id:10100|Hs108|chr1 ( 221) 480 120.7 8e-28 CCDS81654.1 CD9 gene_id:928|Hs108|chr12 ( 159) 464 116.8 8.2e-27 CCDS8520.1 TSPAN9 gene_id:10867|Hs108|chr12 ( 239) 384 98.3 4.9e-21 CCDS8999.1 TSPAN8 gene_id:7103|Hs108|chr12 ( 237) 380 97.3 9.2e-21 CCDS7721.1 TSPAN4 gene_id:7106|Hs108|chr11 ( 238) 365 93.8 1.1e-19 CCDS7910.1 TSPAN18 gene_id:90139|Hs108|chr11 ( 248) 334 86.6 1.7e-17 CCDS7909.1 CD82 gene_id:3732|Hs108|chr11 ( 267) 315 82.2 3.8e-16 CCDS76193.1 TSPAN2 gene_id:10100|Hs108|chr1 ( 196) 305 79.7 1.5e-15 CCDS31765.1 TSPAN11 gene_id:441631|Hs108|chr12 ( 253) 299 78.4 4.9e-15 CCDS530.1 TSPAN1 gene_id:10103|Hs108|chr1 ( 241) 297 77.9 6.5e-15 CCDS31469.1 CD82 gene_id:3732|Hs108|chr11 ( 242) 293 77.0 1.2e-14 CCDS829.1 CD53 gene_id:963|Hs108|chr1 ( 219) 292 76.7 1.3e-14 CCDS7369.1 TSPAN14 gene_id:81619|Hs108|chr10 ( 270) 289 76.1 2.6e-14 CCDS7719.1 CD151 gene_id:977|Hs108|chr11 ( 253) 286 75.4 4e-14 CCDS3646.1 TSPAN5 gene_id:10098|Hs108|chr4 ( 268) 283 74.7 6.8e-14 CCDS47346.2 TSPAN17 gene_id:26262|Hs108|chr5 ( 263) 278 73.5 1.5e-13 CCDS14470.1 TSPAN6 gene_id:7105|Hs108|chrX ( 245) 277 73.2 1.7e-13 CCDS54952.1 TSPAN17 gene_id:26262|Hs108|chr5 ( 329) 278 73.6 1.8e-13 CCDS34298.1 TSPAN17 gene_id:26262|Hs108|chr5 ( 332) 278 73.6 1.8e-13 CCDS12760.1 CD37 gene_id:951|Hs108|chr19 ( 281) 275 72.8 2.6e-13 CCDS8890.1 CD63 gene_id:967|Hs108|chr12 ( 238) 252 67.4 9.4e-12 >>CCDS7734.1 CD81 gene_id:975|Hs108|chr11 (236 aa) initn: 1581 init1: 1581 opt: 1581 Z-score: 2003.5 bits: 378.1 E(32554): 2.8e-105 Smith-Waterman score: 1581; 100.0% identity (100.0% similar) in 236 aa overlap (1-236:1-236) 10 20 30 40 50 60 pF1KE5 MGVEGCTKCIKYLLFVFNFVFWLAGGVILGVALWLRHDPQTTNLLYLELGDKPAPNTFYV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 MGVEGCTKCIKYLLFVFNFVFWLAGGVILGVALWLRHDPQTTNLLYLELGDKPAPNTFYV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 GIYILIAVGAVMMFVGFLGCYGAIQESQCLLGTFFTCLVILFACEVAAGIWGFVNKDQIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 GIYILIAVGAVMMFVGFLGCYGAIQESQCLLGTFFTCLVILFACEVAAGIWGFVNKDQIA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 KDVKQFYDQALQQAVVDDDANNAKAVVKTFHETLDCCGSSTLTALTTSVLKNNLCPSGSN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 KDVKQFYDQALQQAVVDDDANNAKAVVKTFHETLDCCGSSTLTALTTSVLKNNLCPSGSN 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 IISNLFKEDCHQKIDDLFSGKLYLIGIAAIVVAVIMIFEMILSMVLCCGIRNSSVY :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS77 IISNLFKEDCHQKIDDLFSGKLYLIGIAAIVVAVIMIFEMILSMVLCCGIRNSSVY 190 200 210 220 230 >>CCDS73240.1 CD81 gene_id:975|Hs108|chr11 (165 aa) initn: 1096 init1: 1096 opt: 1096 Z-score: 1393.0 bits: 264.6 E(32554): 2.9e-71 Smith-Waterman score: 1096; 100.0% identity (100.0% similar) in 165 aa overlap (72-236:1-165) 50 60 70 80 90 100 pF1KE5 TNLLYLELGDKPAPNTFYVGIYILIAVGAVMMFVGFLGCYGAIQESQCLLGTFFTCLVIL :::::::::::::::::::::::::::::: CCDS73 MMFVGFLGCYGAIQESQCLLGTFFTCLVIL 10 20 30 110 120 130 140 150 160 pF1KE5 FACEVAAGIWGFVNKDQIAKDVKQFYDQALQQAVVDDDANNAKAVVKTFHETLDCCGSST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 FACEVAAGIWGFVNKDQIAKDVKQFYDQALQQAVVDDDANNAKAVVKTFHETLDCCGSST 40 50 60 70 80 90 170 180 190 200 210 220 pF1KE5 LTALTTSVLKNNLCPSGSNIISNLFKEDCHQKIDDLFSGKLYLIGIAAIVVAVIMIFEMI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 LTALTTSVLKNNLCPSGSNIISNLFKEDCHQKIDDLFSGKLYLIGIAAIVVAVIMIFEMI 100 110 120 130 140 150 230 pF1KE5 LSMVLCCGIRNSSVY ::::::::::::::: CCDS73 LSMVLCCGIRNSSVY 160 >>CCDS8540.1 CD9 gene_id:928|Hs108|chr12 (228 aa) initn: 714 init1: 335 opt: 722 Z-score: 918.5 bits: 177.2 E(32554): 7.7e-45 Smith-Waterman score: 722; 44.6% identity (78.4% similar) in 231 aa overlap (1-231:1-222) 10 20 30 40 50 60 pF1KE5 MGVEGCTKCIKYLLFVFNFVFWLAGGVILGVALWLRHDPQTTNLLYLELGDKPAPNTFYV : :.: ::::::::: :::.::::: ..:...:::: : :: ... : ... ..::. CCDS85 MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNN--SSFYT 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 GIYILIAVGAVMMFVGFLGCYGAIQESQCLLGTFFTCLVILFACEVAAGIWGFVNKDQIA :.::::..::.::.:::::: ::.:::::.:: :: :...:: :.::.:::. .::.. CCDS85 GVYILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEVI 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE5 KDVKQFYDQALQQAVVDDDANNAKAVVKTFHETLDCCGSSTLTALTTSVLKNNLCPSGSN :.:..:: .. .. . :. . . ..:..: .:.::: : . . ...::. .. CCDS85 KEVQEFYKDTYNKLKTKDEPQ--RETLKAIHYALNCCG----LAGGVEQFISDICPK-KD 120 130 140 150 160 170 190 200 210 220 230 pF1KE5 IISNLFKEDCHQKIDDLFSGKLYLIGIAAIVVAVIMIFEMILSMVLCCGIRNSSVY .. .. ..: . : ..:..:...:: ..: .::.::: ::.::.:::.:: CCDS85 VLETFTVKSCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCCAIRRNREMV 180 190 200 210 220 >>CCDS881.1 TSPAN2 gene_id:10100|Hs108|chr1 (221 aa) initn: 631 init1: 315 opt: 480 Z-score: 613.0 bits: 120.7 E(32554): 8e-28 Smith-Waterman score: 610; 41.0% identity (70.9% similar) in 234 aa overlap (1-233:1-217) 10 20 30 40 50 pF1KE5 MG-VEGCTKCIKYLLFVFNFVFWLAGGVILGVALWLRHDPQTTNLLYLELGDKPAPNTFY :: .: .::::::. ::..:::::..... .::.: .: .. .:. :: CCDS88 MGRFRGGLRCIKYLLLGFNLLFWLAGSAVIAFGLWFRFGGAIKELS----SEDKSPEYFY 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 VGIYILIAVGAVMMFVGFLGCYGAIQESQCLLGTFFTCLVILFACEVAAGIWGFVNKDQI ::.:.:...::.:: :::.:: ::..::::.::.:::::...:: ::..:...:..: CCDS88 VGLYVLVGAGALMMAVGFFGCCGAMRESQCVLGSFFTCLLVIFAAEVTTGVFAFIGKGVA 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 AKDVKQFYDQALQQAVVDDDANNAKAVVKTFHETLDCCGSSTLTALTTSVLKNNLCPSGS . :. .:..: .. . : .:. . ::: :..:::. . . . ::. CCDS88 IRHVQTMYEEAYNDYLKDRGKGNGTLI--TFHSTFQCCGKESSEQVQPT------CPK-- 120 130 140 150 160 180 190 200 210 220 230 pF1KE5 NIISNLFKEDCHQKIDDLFSGKLYLIGIAAIVVAVIMIFEMILSMVLCCGIRNSSVY : ...: ..:. ..: :: ::::..: .: . :: ::.::::::.:::: CCDS88 ---ELLGHKNCIDEIETIISVKLQLIGIVGIGIAGLTIFGMIFSMVLCCAIRNSRDVI 170 180 190 200 210 220 >>CCDS81654.1 CD9 gene_id:928|Hs108|chr12 (159 aa) initn: 454 init1: 267 opt: 464 Z-score: 594.8 bits: 116.8 E(32554): 8.2e-27 Smith-Waterman score: 464; 40.6% identity (76.9% similar) in 160 aa overlap (72-231:1-153) 50 60 70 80 90 100 pF1KE5 TNLLYLELGDKPAPNTFYVGIYILIAVGAVMMFVGFLGCYGAIQESQCLLGTFFTCLVIL ::.:::::: ::.:::::.:: :: :... CCDS81 MMLVGFLGCCGAVQESQCMLGLFFGFLLVI 10 20 30 110 120 130 140 150 160 pF1KE5 FACEVAAGIWGFVNKDQIAKDVKQFYDQALQQAVVDDDANNAKAVVKTFHETLDCCGSST :: :.::.:::. .::.. :.:..:: .. .. . :. . . ..:..: .:.::: CCDS81 FAIEIAAAIWGYSHKDEVIKEVQEFYKDTYNKLKTKDEPQ--RETLKAIHYALNCCG--- 40 50 60 70 80 170 180 190 200 210 220 pF1KE5 LTALTTSVLKNNLCPSGSNIISNLFKEDCHQKIDDLFSGKLYLIGIAAIVVAVIMIFEMI : . . ...::. .... .. ..: . : ..:..:...:: ..: .::.::: :: CCDS81 -LAGGVEQFISDICPK-KDVLETFTVKSCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMI 90 100 110 120 130 140 230 pF1KE5 LSMVLCCGIRNSSVY .::.:::.:: CCDS81 FSMILCCAIRRNREMV 150 >>CCDS8520.1 TSPAN9 gene_id:10867|Hs108|chr12 (239 aa) initn: 368 init1: 180 opt: 384 Z-score: 491.2 bits: 98.3 E(32554): 4.9e-21 Smith-Waterman score: 384; 28.6% identity (59.8% similar) in 234 aa overlap (5-226:4-226) 10 20 30 40 50 60 pF1KE5 MGVEGCTKCIKYLLFVFNFVFWLAGGVILGVALWLRHDPQTTNLLYLELGDKPAPNTFYV :: :.::..:.::..::: : .:::..:: . . . . . : : CCDS85 MARGCLCCLKYMMFLFNLIFWLCGCGLLGVGIWLSVSQGNFATFSPSFPSLSAAN---- 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 GIYILIAVGAVMMFVGFLGCYGAIQESQCLLGTFFTCLVILFACEVAAGIWGFVNKDQIA ..::.:...: .::::: :::.:..::: .:: :.... :. : :: :.. CCDS85 ---LVIAIGTIVMVTGFLGCLGAIKENKCLLLSFFIVLLVILLAELILLILFFVYMDKVN 60 70 80 90 100 110 130 140 150 160 170 pF1KE5 KDVKQFYDQALQQAVVDDDAN--NAKAVVKTFHETLDCCGSSTLTALTTSVLKNNLCPS- ...:. ..: ...... :: .... . ::: . : :: .: :. CCDS85 ENAKKDLKEGLLLYHTENNVGLKNAWNIIQA---EMRCCGVTDYTDWYP-VLGENTVPDR 120 130 140 150 160 180 190 200 210 220 pF1KE5 ---------GSNIISNLFKEDCHQKIDDLFSGKLYLIGIAAIVVAVIMIFEMILSMVLCC : : . :.. :..:. :. . ...: ... . ...:. : .::.: CCDS85 CCMENSQGCGRNATTPLWRTGCYEKVKMWFDDNKHVLGTVGMCILIMQILGMAFSMTLFQ 170 180 190 200 210 220 230 pF1KE5 GIRNSSVY CCDS85 HIHRTGKKYDA 230 >>CCDS8999.1 TSPAN8 gene_id:7103|Hs108|chr12 (237 aa) initn: 472 init1: 209 opt: 380 Z-score: 486.2 bits: 97.3 E(32554): 9.2e-21 Smith-Waterman score: 448; 33.6% identity (67.6% similar) in 238 aa overlap (5-232:3-236) 10 20 30 40 50 60 pF1KE5 MGVEGCTKCIKYLLFVFNFVFWLAGGVILGVALWLRHDPQTTNLLYLELGDKPAPNTFYV : . :::: .:.:::.::: : .::..:.:.: . .. .. :.. . .. :: CCDS89 MAGVSACIKYSMFTFNFLFWLCGILILALAIWVRVSNDSQAIF----GSEDVGSSSYV 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 GIYILIAVGAVMMFVGFLGCYGAIQESQCLLGTFFTCLVILFACEVAAGIWGFVNKDQIA .. :::::::..:..::::: :::.::.:.: :: :.... .::.:: : : :.. CCDS89 AVDILIAVGAIIMILGFLGCCGAIKESRCMLLLFFIGLLLILLLQVATGILGAVFKSKSD 60 70 80 90 100 110 130 140 150 160 170 pF1KE5 KDVKQ-FYDQALQQAVVDDDANNAKAVVKTFHETLDCCGSSTLTALTTSVLKN--NLC-- . :.. .:... ... .. .. . .. .:.: . ::: . .: . ... .:: CCDS89 RIVNETLYENTKLLSATGESEKQFQEAIIVFQEEFKCCGLVNGAADWGNNFQHYPELCAC 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 -----PSGSNIISNLFKEDCHQKIDDLFSGKLYLIGIAAIVVAVIMIFEMILSMVLCCGI : : ....:: : . : :... .: .. .. .::: :. ...:::: : : CCDS89 LDKQRPCQSYNGKQVYKETCISFIKDFLAKNLIIVIGISFGLAVIEILGLVFSMVLYCQI 180 190 200 210 220 230 pF1KE5 RNSSVY : CCDS89 GNK >>CCDS7721.1 TSPAN4 gene_id:7106|Hs108|chr11 (238 aa) initn: 321 init1: 156 opt: 365 Z-score: 467.2 bits: 93.8 E(32554): 1.1e-19 Smith-Waterman score: 365; 27.7% identity (61.2% similar) in 242 aa overlap (6-236:5-236) 10 20 30 40 50 60 pF1KE5 MGVEGCTKCIKYLLFVFNFVFWLAGGVILGVALWLRHDPQTTNLLYLELGDKPAPNTFYV : . .:::.:.::..:::.: .:::..:: . : . . : : CCDS77 MARACLQAVKYLMFAFNLLFWLGGCGVLGVGIWLAATQGSFATLSSSFPSLSAAN---- 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 GIYILIAVGAVMMFVGFLGCYGAIQESQCLLGTFFTCLVILFACEVAAGIWGFVNKDQIA .:: .:: .: .::.:: :::.:..::: ::: :...: :.. .: :. :.: CCDS77 ---LLIITGAFVMAIGFVGCLGAIKENKCLLLTFFLLLLLVFLLEATIAILFFAYTDKID 60 70 80 90 100 110 130 140 150 160 170 pF1KE5 KDVKQFYDQALQQAVVDDDAN--NAKAVVKTFHETLDCCGSSTLT----ALTTSVLKNNL . ..: ..:. .. ... :: ....: . ::: :. : . ... . .. CCDS77 RYAQQDLKKGLHLYGTQGNVGLTNAWSIIQT---DFRCCGVSNYTDWFEVYNATRVPDSC 120 130 140 150 160 180 190 200 210 220 pF1KE5 C-----PSGSNIISNLFKEDCHQKIDDLFSGKLYLIGIAAIVVAVIMIFEMILSMVLCCG : : . .. .: :.. . .. .: .:: .. .:...:. . ..:.. : CCDS77 CLEFSESCGLHAPGTWWKAPCYETVKVWLQENLLAVGIFGLCTALVQILGLTFAMTMYCQ 170 180 190 200 210 220 230 pF1KE5 IRNSSVY . ....: CCDS77 VVKADTYCA 230 >>CCDS7910.1 TSPAN18 gene_id:90139|Hs108|chr11 (248 aa) initn: 417 init1: 187 opt: 334 Z-score: 427.8 bits: 86.6 E(32554): 1.7e-17 Smith-Waterman score: 375; 30.4% identity (60.3% similar) in 257 aa overlap (3-231:1-248) 10 20 30 40 50 pF1KE5 MGVEG-CTKCIKYLLFVFNFVFWLAGGVILGVALWLRHDPQTTNLLYLELGDKPAPNTFY .:: : .:.:::.::::: ..:.:. .:....:. :: . :. : . CCDS79 MEGDCLSCMKYLMFVFNFFIFLGGACLLAIGIWVMVDPTG----FREI--VAANPLLL 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 VGIYILIAVGAVMMFVGFLGCYGAIQESQCLLGTFFTCLVILFACEVAAGIWGFVNKDQI .: :::.:.:......::::: ::..:..::: :: ..:.: :..:.: .:. .... CCDS79 TGAYILLAMGGLLFLLGFLGCCGAVRENKCLLLFFFLFILIIFLAELSAAILAFIFRENL 60 70 80 90 100 110 120 130 140 150 160 pF1KE5 AKDVKQFYDQALQQAVV-DDDANNAKAVVKTFHETLDCCGSS-----------TLTALTT . ..:. . : . ..:.. .:. .. :. ::: . : .: . CCDS79 T---REFFTKELTKHYQGNNDTDVFSATWNSVMITFGCCGVNGPEDFKFASVFRLLTLDS 120 130 140 150 160 170 180 190 200 210 pF1KE5 SVLKNNLC---PSGSN--IIS--------NLF--KEDCHQKIDDLFSGKLYLIGIAAIVV . . : :.. . ..: .:: :. :. : . : .:: : :: : CCDS79 EEVPEACCRREPQSRDGVLLSREECLLGRSLFLNKQGCYTVILNTFETYVYLAGALAIGV 170 180 190 200 210 220 220 230 pF1KE5 AVIMIFEMILSMVLCCGIRNSSVY .: .: ::..: : ::. CCDS79 LAIELFAMIFAMCLFRGIQ 230 240 >>CCDS7909.1 CD82 gene_id:3732|Hs108|chr11 (267 aa) initn: 356 init1: 201 opt: 315 Z-score: 403.3 bits: 82.2 E(32554): 3.8e-16 Smith-Waterman score: 333; 25.6% identity (59.2% similar) in 262 aa overlap (1-227:1-253) 10 20 30 40 50 60 pF1KE5 MGVEGCTKCIKYLLFVFNFVFWLAGGVILGVALWLRHDPQTTNLLYLELGDKPAPNTFYV :: .: : ::.::.::..:.. :.:::: ..:. : ... . :. .. ... . CCDS79 MG-SACIKVTKYFLFLFNLIFFILGAVILGFGVWILAD-KSSFISVLQTSS----SSLRM 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 GIYILIAVGAVMMFVGFLGCYGAIQESQCLLGTFFTCLVILFACEVAAGIWGFVNKDQIA : :..:.:::: :..::::: ::..: .:::: .:. :.... .:.:: . : .. CCDS79 GAYVFIGVGAVTMLMGFLGCIGAVNEVRCLLGLYFAFLLLILIAQVTAGALFYFNMGKLK 60 70 80 90 100 110 130 140 150 160 pF1KE5 KDVKQFYDQALQQ--AVVDDDANNAKAVVKTFHETLDCCGSSTLTALTTSV--------- ... . . ... . .:. ..: :.. . ::: .. : .. CCDS79 QEMGGIVTELIRDYNSSREDSLQDAWDYVQA---QVKCCGWVSFYNWTDNAELMNRPEVT 120 130 140 150 160 170 170 180 190 200 pF1KE5 ----------------LKNNLCPSGSNIISN--------LFKEDCHQKIDDLFSGKLYLI .....: . .: .. ...: : .:.. .. .: .: CCDS79 YPCSCEVKGEEDNSLSVRKGFCEAPGNRTQSGNHPEDWPVYQEGCMEKVQAWLQENLGII 180 190 200 210 220 230 210 220 230 pF1KE5 GIAAIVVAVIMIFEMILSMVLCCGIRNSSVY ... ::.: .. :.::. :: CCDS79 LGVGVGVAIIELLGMVLSICLCRHVHSEDYSKVPKY 240 250 260 236 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 20:26:13 2016 done: Mon Nov 7 20:26:13 2016 Total Scan time: 1.410 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]