FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1546, 228 aa
1>>>pF1KE1546 228 - 228 aa - 228 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.7818+/-0.000928; mu= 17.2463+/- 0.055
mean_var=69.9754+/-14.294, 0's: 0 Z-trim(105.9): 45 B-trim: 0 in 0/49
Lambda= 0.153321
statistics sampled from 8631 (8675) to 8631 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.63), E-opt: 0.2 (0.266), width: 16
Scan time: 1.730
The best scores are: opt bits E(32554)
CCDS8540.1 CD9 gene_id:928|Hs108|chr12 ( 228) 1537 348.8 1.7e-96
CCDS81654.1 CD9 gene_id:928|Hs108|chr12 ( 159) 1078 247.2 4.7e-66
CCDS7734.1 CD81 gene_id:975|Hs108|chr11 ( 236) 722 168.6 3.2e-42
CCDS881.1 TSPAN2 gene_id:10100|Hs108|chr1 ( 221) 717 167.4 6.5e-42
CCDS73240.1 CD81 gene_id:975|Hs108|chr11 ( 165) 464 111.4 3.7e-25
CCDS8999.1 TSPAN8 gene_id:7103|Hs108|chr12 ( 237) 421 102.0 3.5e-22
CCDS76193.1 TSPAN2 gene_id:10100|Hs108|chr1 ( 196) 380 92.8 1.6e-19
CCDS7910.1 TSPAN18 gene_id:90139|Hs108|chr11 ( 248) 367 90.1 1.4e-18
CCDS829.1 CD53 gene_id:963|Hs108|chr1 ( 219) 364 89.3 2.1e-18
CCDS7909.1 CD82 gene_id:3732|Hs108|chr11 ( 267) 351 86.5 1.8e-17
CCDS7719.1 CD151 gene_id:977|Hs108|chr11 ( 253) 349 86.1 2.3e-17
CCDS7721.1 TSPAN4 gene_id:7106|Hs108|chr11 ( 238) 312 77.9 6.4e-15
CCDS7369.1 TSPAN14 gene_id:81619|Hs108|chr10 ( 270) 306 76.6 1.8e-14
CCDS8520.1 TSPAN9 gene_id:10867|Hs108|chr12 ( 239) 292 73.5 1.4e-13
CCDS3646.1 TSPAN5 gene_id:10098|Hs108|chr4 ( 268) 288 72.6 2.7e-13
CCDS14248.1 TSPAN7 gene_id:7102|Hs108|chrX ( 249) 282 71.3 6.5e-13
CCDS8890.1 CD63 gene_id:967|Hs108|chr12 ( 238) 280 70.8 8.6e-13
CCDS14470.1 TSPAN6 gene_id:7105|Hs108|chrX ( 245) 271 68.8 3.5e-12
CCDS31469.1 CD82 gene_id:3732|Hs108|chr11 ( 242) 270 68.6 4e-12
CCDS47346.2 TSPAN17 gene_id:26262|Hs108|chr5 ( 263) 269 68.4 5e-12
CCDS31765.1 TSPAN11 gene_id:441631|Hs108|chr12 ( 253) 268 68.2 5.7e-12
CCDS54952.1 TSPAN17 gene_id:26262|Hs108|chr5 ( 329) 269 68.5 5.9e-12
CCDS34298.1 TSPAN17 gene_id:26262|Hs108|chr5 ( 332) 269 68.5 5.9e-12
CCDS12760.1 CD37 gene_id:951|Hs108|chr19 ( 281) 266 67.8 8.3e-12
CCDS5777.1 TSPAN12 gene_id:23554|Hs108|chr7 ( 305) 260 66.5 2.2e-11
CCDS530.1 TSPAN1 gene_id:10103|Hs108|chr1 ( 241) 257 65.7 2.9e-11
>>CCDS8540.1 CD9 gene_id:928|Hs108|chr12 (228 aa)
initn: 1537 init1: 1537 opt: 1537 Z-score: 1846.0 bits: 348.8 E(32554): 1.7e-96
Smith-Waterman score: 1537; 100.0% identity (100.0% similar) in 228 aa overlap (1-228:1-228)
10 20 30 40 50 60
pF1KE1 MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNSSFYTGV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNSSFYTGV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 YILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEVIKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 YILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEVIKE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 VQEFYKDTYNKLKTKDEPQRETLKAIHYALNCCGLAGGVEQFISDICPKKDVLETFTVKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 VQEFYKDTYNKLKTKDEPQRETLKAIHYALNCCGLAGGVEQFISDICPKKDVLETFTVKS
130 140 150 160 170 180
190 200 210 220
pF1KE1 CPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCCAIRRNREMV
::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 CPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCCAIRRNREMV
190 200 210 220
>>CCDS81654.1 CD9 gene_id:928|Hs108|chr12 (159 aa)
initn: 1078 init1: 1078 opt: 1078 Z-score: 1299.4 bits: 247.2 E(32554): 4.7e-66
Smith-Waterman score: 1078; 100.0% identity (100.0% similar) in 159 aa overlap (70-228:1-159)
40 50 60 70 80 90
pF1KE1 QTKSIFEQETNNNNSSFYTGVYILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVI
::::::::::::::::::::::::::::::
CCDS81 MMLVGFLGCCGAVQESQCMLGLFFGFLLVI
10 20 30
100 110 120 130 140 150
pF1KE1 FAIEIAAAIWGYSHKDEVIKEVQEFYKDTYNKLKTKDEPQRETLKAIHYALNCCGLAGGV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 FAIEIAAAIWGYSHKDEVIKEVQEFYKDTYNKLKTKDEPQRETLKAIHYALNCCGLAGGV
40 50 60 70 80 90
160 170 180 190 200 210
pF1KE1 EQFISDICPKKDVLETFTVKSCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 EQFISDICPKKDVLETFTVKSCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCC
100 110 120 130 140 150
220
pF1KE1 AIRRNREMV
:::::::::
CCDS81 AIRRNREMV
>>CCDS7734.1 CD81 gene_id:975|Hs108|chr11 (236 aa)
initn: 714 init1: 335 opt: 722 Z-score: 871.6 bits: 168.6 E(32554): 3.2e-42
Smith-Waterman score: 722; 44.6% identity (78.4% similar) in 231 aa overlap (1-222:1-231)
10 20 30 40 50
pF1KE1 MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNS--SFYT
: :.: ::::::::: :::.::::: ..:...:::: : :: ... : ... . .::.
CCDS77 MGVEGCTKCIKYLLFVFNFVFWLAGGVILGVALWLRHDPQTTNLLYLELGDKPAPNTFYV
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE1 GVYILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEVI
:.::::..::.::.:::::: ::.:::::.:: :: :...:: :.::.:::. .::..
CCDS77 GIYILIAVGAVMMFVGFLGCYGAIQESQCLLGTFFTCLVILFACEVAAGIWGFVNKDQIA
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE1 KEVQEFYKDTYNKLKTKDEPQ--RETLKAIHYALNCCG----LAGGVEQFISDICPK-KD
:.:..:: .. .. . :. . . ..:..: .:.::: : . . ...::. ..
CCDS77 KDVKQFYDQALQQAVVDDDANNAKAVVKTFHETLDCCGSSTLTALTTSVLKNNLCPSGSN
130 140 150 160 170 180
180 190 200 210 220
pF1KE1 VLETFTVKSCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCCAIRRNREMV
.. .. ..: . : ..:..:...:: ..: .::.::: ::.::.:::.::
CCDS77 IISNLFKEDCHQKIDDLFSGKLYLIGIAAIVVAVIMIFEMILSMVLCCGIRNSSVY
190 200 210 220 230
>>CCDS881.1 TSPAN2 gene_id:10100|Hs108|chr1 (221 aa)
initn: 684 init1: 338 opt: 717 Z-score: 866.0 bits: 167.4 E(32554): 6.5e-42
Smith-Waterman score: 717; 48.2% identity (76.5% similar) in 226 aa overlap (4-228:5-221)
10 20 30 40 50
pF1KE1 MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNSSFYTG
.:: .::::::.:::..::::: ::.:.:::.:: . : . .. . . ::.:
CCDS88 MGRFRGGLRCIKYLLLGFNLLFWLAGSAVIAFGLWFRFGGAIKELSSEDKSPEY--FYVG
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 VYILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEVIK
.:.:.::::::: :::.:::::..::::.:: :: :::::: :........ : .:.
CCDS88 LYVLVGAGALMMAVGFFGCCGAMRESQCVLGSFFTCLLVIFAAEVTTGVFAFIGKGVAIR
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE1 EVQEFYKDTYNKLKTKDEPQRE-TLKAIHYALNCCGLAGGVEQFISDICPKKDVLETFTV
.:: .:...:: ::. . . :: ..: ...::: .. :: .. ::: : .
CCDS88 HVQTMYEEAYNDY-LKDRGKGNGTLITFHSTFQCCGKESS-EQ-VQPTCPK----ELLGH
120 130 140 150 160 170
180 190 200 210 220
pF1KE1 KSCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCCAIRRNREMV
:.: : :. ... :...:: :::::: . ::::::::.:::::: .:...
CCDS88 KNCIDEIETIISVKLQLIGIVGIGIAGLTIFGMIFSMVLCCAIRNSRDVI
180 190 200 210 220
>>CCDS73240.1 CD81 gene_id:975|Hs108|chr11 (165 aa)
initn: 454 init1: 267 opt: 464 Z-score: 565.2 bits: 111.4 E(32554): 3.7e-25
Smith-Waterman score: 464; 40.6% identity (76.9% similar) in 160 aa overlap (70-222:1-160)
40 50 60 70 80 90
pF1KE1 QTKSIFEQETNNNNSSFYTGVYILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVI
::.:::::: ::.:::::.:: :: :...
CCDS73 MMFVGFLGCYGAIQESQCLLGTFFTCLVIL
10 20 30
100 110 120 130 140 150
pF1KE1 FAIEIAAAIWGYSHKDEVIKEVQEFYKDTYNKLKTKDEPQ--RETLKAIHYALNCCG---
:: :.::.:::. .::.. :.:..:: .. .. . :. . . ..:..: .:.:::
CCDS73 FACEVAAGIWGFVNKDQIAKDVKQFYDQALQQAVVDDDANNAKAVVKTFHETLDCCGSST
40 50 60 70 80 90
160 170 180 190 200 210
pF1KE1 -LAGGVEQFISDICPK-KDVLETFTVKSCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMI
: . . ...::. .... .. ..: . : ..:..:...:: ..: .::.::: ::
CCDS73 LTALTTSVLKNNLCPSGSNIISNLFKEDCHQKIDDLFSGKLYLIGIAAIVVAVIMIFEMI
100 110 120 130 140 150
220
pF1KE1 FSMILCCAIRRNREMV
.::.:::.::
CCDS73 LSMVLCCGIRNSSVY
160
>>CCDS8999.1 TSPAN8 gene_id:7103|Hs108|chr12 (237 aa)
initn: 272 init1: 239 opt: 421 Z-score: 511.7 bits: 102.0 E(32554): 3.5e-22
Smith-Waterman score: 505; 35.9% identity (69.7% similar) in 234 aa overlap (5-221:3-234)
10 20 30 40 50 60
pF1KE1 MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNSSFYTGV
: . :::: .: :::.::: :: .::...:.: ......:: .: . .:: :..:
CCDS89 MAGVSACIKYSMFTFNFLFWLCGILILALAIWVRVSNDSQAIFGSE--DVGSSSYVAV
10 20 30 40 50
70 80 90 100 110 120
pF1KE1 YILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEVIKE
:::..::..:..::::::::..::.::: ::: ::.:. ...:..: : :.. .
CCDS89 DILIAVGAIIMILGFLGCCGAIKESRCMLLLFFIGLLLILLLQVATGILGAVFKSKSDRI
60 70 80 90 100 110
130 140 150 160
pF1KE1 VQEFYKDTYNKLKTKDEPQRETLKAI---HYALNCCGLAGGVE------QFISDICP---
:.: .. . :.. : ... .:: . ..::::..:. : ..:
CCDS89 VNETLYENTKLLSATGESEKQFQEAIIVFQEEFKCCGLVNGAADWGNNFQHYPELCACLD
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE1 KKDVLETFTVK-----SCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCCAIRR
:. .... : .: . ::. . ... :. ....:.::. :.:..:::.: : :
CCDS89 KQRPCQSYNGKQVYKETCISFIKDFLAKNLIIVIGISFGLAVIEILGLVFSMVLYCQIGN
180 190 200 210 220 230
pF1KE1 NREMV
CCDS89 K
>>CCDS76193.1 TSPAN2 gene_id:10100|Hs108|chr1 (196 aa)
initn: 613 init1: 205 opt: 380 Z-score: 463.8 bits: 92.8 E(32554): 1.6e-19
Smith-Waterman score: 599; 43.8% identity (68.6% similar) in 226 aa overlap (4-228:5-196)
10 20 30 40 50
pF1KE1 MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNSSFYTG
.:: .::::::.:::..::::: ::.:.:::.:: . : . .. . . ::.:
CCDS76 MGRFRGGLRCIKYLLLGFNLLFWLAGSAVIAFGLWFRFGGAIKELSSEDKSPEY--FYVG
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 VYILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEVIK
.:.:.::::::: :::.:::::..::::.:: .:.
CCDS76 LYVLVGAGALMMAVGFFGCCGAMRESQCVLG-------------------------SAIR
60 70 80 90
120 130 140 150 160 170
pF1KE1 EVQEFYKDTYNKLKTKDEPQRE-TLKAIHYALNCCGLAGGVEQFISDICPKKDVLETFTV
.:: .:...:: ::. . . :: ..: ...::: .. :: .. ::: : .
CCDS76 HVQTMYEEAYNDY-LKDRGKGNGTLITFHSTFQCCGKESS-EQ-VQPTCPK----ELLGH
100 110 120 130 140
180 190 200 210 220
pF1KE1 KSCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCCAIRRNREMV
:.: : :. ... :...:: :::::: . ::::::::.:::::: .:...
CCDS76 KNCIDEIETIISVKLQLIGIVGIGIAGLTIFGMIFSMVLCCAIRNSRDVI
150 160 170 180 190
>>CCDS7910.1 TSPAN18 gene_id:90139|Hs108|chr11 (248 aa)
initn: 408 init1: 256 opt: 367 Z-score: 446.9 bits: 90.1 E(32554): 1.4e-18
Smith-Waterman score: 396; 31.8% identity (62.0% similar) in 242 aa overlap (9-217:8-243)
10 20 30 40 50 60
pF1KE1 MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNSSFYTGV
:.:::.: :::...:.: .::::.:. : . : .: : . ::.
CCDS79 MEGDCLSCMKYLMFVFNFFIFLGGACLLAIGIWVMVDP---TGF-REIVAANPLLLTGA
10 20 30 40 50
70 80 90 100 110 120
pF1KE1 YILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEVIKE
:::.. :.:..:.:::::::::.:..:.: .:: :.:.:: :..::: .. .... .:
CCDS79 YILLAMGGLLFLLGFLGCCGAVRENKCLLLFFFLFILIIFLAELSAAILAFIFRENLTRE
60 70 80 90 100 110
130 140 150 160
pF1KE1 VQEFYKDTYNKLKTKDEPQ--RETLKAIHYALNCCGLAGGVE-QFIS-------------
: :. .. . ... . : ... ...:::. : . .: :
CCDS79 F--FTKELTKHYQGNNDTDVFSATWNSVMITFGCCGVNGPEDFKFASVFRLLTLDSEEVP
120 130 140 150 160 170
170 180 190 200
pF1KE1 DICPKKD------VL----ETFTVKS-------CPDAIKEVFDNKFHIIGAVGIGIAVVM
. : ... :: : . .: : .: ..:.. .. ::..::. ..
CCDS79 EACCRREPQSRDGVLLSREECLLGRSLFLNKQGCYTVILNTFETYVYLAGALAIGVLAIE
180 190 200 210 220 230
210 220
pF1KE1 IFGMIFSMILCCAIRRNREMV
.:.:::.: :
CCDS79 LFAMIFAMCLFRGIQ
240
>>CCDS829.1 CD53 gene_id:963|Hs108|chr1 (219 aa)
initn: 179 init1: 150 opt: 364 Z-score: 444.0 bits: 89.3 E(32554): 2.1e-18
Smith-Waterman score: 364; 27.9% identity (62.6% similar) in 222 aa overlap (8-228:7-217)
10 20 30 40 50
pF1KE1 MPVKGGTKCIKYLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNSSFYTG-
: .::.:: ::..::. : .:..:..: . .. .: .: :. :
CCDS82 MGMSSLKLLKYVLFFFNLLFWICGCCILGFGIYLLIHNNFGVLF-----HNLPSLTLGN
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 VYILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEVIK
:....:. ..:.:.:::: :...:..:.: :: .::.:. :.. :: . ..... .
CCDS82 VFVIVGS--IIMVVAFLGCMGSIKENKCLLMSFFILLLIILLAEVTLAILLFVYEQKLNE
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE1 EVQEFYKDTYNKLKTKDEPQRETLKAIHYALNCCGLAGGVEQFISDICPKKDVLETFTVK
: . :. .. .. :. . . .:. :.:::. .:. .. : : . :.
CCDS82 YVAKGLTDSIHRYHS-DNSTKAAWDSIQSFLQCCGI-NGTSDWTSG--PPASCPSDRKVE
120 130 140 150 160
180 190 200 210 220
pF1KE1 SCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCCAIRRNREMV
.: . : ..: :: . : . :. ..:: :.. : : : .. . .
CCDS82 GCYAKARLWFHSNFLYIGIITICVCVIEVLGMSFALTLNCQIDKTSQTIGL
170 180 190 200 210
>>CCDS7909.1 CD82 gene_id:3732|Hs108|chr11 (267 aa)
initn: 427 init1: 222 opt: 351 Z-score: 427.4 bits: 86.5 E(32554): 1.8e-17
Smith-Waterman score: 366; 28.7% identity (61.2% similar) in 258 aa overlap (6-218:2-253)
10 20 30 40 50
pF1KE1 MPVKGGTKCIK---YLLFGFNFIFWLAGIAVLAIGLWLRFDSQTKSIFEQETNNNNSSFY
:. ::: :.:: ::.::.. : ..:..:.:. : :: : . ....::.
CCDS79 MGSACIKVTKYFLFLFNLIFFILGAVILGFGVWILAD---KSSFISVLQTSSSSLR
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 TGVYILIGAGALMMLVGFLGCCGAVQESQCMLGLFFGFLLVIFAIEIAAAIWGYSHKDEV
:.:..::.::. ::.::::: :::.: .:.:::.:.:::.:. ...:. : . ..
CCDS79 MGAYVFIGVGAVTMLMGFLGCIGAVNEVRCLLGLYFAFLLLILIAQVTAGALFYFNMGKL
60 70 80 90 100 110
120 130 140 150 160
pF1KE1 IKE----VQEFYKDTYNKLKTKDEPQRETLKAIHYALNCCGLAG------GVEQF----I
.: : :. .: ::. .... ... .. ..::: .. ..: . .
CCDS79 KQEMGGIVTELIRD-YNS--SREDSLQDAWDYVQAQVKCCGWVSFYNWTDNAELMNRPEV
120 130 140 150 160 170
170 180 190
pF1KE1 SDICP---KKDVLETFTVKS-------------------------CPDAIKEVFDNKFHI
. : : . ....:.. : . .. ..... :
CCDS79 TYPCSCEVKGEEDNSLSVRKGFCEAPGNRTQSGNHPEDWPVYQEGCMEKVQAWLQENLGI
180 190 200 210 220 230
200 210 220
pF1KE1 IGAVGIGIAVVMIFGMIFSMILCCAIRRNREMV
: .::.:.:.. ..::..:. ::
CCDS79 ILGVGVGVAIIELLGMVLSICLCRHVHSEDYSKVPKY
240 250 260
228 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 01:45:10 2016 done: Mon Nov 7 01:45:10 2016
Total Scan time: 1.730 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]