FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB3993, 418 aa
1>>>pF1KB3993 418 - 418 aa - 418 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.8938+/-0.000747; mu= 20.0215+/- 0.045
mean_var=72.9988+/-14.962, 0's: 0 Z-trim(109.4): 13 B-trim: 0 in 0/50
Lambda= 0.150112
statistics sampled from 10827 (10835) to 10827 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.69), E-opt: 0.2 (0.333), width: 16
Scan time: 1.420
The best scores are: opt bits E(32554)
CCDS41669.1 TM7SF2 gene_id:7108|Hs108|chr11 ( 418) 2854 627.2 9e-180
CCDS60846.1 TM7SF2 gene_id:7108|Hs108|chr11 ( 391) 2002 442.6 3e-124
CCDS1545.1 LBR gene_id:3930|Hs108|chr1 ( 615) 1676 372.2 7.5e-103
CCDS8200.1 DHCR7 gene_id:1717|Hs108|chr11 ( 475) 546 127.4 2.9e-29
>>CCDS41669.1 TM7SF2 gene_id:7108|Hs108|chr11 (418 aa)
initn: 2854 init1: 2854 opt: 2854 Z-score: 3341.0 bits: 627.2 E(32554): 9e-180
Smith-Waterman score: 2854; 100.0% identity (100.0% similar) in 418 aa overlap (1-418:1-418)
10 20 30 40 50 60
pF1KB3 MAPTQGPRAPLEFGGPLGAAALLLLLPATMFHLLLAARSGPARLLGPPASLPGLEVLWSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 MAPTQGPRAPLEFGGPLGAAALLLLLPATMFHLLLAARSGPARLLGPPASLPGLEVLWSP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB3 RALLLWLAWLGLQAALYLLPARKVAEGQELKDKSRLRYPINGFQALVLTALLVGLGMSAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 RALLLWLAWLGLQAALYLLPARKVAEGQELKDKSRLRYPINGFQALVLTALLVGLGMSAG
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB3 LPLGALPEMLLPLAFVATLTAFIFSLFLYMKAQVAPVSALAPGGNSGNPIYDFFLGRELN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 LPLGALPEMLLPLAFVATLTAFIFSLFLYMKAQVAPVSALAPGGNSGNPIYDFFLGRELN
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB3 PRICFFDFKYFCELRPGLIGWVLINLALLMKEAELRGSPSLAMWLVNGFQLLYVGDALWH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 PRICFFDFKYFCELRPGLIGWVLINLALLMKEAELRGSPSLAMWLVNGFQLLYVGDALWH
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB3 EEAVLTTMDITHDGFGFMLAFGDMAWVPFTYSLQAQFLLHHPQPLGLPMASVICLINATG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 EEAVLTTMDITHDGFGFMLAFGDMAWVPFTYSLQAQFLLHHPQPLGLPMASVICLINATG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB3 YYIFRGANSQKNTFRKNPSDPRVAGLETISTATGRKLLVSGWWGMVRHPNYLGDLIMALA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 YYIFRGANSQKNTFRKNPSDPRVAGLETISTATGRKLLVSGWWGMVRHPNYLGDLIMALA
310 320 330 340 350 360
370 380 390 400 410
pF1KB3 WSLPCGVSHLLPYFYLLYFTALLVHREARDERQCLQKYGLAWQEYCRRVPYRIMPYIY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 WSLPCGVSHLLPYFYLLYFTALLVHREARDERQCLQKYGLAWQEYCRRVPYRIMPYIY
370 380 390 400 410
>>CCDS60846.1 TM7SF2 gene_id:7108|Hs108|chr11 (391 aa)
initn: 2002 init1: 2002 opt: 2002 Z-score: 2344.1 bits: 442.6 E(32554): 3e-124
Smith-Waterman score: 2608; 93.5% identity (93.5% similar) in 418 aa overlap (1-418:1-391)
10 20 30 40 50 60
pF1KB3 MAPTQGPRAPLEFGGPLGAAALLLLLPATMFHLLLAARSGPARLLGPPASLPGLEVLWSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS60 MAPTQGPRAPLEFGGPLGAAALLLLLPATMFHLLLAARSGPARLLGPPASLPGLEVLWSP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB3 RALLLWLAWLGLQAALYLLPARKVAEGQELKDKSRLRYPINGFQALVLTALLVGLGMSAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS60 RALLLWLAWLGLQAALYLLPARKVAEGQELKDKSRLRYPINGFQALVLTALLVGLGMSAG
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB3 LPLGALPEMLLPLAFVATLTAFIFSLFLYMKAQVAPVSALAPGGNSGNPIYDFFLGRELN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS60 LPLGALPEMLLPLAFVATLTAFIFSLFLYMKAQVAPVSALAPGGNSGNPIYDFFLGRELN
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB3 PRICFFDFKYFCELRPGLIGWVLINLALLMKEAELRGSPSLAMWLVNGFQLLYVGDALWH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS60 PRICFFDFKYFCELRPGLIGWVLINLALLMKEAELRGSPSLAMWLVNGFQLLYVGDALWH
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB3 EEAVLTTMDITHDGFGFMLAFGDMAWVPFTYSLQAQFLLHHPQPLGLPMASVICLINATG
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS60 EEAVLTTMDITHDGFGFMLAFGDMAWVPFTYSLQAQFLLHHPQPLGLPMASVICLIN---
250 260 270 280 290
310 320 330 340 350 360
pF1KB3 YYIFRGANSQKNTFRKNPSDPRVAGLETISTATGRKLLVSGWWGMVRHPNYLGDLIMALA
::::::::::::::::::::::::::::::::::::
CCDS60 ------------------------GLETISTATGRKLLVSGWWGMVRHPNYLGDLIMALA
300 310 320 330
370 380 390 400 410
pF1KB3 WSLPCGVSHLLPYFYLLYFTALLVHREARDERQCLQKYGLAWQEYCRRVPYRIMPYIY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS60 WSLPCGVSHLLPYFYLLYFTALLVHREARDERQCLQKYGLAWQEYCRRVPYRIMPYIY
340 350 360 370 380 390
>>CCDS1545.1 LBR gene_id:3930|Hs108|chr1 (615 aa)
initn: 1419 init1: 1081 opt: 1676 Z-score: 1960.0 bits: 372.2 E(32554): 7.5e-103
Smith-Waterman score: 1676; 59.0% identity (81.0% similar) in 410 aa overlap (11-418:207-615)
10 20 30 40
pF1KB3 MAPTQGPRAPLEFGGPLGAAALLLLLPATMFHLLLAARSG
::::: :. ... ::. .: ::: ..
CCDS15 LKEIDSKEEKYVAKELAVRTFEVTPIRAKDLEFGGVPGVFLIMFGLPVFLFLLLLMCKQK
180 190 200 210 220 230
50 60 70 80 90 100
pF1KB3 PARLLGPPASLPGLEVLWSPRALLLWLAWLGLQAALYLLPARKVAEGQELKDKSRLRYPI
::. : ::.: :: :.. ..: :. .:. .:::: ::.:: : : ::.: .
CCDS15 DPSLLNFPPPLPALYELWETRVFGVYLLWFLIQVLFYLLPIGKVVEGTPLIDGRRLKYRL
240 250 260 270 280 290
110 120 130 140 150 160
pF1KB3 NGFQALVLTALLVGLGMSAGLPLGALPEMLLPLAFVATLTAFIFSLFLYMKAQVAPVSAL
::: :..::. ..: .. :. . . .: .:..::. ..:..:::.. :: . :
CCDS15 NGFYAFILTSAVIGTSLFQGVEFHYVYSHFLQFALAATVFCVVLSVYLYMRSLKAPRNDL
300 310 320 330 340 350
170 180 190 200 210
pF1KB3 APGGNSGNPIYDFFLGRELNPRICFFDFKYFCELRPGLIGWVLINLALLMKEAEL--RGS
.:. .::: .::::.:::::::: ::.::::::::::::::.:::..:. : .. :.
CCDS15 SPA-SSGNAVYDFFIGRELNPRIGTFDLKYFCELRPGLIGWVVINLVMLLAEMKIQDRAV
360 370 380 390 400 410
220 230 240 250 260 270
pF1KB3 PSLAMWLVNGFQLLYVGDALWHEEAVLTTMDITHDGFGFMLAFGDMAWVPFTYSLQAQFL
::::: :::.:::::: ::::.:::.:::::: ::::::::::::..:::: ::.:: .:
CCDS15 PSLAMILVNSFQLLYVVDALWNEEALLTTMDIIHDGFGFMLAFGDLVWVPFIYSFQAFYL
420 430 440 450 460 470
280 290 300 310 320 330
pF1KB3 LHHPQPLGLPMASVICLINATGYYIFRGANSQKNTFRKNPSDPRVAGLETISTATGRKLL
. ::. .. ::::.: ... :: ::::::::::.::::::::..: :.:: :.::..::
CCDS15 VSHPNEVSWPMASLIIVLKLCGYVIFRGANSQKNAFRKNPSDPKLAHLKTIHTSTGKNLL
480 490 500 510 520 530
340 350 360 370 380 390
pF1KB3 VSGWWGMVRHPNYLGDLIMALAWSLPCGVSHLLPYFYLLYFTALLVHREARDERQCLQKY
::::::.::::::::::::::::::::: .:.:::::..::: :::::::::: .: .::
CCDS15 VSGWWGFVRHPNYLGDLIMALAWSLPCGFNHILPYFYIIYFTMLLVHREARDEYHCKKKY
540 550 560 570 580 590
400 410
pF1KB3 GLAWQEYCRRVPYRIMPYIY
:.::..::.::::::.::::
CCDS15 GVAWEKYCQRVPYRIFPYIY
600 610
>>CCDS8200.1 DHCR7 gene_id:1717|Hs108|chr11 (475 aa)
initn: 864 init1: 427 opt: 546 Z-score: 638.9 bits: 127.4 E(32554): 2.9e-29
Smith-Waterman score: 901; 37.6% identity (63.8% similar) in 431 aa overlap (22-418:46-475)
10 20 30 40 50
pF1KB3 MAPTQGPRAPLEFGGPLGAAALLLLLPATMFHLLLAARSGPARLLGPPASL
:::. : ......: . : :: ...
CCDS82 DGVTNDRTASQGQWGRAWEVDWFSLASVIFLLLFAPFIVYYFIMACDQYSCALTGPVVDI
20 30 40 50 60 70
60 70 80 90
pF1KB3 -PG---LEVLW--SP----RALLLWLAWLGLQAALY---------LLPAR--KVAEGQEL
: : .: .: .: :. :. .:. :: .::. . ::
CCDS82 VTGHARLSDIWAKTPPITRKAAQLYTLWVTFQVLLYTSLPDFCHKFLPGYVGGIQEGAVT
80 90 100 110 120 130
100 110 120 130 140
pF1KB3 KDKSRLRYPINGFQALVLTALL--VGLGMSAGLPLGALPEMLLPLAFVATLTAFIFSLFL
.: :::.:: .:: :: .. . . . . . .:: . :.. .. : :
CCDS82 PAGVVNKYQINGLQAWLLTHLLWFANAHLLSWFSPTIIFDNWIPLLWCANILGYAVSTFA
140 150 160 170 180 190
150 160 170 180 190 200
pF1KB3 YMKAQVAPVSALAPGGNSGNPIYDFFLGRELNPRIC-FFDFKYFCELRPGLIGWVLINLA
..:. :.:: .:: .:....: :.:::: .:::: : . :::...:.::::.
CCDS82 MVKGYFFPTSA-RDCKFTGNFFYNYMMGIEFNPRIGKWFDFKLFFNGRPGIVAWTLINLS
200 210 220 230 240 250
210 220 230 240 250 260
pF1KB3 LLMKEAELRGSPSLAMWLVNGFQLLYVGDALWHEEAVLTTMDITHDGFGFMLAFGDMAWV
. :. ::.. . :: ::: .: .:: : .:.: : :.:: :: ::..:..:: .:.
CCDS82 FAAKQRELHSHVTNAMVLVNVLQAIYVIDFFWNETWYLKTIDICHDHFGWYLGWGDCVWL
260 270 280 290 300 310
270 280 290 300 310 320
pF1KB3 PFTYSLQAQFLLHHPQPLGLPMASVICLINATGYYIFRGANSQKNTFRKNPSDPRVAGLE
:. :.::. .:..:: :. : : . :.. .:::::: :: ::. ::.. . . : .
CCDS82 PYLYTLQGLYLVYHPVQLSTPHAVGVLLLGLVGYYIFRVANHQKDLFRRTDGRCLIWGRK
320 330 340 350 360 370
330 340 350 360 370
pF1KB3 ------TISTATGR----KLLVSGWWGMVRHPNYLGDLIMALAWSLPCGVSHLLPYFYLL
. ..: :. ::::::.::..:: ::.:::. .::. : :: .:::::::..
CCDS82 PKVIECSYTSADGQRHHSKLLVSGFWGVARHFNYVGDLMGSLAYCLACGGGHLLPYFYII
380 390 400 410 420 430
380 390 400 410
pF1KB3 YFTALLVHREARDERQCLQKYGLAWQEYCRRVPYRIMPYIY
:.. ::.:: :::..: .::: :..: ::::..: :.
CCDS82 YMAILLTHRCLRDEHRCASKYGRDWERYTAAVPYRLLPGIF
440 450 460 470
418 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 16 14:50:25 2017 done: Thu Nov 16 14:50:25 2017
Total Scan time: 1.420 Total Display time: 0.040
Function used was FASTA [36.3.4 Apr, 2011]