FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB3993, 418 aa 1>>>pF1KB3993 418 - 418 aa - 418 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.8938+/-0.000747; mu= 20.0215+/- 0.045 mean_var=72.9988+/-14.962, 0's: 0 Z-trim(109.4): 13 B-trim: 0 in 0/50 Lambda= 0.150112 statistics sampled from 10827 (10835) to 10827 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.69), E-opt: 0.2 (0.333), width: 16 Scan time: 1.420 The best scores are: opt bits E(32554) CCDS41669.1 TM7SF2 gene_id:7108|Hs108|chr11 ( 418) 2854 627.2 9e-180 CCDS60846.1 TM7SF2 gene_id:7108|Hs108|chr11 ( 391) 2002 442.6 3e-124 CCDS1545.1 LBR gene_id:3930|Hs108|chr1 ( 615) 1676 372.2 7.5e-103 CCDS8200.1 DHCR7 gene_id:1717|Hs108|chr11 ( 475) 546 127.4 2.9e-29 >>CCDS41669.1 TM7SF2 gene_id:7108|Hs108|chr11 (418 aa) initn: 2854 init1: 2854 opt: 2854 Z-score: 3341.0 bits: 627.2 E(32554): 9e-180 Smith-Waterman score: 2854; 100.0% identity (100.0% similar) in 418 aa overlap (1-418:1-418) 10 20 30 40 50 60 pF1KB3 MAPTQGPRAPLEFGGPLGAAALLLLLPATMFHLLLAARSGPARLLGPPASLPGLEVLWSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MAPTQGPRAPLEFGGPLGAAALLLLLPATMFHLLLAARSGPARLLGPPASLPGLEVLWSP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 RALLLWLAWLGLQAALYLLPARKVAEGQELKDKSRLRYPINGFQALVLTALLVGLGMSAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 RALLLWLAWLGLQAALYLLPARKVAEGQELKDKSRLRYPINGFQALVLTALLVGLGMSAG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 LPLGALPEMLLPLAFVATLTAFIFSLFLYMKAQVAPVSALAPGGNSGNPIYDFFLGRELN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 LPLGALPEMLLPLAFVATLTAFIFSLFLYMKAQVAPVSALAPGGNSGNPIYDFFLGRELN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB3 PRICFFDFKYFCELRPGLIGWVLINLALLMKEAELRGSPSLAMWLVNGFQLLYVGDALWH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 PRICFFDFKYFCELRPGLIGWVLINLALLMKEAELRGSPSLAMWLVNGFQLLYVGDALWH 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB3 EEAVLTTMDITHDGFGFMLAFGDMAWVPFTYSLQAQFLLHHPQPLGLPMASVICLINATG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 EEAVLTTMDITHDGFGFMLAFGDMAWVPFTYSLQAQFLLHHPQPLGLPMASVICLINATG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB3 YYIFRGANSQKNTFRKNPSDPRVAGLETISTATGRKLLVSGWWGMVRHPNYLGDLIMALA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 YYIFRGANSQKNTFRKNPSDPRVAGLETISTATGRKLLVSGWWGMVRHPNYLGDLIMALA 310 320 330 340 350 360 370 380 390 400 410 pF1KB3 WSLPCGVSHLLPYFYLLYFTALLVHREARDERQCLQKYGLAWQEYCRRVPYRIMPYIY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 WSLPCGVSHLLPYFYLLYFTALLVHREARDERQCLQKYGLAWQEYCRRVPYRIMPYIY 370 380 390 400 410 >>CCDS60846.1 TM7SF2 gene_id:7108|Hs108|chr11 (391 aa) initn: 2002 init1: 2002 opt: 2002 Z-score: 2344.1 bits: 442.6 E(32554): 3e-124 Smith-Waterman score: 2608; 93.5% identity (93.5% similar) in 418 aa overlap (1-418:1-391) 10 20 30 40 50 60 pF1KB3 MAPTQGPRAPLEFGGPLGAAALLLLLPATMFHLLLAARSGPARLLGPPASLPGLEVLWSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 MAPTQGPRAPLEFGGPLGAAALLLLLPATMFHLLLAARSGPARLLGPPASLPGLEVLWSP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 RALLLWLAWLGLQAALYLLPARKVAEGQELKDKSRLRYPINGFQALVLTALLVGLGMSAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 RALLLWLAWLGLQAALYLLPARKVAEGQELKDKSRLRYPINGFQALVLTALLVGLGMSAG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 LPLGALPEMLLPLAFVATLTAFIFSLFLYMKAQVAPVSALAPGGNSGNPIYDFFLGRELN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 LPLGALPEMLLPLAFVATLTAFIFSLFLYMKAQVAPVSALAPGGNSGNPIYDFFLGRELN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB3 PRICFFDFKYFCELRPGLIGWVLINLALLMKEAELRGSPSLAMWLVNGFQLLYVGDALWH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 PRICFFDFKYFCELRPGLIGWVLINLALLMKEAELRGSPSLAMWLVNGFQLLYVGDALWH 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB3 EEAVLTTMDITHDGFGFMLAFGDMAWVPFTYSLQAQFLLHHPQPLGLPMASVICLINATG ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 EEAVLTTMDITHDGFGFMLAFGDMAWVPFTYSLQAQFLLHHPQPLGLPMASVICLIN--- 250 260 270 280 290 310 320 330 340 350 360 pF1KB3 YYIFRGANSQKNTFRKNPSDPRVAGLETISTATGRKLLVSGWWGMVRHPNYLGDLIMALA :::::::::::::::::::::::::::::::::::: CCDS60 ------------------------GLETISTATGRKLLVSGWWGMVRHPNYLGDLIMALA 300 310 320 330 370 380 390 400 410 pF1KB3 WSLPCGVSHLLPYFYLLYFTALLVHREARDERQCLQKYGLAWQEYCRRVPYRIMPYIY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 WSLPCGVSHLLPYFYLLYFTALLVHREARDERQCLQKYGLAWQEYCRRVPYRIMPYIY 340 350 360 370 380 390 >>CCDS1545.1 LBR gene_id:3930|Hs108|chr1 (615 aa) initn: 1419 init1: 1081 opt: 1676 Z-score: 1960.0 bits: 372.2 E(32554): 7.5e-103 Smith-Waterman score: 1676; 59.0% identity (81.0% similar) in 410 aa overlap (11-418:207-615) 10 20 30 40 pF1KB3 MAPTQGPRAPLEFGGPLGAAALLLLLPATMFHLLLAARSG ::::: :. ... ::. .: ::: .. CCDS15 LKEIDSKEEKYVAKELAVRTFEVTPIRAKDLEFGGVPGVFLIMFGLPVFLFLLLLMCKQK 180 190 200 210 220 230 50 60 70 80 90 100 pF1KB3 PARLLGPPASLPGLEVLWSPRALLLWLAWLGLQAALYLLPARKVAEGQELKDKSRLRYPI ::. : ::.: :: :.. ..: :. .:. .:::: ::.:: : : ::.: . CCDS15 DPSLLNFPPPLPALYELWETRVFGVYLLWFLIQVLFYLLPIGKVVEGTPLIDGRRLKYRL 240 250 260 270 280 290 110 120 130 140 150 160 pF1KB3 NGFQALVLTALLVGLGMSAGLPLGALPEMLLPLAFVATLTAFIFSLFLYMKAQVAPVSAL ::: :..::. ..: .. :. . . .: .:..::. ..:..:::.. :: . : CCDS15 NGFYAFILTSAVIGTSLFQGVEFHYVYSHFLQFALAATVFCVVLSVYLYMRSLKAPRNDL 300 310 320 330 340 350 170 180 190 200 210 pF1KB3 APGGNSGNPIYDFFLGRELNPRICFFDFKYFCELRPGLIGWVLINLALLMKEAEL--RGS .:. .::: .::::.:::::::: ::.::::::::::::::.:::..:. : .. :. CCDS15 SPA-SSGNAVYDFFIGRELNPRIGTFDLKYFCELRPGLIGWVVINLVMLLAEMKIQDRAV 360 370 380 390 400 410 220 230 240 250 260 270 pF1KB3 PSLAMWLVNGFQLLYVGDALWHEEAVLTTMDITHDGFGFMLAFGDMAWVPFTYSLQAQFL ::::: :::.:::::: ::::.:::.:::::: ::::::::::::..:::: ::.:: .: CCDS15 PSLAMILVNSFQLLYVVDALWNEEALLTTMDIIHDGFGFMLAFGDLVWVPFIYSFQAFYL 420 430 440 450 460 470 280 290 300 310 320 330 pF1KB3 LHHPQPLGLPMASVICLINATGYYIFRGANSQKNTFRKNPSDPRVAGLETISTATGRKLL . ::. .. ::::.: ... :: ::::::::::.::::::::..: :.:: :.::..:: CCDS15 VSHPNEVSWPMASLIIVLKLCGYVIFRGANSQKNAFRKNPSDPKLAHLKTIHTSTGKNLL 480 490 500 510 520 530 340 350 360 370 380 390 pF1KB3 VSGWWGMVRHPNYLGDLIMALAWSLPCGVSHLLPYFYLLYFTALLVHREARDERQCLQKY ::::::.::::::::::::::::::::: .:.:::::..::: :::::::::: .: .:: CCDS15 VSGWWGFVRHPNYLGDLIMALAWSLPCGFNHILPYFYIIYFTMLLVHREARDEYHCKKKY 540 550 560 570 580 590 400 410 pF1KB3 GLAWQEYCRRVPYRIMPYIY :.::..::.::::::.:::: CCDS15 GVAWEKYCQRVPYRIFPYIY 600 610 >>CCDS8200.1 DHCR7 gene_id:1717|Hs108|chr11 (475 aa) initn: 864 init1: 427 opt: 546 Z-score: 638.9 bits: 127.4 E(32554): 2.9e-29 Smith-Waterman score: 901; 37.6% identity (63.8% similar) in 431 aa overlap (22-418:46-475) 10 20 30 40 50 pF1KB3 MAPTQGPRAPLEFGGPLGAAALLLLLPATMFHLLLAARSGPARLLGPPASL :::. : ......: . : :: ... CCDS82 DGVTNDRTASQGQWGRAWEVDWFSLASVIFLLLFAPFIVYYFIMACDQYSCALTGPVVDI 20 30 40 50 60 70 60 70 80 90 pF1KB3 -PG---LEVLW--SP----RALLLWLAWLGLQAALY---------LLPAR--KVAEGQEL : : .: .: .: :. :. .:. :: .::. . :: CCDS82 VTGHARLSDIWAKTPPITRKAAQLYTLWVTFQVLLYTSLPDFCHKFLPGYVGGIQEGAVT 80 90 100 110 120 130 100 110 120 130 140 pF1KB3 KDKSRLRYPINGFQALVLTALL--VGLGMSAGLPLGALPEMLLPLAFVATLTAFIFSLFL .: :::.:: .:: :: .. . . . . . .:: . :.. .. : : CCDS82 PAGVVNKYQINGLQAWLLTHLLWFANAHLLSWFSPTIIFDNWIPLLWCANILGYAVSTFA 140 150 160 170 180 190 150 160 170 180 190 200 pF1KB3 YMKAQVAPVSALAPGGNSGNPIYDFFLGRELNPRIC-FFDFKYFCELRPGLIGWVLINLA ..:. :.:: .:: .:....: :.:::: .:::: : . :::...:.::::. CCDS82 MVKGYFFPTSA-RDCKFTGNFFYNYMMGIEFNPRIGKWFDFKLFFNGRPGIVAWTLINLS 200 210 220 230 240 250 210 220 230 240 250 260 pF1KB3 LLMKEAELRGSPSLAMWLVNGFQLLYVGDALWHEEAVLTTMDITHDGFGFMLAFGDMAWV . :. ::.. . :: ::: .: .:: : .:.: : :.:: :: ::..:..:: .:. CCDS82 FAAKQRELHSHVTNAMVLVNVLQAIYVIDFFWNETWYLKTIDICHDHFGWYLGWGDCVWL 260 270 280 290 300 310 270 280 290 300 310 320 pF1KB3 PFTYSLQAQFLLHHPQPLGLPMASVICLINATGYYIFRGANSQKNTFRKNPSDPRVAGLE :. :.::. .:..:: :. : : . :.. .:::::: :: ::. ::.. . . : . CCDS82 PYLYTLQGLYLVYHPVQLSTPHAVGVLLLGLVGYYIFRVANHQKDLFRRTDGRCLIWGRK 320 330 340 350 360 370 330 340 350 360 370 pF1KB3 ------TISTATGR----KLLVSGWWGMVRHPNYLGDLIMALAWSLPCGVSHLLPYFYLL . ..: :. ::::::.::..:: ::.:::. .::. : :: .:::::::.. CCDS82 PKVIECSYTSADGQRHHSKLLVSGFWGVARHFNYVGDLMGSLAYCLACGGGHLLPYFYII 380 390 400 410 420 430 380 390 400 410 pF1KB3 YFTALLVHREARDERQCLQKYGLAWQEYCRRVPYRIMPYIY :.. ::.:: :::..: .::: :..: ::::..: :. CCDS82 YMAILLTHRCLRDEHRCASKYGRDWERYTAAVPYRLLPGIF 440 450 460 470 418 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 16 14:50:25 2017 done: Thu Nov 16 14:50:25 2017 Total Scan time: 1.420 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]