FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4176, 638 aa 1>>>pF1KE4176 638 - 638 aa - 638 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.8269+/-0.000941; mu= 7.1989+/- 0.057 mean_var=318.3040+/-65.409, 0's: 0 Z-trim(115.5): 48 B-trim: 0 in 0/54 Lambda= 0.071887 statistics sampled from 16000 (16041) to 16000 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.792), E-opt: 0.2 (0.493), width: 16 Scan time: 4.380 The best scores are: opt bits E(32554) CCDS32737.1 TRIM47 gene_id:91107|Hs108|chr17 ( 638) 4439 474.2 2.4e-133 CCDS11171.1 TRIM16 gene_id:10626|Hs108|chr17 ( 564) 624 78.5 2.8e-14 >>CCDS32737.1 TRIM47 gene_id:91107|Hs108|chr17 (638 aa) initn: 4439 init1: 4439 opt: 4439 Z-score: 2507.6 bits: 474.2 E(32554): 2.4e-133 Smith-Waterman score: 4439; 99.8% identity (99.8% similar) in 638 aa overlap (1-638:1-638) 10 20 30 40 50 60 pF1KE4 MDGSGPFSCPICLEPLREPVTLPCGHNFCLACLGALWPHRGASGAGGPGGAARCPLCQEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MDGSGPFSCPICLEPLREPVTLPCGHNFCLACLGALWPHRGASGAGGPGGAARCPLCQEP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 FPDGLQLRKNHTLSELLQLRQGSGPGSGPGPAPALAPEPSAPSALPSVPEPSAPCAPEPW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 FPDGLQLRKNHTLSELLQLRQGSGPGSGPGPAPALAPEPSAPSALPSVPEPSAPCAPEPW 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 PAGEEPVRCDACPEGAALPAALSCLSCLASFCPAHLGPHERSPALRGHRLVPPLRRLEES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 PAGEEPVRCDACPEGAALPAALSCLSCLASFCPAHLGPHERSPALRGHRLVPPLRRLEES 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 LCPRHLWPLERYCRAERVCLCEACAAQEHRGHELVPLEQERALQEAEQSKVLSAVEDRMD :::::: ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 LCPRHLRPLERYCRAERVCLCEACAAQEHRGHELVPLEQERALQEAEQSKVLSAVEDRMD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 ELGAGIAQSRRTVALIKSAAVAERERVSRLFADAAAALQGFQTQVLGFIEEGEAAMLGRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 ELGAGIAQSRRTVALIKSAAVAERERVSRLFADAAAALQGFQTQVLGFIEEGEAAMLGRS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 QGDLRRQEEQRSRLSRARQNLSQVPEADSVSFLQELLALRLALEDGCGPGPGPPRELSFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 QGDLRRQEEQRSRLSRARQNLSQVPEADSVSFLQELLALRLALEDGCGPGPGPPRELSFT 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 KSSQAVRAVRDMLAVACVNQWEQLRGPGGNEDGPQKLDSEADAEPQDLESTNLLESEAPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 KSSQAVRAVRDMLAVACVNQWEQLRGPGGNEDGPQKLDSEADAEPQDLESTNLLESEAPR 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 DYFLKFAYIVDLDSDTADKFLQLFGTKGVKRVLCPINYPLSPTRFTHCEQVLGEGALDRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 DYFLKFAYIVDLDSDTADKFLQLFGTKGVKRVLCPINYPLSPTRFTHCEQVLGEGALDRG 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE4 TYYWEVEIIEGWVSMGVMAEDFSPQEPYDRGRLGRNAHSCCLQWNGRSFSVWFHGLEAPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 TYYWEVEIIEGWVSMGVMAEDFSPQEPYDRGRLGRNAHSCCLQWNGRSFSVWFHGLEAPL 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE4 PHPFSPTVGVCLEYADRALAFYAVRDGKMSLLRRLKASRPRRGGIPASPIDPFQSRLDSH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 PHPFSPTVGVCLEYADRALAFYAVRDGKMSLLRRLKASRPRRGGIPASPIDPFQSRLDSH 550 560 570 580 590 600 610 620 630 pF1KE4 FAGLFTHRLKPAFFLESVDAHLQIGPLKKSCISVLKRR :::::::::::::::::::::::::::::::::::::: CCDS32 FAGLFTHRLKPAFFLESVDAHLQIGPLKKSCISVLKRR 610 620 630 >>CCDS11171.1 TRIM16 gene_id:10626|Hs108|chr17 (564 aa) initn: 604 init1: 203 opt: 624 Z-score: 369.9 bits: 78.5 E(32554): 2.8e-14 Smith-Waterman score: 629; 27.9% identity (56.9% similar) in 541 aa overlap (73-580:1-529) 50 60 70 80 90 pF1KE4 SGAGGPGGAARCPLCQEPFPDGLQLRKNHTLSELLQLRQGSGP-GSGPGPAP----ALAP ..:: . : : ... ::: . .: CCDS11 MAELDLMAPGPLPRATAQPPAPLSPDSGSP 10 20 30 100 110 120 130 140 pF1KE4 EPSAPSALP--------------SVPEPSAPCAPEPWPAGE-EPVRCDACPEGAA-LPAA :.. :: : . : .. : . :::: . : :: : . . . :. CCDS11 SPDSGSASPVEEEDVGSSEKLGRETEEQDSDSAEQGDPAGEGKEVLCDFCLDDTRRVKAV 40 50 60 70 80 90 150 160 170 180 190 200 pF1KE4 LSCLSCLASFCPAHLGPHERSPALRGHRLVPPLRRLEESLCPRHLWPLERYCRAERVCLC :::.:....: :: ::. . :..: :. :.. . :: : :: .: .. :.: CCDS11 KSCLTCMVNYCEEHLQPHQVNIKLQSHLLTEPVKDHNWRYCPAHHSPLSAFCCPDQQCIC 100 110 120 130 140 150 210 220 230 240 250 pF1KE4 EACAAQEHRGHELVPLEQERALQEAEQSKVLSAVEDRM--DELGAGIAQSRRTVALIKSA . : ::: :: .: :. : .::: . . .: .. .: . . :. . .:. CCDS11 QDCC-QEHSGHTIVSLDAARRDKEAELQCTQLDLERKLKLNENAISRLQANQKSVLV--- 160 170 180 190 200 260 270 280 290 300 310 pF1KE4 AVAERERVSRL-FADAAAALQGFQTQVLGFIEEGEAAMLGRSQGDLRRQEEQRS-RLSRA .:.: . :... :.. ::.. :..:. :.:: : : :....: .. . : :: .. .. CCDS11 SVSEVKAVAEMQFGELLAAVRKAQANVMLFLEEKEQAALSQANG-IKAHLEYRSAEMEKS 210 220 230 240 250 260 320 330 340 350 360 370 pF1KE4 RQNLSQVPE-ADSVSFLQELLALRLALEDGCGPGPGPPRELSFTKSSQAVRAVRDMLAVA .:.: .. ...:.::.: .. :: :. ... . ...: : .: CCDS11 KQELERMAAISNTVQFLEEYCKFK-NTEDITFPSV----YVGLKDKLSGIRKVITESTVH 270 280 290 300 310 320 380 390 400 410 420 430 pF1KE4 CVNQWEQLRGPGGNEDGPQKLD--SEADAEPQDLESTNLLESEAPRDYFLKFAYIVDLDS .. :. . . . .. : ....: : :. : . :. ::..:: . .: CCDS11 LIQLLENYKKKLQEFSKEEEYDIRTQVSAVVQRKYWTSKPEP-STREQFLQYAYDITFDP 330 340 350 360 370 440 450 460 470 480 490 pF1KE4 DTADKFLQLFGTKGVKRVLCPIN--YPLSPTRFTHCEQVLGEGALDRGTYYWEVEIIEGW ::: :.:.: . : . :: :.:: : .:::.. .: ::.::::. . CCDS11 DTAHKYLRLQEENRKVTNTTPWEHPYPDLPSRFLHWRQVLSQQSLYLHRYYFEVEIFGAG 380 390 400 410 420 430 500 510 520 530 540 550 pF1KE4 VSMGVMAEDFSPQEPYDRGRLGRNAHSCCLQWNGRSFSVWFHGLEAPLPH-PFSPTVGVC . .:. . .. . . .. : : :::::. :..:. .:.:: :: .:: CCDS11 TYVGLTCKGIDRKGEERNSCISGNNFSWSLQWNGKEFTAWYSDMETPLKAGPFR-RLGVY 440 450 460 470 480 490 560 570 580 590 600 pF1KE4 LEYADRALAFYAVRDGKMSLLRRL--KASRPRRGGIPASPIDPFQSRLDSHFAGLFTHRL ... :.::.:. :.:.... : :.: CCDS11 IDFPGGILSFYGVEYDTMTLVHKFACKFSEPVYAAFWLSKKENAIRIVDLGEEPEKPAPS 500 510 520 530 540 550 610 620 630 pF1KE4 KPAFFLESVDAHLQIGPLKKSCISVLKRR CCDS11 LVGTAP 560 638 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 22:54:38 2016 done: Sat Nov 5 22:54:39 2016 Total Scan time: 4.380 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]