FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE9568, 398 aa 1>>>pF1KE9568 398 - 398 aa - 398 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.0537+/-0.00133; mu= 12.5991+/- 0.079 mean_var=169.9780+/-79.602, 0's: 0 Z-trim(101.5): 320 B-trim: 847 in 2/46 Lambda= 0.098373 statistics sampled from 5988 (6559) to 5988 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.537), E-opt: 0.2 (0.201), width: 16 Scan time: 2.640 The best scores are: opt bits E(32554) CCDS6311.1 TRHR gene_id:7201|Hs108|chr8 ( 398) 2615 384.6 8.5e-107 CCDS2486.1 NMUR1 gene_id:10316|Hs108|chr2 ( 426) 448 77.1 3.4e-14 CCDS13502.1 NTSR1 gene_id:4923|Hs108|chr20 ( 418) 393 69.3 7.5e-12 >>CCDS6311.1 TRHR gene_id:7201|Hs108|chr8 (398 aa) initn: 2615 init1: 2615 opt: 2615 Z-score: 2030.8 bits: 384.6 E(32554): 8.5e-107 Smith-Waterman score: 2615; 100.0% identity (100.0% similar) in 398 aa overlap (1-398:1-398) 10 20 30 40 50 60 pF1KE9 MENETVSELNQTQLQPRAVVALEYQVVTILLVLIICGLGIVGNIMVVLVVMRTKHMRTPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 MENETVSELNQTQLQPRAVVALEYQVVTILLVLIICGLGIVGNIMVVLVVMRTKHMRTPT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 NCYLVSLAVADLMVLVAAGLPNITDSIYGSWVYGYVGCLCITYLQYLGINASSCSITAFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 NCYLVSLAVADLMVLVAAGLPNITDSIYGSWVYGYVGCLCITYLQYLGINASSCSITAFT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE9 IERYIAICHPIKAQFLCTFSRAKKIIIFVWAFTSLYCMLWFFLLDLNISTYKDAIVISCG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 IERYIAICHPIKAQFLCTFSRAKKIIIFVWAFTSLYCMLWFFLLDLNISTYKDAIVISCG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE9 YKISRNYYSPIYLMDFGVFYVVPMILATVLYGFIARILFLNPIPSDPKENSKTWKNDSTH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 YKISRNYYSPIYLMDFGVFYVVPMILATVLYGFIARILFLNPIPSDPKENSKTWKNDSTH 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE9 QNTNLNVNTSNRCFNSTVSSRKQVTKMLAVVVILFALLWMPYRTLVVVNSFLSSPFQENW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 QNTNLNVNTSNRCFNSTVSSRKQVTKMLAVVVILFALLWMPYRTLVVVNSFLSSPFQENW 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE9 FLLFCRICIYLNSAINPVIYNLMSQKFRAAFRKLCNCKQKPTEKPANYSVALNYSVIKES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 FLLFCRICIYLNSAINPVIYNLMSQKFRAAFRKLCNCKQKPTEKPANYSVALNYSVIKES 310 320 330 340 350 360 370 380 390 pF1KE9 DHFSTELDDITVTDTYLSATKVSFDDTCLASEVSFSQS :::::::::::::::::::::::::::::::::::::: CCDS63 DHFSTELDDITVTDTYLSATKVSFDDTCLASEVSFSQS 370 380 390 >>CCDS2486.1 NMUR1 gene_id:10316|Hs108|chr2 (426 aa) initn: 367 init1: 150 opt: 448 Z-score: 368.3 bits: 77.1 E(32554): 3.4e-14 Smith-Waterman score: 449; 29.5% identity (57.4% similar) in 376 aa overlap (3-351:41-390) 10 20 30 pF1KE9 MENETVSELNQTQLQPRAVVALEYQVVTILLV : : : : :. . . .: ::. CCDS24 LPGDLYPGGARNPMACNGSAARGHFDPEDLNLTDEALRLKYLGPQQTELFMPICATYLLI 20 30 40 50 60 70 40 50 60 70 80 90 pF1KE9 LIICGLGIVGNIMVVLVVMRTKHMRTPTNCYLVSLAVADLMVLVAAGLPNITDSIYGSW- ... : ::: .. ::..: : :::::: :: ::::.::.::.. ::: .: : CCDS24 FVV---GAVGNGLTCLVILRHKAMRTPTNYYLFSLAVSDLLVLLV-GLPL---ELYEMWH 80 90 100 110 120 100 110 120 130 140 pF1KE9 ----VYGYVGCLCITYLQYLGINASSCSITAFTIERYIAICHPIKAQFLCTFSRAKKIII . : :: : : . :: ..::...:::.:. ::..:. . : ....... CCDS24 NYPFLLGVGGCYFRTLLFEMVCLASVLNVTALSVERYVAVVHPLQARSMVTRAHVRRVLG 130 140 150 160 170 180 150 160 170 180 190 200 pF1KE9 FVWAFTSLYCMLWFFLL----DLNI---STYKDAIVISCGYKISRNYYSPIYLMDFGVFY ::.. .. : : : .:.. . :. : : : :. . .:. CCDS24 AVWGL-AMLCSLPNTSLHGIRQLHVPCRGPVPDSAV--CMLVRPRALYNMVVQTTALLFF 190 200 210 220 230 240 210 220 230 240 250 pF1KE9 VVPMILATVLYGFIA------RILFLNPIPSDPKENSKTWKNDSTHQNTNLNVNTSNRCF .:: . .::: .:. :.:.. ...: . .... . .. .: CCDS24 CLPMAIMSVLYLLIGLRLRRERLLLM--------QEAKGRGSAAARSRYTCRLQQHDR-- 250 260 270 280 290 260 270 280 290 300 pF1KE9 NSTVSSRKQVTKMLAVVVILFALLWMPYRTLVVVNSFLSSPFQENWFLLFCRICI----- .:.:::::: :.:..:.. : :... :. : .:. . .. : : .. . CCDS24 -----GRRQVTKMLFVLVVVFGICWAPFHADRVMWSVVSQ-WTDGLHLAFQHVHVISGIF 300 310 320 330 340 310 320 330 340 350 360 pF1KE9 -YLNSAINPVIYNLMSQKFRAAFRK-LC--NCKQKPTEKPANYSVALNYSVIKESDHFST ::.:: :::.:.:::..:: .:.. :: : .. . ...:.. CCDS24 FYLGSAANPVLYSLMSSRFRETFQEALCLGACCHRLRPRHSSHSLSRMTTGSTLCDVGSL 350 360 370 380 390 400 370 380 390 pF1KE9 ELDDITVTDTYLSATKVSFDDTCLASEVSFSQS CCDS24 GSWVHPLAGNDGPEAQQETDPS 410 420 >>CCDS13502.1 NTSR1 gene_id:4923|Hs108|chr20 (418 aa) initn: 323 init1: 186 opt: 393 Z-score: 326.2 bits: 69.3 E(32554): 7.5e-12 Smith-Waterman score: 393; 24.6% identity (59.4% similar) in 362 aa overlap (3-346:43-394) 10 20 30 pF1KE9 MENETVSELNQTQLQPRAVVALEYQVVTILLV .: : ...:. . . . :... :. CCDS13 TPAADPFQRAQAGLEEALLAPGFGNASGNASERVLAAPSSELDVNTDIYSKVLVTAVYLA 20 30 40 50 60 70 40 50 60 70 80 pF1KE9 LIICGLGIVGNIMVVLVVMRTKHMRT---PTNCYLVSLAVADLMVLVAAGLPNITDSIY- :.. .: ::: ...... : : ... .. .: :::..::..:. : .. . :. CCDS13 LFV--VGTVGNTVTAFTLARKKSLQSLQSTVHYHLGSLALSDLLTLLLAMPVELYNFIWV 80 90 100 110 120 130 90 100 110 120 130 140 pF1KE9 -GSWVYGYVGCLCITYLQYLGINASSCSITAFTIERYIAICHPIKAQFLCTFSRAKKIII :..: .:: .:. :.. .......:::.:::::.::. : . ::.::.: CCDS13 HHPWAFGDAGCRGYYFLRDACTYATALNVASLSVERYLAICHPFKAKTLMSRSRTKKFIS 140 150 160 170 180 190 150 160 170 180 190 200 pF1KE9 FVWAFTSLYCMLWFFLL-DLNISTY-KDAIVISCGYKISRNYYSPIYLMDFGVFYVVPMI .: ..: . .: . . : :. . : . : : . . .. . .. ::. CCDS13 AIWLASALLAVPMLFTMGEQNRSADGQHAGGLVCTPTIHTATVKVVIQVNTFMSFIFPMV 200 210 220 230 240 250 210 220 230 240 250 260 pF1KE9 LATVLYGFIARILFLNPIPSDPKENSKTWKNDSTHQNTNLNVNTSNRCFNSTVSSRKQVT . .:: .:: : . . . :.... . :.. .. .. . :.. .. . CCDS13 VISVLNTIIANKLTV--MVRQAAEQGQVCTVGGEHSTFSMAIEPGR------VQALRHGV 260 270 280 290 300 270 280 290 300 310 pF1KE9 KMLAVVVILFALLWMPYRTLVVVNSFLS----SPFQENWFLLFCRIC---IYLNSAINPV ..: .::: :.. :.::.. .. ..: .:: ... : . .:..:.:::. CCDS13 RVLRAVVIAFVVCWLPYHVRRLMFCYISDEQWTPFLYDFYHYFYMVTNALFYVSSTINPI 310 320 330 340 350 360 320 330 340 350 360 370 pF1KE9 IYNLMSQKFRAAFRK----LCNCKQKPTEKPANYSVALNYSVIKESDHFSTELDDITVTD .:::.: .:: : :: .. ..:: CCDS13 LYNLVSANFRHIFLATLACLCPVWRRRRKRPAFSRKADSVSSNHTLSSNATRETLY 370 380 390 400 410 380 390 pF1KE9 TYLSATKVSFDDTCLASEVSFSQS 398 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 16:04:21 2016 done: Sun Nov 6 16:04:22 2016 Total Scan time: 2.640 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]