FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE9568, 398 aa
1>>>pF1KE9568 398 - 398 aa - 398 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.0537+/-0.00133; mu= 12.5991+/- 0.079
mean_var=169.9780+/-79.602, 0's: 0 Z-trim(101.5): 320 B-trim: 847 in 2/46
Lambda= 0.098373
statistics sampled from 5988 (6559) to 5988 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.537), E-opt: 0.2 (0.201), width: 16
Scan time: 2.640
The best scores are: opt bits E(32554)
CCDS6311.1 TRHR gene_id:7201|Hs108|chr8 ( 398) 2615 384.6 8.5e-107
CCDS2486.1 NMUR1 gene_id:10316|Hs108|chr2 ( 426) 448 77.1 3.4e-14
CCDS13502.1 NTSR1 gene_id:4923|Hs108|chr20 ( 418) 393 69.3 7.5e-12
>>CCDS6311.1 TRHR gene_id:7201|Hs108|chr8 (398 aa)
initn: 2615 init1: 2615 opt: 2615 Z-score: 2030.8 bits: 384.6 E(32554): 8.5e-107
Smith-Waterman score: 2615; 100.0% identity (100.0% similar) in 398 aa overlap (1-398:1-398)
10 20 30 40 50 60
pF1KE9 MENETVSELNQTQLQPRAVVALEYQVVTILLVLIICGLGIVGNIMVVLVVMRTKHMRTPT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 MENETVSELNQTQLQPRAVVALEYQVVTILLVLIICGLGIVGNIMVVLVVMRTKHMRTPT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE9 NCYLVSLAVADLMVLVAAGLPNITDSIYGSWVYGYVGCLCITYLQYLGINASSCSITAFT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 NCYLVSLAVADLMVLVAAGLPNITDSIYGSWVYGYVGCLCITYLQYLGINASSCSITAFT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE9 IERYIAICHPIKAQFLCTFSRAKKIIIFVWAFTSLYCMLWFFLLDLNISTYKDAIVISCG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 IERYIAICHPIKAQFLCTFSRAKKIIIFVWAFTSLYCMLWFFLLDLNISTYKDAIVISCG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE9 YKISRNYYSPIYLMDFGVFYVVPMILATVLYGFIARILFLNPIPSDPKENSKTWKNDSTH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 YKISRNYYSPIYLMDFGVFYVVPMILATVLYGFIARILFLNPIPSDPKENSKTWKNDSTH
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE9 QNTNLNVNTSNRCFNSTVSSRKQVTKMLAVVVILFALLWMPYRTLVVVNSFLSSPFQENW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 QNTNLNVNTSNRCFNSTVSSRKQVTKMLAVVVILFALLWMPYRTLVVVNSFLSSPFQENW
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE9 FLLFCRICIYLNSAINPVIYNLMSQKFRAAFRKLCNCKQKPTEKPANYSVALNYSVIKES
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 FLLFCRICIYLNSAINPVIYNLMSQKFRAAFRKLCNCKQKPTEKPANYSVALNYSVIKES
310 320 330 340 350 360
370 380 390
pF1KE9 DHFSTELDDITVTDTYLSATKVSFDDTCLASEVSFSQS
::::::::::::::::::::::::::::::::::::::
CCDS63 DHFSTELDDITVTDTYLSATKVSFDDTCLASEVSFSQS
370 380 390
>>CCDS2486.1 NMUR1 gene_id:10316|Hs108|chr2 (426 aa)
initn: 367 init1: 150 opt: 448 Z-score: 368.3 bits: 77.1 E(32554): 3.4e-14
Smith-Waterman score: 449; 29.5% identity (57.4% similar) in 376 aa overlap (3-351:41-390)
10 20 30
pF1KE9 MENETVSELNQTQLQPRAVVALEYQVVTILLV
: : : : :. . . .: ::.
CCDS24 LPGDLYPGGARNPMACNGSAARGHFDPEDLNLTDEALRLKYLGPQQTELFMPICATYLLI
20 30 40 50 60 70
40 50 60 70 80 90
pF1KE9 LIICGLGIVGNIMVVLVVMRTKHMRTPTNCYLVSLAVADLMVLVAAGLPNITDSIYGSW-
... : ::: .. ::..: : :::::: :: ::::.::.::.. ::: .: :
CCDS24 FVV---GAVGNGLTCLVILRHKAMRTPTNYYLFSLAVSDLLVLLV-GLPL---ELYEMWH
80 90 100 110 120
100 110 120 130 140
pF1KE9 ----VYGYVGCLCITYLQYLGINASSCSITAFTIERYIAICHPIKAQFLCTFSRAKKIII
. : :: : : . :: ..::...:::.:. ::..:. . : .......
CCDS24 NYPFLLGVGGCYFRTLLFEMVCLASVLNVTALSVERYVAVVHPLQARSMVTRAHVRRVLG
130 140 150 160 170 180
150 160 170 180 190 200
pF1KE9 FVWAFTSLYCMLWFFLL----DLNI---STYKDAIVISCGYKISRNYYSPIYLMDFGVFY
::.. .. : : : .:.. . :. : : : :. . .:.
CCDS24 AVWGL-AMLCSLPNTSLHGIRQLHVPCRGPVPDSAV--CMLVRPRALYNMVVQTTALLFF
190 200 210 220 230 240
210 220 230 240 250
pF1KE9 VVPMILATVLYGFIA------RILFLNPIPSDPKENSKTWKNDSTHQNTNLNVNTSNRCF
.:: . .::: .:. :.:.. ...: . .... . .. .:
CCDS24 CLPMAIMSVLYLLIGLRLRRERLLLM--------QEAKGRGSAAARSRYTCRLQQHDR--
250 260 270 280 290
260 270 280 290 300
pF1KE9 NSTVSSRKQVTKMLAVVVILFALLWMPYRTLVVVNSFLSSPFQENWFLLFCRICI-----
.:.:::::: :.:..:.. : :... :. : .:. . .. : : .. .
CCDS24 -----GRRQVTKMLFVLVVVFGICWAPFHADRVMWSVVSQ-WTDGLHLAFQHVHVISGIF
300 310 320 330 340
310 320 330 340 350 360
pF1KE9 -YLNSAINPVIYNLMSQKFRAAFRK-LC--NCKQKPTEKPANYSVALNYSVIKESDHFST
::.:: :::.:.:::..:: .:.. :: : .. . ...:..
CCDS24 FYLGSAANPVLYSLMSSRFRETFQEALCLGACCHRLRPRHSSHSLSRMTTGSTLCDVGSL
350 360 370 380 390 400
370 380 390
pF1KE9 ELDDITVTDTYLSATKVSFDDTCLASEVSFSQS
CCDS24 GSWVHPLAGNDGPEAQQETDPS
410 420
>>CCDS13502.1 NTSR1 gene_id:4923|Hs108|chr20 (418 aa)
initn: 323 init1: 186 opt: 393 Z-score: 326.2 bits: 69.3 E(32554): 7.5e-12
Smith-Waterman score: 393; 24.6% identity (59.4% similar) in 362 aa overlap (3-346:43-394)
10 20 30
pF1KE9 MENETVSELNQTQLQPRAVVALEYQVVTILLV
.: : ...:. . . . :... :.
CCDS13 TPAADPFQRAQAGLEEALLAPGFGNASGNASERVLAAPSSELDVNTDIYSKVLVTAVYLA
20 30 40 50 60 70
40 50 60 70 80
pF1KE9 LIICGLGIVGNIMVVLVVMRTKHMRT---PTNCYLVSLAVADLMVLVAAGLPNITDSIY-
:.. .: ::: ...... : : ... .. .: :::..::..:. : .. . :.
CCDS13 LFV--VGTVGNTVTAFTLARKKSLQSLQSTVHYHLGSLALSDLLTLLLAMPVELYNFIWV
80 90 100 110 120 130
90 100 110 120 130 140
pF1KE9 -GSWVYGYVGCLCITYLQYLGINASSCSITAFTIERYIAICHPIKAQFLCTFSRAKKIII
:..: .:: .:. :.. .......:::.:::::.::. : . ::.::.:
CCDS13 HHPWAFGDAGCRGYYFLRDACTYATALNVASLSVERYLAICHPFKAKTLMSRSRTKKFIS
140 150 160 170 180 190
150 160 170 180 190 200
pF1KE9 FVWAFTSLYCMLWFFLL-DLNISTY-KDAIVISCGYKISRNYYSPIYLMDFGVFYVVPMI
.: ..: . .: . . : :. . : . : : . . .. . .. ::.
CCDS13 AIWLASALLAVPMLFTMGEQNRSADGQHAGGLVCTPTIHTATVKVVIQVNTFMSFIFPMV
200 210 220 230 240 250
210 220 230 240 250 260
pF1KE9 LATVLYGFIARILFLNPIPSDPKENSKTWKNDSTHQNTNLNVNTSNRCFNSTVSSRKQVT
. .:: .:: : . . . :.... . :.. .. .. . :.. .. .
CCDS13 VISVLNTIIANKLTV--MVRQAAEQGQVCTVGGEHSTFSMAIEPGR------VQALRHGV
260 270 280 290 300
270 280 290 300 310
pF1KE9 KMLAVVVILFALLWMPYRTLVVVNSFLS----SPFQENWFLLFCRIC---IYLNSAINPV
..: .::: :.. :.::.. .. ..: .:: ... : . .:..:.:::.
CCDS13 RVLRAVVIAFVVCWLPYHVRRLMFCYISDEQWTPFLYDFYHYFYMVTNALFYVSSTINPI
310 320 330 340 350 360
320 330 340 350 360 370
pF1KE9 IYNLMSQKFRAAFRK----LCNCKQKPTEKPANYSVALNYSVIKESDHFSTELDDITVTD
.:::.: .:: : :: .. ..::
CCDS13 LYNLVSANFRHIFLATLACLCPVWRRRRKRPAFSRKADSVSSNHTLSSNATRETLY
370 380 390 400 410
380 390
pF1KE9 TYLSATKVSFDDTCLASEVSFSQS
398 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 16:04:21 2016 done: Sun Nov 6 16:04:22 2016
Total Scan time: 2.640 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]