FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8797, 333 aa
1>>>pF1KB8797 333 - 333 aa - 333 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.6061+/-0.000841; mu= 13.9052+/- 0.051
mean_var=63.0041+/-13.689, 0's: 0 Z-trim(106.1): 90 B-trim: 1046 in 2/46
Lambda= 0.161581
statistics sampled from 8716 (8816) to 8716 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.637), E-opt: 0.2 (0.271), width: 16
Scan time: 2.710
The best scores are: opt bits E(32554)
CCDS5321.1 GPR146 gene_id:115330|Hs108|chr7 ( 333) 2221 526.4 1.3e-149
CCDS5322.1 GPER1 gene_id:2852|Hs108|chr7 ( 375) 350 90.2 2.8e-18
CCDS2408.1 CXCR2 gene_id:3579|Hs108|chr2 ( 360) 248 66.5 3.8e-11
CCDS2409.1 CXCR1 gene_id:3577|Hs108|chr2 ( 350) 244 65.5 7.1e-11
>>CCDS5321.1 GPR146 gene_id:115330|Hs108|chr7 (333 aa)
initn: 2221 init1: 2221 opt: 2221 Z-score: 2799.8 bits: 526.4 E(32554): 1.3e-149
Smith-Waterman score: 2221; 100.0% identity (100.0% similar) in 333 aa overlap (1-333:1-333)
10 20 30 40 50 60
pF1KB8 MWSCSWFNGTGLVEELPACQDLQLGLSLLSLLGLVVGVPVGLCYNALLVLANLHSKASMT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 MWSCSWFNGTGLVEELPACQDLQLGLSLLSLLGLVVGVPVGLCYNALLVLANLHSKASMT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 MPDVYFVNMAVAGLVLSALAPVHLLGPPSSRWALWSVGGEVHVALQIPFNVSSLVAMYST
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 MPDVYFVNMAVAGLVLSALAPVHLLGPPSSRWALWSVGGEVHVALQIPFNVSSLVAMYST
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 ALLSLDHYIERALPRTYMASVYNTRHVCGFVWGGALLTSFSSLLFYICSHVSTRALECAK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 ALLSLDHYIERALPRTYMASVYNTRHVCGFVWGGALLTSFSSLLFYICSHVSTRALECAK
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 MQNAEAADATLVFIGYVVPALATLYALVLLSRVRREDTPLDRDTGRLEPSAHRLLVATVC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 MQNAEAADATLVFIGYVVPALATLYALVLLSRVRREDTPLDRDTGRLEPSAHRLLVATVC
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB8 TQFGLWTPHYLILLGHTVIISRGKPVDAHYLGLLHFVKDFSKLLAFSSSFVTPLLYRYMN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 TQFGLWTPHYLILLGHTVIISRGKPVDAHYLGLLHFVKDFSKLLAFSSSFVTPLLYRYMN
250 260 270 280 290 300
310 320 330
pF1KB8 QSFPSKLQRLMKKLPCGDRHCSPDHMGVQQVLA
:::::::::::::::::::::::::::::::::
CCDS53 QSFPSKLQRLMKKLPCGDRHCSPDHMGVQQVLA
310 320 330
>>CCDS5322.1 GPER1 gene_id:2852|Hs108|chr7 (375 aa)
initn: 248 init1: 135 opt: 350 Z-score: 441.8 bits: 90.2 E(32554): 2.8e-18
Smith-Waterman score: 354; 27.3% identity (58.4% similar) in 341 aa overlap (8-332:44-360)
10 20 30
pF1KB8 MWSCSWFNGTGLVEELPACQDLQLGLSLLSLLGLVVG
:::: :: :. .:: .:: : .
CCDS53 MYPGTAQPAAPNTTSPELNLSHPLLGTALANGTG---ELSEHQQYVIGL-FLSCLYTIFL
20 30 40 50 60
40 50 60 70 80 90
pF1KB8 VPVGLCYNALLVLANLHSKASMTMPDVYFVNMAVAGLVLSALAPVHLLGPPSSRWALWSV
:.:. : :....:. . .::.::.::.:.::: :.: : . ..... . . .
CCDS53 FPIGFVGNILILVVNISFREKMTIPDLYFINLAVADLILVADSLIEVFNLHERYYDIAVL
70 80 90 100 110 120
100 110 120 130 140 150
pF1KB8 GGEVHVALQIPFNVSSLVAMYSTALLSLDHYIERALPRTYMASVYNTRH----VCGFVWG
. . ::. :. : : . . .:.:.:: :: :.. :.. :.: ::..:
CCDS53 CTFMSLFLQV--NMYSSV--FFLTWMSFDRYI--ALARAMRCSLFRTKHHARLSCGLIW-
130 140 150 160 170 180
160 170 180 190 200 210
pF1KB8 GALLTSFSSLLFYICSHVSTRALECAKMQNAEAADATLVFIGYVVP-ALATL-YALVLLS
. . ..:. . :.. : . ... .. : .:..:: :. : :.:..
CCDS53 --MASVSATLVPFTAVHLQHTDEACFCFADVREVQWLEVTLGFIVPFAIIGLCYSLIVRV
190 200 210 220 230 240
220 230 240 250 260
pF1KB8 RVRREDTPLDRDTGRLEP---SAHRLLVATVCTQFGLWTPHYLILLGHTVIISRGKPVDA
:: . : : :.: .: :...:.: . : : :. ... : ...: .: :
CCDS53 LVRAH-----RHRG-LRPRRQKALRMILAVVLVFFVCWLPENVFISVH--LLQRTQPGAA
250 260 270 280 290
270 280 290 300 310 320
pF1KB8 HYLGLLHFVKDFS----KLLAFSSSFVTPLLYRYMNQSFPSKLQRLMKK---LPCGDRHC
.. .. .. .: :::.: ..::.: .....: .::. ... :: .: :
CCDS53 PCKQSFRHAHPLTGHIVNLAAFSNSCLNPLIYSFLGETFRDKLRLYIEQKTNLPALNRFC
300 310 320 330 340 350
330
pF1KB8 SPDHMGVQQVLA
: ... :.
CCDS53 ---HAALKAVIPDSTEQSDVRFSSAV
360 370
>>CCDS2408.1 CXCR2 gene_id:3579|Hs108|chr2 (360 aa)
initn: 92 init1: 72 opt: 248 Z-score: 313.5 bits: 66.5 E(32554): 3.8e-11
Smith-Waterman score: 248; 24.0% identity (54.8% similar) in 279 aa overlap (40-311:61-329)
10 20 30 40 50 60
pF1KB8 TGLVEELPACQDLQLGLSLLSLLGLVVGVPVGLCYNALLVLANLHSKASMTMPDVYFVNM
..: :.:..:. :.:... .. :::..:.
CCDS24 PFLLDAAPCEPESLEINKYFVVIIYALVFLLSLLGNSLVMLVILYSRVGRSVTDVYLLNL
40 50 60 70 80 90
70 80 90 100 110 120
pF1KB8 AVAGLVLSALAPVHLLGPPSSRWALWSVGGEVHVALQIPFNVSSLVAMYSTALLSLDHYI
:.: :... :. .:. : : . .... .:. .. : .:.:.:.
CCDS24 ALADLLFALTLPIW----AASKVNGWIFGTFLCKVVSLLKEVNFYSGILLLACISVDRYL
100 110 120 130 140
130 140 150 160 170 180
pF1KB8 ERA-LPRTYMASVYNTRHVCGFVWGGALLTSFSSLLFYICSHVSTRALECAK-MQNAEAA
. :: . : .. .: .:: .:: .. ::: . :. . : . : : :
CCDS24 AIVHATRTLTQKRYLVKFICLSIWGLSLLLALPVLLFRRTVYSSNVSPACYEDMGNNTAN
150 160 170 180 190 200
190 200 210 220 230 240
pF1KB8 DATLVFI-----GYVVPALATLYALVLLSRVRREDTPLDRDTGRLEPSAHRLLVATVCTQ
:. : :..:: : :. . : : . :. . : :.. :.:
CCDS24 WRMLLRILPQSFGFIVPLLIMLFCYGFTLR-----TLFKAHMGQ-KHRAMRVIFAVVLIF
210 220 230 240 250 260
250 260 270 280 290 300
pF1KB8 FGLWTPHYLILLGHTVIISRGKPVDAHYLGLLHFVKDFSKLLAFSSSFVTPLLYRYMNQS
. : :. :.::. :.. .. . . . . : ...:.. : ..::.: ...:.
CCDS24 LLCWLPYNLVLLADTLMRTQVIQETCERRNHIDRALDATEILGILHSCLNPLIYAFIGQK
270 280 290 300 310 320
310 320 330
pF1KB8 FPSKLQRLMKKLPCGDRHCSPDHMGVQQVLA
: : ...
CCDS24 FRHGLLKILAIHGLISKDSLPKDSRPSFVGSSSGHTSTTL
330 340 350 360
>>CCDS2409.1 CXCR1 gene_id:3577|Hs108|chr2 (350 aa)
initn: 90 init1: 49 opt: 244 Z-score: 308.7 bits: 65.5 E(32554): 7.1e-11
Smith-Waterman score: 244; 22.2% identity (54.2% similar) in 325 aa overlap (1-311:9-320)
10 20 30 40
pF1KB8 MWSCSWFNGTGLVEELPACQDLQ-LGLSLLSLLGLVVGVPVGLCY------N
::. . .: ::. :: .: . : .: :: . .: . :
CCDS24 MSNITDPQMWDFDDLNFTGMP---PADEDYSPCMLETETLNKYVVIIAYALVFLLSLLGN
10 20 30 40 50
50 60 70 80 90 100
pF1KB8 ALLVLANLHSKASMTMPDVYFVNMAVAGLVLSALAPVHLLGPPSSRWALWSVGGEVHVAL
.:..:. :.:... .. :::..:.:.: :... :. .:. : : . ..
CCDS24 SLVMLVILYSRVGRSVTDVYLLNLALADLLFALTLPIW----AASKVNGWIFGTFLCKVV
60 70 80 90 100 110
110 120 130 140 150 160
pF1KB8 QIPFNVSSLVAMYSTALLSLDHYIERA-LPRTYMASVYNTRHVCGFVWGGALLTSFSSLL
.. .:. .. : .:.:.:. . :: . . .. :: :: .. :. .:
CCDS24 SLLKEVNFYSGILLLACISVDRYLAIVHATRTLTQKRHLVKFVCLGCWGLSMNLSLPFFL
120 130 140 150 160 170
170 180 190 200 210
pF1KB8 FYICSHVSTRALECAKMQNAEAADATLVF------IGYVVPALATLYALVLLSRVRREDT
: : .. . : .. . ..: .:. .:..:: .. :. . : :
CCDS24 FRQAYHPNNSSPVCYEVLGNDTAKWRMVLRILPHTFGFIVPLFVMLFCYGFTLR-----T
180 190 200 210 220
220 230 240 250 260 270
pF1KB8 PLDRDTGRLEPSAHRLLVATVCTQFGLWTPHYLILLGHTVIISRGKPVDAHYLGLLHFVK
. :. . : :.. :.: . : :. :.::. :.. .. . . . . .
CCDS24 LFKAHMGQ-KHRAMRVIFAVVLIFLLCWLPYNLVLLADTLMRTQVIQESCERRNNIGRAL
230 240 250 260 270 280
280 290 300 310 320 330
pF1KB8 DFSKLLAFSSSFVTPLLYRYMNQSFPSKLQRLMKKLPCGDRHCSPDHMGVQQVLA
: ...:.: : ..:..: ...:.: . ...
CCDS24 DATEILGFLHSCLNPIIYAFIGQNFRHGFLKILAMHGLVSKEFLARHRVTSYTSSSVNVS
290 300 310 320 330 340
CCDS24 SNL
350
333 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 15:40:50 2016 done: Fri Nov 4 15:40:51 2016
Total Scan time: 2.710 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]