FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4368, 386 aa
1>>>pF1KE4368 386 - 386 aa - 386 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.2726+/-0.000869; mu= 17.2052+/- 0.052
mean_var=73.9040+/-15.136, 0's: 0 Z-trim(107.0): 27 B-trim: 0 in 0/50
Lambda= 0.149190
statistics sampled from 9295 (9317) to 9295 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.663), E-opt: 0.2 (0.286), width: 16
Scan time: 2.460
The best scores are: opt bits E(32554)
CCDS8511.1 ADIPOR2 gene_id:79602|Hs108|chr12 ( 386) 2724 595.7 2.4e-170
CCDS1430.1 ADIPOR1 gene_id:51094|Hs108|chr1 ( 375) 1832 403.7 1.4e-112
CCDS34020.1 PAQR3 gene_id:152559|Hs108|chr4 ( 311) 514 119.9 3.1e-27
CCDS4941.1 PAQR8 gene_id:85315|Hs108|chr6 ( 354) 302 74.3 1.9e-13
CCDS267.1 PAQR7 gene_id:164091|Hs108|chr1 ( 346) 296 73.0 4.5e-13
CCDS10232.1 PAQR5 gene_id:54852|Hs108|chr15 ( 330) 261 65.5 8e-11
>>CCDS8511.1 ADIPOR2 gene_id:79602|Hs108|chr12 (386 aa)
initn: 2724 init1: 2724 opt: 2724 Z-score: 3171.9 bits: 595.7 E(32554): 2.4e-170
Smith-Waterman score: 2724; 100.0% identity (100.0% similar) in 386 aa overlap (1-386:1-386)
10 20 30 40 50 60
pF1KE4 MNEPTENRLGCSRTPEPDIRLRKGHQLDGTRRGDNDSHQGDLEPILEASVLSSHHKKSSE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 MNEPTENRLGCSRTPEPDIRLRKGHQLDGTRRGDNDSHQGDLEPILEASVLSSHHKKSSE
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 EHEYSDEAPQEDEGFMGMSPLLQAHHAMEKMEEFVCKVWEGRWRVIPHDVLPDWLKDNDF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 EHEYSDEAPQEDEGFMGMSPLLQAHHAMEKMEEFVCKVWEGRWRVIPHDVLPDWLKDNDF
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 LLHGHRPPMPSFRACFKSIFRIHTETGNIWTHLLGCVFFLCLGIFYMFRPNISFVAPLQE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 LLHGHRPPMPSFRACFKSIFRIHTETGNIWTHLLGCVFFLCLGIFYMFRPNISFVAPLQE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 KVVFGLFFLGAILCLSFSWLFHTVYCHSEGVSRLFSKLDYSGIALLIMGSFVPWLYYSFY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 KVVFGLFFLGAILCLSFSWLFHTVYCHSEGVSRLFSKLDYSGIALLIMGSFVPWLYYSFY
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 CNPQPCFIYLIVICVLGIAAIIVSQWDMFATPQYRGVRAGVFLGLGLSGIIPTLHYVISE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 CNPQPCFIYLIVICVLGIAAIIVSQWDMFATPQYRGVRAGVFLGLGLSGIIPTLHYVISE
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 GFLKAATIGQIGWLMLMASLYITGAALYAARIPERFFPGKCDIWFHSHQLFHIFVVAGAF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 GFLKAATIGQIGWLMLMASLYITGAALYAARIPERFFPGKCDIWFHSHQLFHIFVVAGAF
310 320 330 340 350 360
370 380
pF1KE4 VHFHGVSNLQEFRFMIGGGCSEEDAL
::::::::::::::::::::::::::
CCDS85 VHFHGVSNLQEFRFMIGGGCSEEDAL
370 380
>>CCDS1430.1 ADIPOR1 gene_id:51094|Hs108|chr1 (375 aa)
initn: 1880 init1: 1806 opt: 1832 Z-score: 2134.4 bits: 403.7 E(32554): 1.4e-112
Smith-Waterman score: 1832; 70.7% identity (87.4% similar) in 365 aa overlap (23-386:11-375)
10 20 30 40 50 60
pF1KE4 MNEPTENRLGCSRTPEPDIRLRKGHQLDGTRRGDNDSHQGDLEPILEASVLSSHHKKSSE
.:. .. : . . ..: :.:: . . .
CCDS14 MSSHKGSVVAQGNGAPASNREADTVELAELGPLLEEKGKRVIANPPKA
10 20 30 40
70 80 90 100 110
pF1KE4 EHEYSDEAPQEDEGFMGMSPL-LQAHHAMEKMEEFVCKVWEGRWRVIPHDVLPDWLKDND
:.: . .:::.: . . : :::::::::::::: :::::::::::.:::::::::::
CCDS14 EEEQTCPVPQEEEEEVRVLTLPLQAHHAMEKMEEFVYKVWEGRWRVIPYDVLPDWLKDND
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE4 FLLHGHRPPMPSFRACFKSIFRIHTETGNIWTHLLGCVFFLCLGIFYMFRPNISFVAPLQ
.::::::::::::::::::::::::::::::::::: :.:: :::. :.:::. :.::::
CCDS14 YLLHGHRPPMPSFRACFKSIFRIHTETGNIWTHLLGFVLFLFLGILTMLRPNMYFMAPLQ
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE4 EKVVFGLFFLGAILCLSFSWLFHTVYCHSEGVSRLFSKLDYSGIALLIMGSFVPWLYYSF
::::::.:::::.::::::::::::::::: ::: :::::::::::::::::::::::::
CCDS14 EKVVFGMFFLGAVLCLSFSWLFHTVYCHSEKVSRTFSKLDYSGIALLIMGSFVPWLYYSF
170 180 190 200 210 220
240 250 260 270 280 290
pF1KE4 YCNPQPCFIYLIVICVLGIAAIIVSQWDMFATPQYRGVRAGVFLGLGLSGIIPTLHYVIS
::.::: .::: ..:::::.::::.::: ::::..: .::::::::::::..::.:..:.
CCDS14 YCSPQPRLIYLSIVCVLGISAIIVAQWDRFATPKHRQTRAGVFLGLGLSGVVPTMHFTIA
230 240 250 260 270 280
300 310 320 330 340 350
pF1KE4 EGFLKAATIGQIGWLMLMASLYITGAALYAARIPERFFPGKCDIWFHSHQLFHIFVVAGA
:::.::.:.::.::..::: .:::::.:::::::::::::: ::::.:::.::..:::.:
CCDS14 EGFVKATTVGQMGWFFLMAVMYITGAGLYAARIPERFFPGKFDIWFQSHQIFHVLVVAAA
290 300 310 320 330 340
360 370 380
pF1KE4 FVHFHGVSNLQEFRFMIGGGCSEEDAL
::::.:::::::::. . :::... :
CCDS14 FVHFYGVSNLQEFRYGLEGGCTDDTLL
350 360 370
>>CCDS34020.1 PAQR3 gene_id:152559|Hs108|chr4 (311 aa)
initn: 448 init1: 197 opt: 514 Z-score: 602.4 bits: 119.9 E(32554): 3.1e-27
Smith-Waterman score: 514; 30.6% identity (63.6% similar) in 297 aa overlap (81-373:5-299)
60 70 80 90 100
pF1KE4 LSSHHKKSSEEHEYSDEAPQEDEGFMGMSPLLQAHHAME--KMEEFVCKVWEGRWRVIPH
::.. : .: ... . : .: :. .
CCDS34 MHQKLLKSAHYIELGSYQYWPVLVPRG-IRLYTY
10 20 30
110 120 130 140 150 160
pF1KE4 DVLPDWLKDNDFLLHGHRPPMPSFRACFKSIFRIHTETGNIWTHLLGCVFFLCLGIFYMF
. .: :::: .. :.: .:: : :.::.: . .:: :::.:::: .:. :::. :
CCDS34 EQIPGSLKDNPYITDGYRAYLPS-RLCIKSLFILSNETVNIWSHLLGFFLFFTLGIYDMT
40 50 60 70 80 90
170 180 190 200 210 220
pF1KE4 RPNISFVAPLQEKVVFGLFFLGAILCLSFSWLFHTVYCH-SEGVSRLFSKLDYSGIALLI
: : .. :. .. .. .:. : .: :: :: . : . :::.::.. :
CCDS34 SVLPSASASREDFVICSICLFCFQVCMLCSVGYHLFSCHRSEKTCRRWMALDYAGISIGI
100 110 120 130 140 150
230 240 250 260 270 280
pF1KE4 MGSFVPWLYYSFYCNPQPCFIYLIVICVLGIAAIIVSQWDMFATPQYRGVRAGVFLGLGL
.: .: ..:.:::: .:::.. .. .:..... . : :.. .:. .: ...
CCDS34 LGCYVSGVFYAFYCNNYWRQVYLITVLAMILAVFFAQIHPNYLTQQWQRLRSIIFCSVSG
160 170 180 190 200 210
290 300 310 320 330 340
pF1KE4 SGIIPTLHYVISEGFLKAATIGQIG-WLMLMASLYITGAALYAARIPERFFPGKCDIWFH
:.:::::.: .: . : . ... ...: . . . .: ...:::.:::. .
CCDS34 YGVIPTLHWVWLNGGIGAPIVQDFAPRVIVMYMIALLAFLFYISKVPERYFPGQLNYLGS
220 230 240 250 260 270
350 360 370 380
pF1KE4 SHQLFHIFVVAGAFVHFHGVSNLQEFRFMIGGGCSEEDAL
:::..::..:. . ... ....:
CCDS34 SHQIWHILAVVMLYWWHQSTVYVMQYRHSKPCPDYVSHL
280 290 300 310
>>CCDS4941.1 PAQR8 gene_id:85315|Hs108|chr6 (354 aa)
initn: 281 init1: 123 opt: 302 Z-score: 355.0 bits: 74.3 E(32554): 1.9e-13
Smith-Waterman score: 302; 27.5% identity (55.4% similar) in 258 aa overlap (106-355:37-289)
80 90 100 110 120 130
pF1KE4 MGMSPLLQAHHAMEKMEEFVCKVWEGRWRVIPHDVLPDWLKDNDFLLHGHRPPMPSFRAC
.:. .:. ... .. :.:: .:
CCDS49 ERLSTLSVSGQQLRRLPKILEDGLPKMPCTVPETDVPQLFRE-PYIRTGYRPTGHEWRYY
10 20 30 40 50 60
140 150 160 170 180 190
pF1KE4 FKSIFRIHTETGNIWTHLLGCVFFLCLGIFYMFRPNISFVAPLQEKVVFGLFFLGAILCL
: :.:. :.:. :.:::::. . : :. : .. ... . ::.:..: :
CCDS49 FFSLFQKHNEVVNVWTHLLAALAVLLR--FWAFAEAEALPWASTHSLPLLLFILSSITYL
70 80 90 100 110 120
200 210 220 230 240 250
pF1KE4 SFSWLFHTVYCHSEGVSRLFSKLDYSGIALLIMGSFVPWLYYS----FYCNPQPCFIYLI
. : : : . .:: : .:: :... .:: . ..:: .: :.
CCDS49 TCSLLAHLLQSKSELSHYTFYFVDYVGVSVYQYGSALAHFFYSSDQAWYDRFWLFFLPAA
130 140 150 160 170 180
260 270 280 290 300
pF1KE4 VIC-VLGIAAIIVSQWDMFAT-PQYRGVRAGVFLGLG-LSGIIPTLHYVISEGFLKAATI
..: :. :. ... . : .: . : ::. . : :. : : . :.
CCDS49 AFCGWLSCAGCCYAKYRYRRPYPVMRKICQVVPAGLAFILDISPVAHRVALCHL--AGCQ
190 200 210 220 230 240
310 320 330 340 350 360
pF1KE4 GQIGWLMLMASLY-ITGAALYAARIPERFFPGKCDIWFHSHQLFHIFVVAGAFVHFHGVS
: .: . :. ...: ... .::..:::.::: :.::.:: :.
CCDS49 EQAAWYHTLQILFFLVSAYFFSCPVPEKYFPGSCDIVGHGHQIFHAFLSICTLSQLEAIL
250 260 270 280 290 300
370 380
pF1KE4 NLQEFRFMIGGGCSEEDAL
CCDS49 LDYQGRQEIFLQRHGPLSVHMACLSFFFLAACSAATAALLRHKVKARLTKKDS
310 320 330 340 350
>>CCDS267.1 PAQR7 gene_id:164091|Hs108|chr1 (346 aa)
initn: 305 init1: 138 opt: 296 Z-score: 348.2 bits: 73.0 E(32554): 4.5e-13
Smith-Waterman score: 301; 27.2% identity (58.9% similar) in 265 aa overlap (120-373:46-300)
90 100 110 120 130 140
pF1KE4 KMEEFVCKVWEGRWRVIPHDVLPDWLKDNDFLLHGHRPPMPSFRACFKSIFRIHTETGNI
.. :.:: ..: :...:. :.:. :.
CCDS26 QVIQEPQLSLQPEPVFTVDRAEVPPLFWKPYIYAGYRPLHQTWRFYFRTLFQQHNEAVNV
20 30 40 50 60 70
150 160 170 180 190 200
pF1KE4 WTHLLGC-VFFLCLGIFYMFRPNISFVAPLQEKVVFGLFFLGAILCLSFSWLFHTVYCHS
:::::. :..: :..: ...: . . .: .. :... :::: : : . .:
CCDS26 WTHLLAALVLLLRLALFV---ETVDFWGDPHALPLF-IIVLASFTYLSFSALAHLLQAKS
80 90 100 110 120 130
210 220 230 240 250 260
pF1KE4 EGVSRLFSKLDYSGIALLIMGSFVPWLYYSFYCNPQPCFIYLIVICVLGIAAIIVSQWDM
: : ::: :.:. .:: . .::.. .: . . : .::... :
CCDS26 EFWHYSFFFLDYVGVAVYQFGSALAHFYYAI----EPAWHAQVQAVFLPMAAFLA--WLS
140 150 160 170 180
270 280 290 300 310
pF1KE4 FATPQY-RGVRAGVFLGLGLSGIIPTLHY------VISEGFLKAATIGQIGWLML---MA
: . .. .:: . . .: : :. . :... . :. ..
CCDS26 CIGSCYNKYIQKPGLLGRTCQEVPSVLAYALDISPVVHRIFVSSDPTTDDPALLYHKCQV
190 200 210 220 230 240
320 330 340 350 360 370
pF1KE4 SLYITGAALYAARIPERFFPGKCDIWFHSHQLFHIFVVAGAFVHFHGVSNLQEFRFMIGG
... .::.... .:::.:::.: .. ..:::::::.: .......:. : :
CCDS26 VFFLLAAAFFSTFMPERWFPGSCHVFGQGHQLFHIFLVLCTLAQLEAVALDYEARRPIYE
250 260 270 280 290 300
380
pF1KE4 GCSEEDAL
CCDS26 PLHTHWPHNFSGLFLLTVGSSILTAFLLSQLVQRKLDQKTK
310 320 330 340
>>CCDS10232.1 PAQR5 gene_id:54852|Hs108|chr15 (330 aa)
initn: 286 init1: 130 opt: 261 Z-score: 307.8 bits: 65.5 E(32554): 8e-11
Smith-Waterman score: 261; 26.6% identity (55.4% similar) in 278 aa overlap (104-366:8-272)
80 90 100 110 120 130
pF1KE4 GFMGMSPLLQAHHAMEKMEEFVCKVWEGRWRVIPHDVLPDWLKDNDFLLHGHRPPMPSFR
:.. : .:. .... .: :.: :. :
CCDS10 MLSLKLPRLFSIDQIPQVFHEQG-ILFGYRHPQSSAT
10 20 30
140 150 160 170 180 190
pF1KE4 ACFKSIFRIHTETGNIWTHLLGCVFFL--CLGIFYMFR-PNISFVAPLQEKVVFGLFFLG
::. :.:.. .:: ::::::: :: . .:: : :. :. . . .
CCDS10 ACILSLFQMTNETLNIWTHLLPFWFFAWRFVTALYMTDIKNDSYSWPMLVYMCTSCVYPL
40 50 60 70 80 90
200 210 220 230 240
pF1KE4 AILCLSFSWLFHTVYCHSEGVSRLFSKLDYSGIALLIMGSFVPWLYYSF----YCNP-QP
. : :: :... .. :::... :. .:: . . :.: .:. .
CCDS10 VSSCA------HTFSSMSKNARHICYFLDYGAVNLFSLGSAIAYSAYTFPDALMCTTFHD
100 110 120 130 140 150
250 260 270 280 290 300
pF1KE4 CFIYLIVI-CVLGIAAIIVSQWDMFATPQY-RGVRAGVFLGLGLSGIIPTLHYVI---SE
.. : :. .:. . :.. . :. . .:. .: .: .. .. .:
CCDS10 YYVALAVLNTILSTGLSCYSRFLEIQKPRLCKVIRVLAFAYPYTWDSLPIFYRLFLFPGE
160 170 180 190 200 210
310 320 330 340 350
pF1KE4 GFLKAAT-IGQIGWLM-LMASLYITGAALYAARIPERFFPGKCDIWFHSHQLFHIFVVAG
. . :: : .: :.::. ::.:..:::. ::. : :::::::. :. .
CCDS10 SAQNEATSYHQKHMIMTLLASF------LYSAHLPERLAPGRFDYIGHSHQLFHVCVILA
220 230 240 250 260
360 370 380
pF1KE4 AFVHFHGVSNLQEFRFMIGGGCSEEDAL
. ......
CCDS10 THMQMEAILLDKTLRKEWLLATSKPFSFSQIAGAILLCIIFSLSNIIYFSAALYRIPKPE
270 280 290 300 310 320
386 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 02:01:30 2016 done: Sun Nov 6 02:01:31 2016
Total Scan time: 2.460 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]