FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4368, 386 aa 1>>>pF1KE4368 386 - 386 aa - 386 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2726+/-0.000869; mu= 17.2052+/- 0.052 mean_var=73.9040+/-15.136, 0's: 0 Z-trim(107.0): 27 B-trim: 0 in 0/50 Lambda= 0.149190 statistics sampled from 9295 (9317) to 9295 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.663), E-opt: 0.2 (0.286), width: 16 Scan time: 2.460 The best scores are: opt bits E(32554) CCDS8511.1 ADIPOR2 gene_id:79602|Hs108|chr12 ( 386) 2724 595.7 2.4e-170 CCDS1430.1 ADIPOR1 gene_id:51094|Hs108|chr1 ( 375) 1832 403.7 1.4e-112 CCDS34020.1 PAQR3 gene_id:152559|Hs108|chr4 ( 311) 514 119.9 3.1e-27 CCDS4941.1 PAQR8 gene_id:85315|Hs108|chr6 ( 354) 302 74.3 1.9e-13 CCDS267.1 PAQR7 gene_id:164091|Hs108|chr1 ( 346) 296 73.0 4.5e-13 CCDS10232.1 PAQR5 gene_id:54852|Hs108|chr15 ( 330) 261 65.5 8e-11 >>CCDS8511.1 ADIPOR2 gene_id:79602|Hs108|chr12 (386 aa) initn: 2724 init1: 2724 opt: 2724 Z-score: 3171.9 bits: 595.7 E(32554): 2.4e-170 Smith-Waterman score: 2724; 100.0% identity (100.0% similar) in 386 aa overlap (1-386:1-386) 10 20 30 40 50 60 pF1KE4 MNEPTENRLGCSRTPEPDIRLRKGHQLDGTRRGDNDSHQGDLEPILEASVLSSHHKKSSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 MNEPTENRLGCSRTPEPDIRLRKGHQLDGTRRGDNDSHQGDLEPILEASVLSSHHKKSSE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 EHEYSDEAPQEDEGFMGMSPLLQAHHAMEKMEEFVCKVWEGRWRVIPHDVLPDWLKDNDF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 EHEYSDEAPQEDEGFMGMSPLLQAHHAMEKMEEFVCKVWEGRWRVIPHDVLPDWLKDNDF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 LLHGHRPPMPSFRACFKSIFRIHTETGNIWTHLLGCVFFLCLGIFYMFRPNISFVAPLQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 LLHGHRPPMPSFRACFKSIFRIHTETGNIWTHLLGCVFFLCLGIFYMFRPNISFVAPLQE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 KVVFGLFFLGAILCLSFSWLFHTVYCHSEGVSRLFSKLDYSGIALLIMGSFVPWLYYSFY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 KVVFGLFFLGAILCLSFSWLFHTVYCHSEGVSRLFSKLDYSGIALLIMGSFVPWLYYSFY 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 CNPQPCFIYLIVICVLGIAAIIVSQWDMFATPQYRGVRAGVFLGLGLSGIIPTLHYVISE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 CNPQPCFIYLIVICVLGIAAIIVSQWDMFATPQYRGVRAGVFLGLGLSGIIPTLHYVISE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 GFLKAATIGQIGWLMLMASLYITGAALYAARIPERFFPGKCDIWFHSHQLFHIFVVAGAF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS85 GFLKAATIGQIGWLMLMASLYITGAALYAARIPERFFPGKCDIWFHSHQLFHIFVVAGAF 310 320 330 340 350 360 370 380 pF1KE4 VHFHGVSNLQEFRFMIGGGCSEEDAL :::::::::::::::::::::::::: CCDS85 VHFHGVSNLQEFRFMIGGGCSEEDAL 370 380 >>CCDS1430.1 ADIPOR1 gene_id:51094|Hs108|chr1 (375 aa) initn: 1880 init1: 1806 opt: 1832 Z-score: 2134.4 bits: 403.7 E(32554): 1.4e-112 Smith-Waterman score: 1832; 70.7% identity (87.4% similar) in 365 aa overlap (23-386:11-375) 10 20 30 40 50 60 pF1KE4 MNEPTENRLGCSRTPEPDIRLRKGHQLDGTRRGDNDSHQGDLEPILEASVLSSHHKKSSE .:. .. : . . ..: :.:: . . . CCDS14 MSSHKGSVVAQGNGAPASNREADTVELAELGPLLEEKGKRVIANPPKA 10 20 30 40 70 80 90 100 110 pF1KE4 EHEYSDEAPQEDEGFMGMSPL-LQAHHAMEKMEEFVCKVWEGRWRVIPHDVLPDWLKDND :.: . .:::.: . . : :::::::::::::: :::::::::::.::::::::::: CCDS14 EEEQTCPVPQEEEEEVRVLTLPLQAHHAMEKMEEFVYKVWEGRWRVIPYDVLPDWLKDND 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE4 FLLHGHRPPMPSFRACFKSIFRIHTETGNIWTHLLGCVFFLCLGIFYMFRPNISFVAPLQ .::::::::::::::::::::::::::::::::::: :.:: :::. :.:::. :.:::: CCDS14 YLLHGHRPPMPSFRACFKSIFRIHTETGNIWTHLLGFVLFLFLGILTMLRPNMYFMAPLQ 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 EKVVFGLFFLGAILCLSFSWLFHTVYCHSEGVSRLFSKLDYSGIALLIMGSFVPWLYYSF ::::::.:::::.::::::::::::::::: ::: ::::::::::::::::::::::::: CCDS14 EKVVFGMFFLGAVLCLSFSWLFHTVYCHSEKVSRTFSKLDYSGIALLIMGSFVPWLYYSF 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE4 YCNPQPCFIYLIVICVLGIAAIIVSQWDMFATPQYRGVRAGVFLGLGLSGIIPTLHYVIS ::.::: .::: ..:::::.::::.::: ::::..: .::::::::::::..::.:..:. CCDS14 YCSPQPRLIYLSIVCVLGISAIIVAQWDRFATPKHRQTRAGVFLGLGLSGVVPTMHFTIA 230 240 250 260 270 280 300 310 320 330 340 350 pF1KE4 EGFLKAATIGQIGWLMLMASLYITGAALYAARIPERFFPGKCDIWFHSHQLFHIFVVAGA :::.::.:.::.::..::: .:::::.:::::::::::::: ::::.:::.::..:::.: CCDS14 EGFVKATTVGQMGWFFLMAVMYITGAGLYAARIPERFFPGKFDIWFQSHQIFHVLVVAAA 290 300 310 320 330 340 360 370 380 pF1KE4 FVHFHGVSNLQEFRFMIGGGCSEEDAL ::::.:::::::::. . :::... : CCDS14 FVHFYGVSNLQEFRYGLEGGCTDDTLL 350 360 370 >>CCDS34020.1 PAQR3 gene_id:152559|Hs108|chr4 (311 aa) initn: 448 init1: 197 opt: 514 Z-score: 602.4 bits: 119.9 E(32554): 3.1e-27 Smith-Waterman score: 514; 30.6% identity (63.6% similar) in 297 aa overlap (81-373:5-299) 60 70 80 90 100 pF1KE4 LSSHHKKSSEEHEYSDEAPQEDEGFMGMSPLLQAHHAME--KMEEFVCKVWEGRWRVIPH ::.. : .: ... . : .: :. . CCDS34 MHQKLLKSAHYIELGSYQYWPVLVPRG-IRLYTY 10 20 30 110 120 130 140 150 160 pF1KE4 DVLPDWLKDNDFLLHGHRPPMPSFRACFKSIFRIHTETGNIWTHLLGCVFFLCLGIFYMF . .: :::: .. :.: .:: : :.::.: . .:: :::.:::: .:. :::. : CCDS34 EQIPGSLKDNPYITDGYRAYLPS-RLCIKSLFILSNETVNIWSHLLGFFLFFTLGIYDMT 40 50 60 70 80 90 170 180 190 200 210 220 pF1KE4 RPNISFVAPLQEKVVFGLFFLGAILCLSFSWLFHTVYCH-SEGVSRLFSKLDYSGIALLI : : .. :. .. .. .:. : .: :: :: . : . :::.::.. : CCDS34 SVLPSASASREDFVICSICLFCFQVCMLCSVGYHLFSCHRSEKTCRRWMALDYAGISIGI 100 110 120 130 140 150 230 240 250 260 270 280 pF1KE4 MGSFVPWLYYSFYCNPQPCFIYLIVICVLGIAAIIVSQWDMFATPQYRGVRAGVFLGLGL .: .: ..:.:::: .:::.. .. .:..... . : :.. .:. .: ... CCDS34 LGCYVSGVFYAFYCNNYWRQVYLITVLAMILAVFFAQIHPNYLTQQWQRLRSIIFCSVSG 160 170 180 190 200 210 290 300 310 320 330 340 pF1KE4 SGIIPTLHYVISEGFLKAATIGQIG-WLMLMASLYITGAALYAARIPERFFPGKCDIWFH :.:::::.: .: . : . ... ...: . . . .: ...:::.:::. . CCDS34 YGVIPTLHWVWLNGGIGAPIVQDFAPRVIVMYMIALLAFLFYISKVPERYFPGQLNYLGS 220 230 240 250 260 270 350 360 370 380 pF1KE4 SHQLFHIFVVAGAFVHFHGVSNLQEFRFMIGGGCSEEDAL :::..::..:. . ... ....: CCDS34 SHQIWHILAVVMLYWWHQSTVYVMQYRHSKPCPDYVSHL 280 290 300 310 >>CCDS4941.1 PAQR8 gene_id:85315|Hs108|chr6 (354 aa) initn: 281 init1: 123 opt: 302 Z-score: 355.0 bits: 74.3 E(32554): 1.9e-13 Smith-Waterman score: 302; 27.5% identity (55.4% similar) in 258 aa overlap (106-355:37-289) 80 90 100 110 120 130 pF1KE4 MGMSPLLQAHHAMEKMEEFVCKVWEGRWRVIPHDVLPDWLKDNDFLLHGHRPPMPSFRAC .:. .:. ... .. :.:: .: CCDS49 ERLSTLSVSGQQLRRLPKILEDGLPKMPCTVPETDVPQLFRE-PYIRTGYRPTGHEWRYY 10 20 30 40 50 60 140 150 160 170 180 190 pF1KE4 FKSIFRIHTETGNIWTHLLGCVFFLCLGIFYMFRPNISFVAPLQEKVVFGLFFLGAILCL : :.:. :.:. :.:::::. . : :. : .. ... . ::.:..: : CCDS49 FFSLFQKHNEVVNVWTHLLAALAVLLR--FWAFAEAEALPWASTHSLPLLLFILSSITYL 70 80 90 100 110 120 200 210 220 230 240 250 pF1KE4 SFSWLFHTVYCHSEGVSRLFSKLDYSGIALLIMGSFVPWLYYS----FYCNPQPCFIYLI . : : : . .:: : .:: :... .:: . ..:: .: :. CCDS49 TCSLLAHLLQSKSELSHYTFYFVDYVGVSVYQYGSALAHFFYSSDQAWYDRFWLFFLPAA 130 140 150 160 170 180 260 270 280 290 300 pF1KE4 VIC-VLGIAAIIVSQWDMFAT-PQYRGVRAGVFLGLG-LSGIIPTLHYVISEGFLKAATI ..: :. :. ... . : .: . : ::. . : :. : : . :. CCDS49 AFCGWLSCAGCCYAKYRYRRPYPVMRKICQVVPAGLAFILDISPVAHRVALCHL--AGCQ 190 200 210 220 230 240 310 320 330 340 350 360 pF1KE4 GQIGWLMLMASLY-ITGAALYAARIPERFFPGKCDIWFHSHQLFHIFVVAGAFVHFHGVS : .: . :. ...: ... .::..:::.::: :.::.:: :. CCDS49 EQAAWYHTLQILFFLVSAYFFSCPVPEKYFPGSCDIVGHGHQIFHAFLSICTLSQLEAIL 250 260 270 280 290 300 370 380 pF1KE4 NLQEFRFMIGGGCSEEDAL CCDS49 LDYQGRQEIFLQRHGPLSVHMACLSFFFLAACSAATAALLRHKVKARLTKKDS 310 320 330 340 350 >>CCDS267.1 PAQR7 gene_id:164091|Hs108|chr1 (346 aa) initn: 305 init1: 138 opt: 296 Z-score: 348.2 bits: 73.0 E(32554): 4.5e-13 Smith-Waterman score: 301; 27.2% identity (58.9% similar) in 265 aa overlap (120-373:46-300) 90 100 110 120 130 140 pF1KE4 KMEEFVCKVWEGRWRVIPHDVLPDWLKDNDFLLHGHRPPMPSFRACFKSIFRIHTETGNI .. :.:: ..: :...:. :.:. :. CCDS26 QVIQEPQLSLQPEPVFTVDRAEVPPLFWKPYIYAGYRPLHQTWRFYFRTLFQQHNEAVNV 20 30 40 50 60 70 150 160 170 180 190 200 pF1KE4 WTHLLGC-VFFLCLGIFYMFRPNISFVAPLQEKVVFGLFFLGAILCLSFSWLFHTVYCHS :::::. :..: :..: ...: . . .: .. :... :::: : : . .: CCDS26 WTHLLAALVLLLRLALFV---ETVDFWGDPHALPLF-IIVLASFTYLSFSALAHLLQAKS 80 90 100 110 120 130 210 220 230 240 250 260 pF1KE4 EGVSRLFSKLDYSGIALLIMGSFVPWLYYSFYCNPQPCFIYLIVICVLGIAAIIVSQWDM : : ::: :.:. .:: . .::.. .: . . : .::... : CCDS26 EFWHYSFFFLDYVGVAVYQFGSALAHFYYAI----EPAWHAQVQAVFLPMAAFLA--WLS 140 150 160 170 180 270 280 290 300 310 pF1KE4 FATPQY-RGVRAGVFLGLGLSGIIPTLHY------VISEGFLKAATIGQIGWLML---MA : . .. .:: . . .: : :. . :... . :. .. CCDS26 CIGSCYNKYIQKPGLLGRTCQEVPSVLAYALDISPVVHRIFVSSDPTTDDPALLYHKCQV 190 200 210 220 230 240 320 330 340 350 360 370 pF1KE4 SLYITGAALYAARIPERFFPGKCDIWFHSHQLFHIFVVAGAFVHFHGVSNLQEFRFMIGG ... .::.... .:::.:::.: .. ..:::::::.: .......:. : : CCDS26 VFFLLAAAFFSTFMPERWFPGSCHVFGQGHQLFHIFLVLCTLAQLEAVALDYEARRPIYE 250 260 270 280 290 300 380 pF1KE4 GCSEEDAL CCDS26 PLHTHWPHNFSGLFLLTVGSSILTAFLLSQLVQRKLDQKTK 310 320 330 340 >>CCDS10232.1 PAQR5 gene_id:54852|Hs108|chr15 (330 aa) initn: 286 init1: 130 opt: 261 Z-score: 307.8 bits: 65.5 E(32554): 8e-11 Smith-Waterman score: 261; 26.6% identity (55.4% similar) in 278 aa overlap (104-366:8-272) 80 90 100 110 120 130 pF1KE4 GFMGMSPLLQAHHAMEKMEEFVCKVWEGRWRVIPHDVLPDWLKDNDFLLHGHRPPMPSFR :.. : .:. .... .: :.: :. : CCDS10 MLSLKLPRLFSIDQIPQVFHEQG-ILFGYRHPQSSAT 10 20 30 140 150 160 170 180 190 pF1KE4 ACFKSIFRIHTETGNIWTHLLGCVFFL--CLGIFYMFR-PNISFVAPLQEKVVFGLFFLG ::. :.:.. .:: ::::::: :: . .:: : :. :. . . . CCDS10 ACILSLFQMTNETLNIWTHLLPFWFFAWRFVTALYMTDIKNDSYSWPMLVYMCTSCVYPL 40 50 60 70 80 90 200 210 220 230 240 pF1KE4 AILCLSFSWLFHTVYCHSEGVSRLFSKLDYSGIALLIMGSFVPWLYYSF----YCNP-QP . : :: :... .. :::... :. .:: . . :.: .:. . CCDS10 VSSCA------HTFSSMSKNARHICYFLDYGAVNLFSLGSAIAYSAYTFPDALMCTTFHD 100 110 120 130 140 150 250 260 270 280 290 300 pF1KE4 CFIYLIVI-CVLGIAAIIVSQWDMFATPQY-RGVRAGVFLGLGLSGIIPTLHYVI---SE .. : :. .:. . :.. . :. . .:. .: .: .. .. .: CCDS10 YYVALAVLNTILSTGLSCYSRFLEIQKPRLCKVIRVLAFAYPYTWDSLPIFYRLFLFPGE 160 170 180 190 200 210 310 320 330 340 350 pF1KE4 GFLKAAT-IGQIGWLM-LMASLYITGAALYAARIPERFFPGKCDIWFHSHQLFHIFVVAG . . :: : .: :.::. ::.:..:::. ::. : :::::::. :. . CCDS10 SAQNEATSYHQKHMIMTLLASF------LYSAHLPERLAPGRFDYIGHSHQLFHVCVILA 220 230 240 250 260 360 370 380 pF1KE4 AFVHFHGVSNLQEFRFMIGGGCSEEDAL . ...... CCDS10 THMQMEAILLDKTLRKEWLLATSKPFSFSQIAGAILLCIIFSLSNIIYFSAALYRIPKPE 270 280 290 300 310 320 386 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 02:01:30 2016 done: Sun Nov 6 02:01:31 2016 Total Scan time: 2.460 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]