FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB4126, 433 aa
1>>>pF1KB4126 433 - 433 aa - 433 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.6347+/-0.00119; mu= 15.9945+/- 0.070
mean_var=100.7416+/-26.354, 0's: 0 Z-trim(102.1): 148 B-trim: 827 in 2/48
Lambda= 0.127782
statistics sampled from 6587 (6784) to 6587 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.551), E-opt: 0.2 (0.208), width: 16
Scan time: 2.800
The best scores are: opt bits E(32554)
CCDS5744.1 GPR22 gene_id:2845|Hs108|chr7 ( 433) 2782 524.2 9.5e-149
CCDS3438.1 CCKAR gene_id:886|Hs108|chr4 ( 428) 339 73.8 3.6e-13
>>CCDS5744.1 GPR22 gene_id:2845|Hs108|chr7 (433 aa)
initn: 2782 init1: 2782 opt: 2782 Z-score: 2784.0 bits: 524.2 E(32554): 9.5e-149
Smith-Waterman score: 2782; 100.0% identity (100.0% similar) in 433 aa overlap (1-433:1-433)
10 20 30 40 50 60
pF1KB4 MCFSPILEINMQSESNITVRDDIDDINTNMYQPLSYPLSFQVSLTGFLMLEIVLGLGSNL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 MCFSPILEINMQSESNITVRDDIDDINTNMYQPLSYPLSFQVSLTGFLMLEIVLGLGSNL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 TVLVLYCMKSNLINSVSNIITMNLHVLDVIICVGCIPLTIVILLLSLESNTALICCFHEA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 TVLVLYCMKSNLINSVSNIITMNLHVLDVIICVGCIPLTIVILLLSLESNTALICCFHEA
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB4 CVSFASVSTAINVFAITLDRYDISVKPANRILTMGRAVMLMISIWIFSFFSFLIPFIEVN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 CVSFASVSTAINVFAITLDRYDISVKPANRILTMGRAVMLMISIWIFSFFSFLIPFIEVN
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB4 FFSLQSGNTWENKTLLCVSTNEYYTELGMYYHLLVQIPIFFFTVVVMLITYTKILQALNI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 FFSLQSGNTWENKTLLCVSTNEYYTELGMYYHLLVQIPIFFFTVVVMLITYTKILQALNI
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB4 RIGTRFSTGQKKKARKKKTISLTTQHEATDMSQSSGGRNVVFGVRTSVSVIIALRRAVKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 RIGTRFSTGQKKKARKKKTISLTTQHEATDMSQSSGGRNVVFGVRTSVSVIIALRRAVKR
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB4 HRERRERQKRVFRMSLLIISTFLLCWTPISVLNTTILCLGPSDLLVKLRLCFLVMAYGTT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 HRERRERQKRVFRMSLLIISTFLLCWTPISVLNTTILCLGPSDLLVKLRLCFLVMAYGTT
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB4 IFHPLLYAFTRQKFQKVLKSKMKKRVVSIVEADPLPNNAVIHNSWIDPKRNKKITFEDSE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS57 IFHPLLYAFTRQKFQKVLKSKMKKRVVSIVEADPLPNNAVIHNSWIDPKRNKKITFEDSE
370 380 390 400 410 420
430
pF1KB4 IREKCLVPQVVTD
:::::::::::::
CCDS57 IREKCLVPQVVTD
430
>>CCDS3438.1 CCKAR gene_id:886|Hs108|chr4 (428 aa)
initn: 168 init1: 96 opt: 339 Z-score: 350.0 bits: 73.8 E(32554): 3.6e-13
Smith-Waterman score: 339; 23.9% identity (59.5% similar) in 348 aa overlap (41-375:44-378)
20 30 40 50 60 70
pF1KB4 MQSESNITVRDDIDDINTNMYQPLSYPLSFQVSLTGFLMLEIVLGLGSNLTVLVLYCMKS
:. : ....: ::: ..:.. :: ...
CCDS34 ITPPCELGLENETLFCLDQPRPSKEWQPAVQILLYSLIFLLSVLG--NTLVITVL--IRN
20 30 40 50 60
80 90 100 110 120 130
pF1KB4 NLINSVSNIITMNLHVLDVIICVGCIPLTIVILLLSLESNTALICCFHEACVSFASVSTA
. . .:.::. ..: : :...:. :.:.... ::. . .: .. . : ..:..
CCDS34 KRMRTVTNIFLLSLAVSDLMLCLFCMPFNLIPNLLKDFIFGSAVC---KTTTYFMGTSVS
70 80 90 100 110 120
140 150 160 170 180
pF1KB4 INVF---AITLDRYDISVKP-ANRIL-TMGRAVMLMISIWIFSFFSFLIPF-IEVNFFSL
...: ::.:.:: :: .:. : ..:. .. . : .:: ... :. : :. .
CCDS34 VSTFNLVAISLERYGAICKPLQSRVWQTKSHALKVIAATWCLSF-TIMTPYPIYSNLVPF
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB4 QSGNTWENKTLLCVSTNEYYTELGMYYHLLVQIPIFFFTVVVMLITYTKILQALNIRIGT
..:. . . :. . . .: .. . .:.. .::...: : .:.. :
CCDS34 TKNNNQTANMCRFLLPNDV---MQQSWHTFLLLILFLIPGIVMMVAYGLI--SLELYQGI
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB4 RFSTGQKKKARKKKTISLTT-QHEATD---MSQSSGGRNVVFGVRTSVSVIIALRRAVKR
.: ..:::.:...: . .. ..: .: .... :.. . .. : : : .
CCDS34 KFEASQKKSAKERKPSTTSSGKYEDSDGCYLQKTRPPRKLELRQLSTGSSSRANRIRSNS
250 260 270 280 290 300
310 320 330 340 350
pF1KB4 HRERRERQKRVFRMSLLIISTFLLCWTPISVLNT--TILCLGPSDLLVKLRLCF-LVMAY
.:::.:: ..:. :.::: :: :. . . : . : :...:
CCDS34 SAANLMAKKRVIRMLIVIVVLFFLCWMPIFSANAWRAYDTASAERRLSGTPISFILLLSY
310 320 330 340 350 360
360 370 380 390 400 410
pF1KB4 GTTIFHPLLYAFTRQKFQKVLKSKMKKRVVSIVEADPLPNNAVIHNSWIDPKRNKKITFE
.. .:..: : ..:.
CCDS34 TSSCVNPIIYCFMNKRFRLGFMATFPCCPNPGPPGARGEVGEEEEGGTTGASLSRFSYSH
370 380 390 400 410 420
433 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 14:27:57 2016 done: Thu Nov 3 14:27:57 2016
Total Scan time: 2.800 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]