FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7615, 304 aa
1>>>pF1KB7615 304 - 304 aa - 304 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.0899+/-0.000908; mu= 5.1121+/- 0.055
mean_var=219.5131+/-44.827, 0's: 0 Z-trim(114.1): 178 B-trim: 80 in 1/51
Lambda= 0.086565
statistics sampled from 14517 (14716) to 14517 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.781), E-opt: 0.2 (0.452), width: 16
Scan time: 2.550
The best scores are: opt bits E(32554)
CCDS34605.1 MEOX2 gene_id:4223|Hs108|chr7 ( 304) 2101 274.4 7.3e-74
CCDS11466.1 MEOX1 gene_id:4222|Hs108|chr17 ( 254) 525 77.5 1.1e-14
CCDS42343.1 MEOX1 gene_id:4222|Hs108|chr17 ( 139) 519 76.5 1.3e-14
>>CCDS34605.1 MEOX2 gene_id:4223|Hs108|chr7 (304 aa)
initn: 2101 init1: 2101 opt: 2101 Z-score: 1439.6 bits: 274.4 E(32554): 7.3e-74
Smith-Waterman score: 2101; 100.0% identity (100.0% similar) in 304 aa overlap (1-304:1-304)
10 20 30 40 50 60
pF1KB7 MEHPLFGCLRSPHATAQGLHPFSQSSLALHGRSDHMSYPELSTSSSSCIIAGYPNEEGMF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 MEHPLFGCLRSPHATAQGLHPFSQSSLALHGRSDHMSYPELSTSSSSCIIAGYPNEEGMF
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 ASQHHRGHHHHHHHHHHHHHQQQQHQALQTNWHLPQMSSPPSAARHSLCLQPDSGGPPEL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 ASQHHRGHHHHHHHHHHHHHQQQQHQALQTNWHLPQMSSPPSAARHSLCLQPDSGGPPEL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 GSSPPVLCSNSSSLGSSTPTGAACAPGDYGRQALSPAEAEKRSGGKRKSDSSDSQEGNYK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 GSSPPVLCSNSSSLGSSTPTGAACAPGDYGRQALSPAEAEKRSGGKRKSDSSDSQEGNYK
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 SEVNSKPRKERTAFTKEQIRELEAEFAHHNYLTRLRRYEIAVNLDLTERQVKVWFQNRRM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 SEVNSKPRKERTAFTKEQIRELEAEFAHHNYLTRLRRYEIAVNLDLTERQVKVWFQNRRM
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 KWKRVKGGQQGAAAREKELVNVKKGTLLPSELSGIGAATLQQTGDSIANEDSHDSDHSSE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 KWKRVKGGQQGAAAREKELVNVKKGTLLPSELSGIGAATLQQTGDSIANEDSHDSDHSSE
250 260 270 280 290 300
pF1KB7 HAHL
::::
CCDS34 HAHL
>>CCDS11466.1 MEOX1 gene_id:4222|Hs108|chr17 (254 aa)
initn: 629 init1: 487 opt: 525 Z-score: 376.9 bits: 77.5 E(32554): 1.1e-14
Smith-Waterman score: 624; 47.8% identity (64.3% similar) in 255 aa overlap (4-249:17-233)
10 20 30
pF1KB7 MEHPLFGCLRSPHAT---AQGLHPFSQSSLALHGRSDHMS-----YP
:..::::.::. :.:: . . ...: . : .. ::
CCDS11 MDPAASSCMRSLQPPAPVWGCLRNPHSEGNGASGLPHYPPTPFSFHQKPDFLATATAAYP
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB7 ELSTSSSSCIIAGYPNEEGMFASQHHRGHHHHHHHHHHHHHQQQQHQALQTNWHLPQMSS
..:.: . . :.:: .:. :: :. :::.:
CCDS11 DFSASCLAATPHSLPQEEHIFTEQHPA-------------FPQSP------NWHFPV---
70 80 90
100 110 120 130 140 150
pF1KB7 PPSAARHSLCLQPDSGGPPELGSSPPVLCSNSSSLGSSTPTGAACAPGD-YGRQALSPAE
: ::. .:.:: : ::. ..:::: ::. ::: :: . . :
CCDS11 --SDARR----RPNSG--PAGGSKE----MGTSSLGLVDTTGG---PGDDYGVLGSTANE
100 110 120 130 140
160 170 180 190 200 210
pF1KB7 AEKRSGGKRKSDSSDSQEGNYKSEVNSKPRKERTAFTKEQIRELEAEFAHHNYLTRLRRY
.::.:. .:: .:::.::. : : .:: :::::::::::.:::::::::::::::::::
CCDS11 TEKKSSRRRK-ESSDNQENRGKPEGSSKARKERTAFTKEQLRELEAEFAHHNYLTRLRRY
150 160 170 180 190 200
220 230 240 250 260 270
pF1KB7 EIAVNLDLTERQVKVWFQNRRMKWKRVKGGQQGAAAREKELVNVKKGTLLPSELSGIGAA
::::::::.::::::::::::::::::::::
CCDS11 EIAVNLDLSERQVKVWFQNRRMKWKRVKGGQPISPNGQDPEDGDSTASPSSE
210 220 230 240 250
280 290 300
pF1KB7 TLQQTGDSIANEDSHDSDHSSEHAHL
>>CCDS42343.1 MEOX1 gene_id:4222|Hs108|chr17 (139 aa)
initn: 520 init1: 487 opt: 519 Z-score: 376.1 bits: 76.5 E(32554): 1.3e-14
Smith-Waterman score: 519; 70.8% identity (84.2% similar) in 120 aa overlap (131-249:3-118)
110 120 130 140 150
pF1KB7 PSAARHSLCLQPDSGGPPELGSSPPVLCSNSSSLGSSTPTGAACAPGD-YGRQALSPAEA
.:::: :: .::: :: . . :.
CCDS42 MGTSSLGLVDTTG---GPGDDYGVLGSTANET
10 20
160 170 180 190 200 210
pF1KB7 EKRSGGKRKSDSSDSQEGNYKSEVNSKPRKERTAFTKEQIRELEAEFAHHNYLTRLRRYE
::.:. .:...:::.::. : : .:: :::::::::::.::::::::::::::::::::
CCDS42 EKKSS-RRRKESSDNQENRGKPEGSSKARKERTAFTKEQLRELEAEFAHHNYLTRLRRYE
30 40 50 60 70 80
220 230 240 250 260 270
pF1KB7 IAVNLDLTERQVKVWFQNRRMKWKRVKGGQQGAAAREKELVNVKKGTLLPSELSGIGAAT
:::::::.::::::::::::::::::::::
CCDS42 IAVNLDLSERQVKVWFQNRRMKWKRVKGGQPISPNGQDPEDGDSTASPSSE
90 100 110 120 130
304 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 21:19:56 2016 done: Fri Nov 4 21:19:56 2016
Total Scan time: 2.550 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]