FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8945, 304 aa
1>>>pF1KB8945 304 - 304 aa - 304 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.3569+/-0.000822; mu= 9.5948+/- 0.050
mean_var=231.8799+/-48.456, 0's: 0 Z-trim(116.0): 157 B-trim: 113 in 1/52
Lambda= 0.084225
statistics sampled from 16366 (16542) to 16366 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.804), E-opt: 0.2 (0.508), width: 16
Scan time: 2.860
The best scores are: opt bits E(32554)
CCDS3494.1 GSX2 gene_id:170825|Hs108|chr4 ( 304) 2106 267.9 6.7e-72
CCDS9326.1 GSX1 gene_id:219409|Hs108|chr13 ( 264) 516 74.7 8.8e-14
>>CCDS3494.1 GSX2 gene_id:170825|Hs108|chr4 (304 aa)
initn: 2106 init1: 2106 opt: 2106 Z-score: 1404.4 bits: 267.9 E(32554): 6.7e-72
Smith-Waterman score: 2106; 99.7% identity (100.0% similar) in 304 aa overlap (1-304:1-304)
10 20 30 40 50 60
pF1KB8 MSRSFYVDSLIIKDTSRPAPSLPEPHPGPDFFIPLGMPPPLVMSVSGPGCPSRKSGAFCV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 MSRSFYVDSLIIKDTSRPAPSLPEPHPGPDFFIPLGMPPPLVMSVSGPGCPSRKSGAFCV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 CPLCVTSHLHSSRGSVGAGSGGAGAGVTGAGGSGVAGAAGALPLLKSQFSSAPGDAQFCP
::::::::::::::::::::::::::::::::::::::::::::::.:::::::::::::
CCDS34 CPLCVTSHLHSSRGSVGAGSGGAGAGVTGAGGSGVAGAAGALPLLKGQFSSAPGDAQFCP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 RVNHAHHHHHPPQHHHHHHQPQQPGSAAAAAAAAAAAAAAAALGHPQHHAPVCTATTYNV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 RVNHAHHHHHPPQHHHHHHQPQQPGSAAAAAAAAAAAAAAAALGHPQHHAPVCTATTYNV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 ADPRRFHCLTMGGSDASQVPNGKRMRTAFTSTQLLELEREFSSNMYLSRLRRIEIATYLN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 ADPRRFHCLTMGGSDASQVPNGKRMRTAFTSTQLLELEREFSSNMYLSRLRRIEIATYLN
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB8 LSEKQVKIWFQNRRVKHKKEGKGTQRNSHAGCKCVGSQVHYARSEDEDSLSPASANDDKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 LSEKQVKIWFQNRRVKHKKEGKGTQRNSHAGCKCVGSQVHYARSEDEDSLSPASANDDKE
250 260 270 280 290 300
pF1KB8 ISPL
::::
CCDS34 ISPL
>>CCDS9326.1 GSX1 gene_id:219409|Hs108|chr13 (264 aa)
initn: 649 init1: 415 opt: 516 Z-score: 360.9 bits: 74.7 E(32554): 8.8e-14
Smith-Waterman score: 725; 45.0% identity (64.8% similar) in 318 aa overlap (1-302:1-261)
10 20 30 40 50
pF1KB8 MSRSFYVDSLIIKDTSRPAPSLPEPHPGPDFFIPLGMPPPLVMSVSGPG-CPSRKSGAFC
: ::: ::::....... :: : : : : ..::: .. .:: : .::.: .:
CCDS93 MPRSFLVDSLVLREAGEKKA--PEGSPPPLF--PYAVPPPHALHGLSPGACHARKAGLLC
10 20 30 40 50
60 70 80 90 100 110
pF1KB8 VCPLCVT-SHLHSSRGSVGAGSGGAGAGVTGAGGSGVAGAAGALPLLKSQFSSAPGDAQF
::::::: :.::. : ::::::..: : .:.
CCDS93 VCPLCVTASQLHGPPGP------------------------PALPLLKASFP--PFGSQY
60 70 80 90
120 130 140 150 160 170
pF1KB8 CPRVNHAHHHHHPPQHHHHHHQPQQPGSAAAAAAAAAAAAAAAALGHPQHHAPVCTATTY
: : : .. :. .:: .: . ::::::::: . :.:
CCDS93 C----------HAPLGRQ--HSAVSPG----VAHGPAAAAAAAALYQ----------TSY
100 110 120
180 190 200 210 220 230
pF1KB8 NVADPRRFHCLTMGGSDASQVPNGKRMRTAFTSTQLLELEREFSSNMYLSRLRRIEIATY
. :::.:::... .: ..:.:..:::::::::::::::::::.::::::::::::::::
CCDS93 PLPDPRQFHCISVDSS-SNQLPSSKRMRTAFTSTQLLELEREFASNMYLSRLRRIEIATY
130 140 150 160 170 180
240 250 260 270 280
pF1KB8 LNLSEKQVKIWFQNRRVKHKKEGKGTQR------------NSHAGCKCVG-SQVHYARSE
:::::::::::::::::::::::::... .. ::::.. :... ....
CCDS93 LNLSEKQVKIWFQNRRVKHKKEGKGSNHRGGGGGGAGGGGSAPQGCKCASLSSAKCSEDD
190 200 210 220 230 240
290 300
pF1KB8 DEDSLSPASAN-DDKEISPL
:: .::.:.. ::....
CCDS93 DELPMSPSSSGKDDRDLTVTP
250 260
304 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 16:36:52 2016 done: Fri Nov 4 16:36:52 2016
Total Scan time: 2.860 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]