FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7603, 297 aa
1>>>pF1KB7603 297 - 297 aa - 297 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.5786+/-0.000907; mu= 2.2217+/- 0.055
mean_var=221.0601+/-44.728, 0's: 0 Z-trim(113.6): 151 B-trim: 5 in 1/50
Lambda= 0.086262
statistics sampled from 14077 (14249) to 14077 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.766), E-opt: 0.2 (0.438), width: 16
Scan time: 2.180
The best scores are: opt bits E(32554)
CCDS9728.1 OTX2 gene_id:5015|Hs108|chr14 ( 297) 2013 262.6 2.5e-70
CCDS41960.1 OTX2 gene_id:5015|Hs108|chr14 ( 289) 1916 250.5 1.1e-66
CCDS1873.1 OTX1 gene_id:5013|Hs108|chr2 ( 354) 758 106.5 3e-23
CCDS12706.1 CRX gene_id:1406|Hs108|chr19 ( 299) 564 82.3 4.9e-16
>>CCDS9728.1 OTX2 gene_id:5015|Hs108|chr14 (297 aa)
initn: 2013 init1: 2013 opt: 2013 Z-score: 1376.0 bits: 262.6 E(32554): 2.5e-70
Smith-Waterman score: 2013; 100.0% identity (100.0% similar) in 297 aa overlap (1-297:1-297)
10 20 30 40 50 60
pF1KB7 MMSYLKQPPYAVNGLSLTTSGMDLLHPSVGYPGPWASCPAATPRKQRRERTTFTRAQLDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS97 MMSYLKQPPYAVNGLSLTTSGMDLLHPSVGYPGPWASCPAATPRKQRRERTTFTRAQLDV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 LEALFAKTRYPDIFMREEVALKINLPESRVQVWFKNRRAKCRQQQQQQQNGGQNKVRPAK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS97 LEALFAKTRYPDIFMREEVALKINLPESRVQVWFKNRRAKCRQQQQQQQNGGQNKVRPAK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 KKTSPAREVSSESGTSGQFTPPSSTSVPTIASSSAPVSIWSPASISPLSDPLSTSSSCMQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS97 KKTSPAREVSSESGTSGQFTPPSSTSVPTIASSSAPVSIWSPASISPLSDPLSTSSSCMQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 RSYPMTYTQASGYSQGYAGSTSYFGGMDCGSYLTPMHHQLPGPGATLSPMGTNAVTSHLN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS97 RSYPMTYTQASGYSQGYAGSTSYFGGMDCGSYLTPMHHQLPGPGATLSPMGTNAVTSHLN
190 200 210 220 230 240
250 260 270 280 290
pF1KB7 QSPASLSTQGYGASSLGFNSTTDCLDYKDQTASWKLNFNADCLDYKDQTSSWKFQVL
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS97 QSPASLSTQGYGASSLGFNSTTDCLDYKDQTASWKLNFNADCLDYKDQTSSWKFQVL
250 260 270 280 290
>>CCDS41960.1 OTX2 gene_id:5015|Hs108|chr14 (289 aa)
initn: 1928 init1: 1725 opt: 1916 Z-score: 1310.9 bits: 250.5 E(32554): 1.1e-66
Smith-Waterman score: 1916; 97.3% identity (97.3% similar) in 297 aa overlap (1-297:1-289)
10 20 30 40 50 60
pF1KB7 MMSYLKQPPYAVNGLSLTTSGMDLLHPSVGYPGPWASCPAATPRKQRRERTTFTRAQLDV
:::::::::::::::::::::::::::::::: ::::::::::::::::::::
CCDS41 MMSYLKQPPYAVNGLSLTTSGMDLLHPSVGYP--------ATPRKQRRERTTFTRAQLDV
10 20 30 40 50
70 80 90 100 110 120
pF1KB7 LEALFAKTRYPDIFMREEVALKINLPESRVQVWFKNRRAKCRQQQQQQQNGGQNKVRPAK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 LEALFAKTRYPDIFMREEVALKINLPESRVQVWFKNRRAKCRQQQQQQQNGGQNKVRPAK
60 70 80 90 100 110
130 140 150 160 170 180
pF1KB7 KKTSPAREVSSESGTSGQFTPPSSTSVPTIASSSAPVSIWSPASISPLSDPLSTSSSCMQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 KKTSPAREVSSESGTSGQFTPPSSTSVPTIASSSAPVSIWSPASISPLSDPLSTSSSCMQ
120 130 140 150 160 170
190 200 210 220 230 240
pF1KB7 RSYPMTYTQASGYSQGYAGSTSYFGGMDCGSYLTPMHHQLPGPGATLSPMGTNAVTSHLN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 RSYPMTYTQASGYSQGYAGSTSYFGGMDCGSYLTPMHHQLPGPGATLSPMGTNAVTSHLN
180 190 200 210 220 230
250 260 270 280 290
pF1KB7 QSPASLSTQGYGASSLGFNSTTDCLDYKDQTASWKLNFNADCLDYKDQTSSWKFQVL
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 QSPASLSTQGYGASSLGFNSTTDCLDYKDQTASWKLNFNADCLDYKDQTSSWKFQVL
240 250 260 270 280
>>CCDS1873.1 OTX1 gene_id:5013|Hs108|chr2 (354 aa)
initn: 845 init1: 442 opt: 758 Z-score: 530.9 bits: 106.5 E(32554): 3e-23
Smith-Waterman score: 987; 56.1% identity (68.5% similar) in 337 aa overlap (1-269:1-321)
10 20 30 40 50 60
pF1KB7 MMSYLKQPPYAVNGLSLTTSGMDLLHPSVGYPGPWASCPAATPRKQRRERTTFTRAQLDV
::::::::::..:::.:. .::::::::::: :::::::::::::::.::::
CCDS18 MMSYLKQPPYGMNGLGLAGPAMDLLHPSVGYP--------ATPRKQRRERTTFTRSQLDV
10 20 30 40 50
70 80 90 100 110 120
pF1KB7 LEALFAKTRYPDIFMREEVALKINLPESRVQVWFKNRRAKCRQQQQQQQNGGQNKVRPAK
::::::::::::::::::::::::::::::::::::::::::::: :.:. .: ::::
CCDS18 LEALFAKTRYPDIFMREEVALKINLPESRVQVWFKNRRAKCRQQQ---QSGSGTKSRPAK
60 70 80 90 100
130 140 150
pF1KB7 KKTSPAREVSSESGTSGQFTPP---SSTSVPTIASSSA--------------PV------
::.::.:: :: : .::::::: ::.: . ::::. ::
CCDS18 KKSSPVRE-SSGSESSGQFTPPAVSSSASSSSSASSSSANPAAAAAAGLGGNPVAAASSL
110 120 130 140 150 160
160 170 180 190
pF1KB7 ------SIWSPASISPLSDPLSTS----------SSCMQRS-----------YPMTYTQA
:::::::::: : : :.: .:::::: :::.: :.
CCDS18 STPAASSIWSPASISPGSAPASVSVPEPLAAPSNTSCMQRSVAAGAATAAASYPMSYGQG
170 180 190 200 210 220
200 210 220 230 240
pF1KB7 SGYSQGY-AGSTSYFGGMDCGSYLTPMH-HQLPGPGATLSPMGTNAVTSHLNQSPAS---
..:.::: . :.:::::.::.:::.::: :. : ::::. .....: .. : .
CCDS18 GSYGQGYPTPSSSYFGGVDCSSYLAPMHSHHHPHQ---LSPMAPSSMAGHHHHHPHAHHP
230 240 250 260 270 280
250 260 270 280 290
pF1KB7 LST-------------QGYGASSLGFNSTTDCLDYKDQTASWKLNFNADCLDYKDQTSSW
:: ::::.:.:.:::. ::::::.
CCDS18 LSQSSGHHHHHHHHHHQGYGGSGLAFNSA-DCLDYKEPGAAAASSAWKLNFNSPDCLDYK
290 300 310 320 330 340
pF1KB7 KFQVL
CCDS18 DQASWRFQVL
350
>>CCDS12706.1 CRX gene_id:1406|Hs108|chr19 (299 aa)
initn: 646 init1: 410 opt: 564 Z-score: 401.4 bits: 82.3 E(32554): 4.9e-16
Smith-Waterman score: 899; 50.2% identity (70.7% similar) in 317 aa overlap (1-297:1-299)
10 20 30 40 50
pF1KB7 MMSYLKQPP-YAVNGLSLTTSGMDLLHPSVGYPGPWASCPAATPRKQRRERTTFTRAQLD
::.:.. : :.::.:.:. ..::.: .: :: ..:::::::::::::.::.
CCDS12 MMAYMNPGPHYSVNALALSGPSVDLMHQAVPYP--------SAPRKQRRERTTFTRSQLE
10 20 30 40 50
60 70 80 90 100 110
pF1KB7 VLEALFAKTRYPDIFMREEVALKINLPESRVQVWFKNRRAKCRQQ-----QQQQQNGGQN
::::::::.:::.. ::::::::::::::::::::::::::::: :::: :::
CCDS12 ELEALFAKTQYPDVYAREEVALKINLPESRVQVWFKNRRAKCRQQRQQQKQQQQPPGGQA
60 70 80 90 100 110
120 130 140 150 160
pF1KB7 KVRPAKKKTS----PAREVSSES-GTSGQFTPPSSTSVPTIASSSAPVSIWSPASISPLS
:.::::.:.. :. .: . : : ...:: . ... : :::::::: :::
CCDS12 KARPAKRKAGTSPRPSTDVCPDPLGISDSYSPPLPGPSGSPTTAVATVSIWSPASESPLP
120 130 140 150 160 170
170 180 190 200 210 220
pF1KB7 DP-----LSTSSSCMQRSYPMTYTQASGYSQG---YAGSTSYFGGMDCGSYLTPMHHQLP
. .... : . : :::. ::.. .. :.. .:::.:.: ::.:: ::
CCDS12 EAQRAGLVASGPSLTSAPYAMTYAPASAFCSSPSAYGSPSSYFSGLD--PYLSPMVPQLG
180 190 200 210 220 230
230 240 250 260 270 280
pF1KB7 GPGATLSPMGTNAVTSHLNQSPASLSTQGYGASSLGFNSTTDCLDYKDQTASWKLNFNA-
::. :::.. .: : :::.::: :.::: : .: :..:: :..::...:
CCDS12 GPA--LSPLSGPSVGPSLAQSPTSLSGQSYGAY-----SPVDSLEFKDPTGTWKFTYNPM
240 250 260 270 280
290
pF1KB7 DCLDYKDQTSSWKFQVL
: :::::: :.::::.:
CCDS12 DPLDYKDQ-SAWKFQIL
290
297 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 08:05:28 2016 done: Sat Nov 5 08:05:28 2016
Total Scan time: 2.180 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]