FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5452, 351 aa
1>>>pF1KE5452 351 - 351 aa - 351 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.8204+/-0.000991; mu= 8.4525+/- 0.058
mean_var=129.0562+/-26.798, 0's: 0 Z-trim(108.3): 150 B-trim: 0 in 0/50
Lambda= 0.112898
statistics sampled from 9917 (10096) to 9917 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.677), E-opt: 0.2 (0.31), width: 16
Scan time: 1.700
The best scores are: opt bits E(32554)
CCDS4397.1 THOC3 gene_id:84321|Hs108|chr5 ( 351) 2440 409.0 3.1e-114
CCDS3012.1 WDR5B gene_id:54554|Hs108|chr3 ( 330) 338 66.6 3.4e-11
CCDS55859.1 POC1B gene_id:282809|Hs108|chr12 ( 436) 338 66.7 4.2e-11
CCDS31869.1 POC1B gene_id:282809|Hs108|chr12 ( 478) 338 66.7 4.6e-11
>>CCDS4397.1 THOC3 gene_id:84321|Hs108|chr5 (351 aa)
initn: 2440 init1: 2440 opt: 2440 Z-score: 2164.3 bits: 409.0 E(32554): 3.1e-114
Smith-Waterman score: 2440; 100.0% identity (100.0% similar) in 351 aa overlap (1-351:1-351)
10 20 30 40 50 60
pF1KE5 MAVPAAAMGPSALGQSGPGSMAPWCSVSSGPSRYVLGMQELFRGHSKTREFLAHSAKVHS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 MAVPAAAMGPSALGQSGPGSMAPWCSVSSGPSRYVLGMQELFRGHSKTREFLAHSAKVHS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 VAWSCDGRRLASGSFDKTASVFLLEKDRLVKENNYRGHGDSVDQLCWHPSNPDLFVTASG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 VAWSCDGRRLASGSFDKTASVFLLEKDRLVKENNYRGHGDSVDQLCWHPSNPDLFVTASG
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 DKTIRIWDVRTTKCIATVNTKGENINICWSPDGQTIAVGNKDDVVTFIDAKTHRSKAEEQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 DKTIRIWDVRTTKCIATVNTKGENINICWSPDGQTIAVGNKDDVVTFIDAKTHRSKAEEQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE5 FKFEVNEISWNNDNNMFFLTNGNGCINILSYPELKPVQSINAHPSNCICIKFDPMGKYFA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 FKFEVNEISWNNDNNMFFLTNGNGCINILSYPELKPVQSINAHPSNCICIKFDPMGKYFA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE5 TGSADALVSLWDVDELVCVRCFSRLDWPVRTLSFSHDGKMLASASEDHFIDIAEVETGDK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 TGSADALVSLWDVDELVCVRCFSRLDWPVRTLSFSHDGKMLASASEDHFIDIAEVETGDK
250 260 270 280 290 300
310 320 330 340 350
pF1KE5 LWEVQCESPTFTVAWHPKRPLLAFACDDKDGKYDSSREAGTVKLFGLPNDS
:::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 LWEVQCESPTFTVAWHPKRPLLAFACDDKDGKYDSSREAGTVKLFGLPNDS
310 320 330 340 350
>>CCDS3012.1 WDR5B gene_id:54554|Hs108|chr3 (330 aa)
initn: 297 init1: 114 opt: 338 Z-score: 314.4 bits: 66.6 E(32554): 3.4e-11
Smith-Waterman score: 338; 26.3% identity (61.2% similar) in 289 aa overlap (51-330:37-320)
30 40 50 60 70 80
pF1KE5 MAPWCSVSSGPSRYVLGMQELFRGHSKTREFLAHSAKVHSVAWSCDGRRLASGSFDKTAS
...:. : :: .: .:. :::.: :.
CCDS30 RDAKAQLALSSSANQSKEVPENPNYALKCTLVGHTEAVSSVKFSPNGEWLASSSADRLII
10 20 30 40 50 60
90 100 110 120 130 140
pF1KE5 VFLLEKDRLVKENNYRGHGDSVDQLCWHPSNPDLFVTASGDKTIRIWDVRTTKCIATVNT
.. . :.. ::. .... : :. . .:.:: :::...::::. ::. :..
CCDS30 IWGAYDGKY--EKTLYGHNLEISDVAWS-SDSSRLVSASDDKTLKLWDVRSGKCLKTLKG
70 80 90 100 110 120
150 160 170 180 190
pF1KE5 KGENINIC-WSPDGQTIAVGNKDDVVTFIDAKTHRS-KAEEQFKFEVNEISWNNDNNMFF
... . : ..: .. : :. :..: . ..:: . :. . :. . .: .....
CCDS30 HSNYVFCCNFNPPSNLIISGSFDETVKIWEVKTGKCLKTLSAHSDPVSAVHFNCSGSLIV
130 140 150 160 170 180
200 210 220 230 240 250
pF1KE5 LTNGNG-C--INILSYPELKPVQSINAHPSNCICIKFDPMGKYFATGSADALVSLWDVDE
. .: : . : :: . . . : . .::.: :::. :.. : ..::: ..
CCDS30 SGSYDGLCRIWDAASGQCLKTLVDDDNPPVSF--VKFSPNGKYILTATLDNTLKLWDYSR
190 200 210 220 230 240
260 270 280 290 300 310
pF1KE5 LVCVRCFS--RLDWPVRTLSFS-HDGKMLASASEDHFIDIAEVETGDKLWEVQCESPT-F
:.. .. . . .:: :: ..:.:::... : ...: . . ..: .. . .
CCDS30 GRCLKTYTGHKNEKYCIFANFSVTGGKWIVSGSEDNLVYIWNLQTKEIVQKLQGHTDVVI
250 260 270 280 290 300
320 330 340 350
pF1KE5 TVAWHPKRPLLAFACDDKDGKYDSSREAGTVKLFGLPNDS
..: :: . :.: : ..:
CCDS30 SAACHPTENLIASAALENDKTIKLWMSNH
310 320 330
>>CCDS55859.1 POC1B gene_id:282809|Hs108|chr12 (436 aa)
initn: 374 init1: 128 opt: 338 Z-score: 312.7 bits: 66.7 E(32554): 4.2e-11
Smith-Waterman score: 338; 29.9% identity (61.9% similar) in 278 aa overlap (43-309:50-317)
20 30 40 50 60 70
pF1KE5 LGQSGPGSMAPWCSVSSGPSRYVLGMQELFRGHSKTREFLAHSAKVHSVAWSCDGRRLAS
:: : :: ::.: :.:: .: ::. ::.
CCDS55 VVTSVQFSPHGNLLASASRDRTVRLWIPDKRG--KFSEFKAHTAPVRSVDFSADGQFLAT
20 30 40 50 60 70
80 90 100 110 120 130
pF1KE5 GSFDKTASVFLLEKDRLVKENNYRGHGDSVDQLCWHPSNPD--LFVTASGDKTIRIWDVR
.: ::. .:. . ..:.. . :: : : : . : :: :.:. : ::::.:::.
CCDS55 ASEDKSIKVWSMYRQRFLY-SLYR-HTHWVR--CAKFS-PDGRLIVSCSEDKTIKIWDTT
80 90 100 110 120 130
140 150 160 170 180
pF1KE5 TTKCIATV-NTKGENINICWSPDGQTIAVGNKDDVVTFIDAKTHRSKAEEQFKFE-VNEI
. .:. . .. : . ..:.: :: ...:..: :..... . : . :: :
CCDS55 NKQCVNNFSDSVGFANFVDFNPSGTCIASAGSDQTVKVWDVRVNKLLQHYQVHSGGVNCI
140 150 160 170 180 190
190 200 210 220 230 240
pF1KE5 SWNNDNNMFFLTNGNGCINILSYPELKPVQSINAHPSNCICIKFDPMGKYFATGSADALV
:.. ..:... ....: ..::. : . . ....: . . ..:. :. ::.:.::. :
CCDS55 SFHPSGNYLITASSDGTLKILDLLEGRLIYTLQGHTGPVFTVSFSKGGELFASGGADTQV
200 210 220 230 240 250
250 260 270 280 290 300
pF1KE5 SLW--DVDELVCVRCFSRLDWPVRTLSFSHDGKML---ASASEDHFIDIAEVETGDKLW-
:: . ::: : . ... . .. : :. ..: . . : . :: . ::
CCDS55 LLWRTNFDELHC-KGLTKRN--LKRLHFDSPPHLLDIYPRTPHPHEEKVETVEINPKLEV
260 270 280 290 300
310 320 330 340 350
pF1KE5 -EVQCESPTFTVAWHPKRPLLAFACDDKDGKYDSSREAGTVKLFGLPNDS
..: .:
CCDS55 IDLQISTPPVMDILSFDSTTTTETSGRTLPDKGEEACGYFLNPSLMSPECLPTTTKKKTE
310 320 330 340 350 360
>>CCDS31869.1 POC1B gene_id:282809|Hs108|chr12 (478 aa)
initn: 402 init1: 128 opt: 338 Z-score: 312.2 bits: 66.7 E(32554): 4.6e-11
Smith-Waterman score: 338; 29.9% identity (61.9% similar) in 278 aa overlap (43-309:92-359)
20 30 40 50 60 70
pF1KE5 LGQSGPGSMAPWCSVSSGPSRYVLGMQELFRGHSKTREFLAHSAKVHSVAWSCDGRRLAS
:: : :: ::.: :.:: .: ::. ::.
CCDS31 VVTSVQFSPHGNLLASASRDRTVRLWIPDKRG--KFSEFKAHTAPVRSVDFSADGQFLAT
70 80 90 100 110
80 90 100 110 120 130
pF1KE5 GSFDKTASVFLLEKDRLVKENNYRGHGDSVDQLCWHPSNPD--LFVTASGDKTIRIWDVR
.: ::. .:. . ..:.. . :: : : : . : :: :.:. : ::::.:::.
CCDS31 ASEDKSIKVWSMYRQRFLY-SLYR-HTHWVR--CAKFS-PDGRLIVSCSEDKTIKIWDTT
120 130 140 150 160 170
140 150 160 170 180
pF1KE5 TTKCIATV-NTKGENINICWSPDGQTIAVGNKDDVVTFIDAKTHRSKAEEQFKFE-VNEI
. .:. . .. : . ..:.: :: ...:..: :..... . : . :: :
CCDS31 NKQCVNNFSDSVGFANFVDFNPSGTCIASAGSDQTVKVWDVRVNKLLQHYQVHSGGVNCI
180 190 200 210 220 230
190 200 210 220 230 240
pF1KE5 SWNNDNNMFFLTNGNGCINILSYPELKPVQSINAHPSNCICIKFDPMGKYFATGSADALV
:.. ..:... ....: ..::. : . . ....: . . ..:. :. ::.:.::. :
CCDS31 SFHPSGNYLITASSDGTLKILDLLEGRLIYTLQGHTGPVFTVSFSKGGELFASGGADTQV
240 250 260 270 280 290
250 260 270 280 290 300
pF1KE5 SLW--DVDELVCVRCFSRLDWPVRTLSFSHDGKML---ASASEDHFIDIAEVETGDKLW-
:: . ::: : . ... . .. : :. ..: . . : . :: . ::
CCDS31 LLWRTNFDELHC-KGLTKRN--LKRLHFDSPPHLLDIYPRTPHPHEEKVETVEINPKLEV
300 310 320 330 340 350
310 320 330 340 350
pF1KE5 -EVQCESPTFTVAWHPKRPLLAFACDDKDGKYDSSREAGTVKLFGLPNDS
..: .:
CCDS31 IDLQISTPPVMDILSFDSTTTTETSGRTLPDKGEEACGYFLNPSLMSPECLPTTTKKKTE
360 370 380 390 400 410
351 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 00:54:21 2016 done: Tue Nov 8 00:54:21 2016
Total Scan time: 1.700 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]