FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9705, 335 aa
1>>>pF1KB9705 335 - 335 aa - 335 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.7207+/-0.000897; mu= 4.0220+/- 0.055
mean_var=287.4366+/-60.265, 0's: 0 Z-trim(114.9): 79 B-trim: 0 in 0/56
Lambda= 0.075649
statistics sampled from 15345 (15422) to 15345 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.771), E-opt: 0.2 (0.474), width: 16
Scan time: 2.470
The best scores are: opt bits E(32554)
CCDS2264.2 HOXD13 gene_id:3239|Hs108|chr2 ( 343) 2236 256.8 1.9e-68
CCDS5412.1 HOXA13 gene_id:3209|Hs108|chr7 ( 388) 931 114.4 1.6e-25
CCDS8865.1 HOXC13 gene_id:3229|Hs108|chr12 ( 330) 814 101.5 9.7e-22
CCDS11536.1 HOXB13 gene_id:10481|Hs108|chr17 ( 284) 669 85.6 5.1e-17
>>CCDS2264.2 HOXD13 gene_id:3239|Hs108|chr2 (343 aa)
initn: 2236 init1: 2236 opt: 2236 Z-score: 1342.3 bits: 256.8 E(32554): 1.9e-68
Smith-Waterman score: 2236; 100.0% identity (100.0% similar) in 335 aa overlap (1-335:9-343)
10 20 30 40 50
pF1KB9 MDGLRADGGGAGGAPASSSSSSVAAAAASGQCRGFLSAPVFAGTHSGRAAAA
::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 MSRAGSWDMDGLRADGGGAGGAPASSSSSSVAAAAASGQCRGFLSAPVFAGTHSGRAAAA
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB9 AAAAAAAAAAASGFAYPGTSERTGSSSSSSSSAVVAARPEAPPAKECPAPTPAAAAAAPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 AAAAAAAAAAASGFAYPGTSERTGSSSSSSSSAVVAARPEAPPAKECPAPTPAAAAAAPP
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB9 SAPALGYGYHFGNGYYSCRMSHGVGLQQNALKSSPHASLGGFPVEKYMDVSGLASSSVPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 SAPALGYGYHFGNGYYSCRMSHGVGLQQNALKSSPHASLGGFPVEKYMDVSGLASSSVPA
130 140 150 160 170 180
180 190 200 210 220 230
pF1KB9 NEVPARAKEVSFYQGYTSPYQHVPGYIDMVSTFGSGEPRHEAYISMEGYQSWTLANGWNS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 NEVPARAKEVSFYQGYTSPYQHVPGYIDMVSTFGSGEPRHEAYISMEGYQSWTLANGWNS
190 200 210 220 230 240
240 250 260 270 280 290
pF1KB9 QVYCTKDQPQGSHFWKSSFPGDVALNQPDMCVYRRGRKKRVPYTKLQLKELENEYAINKF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 QVYCTKDQPQGSHFWKSSFPGDVALNQPDMCVYRRGRKKRVPYTKLQLKELENEYAINKF
250 260 270 280 290 300
300 310 320 330
pF1KB9 INKDKRRRISAATNLSERQVTIWFQNRRVKDKKIVSKLKDTVS
:::::::::::::::::::::::::::::::::::::::::::
CCDS22 INKDKRRRISAATNLSERQVTIWFQNRRVKDKKIVSKLKDTVS
310 320 330 340
>>CCDS5412.1 HOXA13 gene_id:3209|Hs108|chr7 (388 aa)
initn: 1043 init1: 401 opt: 931 Z-score: 571.9 bits: 114.4 E(32554): 1.6e-25
Smith-Waterman score: 931; 50.1% identity (71.1% similar) in 353 aa overlap (6-333:53-387)
10 20
pF1KB9 MDGLRADGGG--------AGGAPASSSSSSVAAAA
: ::: ::: . ......::::
CCDS54 GGGLVADELNKNMEGAAAAAAAAAAAAAAGAGGGGFPHPAAAAAGGNFSVAAAAAAAAAA
30 40 50 60 70 80
30 40 50 60 70 80
pF1KB9 ASGQCRGFLS--APVFAGTHSGRAAAAAAAAAAAAAAASGFAYPGTSERTGSSSSSSSSA
:..:::.... ::. :. :. ..: . : .:::::.. : ... ..:::.. . :
CCDS54 AANQCRNLMAHPAPLAPGAASAYSSAPGEAPPSAAAAAAAAAAAAAAAAAASSSGGPGPA
90 100 110 120 130 140
90 100 110 120 130 140
pF1KB9 VVAARPEAPPAKECPAPTPAAAAAAPPSAPA-LGYGYHFGNGYYSC-RMSHGVGLQQNAL
:. : ::.: .: .::: :.:: : ::: ::.::: : :: : . ::.
CCDS54 GPAG---AEAAKQC---SPCSAAAQSSSGPAALPYGY-FGSGYYPCARM----GPHPNAI
150 160 170 180 190
150 160 170 180 190
pF1KB9 KSSPH----ASLGGFPVEKYMDVSGLASSSVPANEVPARAKEVSFY-QGYTS-PYQH---
:: . :. ..: ..::::..: : :.: .:::: .:: :::.. ::.:
CCDS54 KSCAQPASAAAAAAF-ADKYMDTAGPA-----AEEFSSRAKEFAFYHQGYAAGPYHHHQP
200 210 220 230 240
200 210 220 230 240 250
pF1KB9 VPGYIDM--VSTFGS-GEPRHEAY-ISMEGYQSWTLANGWNSQVYCTKDQPQGSHFWKSS
.:::.:: : .:. :: ::: . ::.:: :.: ::::.:.:: :.: : :.:::.
CCDS54 MPGYLDMPVVPGLGGPGESRHEPLGLPMESYQPWALPNGWNGQMYCPKEQAQPPHLWKST
250 260 270 280 290 300
260 270 280 290 300 310
pF1KB9 FPGDVALNQPDMCVYRRGRKKRVPYTKLQLKELENEYAINKFINKDKRRRISAATNLSER
.: ::. . : :::::::::::::.:::::: ::: ::::.:::::::::.::::::
CCDS54 LP-DVVSHPSDASSYRRGRKKRVPYTKVQLKELEREYATNKFITKDKRRRISATTNLSER
310 320 330 340 350 360
320 330
pF1KB9 QVTIWFQNRRVKDKKIVSKLKDTVS
::::::::::::.::...::: :
CCDS54 QVTIWFQNRRVKEKKVINKLKTTS
370 380
>>CCDS8865.1 HOXC13 gene_id:3229|Hs108|chr12 (330 aa)
initn: 888 init1: 392 opt: 814 Z-score: 503.7 bits: 101.5 E(32554): 9.7e-22
Smith-Waterman score: 898; 48.3% identity (70.8% similar) in 329 aa overlap (8-331:29-323)
10 20 30
pF1KB9 MDGLRADGGGAGGAPASSSSSSVAAAAASGQCRGFLSAP
:::.::. ..... :.: : : ..:
CCDS88 MTTSLLLHPRWPESLMYVYEDSAAESGIGGGGGGGGGGTGG-------AGGGCSG--ASP
10 20 30 40 50
40 50 60 70 80 90
pF1KB9 VFAGTHSGRAAAAAAAAAAAAAAASGFAYPGTSERTGSSSSSSSSAVVAARPEAPPAKEC
: . .: ... :. .. : . :. ... . . : ::: :..:
CCDS88 GKAPSMDGLGSSCPASHCRDLLPHPVLGRPPAP--LGAPQGAVYTDIPA--PEA--ARQC
60 70 80 90 100
100 110 120 130 140 150
pF1KB9 PAPTPAAAAAAPP--SAPALGYGYHFGNGYYSCRMSHGVGLQQNALKSSPHASLGGFPVE
:: :: :: :. .::::: ::..::.::.::.:.:::. : : : .
CCDS88 -APPPA-----PPTSSSATLGYGYPFGGSYYGCRLSHNVNLQQK-----PCAY---HPGD
110 120 130 140 150
160 170 180 190 200 210
pF1KB9 KYMDVSGLASSSVPANEVPARAKEVSFYQGYTSPYQHVPGYIDMVSTFG-SG--EPRHEA
:: . :: ..:.... .:::: .:: ...: :: .:::.:. . : :: ::::.:
CCDS88 KYPEPSG----ALPGDDLSSRAKEFAFYPSFASSYQAMPGYLDVSVVPGISGHPEPRHDA
160 170 180 190 200
220 230 240 250 260 270
pF1KB9 YISMEGYQSWTLANGWNSQVYCTKDQPQGSHFWKSSFPGDVALNQPDMCVYRRGRKKRVP
: .:::: :.:.:::.:::::.:.: :..:.::: :: ::. ::.. ::::::::::
CCDS88 LIPVEGYQHWALSNGWDSQVYCSKEQSQSAHLWKSPFP-DVVPLQPEVSSYRRGRKKRVP
210 220 230 240 250 260
280 290 300 310 320 330
pF1KB9 YTKLQLKELENEYAINKFINKDKRRRISAATNLSERQVTIWFQNRRVKDKKIVSKLKDTV
:::.::::::.::: .:::.:.:::::::.::::::::::::::::::.::.::: :
CCDS88 YTKVQLKELEKEYAASKFITKEKRRRISATTNLSERQVTIWFQNRRVKEKKVVSKSKAPH
270 280 290 300 310 320
pF1KB9 S
CCDS88 LHST
330
>>CCDS11536.1 HOXB13 gene_id:10481|Hs108|chr17 (284 aa)
initn: 726 init1: 568 opt: 669 Z-score: 419.0 bits: 85.6 E(32554): 5.1e-17
Smith-Waterman score: 749; 47.8% identity (74.1% similar) in 247 aa overlap (93-335:57-283)
70 80 90 100 110 120
pF1KB9 ASGFAYPGTSERTGSSSSSSSSAVVAARPEAPPAKECPAPTPAAAAAAPPSAPALGYGYH
: : :.: : :.. .. : :: . :::
CCDS11 LVAHSPLTSHPAAPTLMPAVNYAPLDLPGSAEPPKQCH-PCPGVPQGTSP-AP-VPYGY-
30 40 50 60 70 80
130 140 150 160 170 180
pF1KB9 FGNGYYSCRMSHGVGLQQNALKSSPHA-SLGGFPVEKYMDVSGLASSSVPANEVPARAKE
::.::::::.: ...:: .: .:...:.: . . ..: :.: :
CCDS11 FGGGYYSCRVS------RSSLKPCAQAATLAAYPAE----------TPTAGEEYPSRPTE
90 100 110 120
190 200 210 220 230
pF1KB9 VSFYQGYTSPYQHVPGYIDM--VSTFGS-GEPRHEAYISMEGYQSWTLANGWNSQVYCTK
.:: :: . :: . .:.:. :.:.:. :::::.. . ...::::.::.:::::. :
CCDS11 FAFYPGYPGTYQPMASYLDVSVVQTLGAPGEPRHDSLLPVDSYQSWALAGGWNSQMCCQG
130 140 150 160 170 180
240 250 260 270 280 290
pF1KB9 DQPQGSHFWKSSFPGDVALNQPDMCVYRRGRKKRVPYTKLQLKELENEYAINKFINKDKR
.: . :::..: . . . :: :..:::::::.::.: ::.::: ::: ::::.::::
CCDS11 EQNPPGPFWKAAFADSSGQHPPDACAFRRGRKKRIPYSKGQLRELEREYAANKFITKDKR
190 200 210 220 230 240
300 310 320 330
pF1KB9 RRISAATNLSERQVTIWFQNRRVKDKKIVSKLKDTVS
:.:::::.:::::.::::::::::.::...:.:....
CCDS11 RKISAATSLSERQITIWFQNRRVKEKKVLAKVKNSATP
250 260 270 280
335 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 18:26:34 2016 done: Fri Nov 4 18:26:34 2016
Total Scan time: 2.470 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]