FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7749, 443 aa
1>>>pF1KB7749 443 - 443 aa - 443 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 13.9168+/-0.00117; mu= -16.4737+/- 0.071
mean_var=687.4279+/-141.307, 0's: 0 Z-trim(118.1): 102 B-trim: 259 in 2/53
Lambda= 0.048917
statistics sampled from 18885 (18988) to 18885 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.829), E-opt: 0.2 (0.583), width: 16
Scan time: 2.910
The best scores are: opt bits E(32554)
CCDS5404.1 HOXA3 gene_id:3200|Hs108|chr7 ( 443) 3151 236.7 3.5e-62
CCDS11528.1 HOXB3 gene_id:3213|Hs108|chr17 ( 431) 959 82.0 1.3e-15
CCDS2270.1 HOXD3 gene_id:3232|Hs108|chr2 ( 432) 883 76.7 5.2e-14
CCDS82154.1 HOXB3 gene_id:3213|Hs108|chr17 ( 358) 746 66.9 3.7e-11
>>CCDS5404.1 HOXA3 gene_id:3200|Hs108|chr7 (443 aa)
initn: 3151 init1: 3151 opt: 3151 Z-score: 1229.9 bits: 236.7 E(32554): 3.5e-62
Smith-Waterman score: 3151; 100.0% identity (100.0% similar) in 443 aa overlap (1-443:1-443)
10 20 30 40 50 60
pF1KB7 MQKATYYDSSAIYGGYPYQAANGFAYNANQQPYPASAALGADGEYHRPACSLQSPSSAGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 MQKATYYDSSAIYGGYPYQAANGFAYNANQQPYPASAALGADGEYHRPACSLQSPSSAGG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 HPKAHELSEACLRTLSAPPSQPPSLGEPPLHPPPPQAAPPAPQPPQPAPQPPAPTPAAPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 HPKAHELSEACLRTLSAPPSQPPSLGEPPLHPPPPQAAPPAPQPPQPAPQPPAPTPAAPP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 PPSSASPPQNASNNPTPANAAKSPLLNSPTVAKQIFPWMKESRQNTKQKTSSSSSGESCA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PPSSASPPQNASNNPTPANAAKSPLLNSPTVAKQIFPWMKESRQNTKQKTSSSSSGESCA
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 GDKSPPGQASSKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMANLLNLTERQIKIWFQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 GDKSPPGQASSKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMANLLNLTERQIKIWFQ
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 NRRMKYKKDQKGKGMLTSSGGQSPSRSPVPPGAGGYLNSMHSLVNSVPYEPQSPPPFSKP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 NRRMKYKKDQKGKGMLTSSGGQSPSRSPVPPGAGGYLNSMHSLVNSVPYEPQSPPPFSKP
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB7 PQGTYGLPPASYPASLPSCAPPPPPQKRYTAAGAGAGGTPDYDPHAHGLQGNGSYGTPHI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PQGTYGLPPASYPASLPSCAPPPPPQKRYTAAGAGAGGTPDYDPHAHGLQGNGSYGTPHI
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB7 QGSPVFVGGSYVEPMSNSGPALFGLTHLPHAASGAMDYGGAGPLGSGHHHGPGPGEPHPT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 QGSPVFVGGSYVEPMSNSGPALFGLTHLPHAASGAMDYGGAGPLGSGHHHGPGPGEPHPT
370 380 390 400 410 420
430 440
pF1KB7 YTDLTGHHPSQGRIQEAPKLTHL
:::::::::::::::::::::::
CCDS54 YTDLTGHHPSQGRIQEAPKLTHL
430 440
>>CCDS11528.1 HOXB3 gene_id:3213|Hs108|chr17 (431 aa)
initn: 831 init1: 605 opt: 959 Z-score: 394.1 bits: 82.0 E(32554): 1.3e-15
Smith-Waterman score: 1415; 50.4% identity (70.6% similar) in 476 aa overlap (1-443:1-431)
10 20 30 40 50
pF1KB7 MQKATYYDSSA--IYGGYP-YQAANGFAYNANQQPYPASAALGADGEYHRPACSLQSPSS
::::::::..: ..::: : ..:::.... :: : .:: .:.:.: :::::: ..
CCDS11 MQKATYYDNAAAALFGGYSSYPGSNGFGFDVPPQP-PFQAATHLEGDYQRSACSLQSLGN
10 20 30 40 50
60 70 80 90 100 110
pF1KB7 AGGHPKAHELSEACLRTLSAPPSQPPSLGEPPLHPPPPQAAPPAPQPPQPAPQPPAPTPA
:. : :..::. .:.: :.:. : : .:::. ::. :: . . .
CCDS11 AAPHAKSKELNGSCMR---------PGLA------PEPLSAPPGSPPPSAAPTSATSNSS
60 70 80 90 100
120 130 140 150 160 170
pF1KB7 APPPPSSASPPQNASNNPTPANAAKSPLLNSPTVAKQIFPWMKESRQNTKQKTSSSSSGE
::...::. . : :: :..::::::::::::..: :..: ...:
CCDS11 NGGGPSKSGPPKCG------------PGTNS-TLTKQIFPWMKESRQTSKLKNNSPGTAE
110 120 130 140 150
180 190 200 210
pF1KB7 SCAG-----------------------DKSPPGQASSKRARTAYTSAQLVELEKEFHFNR
.:.: ::::::.:.::::::::::::::::::::::::
CCDS11 GCGGGGGGGGGGGSGGSGGGGGGGGGGDKSPPGSAASKRARTAYTSAQLVELEKEFHFNR
160 170 180 190 200 210
220 230 240 250 260 270
pF1KB7 YLCRPRRVEMANLLNLTERQIKIWFQNRRMKYKKDQKGKGMLTSSGGQSPSRSPVPP--G
::::::::::::::::.::::::::::::::::::::.::. .:::: ::. :: : .
CCDS11 YLCRPRRVEMANLLNLSERQIKIWFQNRRMKYKKDQKAKGLASSSGGPSPAGSPPQPMQS
220 230 240 250 260 270
280 290 300 310 320 330
pF1KB7 AGGYLNSMHSLVNSVPYEPQSPPPFSKPPQGTYGLPPASYPASLPSCAPPPPPQKRYTAA
..:..:..::.. : :: ::: :.: :..:.:: ..: : .:. : ..: .
CCDS11 TAGFMNALHSMTPS--YESPSPPAFGKAHQNAYALP-SNYQPPLKGCGAP----QKYPPT
280 290 300 310 320
340 350 360 370 380
pF1KB7 GAGAGGTPDYDPHAHGLQGNG-SYGTPHIQGSPVFVGGS-YVEPMSN-SGPALFGLTHLP
: :.:.::. ::.:: .:::: .:::::.:::. :..:. .::.:.::.::
CCDS11 PA-----PEYEPHV--LQANGGAYGTPTMQGSPVYVGGGGYADPLPPPAGPSLYGLNHLS
330 340 350 360 370
390 400 410 420 430 440
pF1KB7 HAASGAMDYGGAGPLGSGHHHGPGPGEPHPTYTDLTGHH--PSQGRIQEAPKLTHL
: :: .::.:: :.. ..:::: :::::::::..:: : :::::::::::::
CCDS11 HHPSGNLDYNGAPPMAPSQHHGPC--EPHPTYTDLSSHHAPPPQGRIQEAPKLTHL
380 390 400 410 420 430
>>CCDS2270.1 HOXD3 gene_id:3232|Hs108|chr2 (432 aa)
initn: 989 init1: 570 opt: 883 Z-score: 365.1 bits: 76.7 E(32554): 5.2e-14
Smith-Waterman score: 1401; 52.0% identity (72.4% similar) in 450 aa overlap (1-443:17-432)
10 20 30 40
pF1KB7 MQKATYYDSSAIYGGYPY-QAANGFAYNANQQPYPASAALGA-D
::::.::.. ...::: : .... ..:.. .:::: :: .. :
CCDS22 MLFEQGQQALELPECTMQKAAYYENPGLFGGYGYSKTTDTYGYSTPHQPYPPPAAASSLD
10 20 30 40 50 60
50 60 70 80 90 100
pF1KB7 GEYHRPACSLQS--PSSAGGHPKAHELSEACLRTLSAPPSQPPSLGEPPLHPPPPQAAPP
.: :::.:: : : .: :. ::. .:.: :. : : :: .
CCDS22 TDYPGSACSIQSSAPLRAPAH-KGAELNGSCMR-----PGTGNSQGGGGGSQPPGLNSE-
70 80 90 100 110
110 120 130 140 150 160
pF1KB7 APQPPQPAPQPPAPTPAAPPPPSSASPPQNASNNPTPANAAKSPLLNSPTVAKQIFPWMK
::::: : ::. :..: :... : .. ...: ::..: : :..::::::::
CCDS22 -QQPPQPPPPPPTLPPSSPTNPGGGVPAKKPKGGP---NASSS----SATISKQIFPWMK
120 130 140 150 160
170 180 190 200 210 220
pF1KB7 ESRQNTKQKTSSSSSGESCAGDKSPPGQASSKRARTAYTSAQLVELEKEFHFNRYLCRPR
:::::.:::.: ...:::: :::::: :: ::.::::::::::::::::::::::::::
CCDS22 ESRQNSKQKNSCATAGESCE-DKSPPGPAS-KRVRTAYTSAQLVELEKEFHFNRYLCRPR
170 180 190 200 210 220
230 240 250 260 270 280
pF1KB7 RVEMANLLNLTERQIKIWFQNRRMKYKKDQKGKGMLTSSGGQSPSRSPVPPGAGGYLNSM
:::::::::::::::::::::::::::::::.::.: : ..::: ::: ::.:..
CCDS22 RVEMANLLNLTERQIKIWFQNRRMKYKKDQKAKGILHSPASQSPERSPPLGGAAGHVAYS
230 240 250 260 270 280
290 300 310 320 330
pF1KB7 HSL--VNSVPYEPQSPPPFSKPPQGTYGLPPASYPASLPSCAPPPPPQKRYTAAGAGAGG
.: : .. :. ::: :.: . ::: :.: : : :: : ::::.:
CCDS22 GQLPPVPGLAYDAPSPPAFAKSQPNMYGL--AAYTAPLSSCLP---QQKRYAA-------
290 300 310 320 330
340 350 360 370 380 390
pF1KB7 TPDYDPHAHGLQGNGSYGTPHIQGSPVFVGGSYVEPMS-NSGPALFGLTHLPHAASGAMD
:...:: . .:.: ... ..:::::.:::..:: :. ::: .:.: :: : .:...:
CCDS22 -PEFEPHPMASNGGG-FASANLQGSPVYVGGNFVESMAPASGP-VFNLGHLSHPSSASVD
340 350 360 370 380
400 410 420 430 440
pF1KB7 YGGAGPLGSGHHHGPGPGEPHPTYTDLTGHHPSQGRIQEAPKLTHL
:. :. . ..::::: .::::::::..:: ::::. ::::::::
CCDS22 YSCAAQIPGNHHHGPC--DPHPTYTDLSAHHSSQGRLPEAPKLTHL
390 400 410 420 430
>>CCDS82154.1 HOXB3 gene_id:3213|Hs108|chr17 (358 aa)
initn: 831 init1: 605 opt: 746 Z-score: 313.8 bits: 66.9 E(32554): 3.7e-11
Smith-Waterman score: 1229; 53.9% identity (74.3% similar) in 373 aa overlap (105-443:3-358)
80 90 100 110 120 130
pF1KB7 LSAPPSQPPSLGEPPLHPPPPQAAPPAPQPPQPAPQPPAPTPAAPPP---PSSASPPQNA
: ::.: . :..::: :.::. ..
CCDS82 MRPGLAPEPLSAPPGSPPPSAAPTSATSNSSN
10 20 30
140 150 160 170 180
pF1KB7 SNNPTPANAAK-SPLLNSPTVAKQIFPWMKESRQNTKQKTSSSSSGESCAG---------
...:. .. : .: :: :..::::::::::::..: :..: ...:.:.:
CCDS82 GGGPSKSGPPKCGPGTNS-TLTKQIFPWMKESRQTSKLKNNSPGTAEGCGGGGGGGGGGG
40 50 60 70 80 90
190 200 210 220
pF1KB7 --------------DKSPPGQASSKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMANL
::::::.:.:::::::::::::::::::::::::::::::::::::
CCDS82 SGGSGGGGGGGGGGDKSPPGSAASKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMANL
100 110 120 130 140 150
230 240 250 260 270 280
pF1KB7 LNLTERQIKIWFQNRRMKYKKDQKGKGMLTSSGGQSPSRSPVPP--GAGGYLNSMHSLVN
:::.::::::::::::::::::::.::. .:::: ::. :: : ...:..:..::..
CCDS82 LNLSERQIKIWFQNRRMKYKKDQKAKGLASSSGGPSPAGSPPQPMQSTAGFMNALHSMTP
160 170 180 190 200 210
290 300 310 320 330 340
pF1KB7 SVPYEPQSPPPFSKPPQGTYGLPPASYPASLPSCAPPPPPQKRYTAAGAGAGGTPDYDPH
: :: ::: :.: :..:.:: ..: : .:. : ..: . : :.:.::
CCDS82 S--YESPSPPAFGKAHQNAYALP-SNYQPPLKGCGAP----QKYPPTPA-----PEYEPH
220 230 240 250
350 360 370 380 390 400
pF1KB7 AHGLQGNG-SYGTPHIQGSPVFVGGS-YVEPMSN-SGPALFGLTHLPHAASGAMDYGGAG
. ::.:: .:::: .:::::.:::. :..:. .::.:.::.:: : :: .::.::
CCDS82 V--LQANGGAYGTPTMQGSPVYVGGGGYADPLPPPAGPSLYGLNHLSHHPSGNLDYNGAP
260 270 280 290 300 310
410 420 430 440
pF1KB7 PLGSGHHHGPGPGEPHPTYTDLTGHH--PSQGRIQEAPKLTHL
:.. ..:::: :::::::::..:: : :::::::::::::
CCDS82 PMAPSQHHGPC--EPHPTYTDLSSHHAPPPQGRIQEAPKLTHL
320 330 340 350
443 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 09:22:49 2016 done: Fri Nov 4 09:22:49 2016
Total Scan time: 2.910 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]