FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9733, 483 aa
1>>>pF1KB9733 483 - 483 aa - 483 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 11.7020+/-0.00104; mu= -5.4194+/- 0.063
mean_var=561.1444+/-116.033, 0's: 0 Z-trim(118.1): 29 B-trim: 189 in 1/54
Lambda= 0.054142
statistics sampled from 18893 (18922) to 18893 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.83), E-opt: 0.2 (0.581), width: 16
Scan time: 3.030
The best scores are: opt bits E(32554)
CCDS10751.1 IRX5 gene_id:10265|Hs108|chr16 ( 483) 3386 278.7 9.6e-75
CCDS58462.1 IRX5 gene_id:10265|Hs108|chr16 ( 482) 3366 277.1 2.8e-74
CCDS3868.1 IRX2 gene_id:153572|Hs108|chr5 ( 471) 1134 102.8 8.5e-22
CCDS10750.1 IRX3 gene_id:79191|Hs108|chr16 ( 501) 802 76.9 5.6e-14
CCDS3867.1 IRX4 gene_id:50805|Hs108|chr5 ( 519) 705 69.3 1.1e-11
>>CCDS10751.1 IRX5 gene_id:10265|Hs108|chr16 (483 aa)
initn: 3386 init1: 3386 opt: 3386 Z-score: 1455.4 bits: 278.7 E(32554): 9.6e-75
Smith-Waterman score: 3386; 99.8% identity (99.8% similar) in 483 aa overlap (1-483:1-483)
10 20 30 40 50 60
pF1KB9 MSYPQGYLYQPSASLALYSCPAYSTSVISGPRTDELGRSSSGSAFSPYAGSTAFTAPSPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MSYPQGYLYQPSASLALYSCPAYSTSVISGPRTDELGRSSSGSAFSPYAGSTAFTAPSPG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 YNSHLQYGADPAAAAAAAFSSYVGSPYDHTPGMAGSLGYHPYAAPLGSYPYGDPAYRKNA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 YNSHLQYGADPAAAAAAAFSSYVGSPYDHTPGMAGSLGYHPYAAPLGSYPYGDPAYRKNA
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 TRDATATLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 TRDATATLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWT
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 PRNRSEDEEEEENIDLEKNDEDEPQKPEDKGDPEGPEAGGAEQKAASGCERLQGPPTPAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 PRNRSEDEEEEENIDLEKNDEDEPQKPEDKGDPEGPEAGGAEQKAASGCERLQGPPTPAG
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 KETEGSLSDSDFKETPSEGRLDALQGPPRTGGPSPAGPAAARLAEDPAPHYPAGAPAPGP
:::::::::::::: :::::::::::::::::::::::::::::::::::::::::::::
CCDS10 KETEGSLSDSDFKEPPSEGRLDALQGPPRTGGPSPAGPAAARLAEDPAPHYPAGAPAPGP
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB9 HPAAGEVPPGPGGPSVIHSPPPPPPPAVLAKPKLWSLAEIATSSDKVKDGGGGNEGSPCP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 HPAAGEVPPGPGGPSVIHSPPPPPPPAVLAKPKLWSLAEIATSSDKVKDGGGGNEGSPCP
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB9 PCPGPIAGQALGGSRASPAPAPSRSPSAQCPFPGGTVLSRPLYYTAPFYPGYTNYGSFGH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 PCPGPIAGQALGGSRASPAPAPSRSPSAQCPFPGGTVLSRPLYYTAPFYPGYTNYGSFGH
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB9 LHGHPGPGPGPTTGPGSHFNGLNQTVLNRADALAKDPKMLRSQSQLDLCKDSPYELKKGM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 LHGHPGPGPGPTTGPGSHFNGLNQTVLNRADALAKDPKMLRSQSQLDLCKDSPYELKKGM
430 440 450 460 470 480
pF1KB9 SDI
:::
CCDS10 SDI
>>CCDS58462.1 IRX5 gene_id:10265|Hs108|chr16 (482 aa)
initn: 1908 init1: 1908 opt: 3366 Z-score: 1446.9 bits: 277.1 E(32554): 2.8e-74
Smith-Waterman score: 3366; 99.6% identity (99.6% similar) in 483 aa overlap (1-483:1-482)
10 20 30 40 50 60
pF1KB9 MSYPQGYLYQPSASLALYSCPAYSTSVISGPRTDELGRSSSGSAFSPYAGSTAFTAPSPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 MSYPQGYLYQPSASLALYSCPAYSTSVISGPRTDELGRSSSGSAFSPYAGSTAFTAPSPG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 YNSHLQYGADPAAAAAAAFSSYVGSPYDHTPGMAGSLGYHPYAAPLGSYPYGDPAYRKNA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 YNSHLQYGADPAAAAAAAFSSYVGSPYDHTPGMAGSLGYHPYAAPLGSYPYGDPAYRKNA
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 TRDATATLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 TRDATATLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWT
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 PRNRSEDEEEEENIDLEKNDEDEPQKPEDKGDPEGPEAGGAEQKAASGCERLQGPPTPAG
::::::::::::::::::::::::::::::::::::::: ::::::::::::::::::::
CCDS58 PRNRSEDEEEEENIDLEKNDEDEPQKPEDKGDPEGPEAG-AEQKAASGCERLQGPPTPAG
190 200 210 220 230
250 260 270 280 290 300
pF1KB9 KETEGSLSDSDFKETPSEGRLDALQGPPRTGGPSPAGPAAARLAEDPAPHYPAGAPAPGP
:::::::::::::: :::::::::::::::::::::::::::::::::::::::::::::
CCDS58 KETEGSLSDSDFKEPPSEGRLDALQGPPRTGGPSPAGPAAARLAEDPAPHYPAGAPAPGP
240 250 260 270 280 290
310 320 330 340 350 360
pF1KB9 HPAAGEVPPGPGGPSVIHSPPPPPPPAVLAKPKLWSLAEIATSSDKVKDGGGGNEGSPCP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 HPAAGEVPPGPGGPSVIHSPPPPPPPAVLAKPKLWSLAEIATSSDKVKDGGGGNEGSPCP
300 310 320 330 340 350
370 380 390 400 410 420
pF1KB9 PCPGPIAGQALGGSRASPAPAPSRSPSAQCPFPGGTVLSRPLYYTAPFYPGYTNYGSFGH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 PCPGPIAGQALGGSRASPAPAPSRSPSAQCPFPGGTVLSRPLYYTAPFYPGYTNYGSFGH
360 370 380 390 400 410
430 440 450 460 470 480
pF1KB9 LHGHPGPGPGPTTGPGSHFNGLNQTVLNRADALAKDPKMLRSQSQLDLCKDSPYELKKGM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 LHGHPGPGPGPTTGPGSHFNGLNQTVLNRADALAKDPKMLRSQSQLDLCKDSPYELKKGM
420 430 440 450 460 470
pF1KB9 SDI
:::
CCDS58 SDI
480
>>CCDS3868.1 IRX2 gene_id:153572|Hs108|chr5 (471 aa)
initn: 970 init1: 655 opt: 1134 Z-score: 504.8 bits: 102.8 E(32554): 8.5e-22
Smith-Waterman score: 1246; 51.5% identity (69.8% similar) in 443 aa overlap (1-424:1-405)
10 20 30 40 50
pF1KB9 MSYPQGYLYQPSASLALYSCPAYSTSVISGPRTDELGRSSSGSAFSPYAGSTAFTAPSP-
:::::::::: .::::::::::..:....::..::.::.:::::::: ::.:::: .
CCDS38 MSYPQGYLYQAPGSLALYSCPAYGASALAAPRSEELARSASGSAFSPYPGSAAFTAQAAT
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB9 GYNSHLQYGADPAAAAAAAFSSYVGSPYD-HTPGMAGSLGYHPYAAPLGSYPY--GDPAY
:..: :::.:: ::::::.: ::.:.::: :: ::.:...::::.. ..::: .::::
CCDS38 GFGSPLQYSAD-AAAAAAGFPSYMGAPYDAHTTGMTGAISYHPYGS--AAYPYQLNDPAY
70 80 90 100 110
120 130 140 150 160 170
pF1KB9 RKNATRDATATLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 RKNATRDATATLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENK
120 130 140 150 160 170
180 190 200 210 220
pF1KB9 MTWTPRNRSEDEEEEENIDLEKNDEDEPQKPEDKGDPEGPEAGGAEQ---------KAAS
:::.:::.::::.:.:. : .. .. :.: .. . . . : . . .: :
CCDS38 MTWAPRNKSEDEDEDEG-DATRSKDESPDKAQEGTETSAEDEGISLHVDSLTDHSCSAES
180 190 200 210 220 230
230 240 250 260 270 280
pF1KB9 GCERLQ---GPPT-PAGKETEGSLSDSDFKETPSEGRLDALQGPPRTGGPSPAGPAAARL
:.: : : .:.: . . .: . : .: .: .::. :: : :
CCDS38 DGEKLPCRAGDPLCESGSECKDKYDDLEDDEDDDEEGERGL-APPKPVTSSPLTGLEAPL
240 250 260 270 280 290
290 300 310 320 330 340
pF1KB9 AEDPAPHYPAGAPAPGPHPAAGEVPPGPGGPSVIHSPPPPPPPAVLAKPKLWSLAEIATS
: : .:: : ..: : .. : :::: .:::::::::::::
CCDS38 LSPP----PEAAPRGG-----RKTPQGS------RTSPGAPPPA--SKPKLWSLAEIATS
300 310 320 330
350 360 370 380 390 400
pF1KB9 SDKVKDGGGGNEGSPCPPCPG-PIAGQALGGSRASPAPAPSRSPSAQCPFPGGTVLSRPL
. : . : : : : :: : :. ::: . .: . :.:.. .:.:::
CCDS38 DLKQPSLGPG-----CGP-PGLP----------AAAAPASTGAPPGGSPYPASPLLGRPL
340 350 360 370 380
410 420 430 440 450 460
pF1KB9 YYTAPFYPGYTNYGSFGH-LHGHPGPGPGPTTGPGSHFNGLNQTVLNRADALAKDPKMLR
:::.::: .:::::... :.:.
CCDS38 YYTSPFYGNYTNYGNLNAALQGQGLLRYNSAAAAPGEALHTAPKAASDAGKAGAHPLESH
390 400 410 420 430 440
>>CCDS10750.1 IRX3 gene_id:79191|Hs108|chr16 (501 aa)
initn: 730 init1: 522 opt: 802 Z-score: 364.4 bits: 76.9 E(32554): 5.6e-14
Smith-Waterman score: 811; 38.4% identity (56.3% similar) in 497 aa overlap (1-465:1-479)
10 20 30 40
pF1KB9 MSYPQ-GYLY----QPSASLALYSCPAYSTSVISG--PRTDELGRSSSGSAF------SP
::.:: :: : :: . . . :... .: ..::. :.: : .:
CCDS10 MSFPQLGYQYIRPLYPSERPGAAGGSGGSAGARGGLGAGASELNASGSLSNVLSSVYGAP
10 20 30 40 50 60
50 60 70 80 90 100
pF1KB9 YAGSTAFTAPSPGYNSHLQYGAD-PAAAAAAAFSSYVGSPYDHTPGMAGSLGY-HPYAAP
::...: .: . ::.. : :.:. : .: :: . :. :... . :: :
CCDS10 YAAAAA-AAAAQGYGAFLPYAAELPIFPQLGAQYELKDSPGVQHPAAAAAFPHPHPAFYP
70 80 90 100 110
110 120 130 140 150 160
pF1KB9 LGSYPYGDPAYRKNATRDATATLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFA
:.: .:::. :::::..:.:::::::::::::::::::::::::::::::::::::::
CCDS10 YGQYQFGDPSRPKNATRESTSTLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFA
120 130 140 150 160 170
170 180 190 200 210 220
pF1KB9 NARRRLKKENKMTWTPRNRSEDEEEEENIDLEKNDEDEPQKPED---KGDPEGPEAGGAE
::::::::::::::.::.:.. :: . :...::: . :: . . : : :: :
CCDS10 NARRRLKKENKMTWAPRSRTD--EEGNAYGSEREEEDEEEDEEDGKRELELEEEELGGEE
180 190 200 210 220 230
230 240 250 260 270 280
pF1KB9 QKAASGCERLQGPPTPAGKETEGSLSDSDFKETPSEGRLDALQGPPRTGGPSPAGPAAAR
. .: : : .. : .: . : : : .: : : : :: .
CCDS10 ED--TGGEGLADD----DEDEEIDLENLDGAATEPE---LSLAGAARRDGDLGLGPISDS
240 250 260 270 280
290 300 310 320 330
pF1KB9 LAEDPAPH--------YPAGAPAPGPHPAAGEVPPGPGGP-SVIHSPPPPPPPAVLAKPK
: :. . ::.: :.: : :. : :. : : : ..: :::
CCDS10 KNSDSEDSSEGLEDRPLPVLSLAPAPPPVAVASPSLPSPPVSLDPCAPAPAPASALQKPK
290 300 310 320 330 340
340 350 360 370 380 390
pF1KB9 LWSLAEIATSSDKVKDGGGGNEGSPCPPCPGP-IAGQALGGSRASPAPAPSRSPSAQC-P
.::::: ::: :. . . : ::: :: .: .:: : :. : : : ::
CCDS10 IWSLAETATSPDNPRRSPPGAGGSP----PGAAVAPSALQLSPAAAAAAAHRLVSAPLGK
350 360 370 380 390 400
400 410 420 430 440
pF1KB9 FPGGTVLSRPLYYTAP---FYPGYTNYGSFGHLHGHPGPGPGPTTGPGSHFNGLNQTVLN
::. : .::. : ..: .. :: : :: . :... . . . .
CCDS10 FPAWT--NRPFPGPPPGPRLHPLSLLGSAPPHLLGLPGAAGHPAAAAAFARPAEPEGGTD
410 420 430 440 450 460
450 460 470 480
pF1KB9 RADALAKDPKMLRSQSQLDLCKDSPYELKKGMSDI
: .:: . :.:.. :
CCDS10 RCSALEVEKKLLKTAFQPVPRRPQNHLDAALVLSALSSS
470 480 490 500
>>CCDS3867.1 IRX4 gene_id:50805|Hs108|chr5 (519 aa)
initn: 562 init1: 451 opt: 705 Z-score: 323.2 bits: 69.3 E(32554): 1.1e-11
Smith-Waterman score: 705; 38.7% identity (59.6% similar) in 406 aa overlap (11-394:39-422)
10 20 30
pF1KB9 MSYPQGYLYQPSASL-ALYSCPAYSTSVISGPRTDELGRS
:.:: : ::.: . ... : . . .
CCDS38 PYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATARHELNSAA
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB9 SSGSAFSPYAGSTAFTAPSPGYNSHLQYGADPAA-AAAAAFSSYVGSPYDHTPGMA-GSL
. : .::.:: ::.... ::.. .: . .:.: :: : :.: ..
CCDS38 ALGVYGGPYGGS-------QGYGNYVTYGSEASAFYSLNSFDSKDGSGSAHG-GLAPAAA
70 80 90 100 110 120
100 110 120 130 140 150
pF1KB9 GYHPYAAPLGSYPYG------DPAYRKNATRDATATLKAWLNEHRKNPYPTKGEKIMLAI
.:.:: ::.::: . . ::::::..:.::::::.::::::::::::::::::
CCDS38 AYYPYEPALGQYPYDRYGTMDSGTRRKNATRETTSTLKAWLQEHRKNPYPTKGEKIMLAI
130 140 150 160 170 180
160 170 180 190 200
pF1KB9 ITKMTLTQVSTWFANARRRLKKENKMTWTPRNRSEDE-----EEEENIDLEKNDEDEPQK
:::::::::::::::::::::::::::: :::. :: : ::. :.. ..:: :
CCDS38 ITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEEEAREEPLK
190 200 210 220 230 240
210 220 230 240 250 260
pF1KB9 PEDKGDPEGPEAGGAEQKAASGCERLQG-PPTPAGKETEGSLSDSDFKETPS--EGRLDA
...: : : : . . . :.. ::. : :: :. ....:. .: .
CCDS38 SSKNAEPVGKEEKELELSDLDDFDPLEAEPPACELKPPFHSL-DGGLERVPAAPDGPVKE
250 260 270 280 290
270 280 290 300 310 320
pF1KB9 LQGPPRTGGPSPAGPAAARLAED-PAPHYPAGAPAPGPHPAAGEVPPGPGGPSVIHSP--
.: : : :. ..: : :: . . : ::.: .: . :::.: ..
CCDS38 ASGALRM---SLAAGGGAALDEDLERARSCLRSAAAGPEP----LPGAEGGPQVCEAKLG
300 310 320 330 340 350
330 340 350 360 370
pF1KB9 --PPPPPPAVLAKPKLWSLAEIATSSDKVKDGGGGNEGSPCPPCPGPIAGQALGGSRASP
: .. :::..::::. ::.. . . . .: : : : : . :.
CCDS38 FVPAGASAGLEAKPRIWSLAHTATAAAAAATSLSQTE---FPSCMLKRQGPA---APAAV
360 370 380 390 400
380 390 400 410 420 430
pF1KB9 APAPSRSPSAQCPFPGGTVLSRPLYYTAPFYPGYTNYGSFGHLHGHPGPGPGPTTGPGSH
. ::. :::. : :
CCDS38 SSAPATSPSVALPHSGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLNQAWATAKGALLD
410 420 430 440 450 460
483 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 04:32:16 2016 done: Sun Nov 6 04:32:16 2016
Total Scan time: 3.030 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]