FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9733, 483 aa 1>>>pF1KB9733 483 - 483 aa - 483 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.7020+/-0.00104; mu= -5.4194+/- 0.063 mean_var=561.1444+/-116.033, 0's: 0 Z-trim(118.1): 29 B-trim: 189 in 1/54 Lambda= 0.054142 statistics sampled from 18893 (18922) to 18893 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.83), E-opt: 0.2 (0.581), width: 16 Scan time: 3.030 The best scores are: opt bits E(32554) CCDS10751.1 IRX5 gene_id:10265|Hs108|chr16 ( 483) 3386 278.7 9.6e-75 CCDS58462.1 IRX5 gene_id:10265|Hs108|chr16 ( 482) 3366 277.1 2.8e-74 CCDS3868.1 IRX2 gene_id:153572|Hs108|chr5 ( 471) 1134 102.8 8.5e-22 CCDS10750.1 IRX3 gene_id:79191|Hs108|chr16 ( 501) 802 76.9 5.6e-14 CCDS3867.1 IRX4 gene_id:50805|Hs108|chr5 ( 519) 705 69.3 1.1e-11 >>CCDS10751.1 IRX5 gene_id:10265|Hs108|chr16 (483 aa) initn: 3386 init1: 3386 opt: 3386 Z-score: 1455.4 bits: 278.7 E(32554): 9.6e-75 Smith-Waterman score: 3386; 99.8% identity (99.8% similar) in 483 aa overlap (1-483:1-483) 10 20 30 40 50 60 pF1KB9 MSYPQGYLYQPSASLALYSCPAYSTSVISGPRTDELGRSSSGSAFSPYAGSTAFTAPSPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MSYPQGYLYQPSASLALYSCPAYSTSVISGPRTDELGRSSSGSAFSPYAGSTAFTAPSPG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 YNSHLQYGADPAAAAAAAFSSYVGSPYDHTPGMAGSLGYHPYAAPLGSYPYGDPAYRKNA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 YNSHLQYGADPAAAAAAAFSSYVGSPYDHTPGMAGSLGYHPYAAPLGSYPYGDPAYRKNA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 TRDATATLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 TRDATATLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 PRNRSEDEEEEENIDLEKNDEDEPQKPEDKGDPEGPEAGGAEQKAASGCERLQGPPTPAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 PRNRSEDEEEEENIDLEKNDEDEPQKPEDKGDPEGPEAGGAEQKAASGCERLQGPPTPAG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 KETEGSLSDSDFKETPSEGRLDALQGPPRTGGPSPAGPAAARLAEDPAPHYPAGAPAPGP :::::::::::::: ::::::::::::::::::::::::::::::::::::::::::::: CCDS10 KETEGSLSDSDFKEPPSEGRLDALQGPPRTGGPSPAGPAAARLAEDPAPHYPAGAPAPGP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 HPAAGEVPPGPGGPSVIHSPPPPPPPAVLAKPKLWSLAEIATSSDKVKDGGGGNEGSPCP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 HPAAGEVPPGPGGPSVIHSPPPPPPPAVLAKPKLWSLAEIATSSDKVKDGGGGNEGSPCP 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB9 PCPGPIAGQALGGSRASPAPAPSRSPSAQCPFPGGTVLSRPLYYTAPFYPGYTNYGSFGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 PCPGPIAGQALGGSRASPAPAPSRSPSAQCPFPGGTVLSRPLYYTAPFYPGYTNYGSFGH 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB9 LHGHPGPGPGPTTGPGSHFNGLNQTVLNRADALAKDPKMLRSQSQLDLCKDSPYELKKGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LHGHPGPGPGPTTGPGSHFNGLNQTVLNRADALAKDPKMLRSQSQLDLCKDSPYELKKGM 430 440 450 460 470 480 pF1KB9 SDI ::: CCDS10 SDI >>CCDS58462.1 IRX5 gene_id:10265|Hs108|chr16 (482 aa) initn: 1908 init1: 1908 opt: 3366 Z-score: 1446.9 bits: 277.1 E(32554): 2.8e-74 Smith-Waterman score: 3366; 99.6% identity (99.6% similar) in 483 aa overlap (1-483:1-482) 10 20 30 40 50 60 pF1KB9 MSYPQGYLYQPSASLALYSCPAYSTSVISGPRTDELGRSSSGSAFSPYAGSTAFTAPSPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MSYPQGYLYQPSASLALYSCPAYSTSVISGPRTDELGRSSSGSAFSPYAGSTAFTAPSPG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 YNSHLQYGADPAAAAAAAFSSYVGSPYDHTPGMAGSLGYHPYAAPLGSYPYGDPAYRKNA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 YNSHLQYGADPAAAAAAAFSSYVGSPYDHTPGMAGSLGYHPYAAPLGSYPYGDPAYRKNA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 TRDATATLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 TRDATATLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 PRNRSEDEEEEENIDLEKNDEDEPQKPEDKGDPEGPEAGGAEQKAASGCERLQGPPTPAG ::::::::::::::::::::::::::::::::::::::: :::::::::::::::::::: CCDS58 PRNRSEDEEEEENIDLEKNDEDEPQKPEDKGDPEGPEAG-AEQKAASGCERLQGPPTPAG 190 200 210 220 230 250 260 270 280 290 300 pF1KB9 KETEGSLSDSDFKETPSEGRLDALQGPPRTGGPSPAGPAAARLAEDPAPHYPAGAPAPGP :::::::::::::: ::::::::::::::::::::::::::::::::::::::::::::: CCDS58 KETEGSLSDSDFKEPPSEGRLDALQGPPRTGGPSPAGPAAARLAEDPAPHYPAGAPAPGP 240 250 260 270 280 290 310 320 330 340 350 360 pF1KB9 HPAAGEVPPGPGGPSVIHSPPPPPPPAVLAKPKLWSLAEIATSSDKVKDGGGGNEGSPCP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 HPAAGEVPPGPGGPSVIHSPPPPPPPAVLAKPKLWSLAEIATSSDKVKDGGGGNEGSPCP 300 310 320 330 340 350 370 380 390 400 410 420 pF1KB9 PCPGPIAGQALGGSRASPAPAPSRSPSAQCPFPGGTVLSRPLYYTAPFYPGYTNYGSFGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 PCPGPIAGQALGGSRASPAPAPSRSPSAQCPFPGGTVLSRPLYYTAPFYPGYTNYGSFGH 360 370 380 390 400 410 430 440 450 460 470 480 pF1KB9 LHGHPGPGPGPTTGPGSHFNGLNQTVLNRADALAKDPKMLRSQSQLDLCKDSPYELKKGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 LHGHPGPGPGPTTGPGSHFNGLNQTVLNRADALAKDPKMLRSQSQLDLCKDSPYELKKGM 420 430 440 450 460 470 pF1KB9 SDI ::: CCDS58 SDI 480 >>CCDS3868.1 IRX2 gene_id:153572|Hs108|chr5 (471 aa) initn: 970 init1: 655 opt: 1134 Z-score: 504.8 bits: 102.8 E(32554): 8.5e-22 Smith-Waterman score: 1246; 51.5% identity (69.8% similar) in 443 aa overlap (1-424:1-405) 10 20 30 40 50 pF1KB9 MSYPQGYLYQPSASLALYSCPAYSTSVISGPRTDELGRSSSGSAFSPYAGSTAFTAPSP- :::::::::: .::::::::::..:....::..::.::.:::::::: ::.:::: . CCDS38 MSYPQGYLYQAPGSLALYSCPAYGASALAAPRSEELARSASGSAFSPYPGSAAFTAQAAT 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB9 GYNSHLQYGADPAAAAAAAFSSYVGSPYD-HTPGMAGSLGYHPYAAPLGSYPY--GDPAY :..: :::.:: ::::::.: ::.:.::: :: ::.:...::::.. ..::: .:::: CCDS38 GFGSPLQYSAD-AAAAAAGFPSYMGAPYDAHTTGMTGAISYHPYGS--AAYPYQLNDPAY 70 80 90 100 110 120 130 140 150 160 170 pF1KB9 RKNATRDATATLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 RKNATRDATATLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENK 120 130 140 150 160 170 180 190 200 210 220 pF1KB9 MTWTPRNRSEDEEEEENIDLEKNDEDEPQKPEDKGDPEGPEAGGAEQ---------KAAS :::.:::.::::.:.:. : .. .. :.: .. . . . : . . .: : CCDS38 MTWAPRNKSEDEDEDEG-DATRSKDESPDKAQEGTETSAEDEGISLHVDSLTDHSCSAES 180 190 200 210 220 230 230 240 250 260 270 280 pF1KB9 GCERLQ---GPPT-PAGKETEGSLSDSDFKETPSEGRLDALQGPPRTGGPSPAGPAAARL :.: : : .:.: . . .: . : .: .: .::. :: : : CCDS38 DGEKLPCRAGDPLCESGSECKDKYDDLEDDEDDDEEGERGL-APPKPVTSSPLTGLEAPL 240 250 260 270 280 290 290 300 310 320 330 340 pF1KB9 AEDPAPHYPAGAPAPGPHPAAGEVPPGPGGPSVIHSPPPPPPPAVLAKPKLWSLAEIATS : : .:: : ..: : .. : :::: .::::::::::::: CCDS38 LSPP----PEAAPRGG-----RKTPQGS------RTSPGAPPPA--SKPKLWSLAEIATS 300 310 320 330 350 360 370 380 390 400 pF1KB9 SDKVKDGGGGNEGSPCPPCPG-PIAGQALGGSRASPAPAPSRSPSAQCPFPGGTVLSRPL . : . : : : : :: : :. ::: . .: . :.:.. .:.::: CCDS38 DLKQPSLGPG-----CGP-PGLP----------AAAAPASTGAPPGGSPYPASPLLGRPL 340 350 360 370 380 410 420 430 440 450 460 pF1KB9 YYTAPFYPGYTNYGSFGH-LHGHPGPGPGPTTGPGSHFNGLNQTVLNRADALAKDPKMLR :::.::: .:::::... :.:. CCDS38 YYTSPFYGNYTNYGNLNAALQGQGLLRYNSAAAAPGEALHTAPKAASDAGKAGAHPLESH 390 400 410 420 430 440 >>CCDS10750.1 IRX3 gene_id:79191|Hs108|chr16 (501 aa) initn: 730 init1: 522 opt: 802 Z-score: 364.4 bits: 76.9 E(32554): 5.6e-14 Smith-Waterman score: 811; 38.4% identity (56.3% similar) in 497 aa overlap (1-465:1-479) 10 20 30 40 pF1KB9 MSYPQ-GYLY----QPSASLALYSCPAYSTSVISG--PRTDELGRSSSGSAF------SP ::.:: :: : :: . . . :... .: ..::. :.: : .: CCDS10 MSFPQLGYQYIRPLYPSERPGAAGGSGGSAGARGGLGAGASELNASGSLSNVLSSVYGAP 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB9 YAGSTAFTAPSPGYNSHLQYGAD-PAAAAAAAFSSYVGSPYDHTPGMAGSLGY-HPYAAP ::...: .: . ::.. : :.:. : .: :: . :. :... . :: : CCDS10 YAAAAA-AAAAQGYGAFLPYAAELPIFPQLGAQYELKDSPGVQHPAAAAAFPHPHPAFYP 70 80 90 100 110 110 120 130 140 150 160 pF1KB9 LGSYPYGDPAYRKNATRDATATLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFA :.: .:::. :::::..:.::::::::::::::::::::::::::::::::::::::: CCDS10 YGQYQFGDPSRPKNATRESTSTLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVSTWFA 120 130 140 150 160 170 170 180 190 200 210 220 pF1KB9 NARRRLKKENKMTWTPRNRSEDEEEEENIDLEKNDEDEPQKPED---KGDPEGPEAGGAE ::::::::::::::.::.:.. :: . :...::: . :: . . : : :: : CCDS10 NARRRLKKENKMTWAPRSRTD--EEGNAYGSEREEEDEEEDEEDGKRELELEEEELGGEE 180 190 200 210 220 230 230 240 250 260 270 280 pF1KB9 QKAASGCERLQGPPTPAGKETEGSLSDSDFKETPSEGRLDALQGPPRTGGPSPAGPAAAR . .: : : .. : .: . : : : .: : : : :: . CCDS10 ED--TGGEGLADD----DEDEEIDLENLDGAATEPE---LSLAGAARRDGDLGLGPISDS 240 250 260 270 280 290 300 310 320 330 pF1KB9 LAEDPAPH--------YPAGAPAPGPHPAAGEVPPGPGGP-SVIHSPPPPPPPAVLAKPK : :. . ::.: :.: : :. : :. : : : ..: ::: CCDS10 KNSDSEDSSEGLEDRPLPVLSLAPAPPPVAVASPSLPSPPVSLDPCAPAPAPASALQKPK 290 300 310 320 330 340 340 350 360 370 380 390 pF1KB9 LWSLAEIATSSDKVKDGGGGNEGSPCPPCPGP-IAGQALGGSRASPAPAPSRSPSAQC-P .::::: ::: :. . . : ::: :: .: .:: : :. : : : :: CCDS10 IWSLAETATSPDNPRRSPPGAGGSP----PGAAVAPSALQLSPAAAAAAAHRLVSAPLGK 350 360 370 380 390 400 400 410 420 430 440 pF1KB9 FPGGTVLSRPLYYTAP---FYPGYTNYGSFGHLHGHPGPGPGPTTGPGSHFNGLNQTVLN ::. : .::. : ..: .. :: : :: . :... . . . . CCDS10 FPAWT--NRPFPGPPPGPRLHPLSLLGSAPPHLLGLPGAAGHPAAAAAFARPAEPEGGTD 410 420 430 440 450 460 450 460 470 480 pF1KB9 RADALAKDPKMLRSQSQLDLCKDSPYELKKGMSDI : .:: . :.:.. : CCDS10 RCSALEVEKKLLKTAFQPVPRRPQNHLDAALVLSALSSS 470 480 490 500 >>CCDS3867.1 IRX4 gene_id:50805|Hs108|chr5 (519 aa) initn: 562 init1: 451 opt: 705 Z-score: 323.2 bits: 69.3 E(32554): 1.1e-11 Smith-Waterman score: 705; 38.7% identity (59.6% similar) in 406 aa overlap (11-394:39-422) 10 20 30 pF1KB9 MSYPQGYLYQPSASL-ALYSCPAYSTSVISGPRTDELGRS :.:: : ::.: . ... : . . . CCDS38 PYSSAPQFLMATNSLSTCCESGGRTLADSGPAASAQAPVYCPVYESRLLATARHELNSAA 10 20 30 40 50 60 40 50 60 70 80 90 pF1KB9 SSGSAFSPYAGSTAFTAPSPGYNSHLQYGADPAA-AAAAAFSSYVGSPYDHTPGMA-GSL . : .::.:: ::.... ::.. .: . .:.: :: : :.: .. CCDS38 ALGVYGGPYGGS-------QGYGNYVTYGSEASAFYSLNSFDSKDGSGSAHG-GLAPAAA 70 80 90 100 110 120 100 110 120 130 140 150 pF1KB9 GYHPYAAPLGSYPYG------DPAYRKNATRDATATLKAWLNEHRKNPYPTKGEKIMLAI .:.:: ::.::: . . ::::::..:.::::::.:::::::::::::::::: CCDS38 AYYPYEPALGQYPYDRYGTMDSGTRRKNATRETTSTLKAWLQEHRKNPYPTKGEKIMLAI 130 140 150 160 170 180 160 170 180 190 200 pF1KB9 ITKMTLTQVSTWFANARRRLKKENKMTWTPRNRSEDE-----EEEENIDLEKNDEDEPQK :::::::::::::::::::::::::::: :::. :: : ::. :.. ..:: : CCDS38 ITKMTLTQVSTWFANARRRLKKENKMTWPPRNKCADEKRPYAEGEEEEGGEEEAREEPLK 190 200 210 220 230 240 210 220 230 240 250 260 pF1KB9 PEDKGDPEGPEAGGAEQKAASGCERLQG-PPTPAGKETEGSLSDSDFKETPS--EGRLDA ...: : : : . . . :.. ::. : :: :. ....:. .: . CCDS38 SSKNAEPVGKEEKELELSDLDDFDPLEAEPPACELKPPFHSL-DGGLERVPAAPDGPVKE 250 260 270 280 290 270 280 290 300 310 320 pF1KB9 LQGPPRTGGPSPAGPAAARLAED-PAPHYPAGAPAPGPHPAAGEVPPGPGGPSVIHSP-- .: : : :. ..: : :: . . : ::.: .: . :::.: .. CCDS38 ASGALRM---SLAAGGGAALDEDLERARSCLRSAAAGPEP----LPGAEGGPQVCEAKLG 300 310 320 330 340 350 330 340 350 360 370 pF1KB9 --PPPPPPAVLAKPKLWSLAEIATSSDKVKDGGGGNEGSPCPPCPGPIAGQALGGSRASP : .. :::..::::. ::.. . . . .: : : : : . :. CCDS38 FVPAGASAGLEAKPRIWSLAHTATAAAAAATSLSQTE---FPSCMLKRQGPA---APAAV 360 370 380 390 400 380 390 400 410 420 430 pF1KB9 APAPSRSPSAQCPFPGGTVLSRPLYYTAPFYPGYTNYGSFGHLHGHPGPGPGPTTGPGSH . ::. :::. : : CCDS38 SSAPATSPSVALPHSGALDRHQDSPVTSLRNWVDGVFHDPILRHSTLNQAWATAKGALLD 410 420 430 440 450 460 483 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 04:32:16 2016 done: Sun Nov 6 04:32:16 2016 Total Scan time: 3.030 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]