FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1250, 506 aa 1>>>pF1KE1250 506 - 506 aa - 506 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.6353+/-0.000978; mu= -7.3128+/- 0.059 mean_var=440.3934+/-90.812, 0's: 0 Z-trim(117.2): 21 B-trim: 114 in 1/53 Lambda= 0.061116 statistics sampled from 17949 (17964) to 17949 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.815), E-opt: 0.2 (0.552), width: 16 Scan time: 4.070 The best scores are: opt bits E(32554) CCDS46603.1 TOX2 gene_id:84969|Hs108|chr20 ( 506) 3423 315.7 7.6e-86 CCDS13324.1 TOX2 gene_id:84969|Hs108|chr20 ( 464) 3127 289.6 5.1e-78 CCDS42875.1 TOX2 gene_id:84969|Hs108|chr20 ( 488) 1812 173.7 4.2e-43 CCDS34897.1 TOX gene_id:9760|Hs108|chr8 ( 526) 906 93.8 5e-19 CCDS54008.1 TOX3 gene_id:27324|Hs108|chr16 ( 571) 829 87.1 5.8e-17 CCDS54009.1 TOX3 gene_id:27324|Hs108|chr16 ( 576) 829 87.1 5.9e-17 CCDS32043.1 TOX4 gene_id:9878|Hs108|chr14 ( 621) 749 80.0 8.2e-15 >>CCDS46603.1 TOX2 gene_id:84969|Hs108|chr20 (506 aa) initn: 3423 init1: 3423 opt: 3423 Z-score: 1654.7 bits: 315.7 E(32554): 7.6e-86 Smith-Waterman score: 3423; 100.0% identity (100.0% similar) in 506 aa overlap (1-506:1-506) 10 20 30 40 50 60 pF1KE1 MDVRLYPSAPAVGARPGAEPAGLAHLDYYHGGKFDGDSAYVGMSDGNPELLSTSQTYNGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 MDVRLYPSAPAVGARPGAEPAGLAHLDYYHGGKFDGDSAYVGMSDGNPELLSTSQTYNGQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 SENNEDYEIPPITPPNLPEPSLLHLGDHEASYHSLCHGLTPNGLLPAYSYQAMDLPAIMV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 SENNEDYEIPPITPPNLPEPSLLHLGDHEASYHSLCHGLTPNGLLPAYSYQAMDLPAIMV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 SNMLAQDSHLLSGQLPTIQEMVHSEVAAYDSGRPGPLLGRPAMLASHMSALSQSQLISQM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 SNMLAQDSHLLSGQLPTIQEMVHSEVAAYDSGRPGPLLGRPAMLASHMSALSQSQLISQM 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 GIRSSIAHSSPSPPGSKSATPSPSSSTQEEESEVHFKISGEKRPSADPGKKAKNPKKKKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 GIRSSIAHSSPSPPGSKSATPSPSSSTQEEESEVHFKISGEKRPSADPGKKAKNPKKKKK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 KDPNEPQKPVSAYALFFRDTQAAIKGQNPSATFGDVSKIVASMWDSLGEEQKQAYKRKTE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 KDPNEPQKPVSAYALFFRDTQAAIKGQNPSATFGDVSKIVASMWDSLGEEQKQAYKRKTE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 AAKKEYLKALAAYRASLVSKSSPDQGETKSTQANPPAKMLPPKQPMYAMPGLASFLTPSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 AAKKEYLKALAAYRASLVSKSSPDQGETKSTQANPPAKMLPPKQPMYAMPGLASFLTPSD 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 LQAFRSGASPASLARTLGSKSLLPGLSASPPPPPSFPLSPTLHQQLSLPPHAQGALLSPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 LQAFRSGASPASLARTLGSKSLLPGLSASPPPPPSFPLSPTLHQQLSLPPHAQGALLSPP 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE1 VSMSPAPQPPVLPTPMALQVQLAMSPSPPGPQDFPHISEFPSSSGSCSPGPSNPTSSGDW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 VSMSPAPQPPVLPTPMALQVQLAMSPSPPGPQDFPHISEFPSSSGSCSPGPSNPTSSGDW 430 440 450 460 470 480 490 500 pF1KE1 DSSYPSGECGISTCSLLPRDKSLYLT :::::::::::::::::::::::::: CCDS46 DSSYPSGECGISTCSLLPRDKSLYLT 490 500 >>CCDS13324.1 TOX2 gene_id:84969|Hs108|chr20 (464 aa) initn: 3127 init1: 3127 opt: 3127 Z-score: 1514.2 bits: 289.6 E(32554): 5.1e-78 Smith-Waterman score: 3127; 100.0% identity (100.0% similar) in 464 aa overlap (43-506:1-464) 20 30 40 50 60 70 pF1KE1 GARPGAEPAGLAHLDYYHGGKFDGDSAYVGMSDGNPELLSTSQTYNGQSENNEDYEIPPI :::::::::::::::::::::::::::::: CCDS13 MSDGNPELLSTSQTYNGQSENNEDYEIPPI 10 20 30 80 90 100 110 120 130 pF1KE1 TPPNLPEPSLLHLGDHEASYHSLCHGLTPNGLLPAYSYQAMDLPAIMVSNMLAQDSHLLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 TPPNLPEPSLLHLGDHEASYHSLCHGLTPNGLLPAYSYQAMDLPAIMVSNMLAQDSHLLS 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE1 GQLPTIQEMVHSEVAAYDSGRPGPLLGRPAMLASHMSALSQSQLISQMGIRSSIAHSSPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GQLPTIQEMVHSEVAAYDSGRPGPLLGRPAMLASHMSALSQSQLISQMGIRSSIAHSSPS 100 110 120 130 140 150 200 210 220 230 240 250 pF1KE1 PPGSKSATPSPSSSTQEEESEVHFKISGEKRPSADPGKKAKNPKKKKKKDPNEPQKPVSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 PPGSKSATPSPSSSTQEEESEVHFKISGEKRPSADPGKKAKNPKKKKKKDPNEPQKPVSA 160 170 180 190 200 210 260 270 280 290 300 310 pF1KE1 YALFFRDTQAAIKGQNPSATFGDVSKIVASMWDSLGEEQKQAYKRKTEAAKKEYLKALAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 YALFFRDTQAAIKGQNPSATFGDVSKIVASMWDSLGEEQKQAYKRKTEAAKKEYLKALAA 220 230 240 250 260 270 320 330 340 350 360 370 pF1KE1 YRASLVSKSSPDQGETKSTQANPPAKMLPPKQPMYAMPGLASFLTPSDLQAFRSGASPAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 YRASLVSKSSPDQGETKSTQANPPAKMLPPKQPMYAMPGLASFLTPSDLQAFRSGASPAS 280 290 300 310 320 330 380 390 400 410 420 430 pF1KE1 LARTLGSKSLLPGLSASPPPPPSFPLSPTLHQQLSLPPHAQGALLSPPVSMSPAPQPPVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 LARTLGSKSLLPGLSASPPPPPSFPLSPTLHQQLSLPPHAQGALLSPPVSMSPAPQPPVL 340 350 360 370 380 390 440 450 460 470 480 490 pF1KE1 PTPMALQVQLAMSPSPPGPQDFPHISEFPSSSGSCSPGPSNPTSSGDWDSSYPSGECGIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 PTPMALQVQLAMSPSPPGPQDFPHISEFPSSSGSCSPGPSNPTSSGDWDSSYPSGECGIS 400 410 420 430 440 450 500 pF1KE1 TCSLLPRDKSLYLT :::::::::::::: CCDS13 TCSLLPRDKSLYLT 460 >>CCDS42875.1 TOX2 gene_id:84969|Hs108|chr20 (488 aa) initn: 1792 init1: 1743 opt: 1812 Z-score: 887.2 bits: 173.7 E(32554): 4.2e-43 Smith-Waterman score: 2976; 94.3% identity (94.3% similar) in 474 aa overlap (33-506:42-488) 10 20 30 40 50 60 pF1KE1 VRLYPSAPAVGARPGAEPAGLAHLDYYHGGKFDGDSAYVGMSDGNPELLSTSQTYNGQSE :::::::::::::::::::::::::::::: CCDS42 AFSRCLGFCGMRLGLLLLARHWCIAGVFPQKFDGDSAYVGMSDGNPELLSTSQTYNGQSE 20 30 40 50 60 70 70 80 90 100 110 120 pF1KE1 NNEDYEIPPITPPNLPEPSLLHLGDHEASYHSLCHGLTPNGLLPAYSYQAMDLPAIMVSN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 NNEDYEIPPITPPNLPEPSLLHLGDHEASYHSLCHGLTPNGLLPAYSYQAMDLPAIMVSN 80 90 100 110 120 130 130 140 150 160 170 180 pF1KE1 MLAQDSHLLSGQLPTIQEMVHSEVAAYDSGRPGPLLGRPAMLASHMSALSQSQLISQMGI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MLAQDSHLLSGQLPTIQEMVHSEVAAYDSGRPGPLLGRPAMLASHMSALSQSQLISQMGI 140 150 160 170 180 190 190 200 210 220 230 240 pF1KE1 RSSIAHSSPSPPGSKSATPSPSSSTQEEESEVHFKISGEKRPSADPGKKAKNPKKKKKKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 RSSIAHSSPSPPGSKSATPSPSSSTQEEESEVHFKISGEKRPSADPGKKAKNPKKKKKKD 200 210 220 230 240 250 250 260 270 280 290 300 pF1KE1 PNEPQKPVSAYALFFRDTQAAIKGQNPSATFGDVSKIVASMWDSLGEEQKQAYKRKTEAA ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 PNEPQKPVSAYALFFRDTQAAIKGQNPSATFGDVSKIVASMWDSLGEEQKQ--------- 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 KKEYLKALAAYRASLVSKSSPDQGETKSTQANPPAKMLPPKQPMYAMPGLASFLTPSDLQ :::::::::::::::::::::::::::::::::::::::::: CCDS42 ------------------SSPDQGETKSTQANPPAKMLPPKQPMYAMPGLASFLTPSDLQ 310 320 330 340 370 380 390 400 410 420 pF1KE1 AFRSGASPASLARTLGSKSLLPGLSASPPPPPSFPLSPTLHQQLSLPPHAQGALLSPPVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 AFRSGASPASLARTLGSKSLLPGLSASPPPPPSFPLSPTLHQQLSLPPHAQGALLSPPVS 350 360 370 380 390 400 430 440 450 460 470 480 pF1KE1 MSPAPQPPVLPTPMALQVQLAMSPSPPGPQDFPHISEFPSSSGSCSPGPSNPTSSGDWDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MSPAPQPPVLPTPMALQVQLAMSPSPPGPQDFPHISEFPSSSGSCSPGPSNPTSSGDWDS 410 420 430 440 450 460 490 500 pF1KE1 SYPSGECGISTCSLLPRDKSLYLT :::::::::::::::::::::::: CCDS42 SYPSGECGISTCSLLPRDKSLYLT 470 480 >>CCDS34897.1 TOX gene_id:9760|Hs108|chr8 (526 aa) initn: 1110 init1: 629 opt: 906 Z-score: 455.1 bits: 93.8 E(32554): 5e-19 Smith-Waterman score: 1389; 46.1% identity (69.5% similar) in 544 aa overlap (1-506:1-526) 10 20 30 40 50 pF1KE1 MDVRLYPSAPAVGARPGAEPAGLAH-LDYYHGGKFDGDSAYVGMSDGNPELLSTSQTYNG ::::.:: .: : : : . :: :. .::::.. :..:.. . . . .::.: : CCDS34 MDVRFYPPPAQPAAAPDAPCLGPSPCLDPYYCNKFDGENMYMSMTEPSQDYVPASQSYPG 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 QSENNEDYEIPPITPPNLPEPSLLHLGDHEASYHSLCHGLTPNGLLPAYSYQAMDLPAIM : ..::..:::::::.::. ::.::.. :..:::::: .. ::::: . : :::: : CCDS34 PSLESEDFNIPPITPPSLPDHSLVHLNEVESGYHSLCHPMNHNGLLP-FHPQNMDLPEIT 70 80 90 100 110 120 130 140 150 160 pF1KE1 VSNMLAQDSHLLSGQ---LPTI------QEMVHSEVAAY-DSGRPGPLLGRPAMLA-SHM :::::.::. :::.. .: : : : ..::. :.:. . .:.:. ... CCDS34 VSNMLGQDGTLLSNSISVMPDIRNPEGTQYSSHPQMAAMRPRGQPADIRQQPGMMPHGQL 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE1 SALSQSQLISQMGIR---SSIAHSSPSPPGSKSATPSPSSSTQEEESEVHFKISG-EKRP ....:::: .:.:. :.. :.:::::::::::::::::..:.:.. ::.: :::: CCDS34 TTINQSQLSAQLGLNMGGSNVPHNSPSPPGSKSATPSPSSSVHEDEGDDTSKINGGEKRP 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE1 SADPGKKAKNPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPSATFGDVSKIVASMW ..: ::: :.:::::::::::::::::::::::::::::::::::.::::.::::::::: CCDS34 ASDMGKKPKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMW 240 250 260 270 280 290 290 300 310 320 330 340 pF1KE1 DSLGEEQKQAYKRKTEAAKKEYLKALAAYRASLVSKSSPDQGETKSTQANPPAKMLPPKQ :.:::::::.::.::::::::::: :::::::::::: . ..:..: :: ... : CCDS34 DGLGEEQKQVYKKKTEAAKKEYLKQLAAYRASLVSKSYSEPVDVKTSQ--PP-QLINSKP 300 310 320 330 340 350 350 360 370 380 pF1KE1 PMYAMPGLA-SFLTPSDLQAFRSGASP------ASLARTLGSK--SLLP------GLSAS .. :. : : : :. . : .: :: :... : . .: ....: CCDS34 SVFHGPSQAHSALYLSSHYHQQPGMNPHLTAMHPSLPRNIAPKPNNQMPVTVSIANMAVS 360 370 380 390 400 410 390 400 410 420 430 440 pF1KE1 PPPPPSFPLSPTLHQQLSLPPHAQGALLSP-----PVSMSPAPQPPVLPTPMALQ--VQL :::: . .:: :::.:.. : .. .: :.... : . :.. ..:: : CCDS34 PPPP--LQISPPLHQHLNMQQHQPLTMQQPLGNQLPMQVQSALHSPTMQQGFTLQPDYQT 420 430 440 450 460 470 450 460 470 480 490 500 pF1KE1 AMSPSPPGPQDFPHISEFPSSSGSCSPGPSNPTSSGDWDSSYPSGECGISTCSLLPRDKS ..:. . : . :. :. : : .:. ::...: : . . . :::. CCDS34 IINPTSTAAQVVTQAMEYVRSG--CRNPPPQPV---DWNNDY----C---SSGGMQRDKA 480 490 500 510 520 pF1KE1 LYLT :::: CCDS34 LYLT >>CCDS54008.1 TOX3 gene_id:27324|Hs108|chr16 (571 aa) initn: 856 init1: 738 opt: 829 Z-score: 418.0 bits: 87.1 E(32554): 5.8e-17 Smith-Waterman score: 990; 42.9% identity (67.1% similar) in 441 aa overlap (33-452:25-448) 10 20 30 40 50 60 pF1KE1 VRLYPSAPAVGARPGAEPAGLAHLDYYHGGKFDGDSAYVGMSDGNPELLSTSQTYNGQSE :: ... :..:...: ....:.:.. : CCDS54 MKCQPRSGARRIEERLHYLITTYLKFGNNNNYMNMAEANNAFFAASETFHTPSL 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 NNEDYEIPPITPPNLPEPSLLHLGDHEASYHSLCHGLTPNG--LLPAYSYQAMDLPAIMV ..:..:::::::: .:.: . : ...: : .: . : . :..:::.: . CCDS54 GDEEFEIPPITPPPESDPAL-GMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLDLPSITI 60 70 80 90 100 110 130 140 150 160 pF1KE1 S-NMLAQDSHLLSGQLPTIQEMVHSEVAAYDSGRPGP-LLGRP-----------AMLASH : :.. ::. : :. : : :..:. : : : :. : .: .. CCDS54 SRNLVEQDGVLHSSGLHMDQS--HTQVSQY---RQDPSLIMRSIVHMTDAARSGVMPPAQ 120 130 140 150 160 170 180 190 200 210 220 pF1KE1 MSALSQSQLISQMGIR---SSIAHSSPSPPGSKSATPSPSSSTQEEESEVHFKISGEKRP .....:::: .:.:. .:. :.:::::.::::::::::: .::... . :::: CCDS54 LTTINQSQLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRAIGEKRA 170 180 190 200 210 220 230 240 250 260 270 280 pF1KE1 SADPGKKAKNPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPSATFGDVSKIVASMW . : ::: :.:::::::::::::::::::::::::::::::::::.::::.::::::::: CCDS54 APDSGKKPKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMW 230 240 250 260 270 280 290 300 310 320 330 340 pF1KE1 DSLGEEQKQAYKRKTEAAKKEYLKALAAYRASLVSKSSPDQGETKSTQANPPAKMLPPKQ :::::::::.::::::::::::::::::::::::::.. ...:... .. .: CCDS54 DSLGEEQKQVYKRKTEAAKKEYLKALAAYRASLVSKAAAESAEAQTIRSV--------QQ 290 300 310 320 330 340 350 360 370 380 390 400 pF1KE1 PMYAMPGLASFLTPSDL-QAFRSGASPASLARTLGSKSLLPGLSASPPPPPSFPLSPTLH . . .:.: . : : .::: .: ..: .:. : . : .. : :. CCDS54 TLASTNLTSSLLLNTPLSQHGTVSASPQTLQQSL-PRSIAPKPLTMRLPMNQIVTSVTI- 350 360 370 380 390 410 420 430 440 450 460 pF1KE1 QQLSLPPHAQGALLSP--PVSMSPAPQPPVLPTPMALQVQLAMSPSPPGPQDFPHISEFP ..: . . :.: . .. ::. : :. .. : :. .. . : CCDS54 -AANMPSNIGAPLISSMGTTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQQQQMQQMQQQ 400 410 420 430 440 450 470 480 490 500 pF1KE1 SSSGSCSPGPSNPTSSGDWDSSYPSGECGISTCSLLPRDKSLYLT CCDS54 QLQQHQMHQQIQQQMQQQHFQHHMQQHLQQQQQHLQQQINQQQLQQQLQQRLQLQQLQHM 460 470 480 490 500 510 >>CCDS54009.1 TOX3 gene_id:27324|Hs108|chr16 (576 aa) initn: 855 init1: 738 opt: 829 Z-score: 417.9 bits: 87.1 E(32554): 5.9e-17 Smith-Waterman score: 1029; 42.6% identity (66.2% similar) in 477 aa overlap (1-452:1-453) 10 20 30 40 50 pF1KE1 MDVRLYPSAPAVGARPGAEPAGLAH---LDYYHGGKFDGDSAYVGMSDGNPELLSTS-QT ::::.::.: ...::.: : :: .:: ... :..:...: ....: :: CCDS54 MDVRFYPAA-------AGDPASLDFAQCLGYYGYSKFGNNNNYMNMAEANNAFFAASEQT 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 YNGQSENNEDYEIPPITPPNLPEPSLLHLGDHEASYHSLCHGLTPNG--LLPAYSYQAMD .. : ..:..:::::::: .:.: . : ...: : .: . : . :..: CCDS54 FHTPSLGDEEFEIPPITPPPESDPAL-GMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLD 60 70 80 90 100 110 120 130 140 150 160 pF1KE1 LPAIMVS-NMLAQDSHLLSGQLPTIQEMVHSEVAAYDSGRPGP-LLGRP----------- ::.: .: :.. ::. : :. : : :..:. : : : :. : CCDS54 LPSITISRNLVEQDGVLHSSGLHMDQS--HTQVSQY---RQDPSLIMRSIVHMTDAARSG 120 130 140 150 160 170 180 190 200 210 pF1KE1 AMLASHMSALSQSQLISQMGIR---SSIAHSSPSPPGSKSATPSPSSSTQEEESEVHFKI .: .......:::: .:.:. .:. :.:::::.::::::::::: .::... . CCDS54 VMPPAQLTTINQSQLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRA 170 180 190 200 210 220 220 230 240 250 260 270 pF1KE1 SGEKRPSADPGKKAKNPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPSATFGDVSK :::: . : ::: :.:::::::::::::::::::::::::::::::::::.::::.::: CCDS54 IGEKRAAPDSGKKPKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSK 230 240 250 260 270 280 280 290 300 310 320 330 pF1KE1 IVASMWDSLGEEQKQAYKRKTEAAKKEYLKALAAYRASLVSKSSPDQGETKSTQANPPAK :::::::::::::::.::::::::::::::::::::::::::.. ...:... .. CCDS54 IVASMWDSLGEEQKQVYKRKTEAAKKEYLKALAAYRASLVSKAAAESAEAQTIRSV---- 290 300 310 320 330 340 340 350 360 370 380 390 pF1KE1 MLPPKQPMYAMPGLASFLTPSDL-QAFRSGASPASLARTLGSKSLLPGLSASPPPPPSFP .: . . .:.: . : : .::: .: ..: .:. : . : .. CCDS54 ----QQTLASTNLTSSLLLNTPLSQHGTVSASPQTLQQSL-PRSIAPKPLTMRLPMNQIV 350 360 370 380 390 400 410 420 430 440 450 pF1KE1 LSPTLHQQLSLPPHAQGALLSP--PVSMSPAPQPPVLPTPMALQVQLAMSPSPPGPQDFP : :. ..: . . :.: . .. ::. : :. .. : :. .. . : CCDS54 TSVTIAA--NMPSNIGAPLISSMGTTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQQQQM 400 410 420 430 440 450 460 470 480 490 500 pF1KE1 HISEFPSSSGSCSPGPSNPTSSGDWDSSYPSGECGISTCSLLPRDKSLYLT CCDS54 QQMQQQQLQQHQMHQQIQQQMQQQHFQHHMQQHLQQQQQHLQQQINQQQLQQQLQQRLQL 460 470 480 490 500 510 >>CCDS32043.1 TOX4 gene_id:9878|Hs108|chr14 (621 aa) initn: 699 init1: 528 opt: 749 Z-score: 379.4 bits: 80.0 E(32554): 8.2e-15 Smith-Waterman score: 811; 36.1% identity (61.6% similar) in 477 aa overlap (36-471:6-477) 10 20 30 40 50 60 pF1KE1 YPSAPAVGARPGAEPAGLAHLDYYHGGKFDGDSAYVGMSDGNPELLSTSQTYNGQSENNE :.. :. .. . .:: ..:.. : ..: CCDS32 MEFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDE 10 20 30 70 80 90 100 110 120 pF1KE1 DYEIPPITPPNLPEPSLLHLGDHEASYHSLCH-GLTPNGLLPA-YSYQAMDLPAIMVSNM ..:::::. . .::: ..: . . .: . . .: . : :. :..:.:. :. .. CCDS32 EFEIPPISLDS--DPSLA-VSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPVGMTHGL 40 50 60 70 80 90 130 140 150 160 170 pF1KE1 LAQDSHLLSGQLPTIQEMVHSEVAAYDSGRPG----PLLGRPAMLASH--MSALSQSQLI . : . :::: : ... :: . :... : :. . : .: .....::.: CCDS32 MEQGGGLLSGGL--TMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELS 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE1 SQMGIR---SSIAHSSPSPPGSKSATPSPSSSTQEEESE-VHFKISGEKRPSADPGKKAK ::.:. ..: . :: :.::::.:: .:. : . .. ..: .. ::: : CCDS32 SQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLHEDGVEDFRRQLPSQKTVVVEAGKKQK 160 170 180 190 200 210 240 250 260 270 280 290 pF1KE1 NPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPSATFGDVSKIVASMWDSLGEEQKQ :::.:::::::::::::::::::::::::::::::.::::.:::::::::::::::::: CCDS32 APKKRKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDSLGEEQKQ 220 230 240 250 260 270 300 310 320 330 340 pF1KE1 AYKRKTEAAKKEYLKALAAYRASLVSKSSP-----DQGETKSTQANPPAKMLPPKQPMYA .:::::::::::::::::::. . ... : . ..: . :: . : .: : CCDS32 VYKRKTEAAKKEYLKALAAYKDNQECQATVETVELDPAPPSQTPSPPPMATVDPASPAPA 280 290 300 310 320 330 350 360 370 380 390 pF1KE1 M---PGLA-SFLTPSDLQAF-----RSGAS-PASLARTLGSKSLLP--------GLSASP :.:. :... : :... :::. .... . .:..:: :. . CCDS32 SIEPPALSPSIVVNSTLSSYVANQASSGAGGQPNITKLIITKQMLPSSITMSQGGMVTVI 340 350 360 370 380 390 400 410 420 430 440 pF1KE1 PPPPSFPLSPTLHQQ--LSLPPHAQGALLSPPVSMSPAPQPPV----LPTPMALQVQLAM : . : : .. : :. ... : .. : . :: : : . CCDS32 PATVVTSRGLQLGQTSTATIQPSQQAQIVTRSVLQAAAAAAAAASMQLPPPRLQPPPLQQ 400 410 420 430 440 450 450 460 470 480 490 500 pF1KE1 SPSPPGPQDFPHISEFPSSSGSCSPGPSNPTSSGDWDSSYPSGECGISTCSLLPRDKSLY :.:: :. ... : .. .: : CCDS32 MPQPPTQQQVTILQQPPPLQAMQQPPPQKVRINLQQQPPPLQIKSVPLPTLKMQTTLVPP 460 470 480 490 500 510 506 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 09:46:03 2016 done: Sun Nov 6 09:46:04 2016 Total Scan time: 4.070 Total Display time: 0.080 Function used was FASTA [36.3.4 Apr, 2011]