GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:29:21 Sequence gi568815578f:5027093_5290231 : 263139 bp : 43.91% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 41357 41410 54 1 0 86 70 85 0.771 7.78 1.02 Intr + 43959 44042 84 2 0 29 58 106 0.335 1.72 1.03 Intr + 56782 56865 84 0 0 94 95 -10 0.024 0.32 1.04 Intr + 67784 67842 59 1 2 61 101 23 0.050 -1.42 1.05 Term + 70141 70279 139 1 1 69 38 123 0.282 2.84 1.06 PlyA + 72101 72106 6 1.05 2.07 PlyA - 72781 72776 6 1.05 2.06 Term - 73839 73699 141 0 0 91 42 260 0.999 19.63 2.05 Intr - 79218 79096 123 0 0 94 86 105 0.978 11.68 2.04 Intr - 82353 82240 114 0 0 46 99 138 0.651 11.34 2.03 Intr - 85459 85360 100 0 1 101 17 37 0.582 -1.99 2.02 Intr - 85848 85651 198 2 0 41 57 111 0.551 1.77 2.01 Init - 85936 85869 68 1 2 49 107 97 0.986 6.34 2.00 Prom - 86604 86565 40 -8.76 3.07 PlyA - 87888 87883 6 1.05 3.06 Term - 88270 88191 80 2 2 112 48 80 0.910 4.33 3.05 Intr - 88480 88357 124 1 1 110 91 59 0.996 8.56 3.04 Intr - 90572 90378 195 2 0 72 87 145 0.999 12.41 3.03 Intr - 91585 91518 68 2 2 64 81 68 0.990 2.32 3.02 Intr - 91774 91677 98 0 2 93 108 89 0.994 11.05 3.01 Init - 92706 92486 221 0 2 90 99 417 0.987 41.10 3.00 Prom - 105416 105377 40 -3.26 4.00 Prom + 115720 115759 40 -3.46 4.01 Init + 139316 139318 3 1 0 71 101 0 0.132 -0.40 4.02 Intr + 143125 143292 168 0 0 72 43 80 0.185 2.04 4.03 Intr + 146431 146567 137 0 2 78 86 87 0.645 6.87 4.04 Intr + 148091 148187 97 2 1 79 97 16 0.666 1.61 4.05 Intr + 149556 149653 98 2 2 106 107 102 0.988 12.71 4.06 Intr + 151725 151864 140 0 2 111 57 45 0.900 3.81 4.07 Intr + 155295 155353 59 1 2 77 58 55 0.842 0.00 4.08 Intr + 155969 156051 83 1 2 87 85 35 0.899 1.54 4.09 Intr + 157766 157853 88 2 1 99 94 45 0.957 6.17 4.10 Intr + 158666 158734 69 1 0 95 82 76 0.992 7.08 4.11 Intr + 159595 159747 153 0 0 93 71 269 0.997 25.97 4.12 Intr + 161975 162094 120 1 0 87 19 161 0.994 9.79 4.13 Intr + 162643 162746 104 0 2 101 80 166 0.995 16.07 4.14 Term + 163010 163142 133 2 1 78 50 219 0.997 14.66 4.15 PlyA + 163292 163297 6 -0.45 5.03 PlyA - 163770 163765 6 1.05 5.02 Term - 164740 164628 113 2 2 48 49 109 0.650 1.72 5.01 Init - 167489 167408 82 2 1 96 110 13 0.546 5.43 5.00 Prom - 167733 167694 40 -5.56 6.03 PlyA - 167757 167752 6 1.05 6.02 Term - 169539 169434 106 2 1 85 44 107 0.928 3.88 6.01 Init - 172711 172542 170 1 2 59 110 66 0.537 4.91 6.00 Prom - 172796 172757 40 -2.46 7.00 Prom + 173090 173129 40 -6.46 7.01 Sngl + 173458 174111 654 0 0 43 48 243 0.979 11.98 7.02 PlyA + 174143 174148 6 1.05 8.02 PlyA - 178698 178693 6 1.05 8.01 Sngl - 198971 198825 147 2 0 78 54 120 0.667 1.64 8.00 Prom - 209154 209115 40 -1.96 9.02 PlyA - 209911 209906 6 1.05 9.01 Sngl - 213778 213380 399 1 0 64 54 339 0.967 24.36 9.00 Prom - 218690 218651 40 -3.46 10.00 Prom + 230312 230351 40 -2.26 10.01 Init + 240887 240935 49 1 1 74 100 21 0.021 3.01 10.02 Term + 250827 252106 1280 1 2 49 43 541 0.223 37.46 10.03 PlyA + 253746 253751 6 1.05 11.00 Prom + 257027 257066 40 -5.96 11.01 Sngl + 259617 260117 501 2 0 17 37 248 0.807 8.64 11.02 PlyA + 260549 260554 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:5027093_5290231|GENSCAN_predicted_peptide_1|139_aa MVEGKGEAGTFFTAHQDRVNGNGVVVRGYSILKSQLIMSLDAPEMMVQGIHVQVCYVGKL CIVGVWYTDYFVTQLQGQLVLLPRFLMVTSYFLITSQFLAEQKCLWYDHVKLNQETHQSG NGSEISIICRTAEATGEGF >gi568815578f:5027093_5290231|GENSCAN_predicted_CDS_1|420_bp atggtggaaggcaaaggagaagcaggcaccttcttcacagcgcaccaggacagagtgaat ggcaatggagtagttgtccgagggtactcaatactcaagagtcagctgatcatgtcctta gatgcaccagaaatgatggttcagggaatacatgtgcaagtttgttacgtgggtaaatta tgtatcgtgggggtttggtatactgactatttcgtcacccagctacagggacagctggtc ttactgcccaggttcctcatggtcacctcctatttcctaatcacctcacaattcctggca gaacagaagtgcctctggtatgaccacgtgaagctcaaccaagagacccaccagagtggc aacggatctgaaatttccatcatatgcagaacagctgaagcaactggagagggtttttag >gi568815578f:5027093_5290231|GENSCAN_predicted_peptide_2|247_aa MQPWALPTVGELWVCGRPGAALRAGTEPSSRALGVSETALPAEIKLRVIRVGHSSPLAQL ASFQKPVLFVLRLNSRRFLFLSLARSEDGILFAKSKHSSPLSLTPLRCIVLMRMYEQLMS GDLCQRVMMPSRTNLATGIPSSKVKYSRLSSTDDGYIDLQFKKTPPKIPYKAIALATVLF LIGAFLIIIGSLLLSGYISKGGADRAVPVLIIGILVFLPGFYHLRIAYYASKGYRGYSYD DIPDFDD >gi568815578f:5027093_5290231|GENSCAN_predicted_CDS_2|744_bp atgcaaccttgggcgctgccaaccgtgggcgagctctgggtgtgcgggcggcctggcgcg gcgctccgggcggggacagaaccgtcctctcgggctctgggcgtgtccgagaccgcgctc cccgccgaaatcaagctccgagtcatccgtgtggggcattcgtcccccctggcacagttg gcctctttccagaagcccgttttgtttgttttacgtctaaattcgcgtcggttcttattt ctctccctggcaaggtctgaagacggaatcctctttgccaagagtaaacattcgtctccc ctctccctcacgcccctgcgatgtattgttttgatgcggatgtatgaacagttaatgtca ggggacctgtgtcagcgtgttatgatgccgtcccgtaccaacctggctactggaatcccc agtagtaaagtgaaatattcaaggctctccagcacagacgatggctacattgaccttcag tttaagaaaacccctcctaagatcccttataaggccatcgcacttgccactgtgctgttt ttgattggcgcctttctcattattataggctccctcctgctgtcaggctacatcagcaaa gggggggcagaccgggccgttccagtgctgatcattggcattctggtgttcctacccgga ttttaccacctgcgcatcgcttactatgcatccaaaggctaccgtggttactcctatgat gacattccagactttgatgactag >gi568815578f:5027093_5290231|GENSCAN_predicted_peptide_3|261_aa MFEARLVQGSILKKVLEALKDLINEACWDISSSGVNLQSMDSSHVSLVQLTLRSEGFDTY RCDRNLAMGVNLTSMSKILKCAGNEDIITLRAEDNADTLALVFEAPNQEKVSDYEMKLMD LDVEQLGIPEQEYSCVVKMPSGEFARICRDLSHIGDAVVISCAKDGVKFSASGELGNGNI KLSQTSNVDKEEEAVTIEMNEPVQLTFALRYLNFFTKATPLSSTVTLSMSADVPLVVEYK IADMGHLKYYLAPKIEDEEGS >gi568815578f:5027093_5290231|GENSCAN_predicted_CDS_3|786_bp atgttcgaggcgcgcctggtccagggctccatcctcaagaaggtgttggaggcactcaag gacctcatcaacgaggcctgctgggatattagctccagcggtgtaaacctgcagagcatg gactcgtcccacgtctctttggtgcagctcaccctgcggtctgagggcttcgacacctac cgctgcgaccgcaacctggccatgggcgtgaacctcaccagtatgtccaaaatactaaaa tgcgccggcaatgaagatatcattacactaagggccgaagataacgcggataccttggcg ctagtatttgaagcaccaaaccaggagaaagtttcagactatgaaatgaagttgatggat ttagatgttgaacaacttggaattccagaacaggagtacagctgtgtagtaaagatgcct tctggtgaatttgcacgtatatgccgagatctcagccatattggagatgctgttgtaatt tcctgtgcaaaagacggagtgaaattttctgcaagtggagaacttggaaatggaaacatt aaattgtcacagacaagtaatgtcgataaagaggaggaagctgttaccatagagatgaat gaaccagttcaactaacttttgcactgaggtacctgaacttctttacaaaagccactcca ctctcttcaacggtgacactcagtatgtctgcagatgtaccccttgttgtagagtataaa attgcggatatgggacacttaaaatactacttggctcccaagatcgaggatgaagaagga tcttag >gi568815578f:5027093_5290231|GENSCAN_predicted_peptide_4|483_aa MKVFLDISPPRCAQRSKLMNVFSMYHLSCGKEHGFQVPACFQTLCGSCRWHSASFSGESE SEAKVDGETASDSESRAESAPLPVSADDTPEVLNRALSNLSSRWKNWWVRGILTLAMIAF FFIIIYLGPMVLMIIVMCVQIKCFHEIITIGYNVYHSYDLPWFRTLSWYFLLCVNYFFYG ETVTDYFFTLVQREEPLRILSKYHRFISFTLYLIGFCMFVLSLVKKHYRLQFYMFGWTHV TLLIVVTQSHLVIHNLFEGMIWFIVPISCVICNDIMAYMFGFFFGRTPLIKLSPKKTWEG FIGGFFATVVFGLLLSYVMSGYRCFVCPVEYNNDTNSFTVDCEPSDLFRLQEYNIPGVIQ SVIGWKTVRMYPFQIHSIALSTFASLIGPFGGFFASGFKRAFKIKDFANTIPGHGGIMDR FDCQYLMATFVNVYIASFIRGPNPSKLIQQFLTLRPDQQLHIFNTLRSHLIDKGMLTSTT EDE >gi568815578f:5027093_5290231|GENSCAN_predicted_CDS_4|1452_bp atgaaagtttttcttgacatctccccacctcgctgtgcccagaggagcaaactgatgaat gtgttcagcatgtaccacctttcttgtgggaaggagcatgggttccaggtccctgcctgc ttccagaccctctgtggaagctgccgctggcactcagcttccttttccggggagtcagag tcagaagcaaaggtagatggagagactgcatcggacagtgagagccgggcagaatccgca cccctgccagtctctgcagatgataccccggaggtcctcaatagggccctttccaacttg tcttcaagatggaagaactggtgggtgagaggcatcctgactttggccatgattgcattt ttcttcatcatcatttacctgggaccaatggttttgatgataatcgtgatgtgcgttcag attaagtgtttccatgagataatcactattggctacaacgtctaccactcatatgatctg ccctggttcaggacgctcagctggtactttctcctgtgtgtaaactatttcttctatggt gagacagtgacggattacttcttcaccctggtccagagagaagagcctttgcggattctc agtaaataccaccggttcatttcctttactctctatctaataggattctgcatgtttgta ctgagtctggtcaagaagcattatcgactgcagttctacatgtttggctggacccatgtg acattgctgattgttgtaacacagtcacatcttgttatccacaacctatttgaaggaatg atctggttcattgtccccatatcttgtgtgatctgtaatgacatcatggcctatatgttt ggctttttctttggtcggaccccactcatcaagctgtccccgaagaagacctgggaaggc ttcattgggggcttctttgctactgtggtgtttggccttctgctgtcctatgtgatgtcc gggtacagatgctttgtctgccctgtggagtacaacaatgacaccaacagcttcactgtg gactgtgagccctcggacctgtttcgcctgcaggagtacaacattcctggggtgatccag tcagtcattggctggaaaacggtccggatgtaccccttccagattcacagcatcgctctc tccacctttgcctcgctcattggcccctttggaggattcttcgcaagtggattcaaacga gcctttaaaatcaaagactttgccaataccattcctggccatggaggcatcatggatcgc tttgactgccagtatctgatggccacctttgtcaatgtatacatcgccagttttatcaga ggccctaacccaagcaaactgattcagcagttcctgactttacggccagatcagcagctc cacatcttcaacacgctgcggtctcatctgatcgacaaagggatgctgacatccaccaca gaggacgagtag >gi568815578f:5027093_5290231|GENSCAN_predicted_peptide_5|64_aa MVPIQTGDSLNQETAPDENSLHPPERPGALDLNKRSHEAEMEIKVQRKEAASAPHTYGKS LKDV >gi568815578f:5027093_5290231|GENSCAN_predicted_CDS_5|195_bp atggtccctatacaaacaggtgacagccttaatcaagagacagccccagatgagaacagc ctccatccacctgagaggccaggggccttggacctgaataagagaagccacgaggctgaa atggaaataaaagttcagcggaaagaagccgcttcagccccacacacatatgggaagtca ctcaaggatgtgtga >gi568815578f:5027093_5290231|GENSCAN_predicted_peptide_6|91_aa MIDWIKKMWHIYTVEYHAAIKKDEFMSFVGTWMKLETIILSKLSQEQKTKHRMFSRRKAS IQEPSAGVKKRHPNCEDRGGYVDDYRQTQFH >gi568815578f:5027093_5290231|GENSCAN_predicted_CDS_6|276_bp atgatagactggattaagaaaatgtggcacatatacaccgtggaataccacgcagccata aaaaaggatgagttcatgtcctttgtagggacatggatgaagctggaaaccatcattctc agcaaactatcgcaagaacaaaaaaccaaacaccgcatgttctcacgcagaaaagccagc atacaagaacctagcgcaggtgtcaaaaaaaggcatccaaactgcgaggacagaggtggc tatgtggatgattatcggcaaacacagtttcactag >gi568815578f:5027093_5290231|GENSCAN_predicted_peptide_7|217_aa MNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGSFNIRKSINVIQYINRTNEKNHM IISIDAEKAFDKIQQCFMLKTLNKLGIDGMYLKIIRAIYDKPTASIILNGQKLEAFPLKT GTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLRKEEVKLSLFADDMIVYLENPIVSA QNLLKLIGNFSKVSGYKINVQKSQAFLYTNNRQTAKS >gi568815578f:5027093_5290231|GENSCAN_predicted_CDS_7|654_bp atgaacatcgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctcattcaac atacgcaaatcaataaatgtaatccagtatataaacagaaccaacgaaaaaaaccatatg attatctcaatagatgcagaaaaggcctttgacaaaattcaacaatgcttcatgctaaaa acgctcaataaattaggtattgatgggatgtacctcaaaataataagagctatctatgac aaacccacagccagtatcatactgaatgggcaaaaactggaagcattccctttgaaaact ggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagttctg gccagggcaatcaggcaggagaaagaaataaagggtattcagttgagaaaagaggaagtc aaattgtccctctttgcagatgacatgattgtgtatctagaaaaccccatcgtctcagcc caaaatctccttaagctgataggcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcttatacaccaataacagacaaacagccaaatcatga >gi568815578f:5027093_5290231|GENSCAN_predicted_peptide_8|48_aa MEYYAAIKKDESMSFAGTWMKLETIIIVSKLSQGQKTKHRMFSLGIEQ >gi568815578f:5027093_5290231|GENSCAN_predicted_CDS_8|147_bp atggaatactatgcagccataaaaaaggatgagtccatgtcctttgcagggacatggatg aagttggaaaccatcatcattgtcagcaaactatcacaaggacagaaaaccaaacaccgc atgttctcactgggaattgaacaatga >gi568815578f:5027093_5290231|GENSCAN_predicted_peptide_9|132_aa MRKNQCKKAENSKNQNASSPPKDHNSSPAREQDWTGNEFDELTEVGFRRWVITNSSKLKE HVLTQCKEVKNLEKRLDKLLTGINSLEKNINDLMELKNTARELREAYTSINSRIDQAEER ISEIEDQLNEIK >gi568815578f:5027093_5290231|GENSCAN_predicted_CDS_9|399_bp atgaggaaaaaccagtgcaaaaaggctgaaaattccaaaaaccagaatgcctcttctcct ccaaaggatcacaactcctcgccagcaagggaacaagactggacggggaatgagtttgat gaactgacagaagtaggcttcagaaggtgggtaatcacaaactcctccaagctaaaggag catgttctaacccaatgcaaggaagttaagaaccttgaaaaaaggttagacaaattgcta actggaataaacagtttagagaagaacataaatgacctgatggagcttaaaaacacagca cgagaacttcgtgaagcatacacaagtatcaatagccgaattgatcaagcagaagaaaga atatcagagattgaagatcaacttaatgaaataaagtga >gi568815578f:5027093_5290231|GENSCAN_predicted_peptide_10|442_aa MQNPVVPSWCSHREDEAPHHTYSKIDHIVGSKALLSKCKRTEIITNYLSDHSTIKLELRI KKLTQNRSITWKLNNLLLNDYWVHNEMKAEIKMFFETNENKDTTYQNLWNTFKAVCRGKF IALNAHKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKI NESRSWFFERINKIDRLLARLIKKKREKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLY ANKLENLEVMDKFLDTYTLPRLNQEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAE FYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNID AKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRTKDKNHMIISI DAEKAFDKIQQPFMLKTLNSLF >gi568815578f:5027093_5290231|GENSCAN_predicted_CDS_10|1329_bp atgcagaatcctgtggtcccatcttggtgtagccacagggaagatgaagcaccacaccac acctattccaaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaaaga acagaaattataacaaactatctctcagaccacagtacaatcaaactagaactcaggatt aagaaactcactcaaaaccgctcaattacgtggaaactgaacaacctgctcctgaacgac tactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaac aaagacacaacataccagaatctctggaacacattcaaagcagtgtgtagagggaaattt atagcactaaatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatca caattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaa ataactaaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatt aatgaatccaggagctggttttttgaaaggatcaacaaaattgatagactgctagcaaga ctaataaagaaaaaaagagagaagaatcaaatagatgcaataaaaaatgataaaggggat atcaccactgatcccacagaaatacaaactaccatcagagaatactacaaacacctctac gcaaataaactagaaaatctagaagtaatggataaattccttgacacatacactctccca agactaaaccaggaagaagttgaatctctgaatagaccaataacaggagctgaaattgtg gcaataatcaatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaa ttctaccagaggtacaaggaggaactggtaccattccttctgaaactcttccaatcaata gaaaaagagggaatcctccctaactcattttatgaggccagcatcatcctgataccaaag ccgggcagagacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgat gcaaaaatcctcaataaaatactggcaaacagaatccagcagcacatcaaaaagcttatc caccatgatcaagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaatca ataaatgtaatccagcatataaacagaaccaaagataaaaaccacatgattatctcaata gatgcagaaaaggcctttgacaaaattcaacaacccttcatgctaaaaactctcaactca cttttttag >gi568815578f:5027093_5290231|GENSCAN_predicted_peptide_11|166_aa MIISIDADKAFNKIQHRFMIKTLSKIGIQGTYLNVIKAIYDKPTANIILMGEKLKAFPLR MGTRQGCPLSLLLFNIVLEVLARVIRQEKEIKGIQIGKEEVKLSFFADDMIIYLENPKNS SRNLLNLIKEFSKASGYKINVHKSVALLYTNSDQADNQIRNSFLLQ >gi568815578f:5027093_5290231|GENSCAN_predicted_CDS_11|501_bp atgatcatctcaatagatgcagacaaagcattcaacaaaatccagcatcgctttatgatt aaaactctcagcaaaatcggcatacaagggacatacctcaatgtaataaaagccatctat gacaaacccacagccaacataatactaatgggggaaaagttgaaagctttccctttgaga atgggaacaagacaaggatgcccactctcgctactcctcttcaacatagtactggaagtc ctagccagagtaatcagacaagagaaagaaataaagggcatccaaattggtaaagaggaa gtcaaactgtcattttttgctgacgatatgatcatttacctcgaaaaccctaaaaactcc tccagaaatctcctaaacctgataaaagaattcagcaaagcttccggatacaagattaat gtacacaaatcagtagctcttctatacaccaacagcgaccaagcggataatcaaatcagg aactcattccttttacaatag