GENSCAN 1.0 Date run: 5-Nov-116 Time: 08:16:54 Sequence gi568815593r:73452800_73662881 : 210082 bp : 40.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 4249 4382 134 2 2 105 44 114 0.645 5.97 1.02 PlyA + 5104 5109 6 1.05 2.00 Prom + 7734 7773 40 -5.75 2.01 Init + 17143 17297 155 0 2 97 48 162 0.250 12.60 2.02 Term + 29663 29834 172 2 1 59 40 107 0.004 -0.68 2.03 PlyA + 32246 32251 6 1.05 3.00 Prom + 39656 39695 40 -4.65 3.01 Init + 40733 40793 61 1 1 66 97 40 0.034 4.17 3.02 Intr + 45322 45593 272 2 2 50 26 238 0.001 10.14 3.03 Intr + 46335 46403 69 2 0 91 106 116 0.994 12.16 3.04 Intr + 49689 49775 87 2 0 89 49 105 0.983 5.95 3.05 Intr + 50117 50318 202 1 1 118 82 184 0.996 18.74 3.06 Intr + 51548 51604 57 0 0 75 91 130 0.927 9.94 3.07 Term + 55962 56455 494 1 2 58 49 426 0.117 29.48 3.08 PlyA + 59947 59952 6 1.05 4.09 PlyA - 62612 62607 6 1.05 4.08 Term - 67175 67101 75 2 0 78 42 94 0.073 0.66 4.07 Intr - 79572 79364 209 1 2 87 53 171 0.072 11.37 4.06 Intr - 100687 100607 81 2 0 47 103 81 0.666 4.19 4.05 Intr - 101589 101523 67 0 1 100 87 19 0.742 0.56 4.04 Intr - 102187 102062 126 1 0 49 106 99 0.991 7.76 4.03 Intr - 102786 102689 98 1 2 52 73 110 0.982 4.71 4.02 Intr - 104875 104776 100 1 1 18 85 103 0.614 1.76 4.01 Init - 110082 109834 249 0 0 81 36 196 0.492 11.21 4.00 Prom - 110422 110383 40 -10.25 5.00 Prom + 111171 111210 40 -7.65 5.01 Init + 112227 112698 472 2 1 53 -28 397 0.566 20.33 5.02 Intr + 113203 113387 185 2 2 59 72 56 0.465 -0.31 5.03 Intr + 115436 115528 93 1 0 57 89 52 0.718 1.44 5.04 Intr + 115621 115805 185 0 2 78 69 143 0.998 9.06 5.05 Intr + 116698 116876 179 1 2 93 77 66 0.879 4.64 5.06 Intr + 117787 117912 126 2 0 35 80 189 0.394 12.53 5.07 Intr + 119690 119801 112 0 1 67 17 15 0.061 -9.18 5.08 Intr + 124153 124237 85 1 1 71 92 23 0.145 -0.10 5.09 Intr + 125057 125206 150 1 0 44 35 147 0.312 4.44 5.10 Intr + 125952 126053 102 2 0 80 76 88 0.973 6.25 5.11 Intr + 126218 126351 134 1 2 58 95 102 0.977 6.42 5.12 Intr + 126518 126576 59 2 2 74 111 25 0.987 1.01 5.13 Term + 127078 127295 218 2 2 114 36 122 0.971 5.92 5.14 PlyA + 128888 128893 6 1.05 6.00 Prom + 147882 147921 40 -5.25 6.01 Init + 158274 158505 232 2 1 69 75 137 0.653 8.97 6.02 Intr + 165273 165451 179 1 2 46 58 113 0.536 2.82 6.03 Term + 168133 168258 126 0 0 84 53 86 0.893 2.00 6.04 PlyA + 168316 168321 6 1.05 7.04 PlyA - 168501 168496 6 1.05 7.03 Term - 173864 173357 508 1 1 126 50 298 0.467 22.69 7.02 Intr - 178319 178162 158 2 2 111 26 74 0.112 1.39 7.01 Init - 193698 193657 42 0 0 85 48 72 0.349 3.37 7.00 Prom - 195216 195177 40 -6.85 8.00 Prom + 198915 198954 40 -5.55 8.01 Init + 200067 200207 141 2 0 66 80 89 0.574 6.08 8.02 Term + 201063 201134 72 2 0 128 44 40 0.686 0.53 8.03 PlyA + 201461 201466 6 -0.45 9.04 PlyA - 202768 202763 6 1.05 9.03 Term - 204070 203888 183 1 0 61 48 221 0.996 11.96 9.02 Intr - 206338 206217 122 2 2 65 76 78 0.718 3.59 9.01 Init - 208830 208689 142 0 1 78 70 74 0.712 4.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 30317 30138 180 2 0 40 48 234 0.851 7.93 S.002 Sngl - 45545 45216 330 2 0 41 52 375 0.806 22.97 S.003 Init + 45869 46000 132 1 0 48 86 171 0.993 11.10 S.004 Init + 86771 86839 69 1 0 91 110 67 0.896 10.30 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:73452800_73662881|GENSCAN_predicted_peptide_1|44_aa XPHSVIPTPYATAVVISVLWNNLAMEILEKMRYIKKAVDFIAAL >gi568815593r:73452800_73662881|GENSCAN_predicted_CDS_1|135_bp ngccctcatagtgtcattcctactccatatgccactgctgtggtaatcagtgtgctgtgg aacaacctggccatggaaattttggaaaagatgaggtacatcaagaaagcagttgacttt attgcagcactttga >gi568815593r:73452800_73662881|GENSCAN_predicted_peptide_2|108_aa MSESVKRPGASKQDIRLMRTYEWGGPMAHEDLQTEAVLWPVRTYIQGWSRGLEVLERGCS AAAVHFKVVPAYRAEPSVNQAQVFSLSVNCQLVGNSPLSHRCRLGEIK >gi568815593r:73452800_73662881|GENSCAN_predicted_CDS_2|327_bp atgtccgagagcgtgaagaggcctggagctagcaaacaagacataaggcttatgaggact tatgaatggggtggtcccatggcccatgaggacttacaaacagaggcggtcctatggcct gtgaggacttacatacaggggtggtcccgtggcctagaagttctggaaaggggctgtagt gctgcagctgtccactttaaggtggtgcctgcatatcgtgcagaaccatctgtaaaccag gcccaagtcttctctttgtctgtgaactgtcaacttgtgggaaactccccattaagccat agatgcaggctgggagagataaaatga >gi568815593r:73452800_73662881|GENSCAN_predicted_peptide_3|413_aa MEQTARVCVSALPLTSTVALGNVNRPFLRRAQNFRIRAQWRLSGLNPNCAQDRALRPLCG LSQRYGLRPKPAVSRRAYSRQPLEQEEENRDSLDHHKRCRACAGRNRVRGEMKETIMNQE KLAKLQAQVRIGGKGTARRKKKVVHRTATADDKKLQFSLKKLGVNMFTNQGTVIHFNNPK VQASLAANTFTITGHAETKQLTEMLPSILNQLGADSLTSLRRLAEALPKQSVDGKAPLAT GEDDDDEVPDPLRVSSRDKLTEMAASSQGNIEGNFESLDLTEFAKKQPWWRKLFGQESGP SAEKYSMATQLFIGGVTGWCMGFIFQKVGKLAATAVGGGFSLLQLANHTGYIKVDWQRVE KDMKKAKEQLKIRKSNQIPTEVRSKAEEVVSFVKKNVLVTGGFFGGFLLGMAS >gi568815593r:73452800_73662881|GENSCAN_predicted_CDS_3|1242_bp atggagcagactgccagggtctgtgtttcagctttgccacttactagcactgtggcctta ggaaacgtgaataggccctttcttcggagggcacagaacttcaggatcagggcccaatgg agactgagcggacttaaccctaactgtgcacaggaccgcgctctccgacccctttgtggc ctctcccagaggtacgggctgcgccccaagccagcagtctctcgtagggcctacagccgg cagccgctggagcaggaggaggagaatagggactcgctggaccatcacaaacgctgtcgc gcgtgcgcagggaggaaccgagttcgcggcgaaatgaaagaaacaatcatgaaccaggaa aaactcgccaaactgcaggcacaagtgcgcattggtgggaaaggaactgctcgcagaaag aagaaggtggttcatagaacagccacagcagatgacaaaaaacttcagttctccttaaag aagttaggggtgaatatgtttacaaaccaaggaacagtgatccactttaacaaccctaaa gttcaggcatctctggcagcgaacactttcaccattacaggccatgctgagacaaagcag ctgacagaaatgctacccagcatcttaaaccagcttggtgcggatagtctgactagttta aggagactggccgaagctctgcccaaacaatctgtggatggaaaagcaccacttgctact ggagaggatgatgatgatgaagttccagatcctctacgtgtgtcctcgcgagacaagctc accgaaatggccgcgtccagtcaaggaaacattgagggaaattttgagtcactggacctt acggaatttgctaagaagcagccatggtggcgtaagctgttcgggcaggaatctggacct tcagcagaaaagtatagcatggcaacccagctgttcattggaggtgtcactggatggtgc atgggtttcatattccagaaggttggaaagttggctgcaacagctgtgggaggtggattt tctctccttcagcttgcaaaccatactgggtatatcaaagttgactggcaacgagtggag aaggacatgaagaaagccaaagagcagctgaagatccgtaagagcaatcagatacctact gaggtcaggagcaaagctgaggaggtggtgtcatttgtgaagaagaatgttctagtgact gggggatttttcggaggctttctgcttggcatggcatcctaa >gi568815593r:73452800_73662881|GENSCAN_predicted_peptide_4|334_aa MDTSTNLDIGAQLIVEECPSTYSLTGMPDIKIEHPLDPNSEEGSAQGVAMGMKFILPNRF DMNVCSRFVKSLNEEDSKNIQDQIIHEAFYDTLSALSVHQLAAQGEMLYLATRIEQENVI NHTDEEGFTPLMWAAAHGQIAVVEFLLQNGADPQLLGKGRESALSLACSKGYTDIVKMLL DCGVDVNEYDWNGGTPLLYAVHGNHVKCVKMLLESGADPTIETDSGYNSMDLAVALGYRS DQHYILTGARPYCKLHNWLVPYENLMPDDLRWNSFIPKPSSSSSSSLLPPSPLLSVEKLS STKPVPGVKKTLGPRCSEQAARVVVIGPCAEDLK >gi568815593r:73452800_73662881|GENSCAN_predicted_CDS_4|1005_bp atggatacatcaacaaatctggatattggagcccagcttatcgtggaagagtgtcccagc acttatagcctaactggcatgccagacattaaaatagaacatccactggacccaaattca gaagaagggtcagctcagggtgttgccatgggaatgaaattcatattgcctaaccgattt gatatgaatgtgtgttctcgatttgtgaagtccttaaatgaagaagatagtaaaaatatt caagatcagataattcatgaagcattttatgatactctttcagctttgtctgttcaccag ttggctgctcagggagagatgctctatctggctactcgtatcgaacaagaaaatgttatc aatcacacggatgaagaaggatttactcctctgatgtgggctgcagcacacgggcaaata gctgtggtagagttcctacttcagaatggtgctgatccccaacttttaggaaaaggtcga gaaagtgcactgtcgttggcctgtagtaaaggctacacagatattgtcaaaatgctgctt gattgtggagttgatgtaaatgaatatgattggaatggaggaacacctctgctttatgct gtacatggaaatcatgtgaaatgtgtaaagatgctcttagaaagtggggctgatccaaca attgaaactgactctggatataattctatggatctagctgtagccctaggctatagaagt gatcagcattacatcctcacaggagcacgaccctactgtaaactgcacaactggcttgtt ccttatgagaatctaatgcctgatgatctgaggtggaacagtttcatcccaaaaccatca tcatcctcctcctcgtccctcctcccaccctcacccctcctgtccgtggaaaaattatct tccacgaaaccagtccctggtgtcaaaaagaccttgggtccacgatgctccgagcaagca gcaagggtggtggttattggcccgtgtgctgaagatctcaaatga >gi568815593r:73452800_73662881|GENSCAN_predicted_peptide_5|699_aa MLVPVDANNRSPSFLPDPGIILQGEEGAMKINDASQATGVPASRDRSPLSPAGGETLAGK KVRSVATGQLCPKSPRKKLSLIRGERIPGTGAGGRPGESARSRDHRRQQTASRAFPSEAS FSTGLQARACFGRDVTSDFFFRLLAKKRSAAMQLKLSGYIGEWPNSLTGSSSPQLSFDLG SSLRLHPFLSTVVSSRSNQQLSKAMDTFPSLPLLTRLPQTPVQIKEFGAVSKVDFSPQPP YNYAVTASSRIHIYGRYSQEPIKTFSRFKDTAYCATFRQDGRLLVAGSEDGGVQLFDISG RAPLRQFEGHTKAVHTVDFTADKYHVVSGADDYTVKLWDIPNSKEILTFKEHSDYVRCGC ASKLNPDLFITGSYDHTVKMFDARTSESVLSVEHGQPVESVLLFPSGGLLVSAGGRYVKV WDMLKGGQLLVSLKNHHKTVTCLCLSSSGQRKVKVYSTTSYKVVHSFDYAASILSLALAH EDETIVVGMTNGILSVKHRKSEAKKESLPRRRRPAYRTFIKGKNYMKQRDDILINRPAKK HLELYDRDLKHFRISKALDRVLDPTCTIKTPEITVSIIKELNRRGVLANALAGRDEKEIS HVLNFLIRNLSQPRFAPVLINAAEIIIDIYLPVIGQSPVVDKKFLLLQGLVEKEIDYQRE LLETLGMMDMLFATMRRKEGTSVLEHTSDGFPENKKIES >gi568815593r:73452800_73662881|GENSCAN_predicted_CDS_5|2100_bp atgctagttcctgtggatgctaataacaggagcccctcgtttcttcctgatcctggaatt attctacaaggagaggaaggtgctatgaaaataaatgatgcctcacaggcgacaggtgtc ccagcgtcccgagacaggagcccgctgtctcccgctggaggggagactcttgcaggaaag aaagtccgttctgtcgccaccggccagctgtgcccgaaaagtcccaggaagaaactctcg ctcatccgcggcgagcggatcccgggcacaggggctggaggccgacccggtgagagtgca aggtctcgcgaccaccgacgacagcagacagcgagtcgggccttcccatctgaggccagc ttcagtacaggcctccaagcccgggcttgttttggccgcgacgtcacttccgatttcttc tttcgccttctggctaaaaaacgttccgcagcaatgcagctgaaactttcgggctatata ggagagtggccaaattctttaaccggctcaagttcccctcaattaagcttcgacctaggt tcttctctgcgattacaccctttcctgtccaccgttgttagttcccggagtaatcagcaa ctctccaaggcaatggacacatttccatctttgcctttgcttacgaggctccctcagacc cctgttcagattaaggaatttggtgcagtttcaaaagtagacttttctcctcagcctcca tataattatgctgtcacagcttcctcaagaattcacatttatggccgatactcccaagaa cctataaaaaccttttctcgatttaaagacacagcatactgtgctacttttcgacaagat ggtagattgcttgtggctggcagtgaagatggtggagttcaactttttgatataagtggg agggctcccctcaggcagtttgaaggccatacaaaagcagttcatacagtagattttaca gctgacaaatatcacgtggtctctggggctgatgattatacagttaaattatgggatatt ccaaactccaaagaaattttgacatttaaagaacactctgattatgtgaggtgtggatgt gctagcaaacttaatccggatctctttataacaggatcatatgatcatactgtgaagatg tttgatgcacgaacgagtgagagtgttctctccgttgagcatgggcagccagtggagagt gtcctacttttcccctctggaggtcttctggtgtcagcaggaggtcgttatgttaaagtc tgggacatgttaaaaggaggacaattgctagtatctttgaaaaatcatcacaaaaccgtg acatgtttatgtctaagcagctctggacagaggaaggtgaaagtatacagcacaacttcc tacaaagtagtccacagttttgattatgcagcttcaattttgagtcttgcccttgcacat gaagatgagacaatagttgtaggaatgaccaatggaatactgagtgttaaacatcggaaa tctgaagcaaagaaggaatcacttcccagaagaagaaggcctgcatatcgaacctttatt aaaggaaaaaattacatgaagcaacgggatgacattttgattaacaggccagcaaagaag cacctagaattgtatgacagggatctgaaacattttcggatctctaaggcactcgataga gttcttgatcccacttgtacaataaagacacccgagattacggtgtccatcataaaggag ttaaatcgaagaggagtccttgcaaatgcgcttgcaggtcgggatgagaaggaaatcagt catgttcttaattttttgataaggaatctttctcagccaagatttgcccctgttttaatc aatgctgctgaaataattattgatatatatctgcctgtaattggtcagtcccctgtagtt gataaaaagtttttactacttcaaggacttgtagaaaaagagattgattaccaaagagaa ttgttagaaaccttggggatgatggatatgctttttgccaccatgagaaggaaggaaggc acttctgtgttggaacacacatctgatggatttccagagaataagaagatagaatcatag >gi568815593r:73452800_73662881|GENSCAN_predicted_peptide_6|178_aa MRKIQKRKPLIKPSDLMRLIHYRENSMGETAPMIQIISHQVPLTTHGNYGSTIQDEIWVG TRRNHIIVLSICPASLESLLSYKVHFFHGASLASPAFIGLSPSNFQALTYRTIWQEAVPP SAGHYWLSVWVCLSDSSDVQEGWFPLYPKPFETGLCNRTFSDDGNILDLCCPMGTCGY >gi568815593r:73452800_73662881|GENSCAN_predicted_CDS_6|537_bp atgaggaagatacaaaagcggaaacccctgataaaaccatcagatctcatgagacttatt cactaccgtgagaacagtatgggggaaactgcccccatgattcaaattatctcccaccag gtccctctcacaacacatgggaattatggaagtacaattcaagatgagatttgggtgggg actcggcgaaaccatattattgtccttagtatctgccctgcttcattagaatccttactg tcctacaaagtccacttcttccatggagcatctttggcttctccagccttcattggcctc tctccttccaacttccaggccttaacataccgcacaatttggcaggaagccgtaccaccg agcgctggtcactattggctctccgtgtgggtctgcctttctgactcttctgatgttcag gaagggtggttccctctctacccaaagccctttgaaacaggactgtgcaacagaactttc tctgatgatggaaatattcttgatctgtgctgtccaatgggaacatgtggctattga >gi568815593r:73452800_73662881|GENSCAN_predicted_peptide_7|235_aa MESGNTKEHAAKKQSYTTPLLHEGFFRRRGLGQPYGPHKESGLACLKSLWTKSRKITPSH LNLMTVWSAVYFPVSRRTLAGLSPAQPPALSAPERPRAPQLPPPHEAAPPSPLVAAPEGA GGGAQPSRRSACSGSPVTSADPRIPGDPGESGPADANSGGQTFPPRAAGKRRRWSPKEHL PGAMAPQPPEPRRGSAPAPCLRAPRNAPPASRRPESGRLSPAMPTPGGGRAEAKV >gi568815593r:73452800_73662881|GENSCAN_predicted_CDS_7|708_bp atggagagcgggaacaccaaggaacatgctgcaaaaaagcagagctataccacccctctt ctacatgaggggttcttcagaagaagaggactagggcagccatatggcccacacaaggaa tctggacttgcatgtctgaaatctctttggactaagtccaggaagataactccaagtcat ttaaacctgatgactgtctggagtgctgtttatttccctgtctcccgaaggacactggcg gggctttcacccgcccagcccccggctctcagtgcgcccgagcgaccccgcgctccgcag ctgccgccgccgcacgaggctgcgccgccctctccacttgttgccgctcccgagggagcg ggaggcggcgcccagccctcgcgccgcagcgcctgcagcggttccccggtcacctctgct gaccctcgcatccctggggaccctggagaaagtggccctgcggatgcgaactcaggtgga caaacttttccgcccagagctgcagggaaacgcagacgatggagtcccaaagagcattta cctggagcgatggccccgcagcctccagaacccaggcgaggctctgcccccgcgccctgc ctgagggcgcctcggaacgcgcccccagccagccgccgcccggagtccggaaggctctcc ccggcgatgccaacgcctggcggcggccgggctgaagccaaggtttag >gi568815593r:73452800_73662881|GENSCAN_predicted_peptide_8|70_aa MEEPKSPHSWTIGFSVMPQGARDTSSIVKGTTSWKERCRQCVVNVYQEDSTEVEVYMWYA GILNRPPWVE >gi568815593r:73452800_73662881|GENSCAN_predicted_CDS_8|213_bp atggaggaacccaaatctcctcattcctggaccattggtttttcagtgatgcctcagggt gcaagggacacatcttccattgttaagggaactacttcttggaaagaaagatgccgtcag tgtgtagttaatgtgtatcaggaagatagcacagaggtggaagtgtatatgtggtatgca gggatactcaatcggccaccctgggtagaatga >gi568815593r:73452800_73662881|GENSCAN_predicted_peptide_9|148_aa MASKCSGERKSCMSLTSNQKQEIIELSEKGTSKAKIAESRPLVPNSQAAYTLIQNTDSKM GTQKAVSIQSLGQAGLLAKLQEIYSPLPEQQEGQYVCSRVKWRVIEDEDTEENEGSIMQG LVGPPKDVDFSSEYAEDPFEGSELSDII >gi568815593r:73452800_73662881|GENSCAN_predicted_CDS_9|447_bp atggcctctaagtgttcaggtgaaaggaagagttgcatgtctctcacttcaaatcaaaag caagaaattattgagcttagtgagaaaggcacatcaaaagccaagatagctgaaagcagg cctcttgtgccaaacagtcaagcagcctataccttgattcagaacactgactcaaagatg ggaacacagaaggcagtcagtatacagagtttggggcaagcaggcctattagccaagtta caagaaatatacagtcccctgcctgagcagcaagaaggccagtatgtctgcagcagagtg aagtggagagtaatagaagatgaagacacagaggaaaatgaaggatctattatgcaaggc cttgtaggtcctcctaaggacgttgacttttcttctgagtatgcagaggatccatttgag ggttctgagctgagtgacataatctga