GENSCAN 1.0 Date run: 5-Nov-116 Time: 22:46:22 Sequence gi568815584r:60546286_60749189 : 202904 bp : 40.93% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 50 45 6 1.05 1.02 Term - 4549 4248 302 2 2 61 42 168 0.639 3.90 1.01 Init - 8053 7906 148 1 1 85 68 64 0.354 4.40 1.00 Prom - 17698 17659 40 -5.05 2.00 Prom + 24563 24602 40 -4.95 2.01 Init + 59296 59680 385 0 1 78 31 214 0.636 11.76 2.02 Intr + 63185 63313 129 0 0 98 78 73 0.068 7.05 2.03 Intr + 70424 70528 105 0 0 90 98 71 0.984 7.57 2.04 Term + 71843 72003 161 0 2 73 41 181 0.971 9.02 2.05 PlyA + 72185 72190 6 1.05 3.07 PlyA - 75011 75006 6 1.05 3.06 Term - 91545 91334 212 1 2 81 49 177 0.393 9.57 3.05 Intr - 92308 92251 58 1 1 60 97 15 0.593 -2.86 3.04 Intr - 95443 95278 166 0 1 69 96 171 0.932 15.04 3.03 Intr - 96009 95816 194 0 2 74 80 80 0.774 3.17 3.02 Intr - 100292 100016 277 1 1 84 3 164 0.612 4.10 3.01 Init - 102904 102345 560 1 2 85 107 568 0.975 50.91 3.00 Prom - 105127 105088 40 -7.05 4.06 PlyA - 105627 105622 6 -0.45 4.05 Term - 106474 106089 386 2 2 73 48 145 0.430 3.07 4.04 Intr - 109232 109019 214 2 1 47 37 156 0.517 3.77 4.03 Intr - 110478 110194 285 0 0 72 59 168 0.658 8.91 4.02 Intr - 111000 110791 210 0 0 107 76 165 0.475 15.39 4.01 Init - 111613 111392 222 1 0 57 59 189 0.609 9.53 4.00 Prom - 113151 113112 40 -4.55 5.03 PlyA - 113454 113449 6 1.05 5.02 Term - 142592 142408 185 0 2 35 44 234 0.838 10.42 5.01 Init - 150003 149955 49 0 1 86 58 40 0.435 -0.04 5.00 Prom - 164522 164483 40 -6.35 6.05 PlyA - 165641 165636 6 1.05 6.04 Term - 167918 167122 797 0 2 100 43 573 0.946 46.45 6.03 Intr - 174160 173421 740 0 2 80 87 410 0.630 30.25 6.02 Intr - 175860 175468 393 2 0 -6 0 319 0.463 6.54 6.01 Init - 177726 176927 800 0 2 35 105 867 0.556 75.32 6.00 Prom - 178620 178581 40 -8.45 7.00 Prom + 184421 184460 40 -5.15 7.01 Sngl + 184475 184861 387 1 0 71 39 186 0.897 7.96 7.02 PlyA + 185540 185545 6 1.05 8.03 PlyA - 186296 186291 6 1.05 8.02 Term - 188690 188381 310 1 1 116 49 326 0.983 25.05 8.01 Intr - 202479 202332 148 1 1 11 25 173 0.091 1.67 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 67687 67741 55 0 1 59 60 59 0.892 1.70 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:60546286_60749189|GENSCAN_predicted_peptide_1|149_aa MMKSVTGNMMKKRSSLPHCESFTNKRPSISVWFLPWVIEQKEVFLKACTDMNPGTLSSFV STSQPEKQPHGMPHGTPLANMPPGQPSSCTSMSQAREAALWPTPSRHTPMPAEKPCDHVL GLRNSPVGCPLWTCLQASQATTCPCALPE >gi568815584r:60546286_60749189|GENSCAN_predicted_CDS_1|450_bp atgatgaagtcagtgacggggaacatgatgaagaagagatcttccttgcctcactgtgag tccttcaccaacaagcggccatccatttctgtctggttcctgccctgggtgattgaacaa aaggaggtttttttgaaagcatgcactgacatgaatccaggcacgctgagcagctttgta tccacatcccagcctgagaaacagccccatgggatgccccatgggacacccctggcaaat atgcccccaggtcaaccaagcagctgtacctccatgtcccaagctagagaagcagccctg tggcccacccccagcaggcacacacccatgccagctgagaagccctgtgaccatgtcctg ggcctgagaaacagccctgtgggctgccccttgtggacatgcctccaggccagccaagca actacatgcccatgtgctctgccagagtaa >gi568815584r:60546286_60749189|GENSCAN_predicted_peptide_2|259_aa MVSCDPAASAPAVAIRGQCTAQAAHTAQAIASEGARPKPWWLTCGVEPVGAQKSRIEVWE PLSRFQRMYGNAWMSRQKFAAGVESSWKTSAGVVWKVKVRLEAPNGVPTGALGGRAVRRG PPPFRPQNAILPLSQEVKFVTSNHKSPEFFHTNKHGATGESETYRKEEVKRGPLRAVFPH LPPPFSNKVQNQQSVLTSTCKRLRAPGFSLLTFQHVPTHRSSYIPHLVVSDTQTGYRVCA NKMTTKPDLPLGTLEPVEV >gi568815584r:60546286_60749189|GENSCAN_predicted_CDS_2|780_bp atggtgtcctgtgacccagctgcctcagctccagctgtggctataaggggccaatgtaca gctcaggctgcacatacagctcaggccattgcctcagagggtgcaagacccaagccttgg tggcttacatgtggtgttgaacctgtgggtgcacagaagtcaagaattgaggtttgggaa cctctgtctagatttcagaggatgtatggaaatgcctggatgtccaggcagaagtttgct gcaggggtggagtcctcatggaaaacttctgctggggtagtgtggaaagtaaaggtgagg ttggaggccccaaacggagtccccactggggcactgggtggcagagctgtgagaagaggg ccaccacccttcagaccccagaatgctattcttcctctttctcaagaggtcaagtttgtt accagcaatcacaagagccctgagttcttccacaccaacaaacatggtgccactggggaa tctgagacatacagaaaggaggaggtgaaaagaggccccttaagagctgtcttcccacac cttccaccccctttctccaataaagtccaaaatcagcaatcagtgttgacatcaacgtgc aaacggcttagagccccaggtttttccctactaacttttcagcatgttcccacccatcga agttcttatattccacatttagtagtatcggatacacaaacaggctacagagtctgtgca aataaaatgactaccaagcctgatcttccattgggaactctagagcccgtggaagtttga >gi568815584r:60546286_60749189|GENSCAN_predicted_peptide_3|488_aa MSMLPSFGFTQEQVACVCEVLQQGGNLERLGRFLWSLPACDHLHKNESVLKAKAVVAFHR GNFRELYKILESHQFSPHNHPKLQQLWLKAHYVEAEKLRGRPLGAVGKYRVRRKFPLPRT IWDGEETSYCFKEKSRGVLREWYAHNPYPSPREKRELAEATGLTTTQVSNWFKNRRQRDR AAEAKERENTENNNSSSNKQNQLSPLEGGKPLMSSSEEEFSPPQSPDQNSVLLLQGNMGH ARSSNYSLPGLTASQPSHGLQTHQHQLQDSLLGPLTSSLAPRFPHRADGAGLGSWFGGGL GPAADSPVPFLLQTPPVSFFASWGLGFGFRTQTLHPRALRRGASFPLEGTLKTLGDYLLQ LPEASALVCVPTAPPWTIHTEGPEGSVSSPARVFRDLSQFPQVLNPTKMRIPQTKKLQGS TRSGKEKPLPPALRRPLQASAASALGAWAARPPGHEGPEAPEGLSGGPAQPLRPLLARLE APGLQASS >gi568815584r:60546286_60749189|GENSCAN_predicted_CDS_3|1467_bp atgtcgatgctgccgtcgtttggctttacgcaggagcaagtggcgtgcgtgtgcgaggtt ctgcagcaaggcggaaacctggagcgcctgggcaggttcctgtggtcactgcccgcctgc gaccacctgcacaagaacgagagcgtactcaaggccaaggcggtggtcgccttccaccgc ggcaacttccgtgagctctacaagatcctggagagccaccagttctcgcctcacaaccac cccaaactgcagcaactgtggctgaaggcgcattacgtggaggccgagaagctgcgcggc cgacccctgggcgccgtgggcaaatatcgggtgcgccgaaaatttccactgccgcgcacc atctgggacggcgaggagaccagctactgcttcaaggagaagtcgaggggtgtcctgcgg gagtggtacgcgcacaatccctacccatcgccgcgtgagaagcgggagctggccgaggcc accggcctcaccaccacccaggtcagcaactggtttaagaaccggaggcaaagagaccgg gccgcggaggccaaggaaagggagaacaccgaaaacaataactcctcctccaacaagcag aaccaactctctcctctggaagggggcaagccgctcatgtccagctcagaagaggaattc tcacctccccaaagtccagaccagaactcggtccttctgctgcagggcaatatgggccac gccaggagctcaaactattctctcccgggcttaacagcctcgcagcccagtcacggcctg cagacccaccagcatcagctccaagactctctgctcggccccctcacctccagtctggcc ccgcgcttcccgcacagggcagatggcgcaggccttggatcctggttcggaggcggccta ggccctgctgccgattctccagtgcccttcttgcttcagacacctcctgtctccttcttc gcttcctggggtcttggctttggttttcggacccaaactcttcacccgcgcgctctgagg aggggtgccagctttcccttggaggggactctaaaaactctcggagactacctccttcag ctgcccgaagccagcgcactagtgtgtgtccccaccgcgcccccatggacaattcacact gagggacccgaaggctcagtctcatcgcctgcacgcgtgttccgcgaccttagccagttt ccccaagttctaaatcctaccaagatgagaatcccacaaactaagaaattgcaagggtca acccggagtgggaaggaaaagccgctgcctccggccctgcgtcgacccctgcaggctagc gcggcctctgctctaggggcttgggccgcgcggcctccaggccatgagggcccagaggcc cccgagggcctcagcggaggcccagcccagcctttgcggcccctacttgcccgcttggag gctccaggactccaggccagctcttag >gi568815584r:60546286_60749189|GENSCAN_predicted_peptide_4|438_aa MRGQKHCHLLRRFNANDWLGSRLPAADSTKPRSQLGFMKGDPDFPARLGQIEAPHSPAKR RRPYPNASSFWREKAPAPLPSSPDACGPHAPPEGTTPESAGPARRDSATSGYKPGRSRPR RRRGAQELPHSPGREFLVQAAGRREGFVKHCSGVGNLVAKSKPKRVGETRRGAWMSGHLP STLYPVSLFPLLRGLVIQTNAARPFWEQRRSSFCLRKSGSFRTRAICSCENAKTDPTAWG EFGPGKWGKEKRVAKGFRVAAPRVQTSALPSRPACCGSHAVRVSLSFSSNQPPHLEVISS REETCPPGISGRVESSPSFWMVKEFQESGPGAPNYLATSQKGNKWVPARGVCIKACQPSR ETLATANAVPLNAFSSTKAHAAVTCTEEALESRNLASGAPCQRCSLPSRAAARAAAGWLE LLISFKSPTHLCFSCDLH >gi568815584r:60546286_60749189|GENSCAN_predicted_CDS_4|1317_bp atgcgcggccaaaagcattgccacttgctgcggcgcttcaacgcgaatgactggctgggg tcgcggcttccggccgcagattccacgaaaccgagaagccaactcggcttcatgaaagga gatcccgatttccccgctcgcctcgggcaaattgaggcacctcattcgccagcgaagcgt cgccgtccttatcccaacgcctccagtttttggagggaaaaagccccggcccctcttccc tcaagccccgacgcctgtgggccccatgcaccgcccgagggaacaaccccagagagcgcc gggccagcccggcgagatagcgcgacctctggctacaagcctgggcggagtcggccccgg cgcagacgcggcgcccaagagctcccgcacagcccggggagggagtttctggtgcaggcg gcggggcgacgggagggctttgtcaagcactgcagcggtgttggaaaccttgttgcaaag tccaagcctaagcgagtgggcgagacgcgcagaggggcatggatgagtgggcaccttccc agcaccctctacccagtatctcttttcccactgctcagaggactggtgattcagacaaac gccgctcggcccttttgggaacagaggcggtcgagcttctgtctgcgaaaatctggatcc tttagaacccgagcaatatgctcgtgtgagaatgcaaaaacagatcccacggcttggggt gagtttgggcctggaaaatgggggaaagaaaagagagtagcgaaaggttttcgggttgcg gcgccccgcgtgcagacgtctgctctccccagccgcccagcctgctgtggcagccacgct gtgcgcgtgtctttatccttcagttcaaaccagcccccacatcttgaagtaatcagcagc cgagaagagacttgtccccctggcatctcaggtagagtagaatcctcaccgtcgttctgg atggtgaaagaattccaagaaagcggcccgggagcccctaactatcttgctacaagccag aagggaaacaagtgggtcccagctaggggtgtttgcatcaaggcctgtcaacccagccga gaaaccctagcaacagctaatgctgtcccactgaatgctttttcatcgactaaagcgcac gcggccgttacctgcacagaggaggcactggagagccgaaacctggcatcgggcgctcct tgccagcggtgttccctcccgagccgcgctgcagcgagggcagcagccggctggctagag ttgctcatttcctttaaatctccaacgcatctctgcttttcgtgcgatttgcattga >gi568815584r:60546286_60749189|GENSCAN_predicted_peptide_5|77_aa MGFLHVGQAGLELLTSGSWRFEQRIGQNVQQSKERMMQRKNKCRDLLQMKVRSTAWERPE QRPKGSRYRIFSGPNNG >gi568815584r:60546286_60749189|GENSCAN_predicted_CDS_5|234_bp atggggtttctccatgttggtcaggctggtctcgaactcctgacctcaggttcttggcgt tttgaacaaagaattggacaaaatgtacagcagagcaaggaaagaatgatgcagcgaaag aacaaatgcagagatttattgcaaatgaaagtacgctccacagcgtgggagcggccagag cagcggcccaagggctccagatacagaatcttctccggtccaaacaacggctag >gi568815584r:60546286_60749189|GENSCAN_predicted_peptide_6|909_aa MESASEGQEAHREVAGGAAVGLSPPAPAPFPLEPGDAATAAARVSGEEGAVAAAAAGAAA DQVQLHSELLGRHHHAAAAAAQTPLAFSPDHVACVCEALQQGGNLDRLARFLWSLPQSDL LRGNESLLKARALVAFHQGIYPELYSILESHSFESANHPLLQQLWYKARYTEAERARGRP LGAVDKYRLRRKFPLPRTIWDGEETVYCFKEKSRNALKELYKQNRYPSPAEKRHLAKITG LSLTQVSNWFKNRRQRDRNPSETQSKRRERIQKQLEGSSRGSKLGAGDPARLRQVKVITS AFQSRSQPALAASGPLWGSDGSGSGVPRPRLGGRLGALSGPRGNQNKRCTGMCVRLPQIC GEVQRRRFGYAEPPALRPLEAADPALGNQRNPQGRGRGESDGNPSTEDESSKGHEDLSPH PLSSSSDGITNLSLSSHMEPVYMQQIGNAKISLSSSGVLLNGSLVPASTSPVFLNGNSFI QGPSGVILNGLNVGNTQAVALNPPKMSSNIVSNGISMTDILGSTSQDVKEFKVLQSSANS ATTTSYSPSVPVSFPGLIPSTEVKREGIQTVASQDGGSVVTFTTPVQINQYGIVQIPNSG ANSQFLNGSIGFSPLQLPPVSVAASQGNNLIWYLNAPANVFISCCNISVSSSTSDGSTFT SESTTVQQGKVFLSSLAPSAVVYTVPNTGQTIGSVKQEGLERSLVFSQLMPVNQNAQVNA NLSSENISGSGLHPLASSLVNVSPTHNFSLSPSTLLNPTELNRDIADSQPMSAPVASKST VTSVSNTNYATLQNCSLITGQDLLSVPMTQAALGEIVPTAEDQVGHPSPAVHQDFVQEHR LVLQSVANMKENFLSNSESKATSSLMMLDSKSKYVLDGMVDTVCEDLETDKKELAKLQTV QLDEDMQDL >gi568815584r:60546286_60749189|GENSCAN_predicted_CDS_6|2730_bp atggaaagcgcctcggaagggcaggaggcgcaccgagaagtggcggggggcgcggcggta gggctgagccccccggctccagccccttttcccctggagccgggggacgccgcgaccgct gccgccagggtgagcggagaggaaggggcagtggcggcggcggcggccggagcggcggcg gatcaggtacaactccactcggaacttctgggcaggcaccaccacgccgccgccgccgcc gcgcagaccccgctggccttctcgcccgaccacgtcgcctgcgtgtgcgaggcactgcag caggggggcaacctggaccgcctggcccggttcctgtggtccctgccccagagcgacctg ctacgtggcaacgagagcctgctgaaggcgcgggcgctcgtggccttccaccagggcatc taccccgagctctacagcatcctcgagagccacagcttcgagtcggccaaccacccgctg ctgcagcagctctggtacaaggcgcgctacaccgaggccgagcgagcccgcggccggccg ctgggagccgtagacaagtaccggctgcgcaggaaattccccctgccccgcaccatctgg gacggcgaggagacggtgtattgtttcaaggagaagtcgcgcaacgcgctcaaggagctc tacaagcagaatcgctacccttcgcccgccgagaagcggcacctggccaagatcaccggc ctctccctcacccaggtcagcaactggttcaagaaccgccggcagcgcgacaggaacccc tccgagacccagtccaaaaggagagaaagaatccaaaagcagctcgaaggttcttctcgg ggaagcaaactgggagccggggatccagcccgcctgcgccaggtgaaggtgatcaccagc gcattccagagccggtctcagcccgcccttgccgcttctgggcccctgtgggggtccgac ggctcgggctccggcgttcctcgcccaaggctgggagggaggcttggtgccctatccggc cctcgcggtaaccaaaacaaaaggtgcaccgggatgtgcgtgcgccttccgcagatatgc ggagaggtccagagaaggcgctttggttacgccgagccacctgccctgcgcccactagag gccgcggatcccgcgctcggaaaccaacggaatccgcagggccggggcagaggtgagtca gatggcaaccccagcactgaagatgaatccagcaagggacatgaggatttatctcctcac ccactctccagttcatctgatggcatcaccaacctcagcctttccagtcatatggagcca gtatatatgcaacaaattggaaatgctaagatatcattaagctcttctggagttctgttg aatggaagcttggtacctgcaagtacttcacctgtcttccttaatggaaattcttttatt cagggacccagtggagttatccttaatggattaaatgtgggaaatacacaggcagtggca ttgaacccaccaaaaatgtcatcaaacattgtgagcaatggtatatccatgactgacata ctggggtctacttcccaggacgtgaaggaattcaaagtcctccagagttctgctaactca gcaaccaccacgtcctacagccccagtgtccctgtctcattcccaggcctgatacccagc actgaggtgaaaagagaaggcattcaaacagtggcttcccaagatggagggtctgtagtg acttttactacaccagtgcaaattaaccagtatggcattgtccagatccccaattccgga gcaaacagccagttccttaatgggagcattggattctctccactgcagctgccccctgtg tcagtggcagcttcacaaggtaacaatctcatttggtaccttaatgcaccagcaaatgtg ttcatcagctgctgtaatatctcagtaagctcaagcacttcagatggaagcacatttaca agtgagtctaccacagtccagcaaggaaaggttttcttgagctctcttgctcccagtgca gtggtatacacggttcctaatacaggccagactataggatctgtgaaacaggaaggcttg gaaaggagcctggtattttctcagttgatgcctgtcaatcagaatgcacaagtaaatgca aacctgtcttctgaaaacatctcggggagtggcctgcatccactggcctcctcattagtt aatgtatctccaactcacaatttttctctcagtccctctacactactaaatcccactgag ctaaaccgcgacattgccgatagccaaccaatgtctgcaccggtggcaagcaaatctact gtgacatctgtcagcaacactaactatgcaactcttcagaactgctcccttattactggt caagacctattgtcagtccctatgactcaggctgcccttggggaaatagttcctacagct gaagatcaggtaggtcacccctccccagcagtacatcaggattttgtccaagaacatcgt ttggttctgcaatcggtagctaacatgaaagagaatttcttatcaaattctgagagcaaa gcaacaagtagcttaatgatgctggactctaaatccaagtatgtcttagatggcatggtt gatactgtctgtgaagacctggaaacagacaaaaaagagcttgccaagctccagactgtc cagctggatgaagatatgcaagacttatga >gi568815584r:60546286_60749189|GENSCAN_predicted_peptide_7|128_aa MGKDFMTKTPKAMATKAKIDKWDLMKLKSFCTAKETTIRMNRQPTDWEKIFGIYSSDKGL ISRIYKELKQIYKKKVKQPHQKVDKGYEQTLFQRRDLCSQQTHETMLIITGHQRNANQNH NEIPPHTS >gi568815584r:60546286_60749189|GENSCAN_predicted_CDS_7|387_bp atgggcaaggacttcatgactaaaacaccaaaagcaatggcaacaaaagctaaaatagac aaatgggatctaatgaaactaaagagcttctgcacagcaaaagaaactaccatcagaatg aacaggcaacctacagactgggagaaaatttttggaatctactcatctgacaaagggcta atatccagaatctacaaagaacttaaacaaatttacaagaaaaaagtcaaacaaccccat caaaaagtggacaaaggatatgaacagacacttttccaaagaagagatttatgcagccaa cagacacatgaaacaatgctcatcatcactggtcatcagagaaatgcaaatcaaaaccac aatgagataccacctcacaccagttag >gi568815584r:60546286_60749189|GENSCAN_predicted_peptide_8|152_aa XSADHSGSILEEGIVITEDDNFMHVIALEDSPMRKNVERGDSDIDDPDPVESTAAQLTEC VRTHSPSASRRGSDIWWSYTEGNPDRPWRFPSYRLSQAPVSARATYKPPQTRPSRFLPTG AWQRRSLRERAPARGEALQHKRSGSQFPRSCI >gi568815584r:60546286_60749189|GENSCAN_predicted_CDS_8|459_bp ncctcagcagatcattccggaagtattctagaagaaggcattgttatcacagaagatgac aacttcatgcatgttattgctctcgaagactctccaatgagaaaaaacgtggagagggga gacagtgatattgatgatcctgaccctgtggaatccactgccgcccaactcacagagtgt gtccgcacacattcaccatcagcttcaaggaggggttccgatatttggtggtcttacacc gagggcaaccctgatcgtccatggcggtttccctcctacagactctcgcaggcgcctgtt tcagccagagccacctacaagccccctcagacgcgaccaagcaggttcctaccaacaggc gcttggcagagacggtcccttcgcgaaagagcaccggcaaggggcgaggcgctgcaacac aaacgttccggcagtcagttcccccggtcttgcatctag