GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:04:37 Sequence gi568815584r:60613410_60824074 : 210665 bp : 40.01% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 563 617 55 1 1 59 60 59 0.714 1.70 1.02 Intr + 3300 3404 105 1 0 90 98 71 0.984 7.57 1.03 Term + 4719 4879 161 1 2 73 41 181 0.971 9.02 1.04 PlyA + 5061 5066 6 1.05 2.07 PlyA - 7887 7882 6 1.05 2.06 Term - 24421 24210 212 2 2 81 49 177 0.393 9.57 2.05 Intr - 25184 25127 58 2 1 60 97 15 0.593 -2.86 2.04 Intr - 28319 28154 166 1 1 69 96 171 0.932 15.04 2.03 Intr - 28885 28692 194 1 2 74 80 80 0.774 3.17 2.02 Intr - 33168 32892 277 2 1 84 3 164 0.612 4.10 2.01 Init - 35780 35221 560 2 2 85 107 568 0.975 50.91 2.00 Prom - 38003 37964 40 -7.05 3.06 PlyA - 38503 38498 6 -0.45 3.05 Term - 39350 38965 386 0 2 73 48 145 0.430 3.07 3.04 Intr - 42108 41895 214 0 1 47 37 156 0.517 3.77 3.03 Intr - 43354 43070 285 1 0 72 59 168 0.658 8.91 3.02 Intr - 43876 43667 210 1 0 107 76 165 0.475 15.39 3.01 Init - 44489 44268 222 2 0 57 59 189 0.609 9.53 3.00 Prom - 46027 45988 40 -4.55 4.03 PlyA - 46330 46325 6 1.05 4.02 Term - 75468 75284 185 1 2 35 44 234 0.838 10.42 4.01 Init - 82879 82831 49 1 1 86 58 40 0.435 -0.04 4.00 Prom - 97398 97359 40 -6.35 5.05 PlyA - 98517 98512 6 1.05 5.04 Term - 100794 99998 797 1 2 100 43 573 0.946 46.45 5.03 Intr - 107036 106297 740 1 2 80 87 410 0.630 30.25 5.02 Intr - 108736 108344 393 0 0 -6 0 319 0.463 6.54 5.01 Init - 110602 109803 800 1 2 35 105 867 0.556 75.32 5.00 Prom - 111496 111457 40 -8.45 6.00 Prom + 117297 117336 40 -5.15 6.01 Sngl + 117351 117737 387 2 0 71 39 186 0.897 7.96 6.02 PlyA + 118416 118421 6 1.05 7.03 PlyA - 119172 119167 6 1.05 7.02 Term - 121566 121257 310 2 1 116 49 326 0.983 25.05 7.01 Init - 128466 128393 74 0 2 59 92 20 0.572 0.09 7.00 Prom - 142406 142367 40 -3.15 8.00 Prom + 143409 143448 40 -2.85 8.01 Init + 166569 166848 280 2 1 97 65 262 0.642 22.02 8.02 Term + 167041 167186 146 2 2 41 49 103 0.360 -1.21 8.03 PlyA + 169436 169441 6 1.05 9.00 Prom + 175476 175515 40 -7.05 9.01 Init + 178225 178292 68 0 2 48 97 55 0.976 3.00 9.02 Intr + 182808 182960 153 0 0 116 95 97 0.983 11.47 9.03 Intr + 184678 184751 74 1 2 70 84 69 0.986 2.83 9.04 Intr + 194916 195019 104 1 2 78 28 86 0.979 0.57 9.05 Intr + 198578 198718 141 1 0 56 84 180 0.991 14.03 9.06 Intr + 205313 205438 126 1 0 60 103 56 0.251 4.26 9.07 Intr + 209610 209741 132 2 0 74 90 35 0.077 2.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:60613410_60824074|GENSCAN_predicted_peptide_1|106_aa MGTPPTTTNGVCIASWQAGPLRAVFPHLPPPFSNKVQNQQSVLTSTCKRLRAPGFSLLTF QHVPTHRSSYIPHLVVSDTQTGYRVCANKMTTKPDLPLGTLEPVEV >gi568815584r:60613410_60824074|GENSCAN_predicted_CDS_1|321_bp atgggtactccaccaactaccactaatggtgtttgtattgccagctggcaggcaggcccc ttaagagctgtcttcccacaccttccaccccctttctccaataaagtccaaaatcagcaa tcagtgttgacatcaacgtgcaaacggcttagagccccaggtttttccctactaactttt cagcatgttcccacccatcgaagttcttatattccacatttagtagtatcggatacacaa acaggctacagagtctgtgcaaataaaatgactaccaagcctgatcttccattgggaact ctagagcccgtggaagtttga >gi568815584r:60613410_60824074|GENSCAN_predicted_peptide_2|488_aa MSMLPSFGFTQEQVACVCEVLQQGGNLERLGRFLWSLPACDHLHKNESVLKAKAVVAFHR GNFRELYKILESHQFSPHNHPKLQQLWLKAHYVEAEKLRGRPLGAVGKYRVRRKFPLPRT IWDGEETSYCFKEKSRGVLREWYAHNPYPSPREKRELAEATGLTTTQVSNWFKNRRQRDR AAEAKERENTENNNSSSNKQNQLSPLEGGKPLMSSSEEEFSPPQSPDQNSVLLLQGNMGH ARSSNYSLPGLTASQPSHGLQTHQHQLQDSLLGPLTSSLAPRFPHRADGAGLGSWFGGGL GPAADSPVPFLLQTPPVSFFASWGLGFGFRTQTLHPRALRRGASFPLEGTLKTLGDYLLQ LPEASALVCVPTAPPWTIHTEGPEGSVSSPARVFRDLSQFPQVLNPTKMRIPQTKKLQGS TRSGKEKPLPPALRRPLQASAASALGAWAARPPGHEGPEAPEGLSGGPAQPLRPLLARLE APGLQASS >gi568815584r:60613410_60824074|GENSCAN_predicted_CDS_2|1467_bp atgtcgatgctgccgtcgtttggctttacgcaggagcaagtggcgtgcgtgtgcgaggtt ctgcagcaaggcggaaacctggagcgcctgggcaggttcctgtggtcactgcccgcctgc gaccacctgcacaagaacgagagcgtactcaaggccaaggcggtggtcgccttccaccgc ggcaacttccgtgagctctacaagatcctggagagccaccagttctcgcctcacaaccac cccaaactgcagcaactgtggctgaaggcgcattacgtggaggccgagaagctgcgcggc cgacccctgggcgccgtgggcaaatatcgggtgcgccgaaaatttccactgccgcgcacc atctgggacggcgaggagaccagctactgcttcaaggagaagtcgaggggtgtcctgcgg gagtggtacgcgcacaatccctacccatcgccgcgtgagaagcgggagctggccgaggcc accggcctcaccaccacccaggtcagcaactggtttaagaaccggaggcaaagagaccgg gccgcggaggccaaggaaagggagaacaccgaaaacaataactcctcctccaacaagcag aaccaactctctcctctggaagggggcaagccgctcatgtccagctcagaagaggaattc tcacctccccaaagtccagaccagaactcggtccttctgctgcagggcaatatgggccac gccaggagctcaaactattctctcccgggcttaacagcctcgcagcccagtcacggcctg cagacccaccagcatcagctccaagactctctgctcggccccctcacctccagtctggcc ccgcgcttcccgcacagggcagatggcgcaggccttggatcctggttcggaggcggccta ggccctgctgccgattctccagtgcccttcttgcttcagacacctcctgtctccttcttc gcttcctggggtcttggctttggttttcggacccaaactcttcacccgcgcgctctgagg aggggtgccagctttcccttggaggggactctaaaaactctcggagactacctccttcag ctgcccgaagccagcgcactagtgtgtgtccccaccgcgcccccatggacaattcacact gagggacccgaaggctcagtctcatcgcctgcacgcgtgttccgcgaccttagccagttt ccccaagttctaaatcctaccaagatgagaatcccacaaactaagaaattgcaagggtca acccggagtgggaaggaaaagccgctgcctccggccctgcgtcgacccctgcaggctagc gcggcctctgctctaggggcttgggccgcgcggcctccaggccatgagggcccagaggcc cccgagggcctcagcggaggcccagcccagcctttgcggcccctacttgcccgcttggag gctccaggactccaggccagctcttag >gi568815584r:60613410_60824074|GENSCAN_predicted_peptide_3|438_aa MRGQKHCHLLRRFNANDWLGSRLPAADSTKPRSQLGFMKGDPDFPARLGQIEAPHSPAKR RRPYPNASSFWREKAPAPLPSSPDACGPHAPPEGTTPESAGPARRDSATSGYKPGRSRPR RRRGAQELPHSPGREFLVQAAGRREGFVKHCSGVGNLVAKSKPKRVGETRRGAWMSGHLP STLYPVSLFPLLRGLVIQTNAARPFWEQRRSSFCLRKSGSFRTRAICSCENAKTDPTAWG EFGPGKWGKEKRVAKGFRVAAPRVQTSALPSRPACCGSHAVRVSLSFSSNQPPHLEVISS REETCPPGISGRVESSPSFWMVKEFQESGPGAPNYLATSQKGNKWVPARGVCIKACQPSR ETLATANAVPLNAFSSTKAHAAVTCTEEALESRNLASGAPCQRCSLPSRAAARAAAGWLE LLISFKSPTHLCFSCDLH >gi568815584r:60613410_60824074|GENSCAN_predicted_CDS_3|1317_bp atgcgcggccaaaagcattgccacttgctgcggcgcttcaacgcgaatgactggctgggg tcgcggcttccggccgcagattccacgaaaccgagaagccaactcggcttcatgaaagga gatcccgatttccccgctcgcctcgggcaaattgaggcacctcattcgccagcgaagcgt cgccgtccttatcccaacgcctccagtttttggagggaaaaagccccggcccctcttccc tcaagccccgacgcctgtgggccccatgcaccgcccgagggaacaaccccagagagcgcc gggccagcccggcgagatagcgcgacctctggctacaagcctgggcggagtcggccccgg cgcagacgcggcgcccaagagctcccgcacagcccggggagggagtttctggtgcaggcg gcggggcgacgggagggctttgtcaagcactgcagcggtgttggaaaccttgttgcaaag tccaagcctaagcgagtgggcgagacgcgcagaggggcatggatgagtgggcaccttccc agcaccctctacccagtatctcttttcccactgctcagaggactggtgattcagacaaac gccgctcggcccttttgggaacagaggcggtcgagcttctgtctgcgaaaatctggatcc tttagaacccgagcaatatgctcgtgtgagaatgcaaaaacagatcccacggcttggggt gagtttgggcctggaaaatgggggaaagaaaagagagtagcgaaaggttttcgggttgcg gcgccccgcgtgcagacgtctgctctccccagccgcccagcctgctgtggcagccacgct gtgcgcgtgtctttatccttcagttcaaaccagcccccacatcttgaagtaatcagcagc cgagaagagacttgtccccctggcatctcaggtagagtagaatcctcaccgtcgttctgg atggtgaaagaattccaagaaagcggcccgggagcccctaactatcttgctacaagccag aagggaaacaagtgggtcccagctaggggtgtttgcatcaaggcctgtcaacccagccga gaaaccctagcaacagctaatgctgtcccactgaatgctttttcatcgactaaagcgcac gcggccgttacctgcacagaggaggcactggagagccgaaacctggcatcgggcgctcct tgccagcggtgttccctcccgagccgcgctgcagcgagggcagcagccggctggctagag ttgctcatttcctttaaatctccaacgcatctctgcttttcgtgcgatttgcattga >gi568815584r:60613410_60824074|GENSCAN_predicted_peptide_4|77_aa MGFLHVGQAGLELLTSGSWRFEQRIGQNVQQSKERMMQRKNKCRDLLQMKVRSTAWERPE QRPKGSRYRIFSGPNNG >gi568815584r:60613410_60824074|GENSCAN_predicted_CDS_4|234_bp atggggtttctccatgttggtcaggctggtctcgaactcctgacctcaggttcttggcgt tttgaacaaagaattggacaaaatgtacagcagagcaaggaaagaatgatgcagcgaaag aacaaatgcagagatttattgcaaatgaaagtacgctccacagcgtgggagcggccagag cagcggcccaagggctccagatacagaatcttctccggtccaaacaacggctag >gi568815584r:60613410_60824074|GENSCAN_predicted_peptide_5|909_aa MESASEGQEAHREVAGGAAVGLSPPAPAPFPLEPGDAATAAARVSGEEGAVAAAAAGAAA DQVQLHSELLGRHHHAAAAAAQTPLAFSPDHVACVCEALQQGGNLDRLARFLWSLPQSDL LRGNESLLKARALVAFHQGIYPELYSILESHSFESANHPLLQQLWYKARYTEAERARGRP LGAVDKYRLRRKFPLPRTIWDGEETVYCFKEKSRNALKELYKQNRYPSPAEKRHLAKITG LSLTQVSNWFKNRRQRDRNPSETQSKRRERIQKQLEGSSRGSKLGAGDPARLRQVKVITS AFQSRSQPALAASGPLWGSDGSGSGVPRPRLGGRLGALSGPRGNQNKRCTGMCVRLPQIC GEVQRRRFGYAEPPALRPLEAADPALGNQRNPQGRGRGESDGNPSTEDESSKGHEDLSPH PLSSSSDGITNLSLSSHMEPVYMQQIGNAKISLSSSGVLLNGSLVPASTSPVFLNGNSFI QGPSGVILNGLNVGNTQAVALNPPKMSSNIVSNGISMTDILGSTSQDVKEFKVLQSSANS ATTTSYSPSVPVSFPGLIPSTEVKREGIQTVASQDGGSVVTFTTPVQINQYGIVQIPNSG ANSQFLNGSIGFSPLQLPPVSVAASQGNNLIWYLNAPANVFISCCNISVSSSTSDGSTFT SESTTVQQGKVFLSSLAPSAVVYTVPNTGQTIGSVKQEGLERSLVFSQLMPVNQNAQVNA NLSSENISGSGLHPLASSLVNVSPTHNFSLSPSTLLNPTELNRDIADSQPMSAPVASKST VTSVSNTNYATLQNCSLITGQDLLSVPMTQAALGEIVPTAEDQVGHPSPAVHQDFVQEHR LVLQSVANMKENFLSNSESKATSSLMMLDSKSKYVLDGMVDTVCEDLETDKKELAKLQTV QLDEDMQDL >gi568815584r:60613410_60824074|GENSCAN_predicted_CDS_5|2730_bp atggaaagcgcctcggaagggcaggaggcgcaccgagaagtggcggggggcgcggcggta gggctgagccccccggctccagccccttttcccctggagccgggggacgccgcgaccgct gccgccagggtgagcggagaggaaggggcagtggcggcggcggcggccggagcggcggcg gatcaggtacaactccactcggaacttctgggcaggcaccaccacgccgccgccgccgcc gcgcagaccccgctggccttctcgcccgaccacgtcgcctgcgtgtgcgaggcactgcag caggggggcaacctggaccgcctggcccggttcctgtggtccctgccccagagcgacctg ctacgtggcaacgagagcctgctgaaggcgcgggcgctcgtggccttccaccagggcatc taccccgagctctacagcatcctcgagagccacagcttcgagtcggccaaccacccgctg ctgcagcagctctggtacaaggcgcgctacaccgaggccgagcgagcccgcggccggccg ctgggagccgtagacaagtaccggctgcgcaggaaattccccctgccccgcaccatctgg gacggcgaggagacggtgtattgtttcaaggagaagtcgcgcaacgcgctcaaggagctc tacaagcagaatcgctacccttcgcccgccgagaagcggcacctggccaagatcaccggc ctctccctcacccaggtcagcaactggttcaagaaccgccggcagcgcgacaggaacccc tccgagacccagtccaaaaggagagaaagaatccaaaagcagctcgaaggttcttctcgg ggaagcaaactgggagccggggatccagcccgcctgcgccaggtgaaggtgatcaccagc gcattccagagccggtctcagcccgcccttgccgcttctgggcccctgtgggggtccgac ggctcgggctccggcgttcctcgcccaaggctgggagggaggcttggtgccctatccggc cctcgcggtaaccaaaacaaaaggtgcaccgggatgtgcgtgcgccttccgcagatatgc ggagaggtccagagaaggcgctttggttacgccgagccacctgccctgcgcccactagag gccgcggatcccgcgctcggaaaccaacggaatccgcagggccggggcagaggtgagtca gatggcaaccccagcactgaagatgaatccagcaagggacatgaggatttatctcctcac ccactctccagttcatctgatggcatcaccaacctcagcctttccagtcatatggagcca gtatatatgcaacaaattggaaatgctaagatatcattaagctcttctggagttctgttg aatggaagcttggtacctgcaagtacttcacctgtcttccttaatggaaattcttttatt cagggacccagtggagttatccttaatggattaaatgtgggaaatacacaggcagtggca ttgaacccaccaaaaatgtcatcaaacattgtgagcaatggtatatccatgactgacata ctggggtctacttcccaggacgtgaaggaattcaaagtcctccagagttctgctaactca gcaaccaccacgtcctacagccccagtgtccctgtctcattcccaggcctgatacccagc actgaggtgaaaagagaaggcattcaaacagtggcttcccaagatggagggtctgtagtg acttttactacaccagtgcaaattaaccagtatggcattgtccagatccccaattccgga gcaaacagccagttccttaatgggagcattggattctctccactgcagctgccccctgtg tcagtggcagcttcacaaggtaacaatctcatttggtaccttaatgcaccagcaaatgtg ttcatcagctgctgtaatatctcagtaagctcaagcacttcagatggaagcacatttaca agtgagtctaccacagtccagcaaggaaaggttttcttgagctctcttgctcccagtgca gtggtatacacggttcctaatacaggccagactataggatctgtgaaacaggaaggcttg gaaaggagcctggtattttctcagttgatgcctgtcaatcagaatgcacaagtaaatgca aacctgtcttctgaaaacatctcggggagtggcctgcatccactggcctcctcattagtt aatgtatctccaactcacaatttttctctcagtccctctacactactaaatcccactgag ctaaaccgcgacattgccgatagccaaccaatgtctgcaccggtggcaagcaaatctact gtgacatctgtcagcaacactaactatgcaactcttcagaactgctcccttattactggt caagacctattgtcagtccctatgactcaggctgcccttggggaaatagttcctacagct gaagatcaggtaggtcacccctccccagcagtacatcaggattttgtccaagaacatcgt ttggttctgcaatcggtagctaacatgaaagagaatttcttatcaaattctgagagcaaa gcaacaagtagcttaatgatgctggactctaaatccaagtatgtcttagatggcatggtt gatactgtctgtgaagacctggaaacagacaaaaaagagcttgccaagctccagactgtc cagctggatgaagatatgcaagacttatga >gi568815584r:60613410_60824074|GENSCAN_predicted_peptide_6|128_aa MGKDFMTKTPKAMATKAKIDKWDLMKLKSFCTAKETTIRMNRQPTDWEKIFGIYSSDKGL ISRIYKELKQIYKKKVKQPHQKVDKGYEQTLFQRRDLCSQQTHETMLIITGHQRNANQNH NEIPPHTS >gi568815584r:60613410_60824074|GENSCAN_predicted_CDS_6|387_bp atgggcaaggacttcatgactaaaacaccaaaagcaatggcaacaaaagctaaaatagac aaatgggatctaatgaaactaaagagcttctgcacagcaaaagaaactaccatcagaatg aacaggcaacctacagactgggagaaaatttttggaatctactcatctgacaaagggcta atatccagaatctacaaagaacttaaacaaatttacaagaaaaaagtcaaacaaccccat caaaaagtggacaaaggatatgaacagacacttttccaaagaagagatttatgcagccaa cagacacatgaaacaatgctcatcatcactggtcatcagagaaatgcaaatcaaaaccac aatgagataccacctcacaccagttag >gi568815584r:60613410_60824074|GENSCAN_predicted_peptide_7|127_aa MARPSEVESPRDRHKLVESKWMGARESTAAQLTECVRTHSPSASRRGSDIWWSYTEGNPD RPWRFPSYRLSQAPVSARATYKPPQTRPSRFLPTGAWQRRSLRERAPARGEALQHKRSGS QFPRSCI >gi568815584r:60613410_60824074|GENSCAN_predicted_CDS_7|384_bp atggcaagacccagtgaagtagagtctcccagagacagacacaaattggttgaaagtaaa tggatgggggccagggaatccactgccgcccaactcacagagtgtgtccgcacacattca ccatcagcttcaaggaggggttccgatatttggtggtcttacaccgagggcaaccctgat cgtccatggcggtttccctcctacagactctcgcaggcgcctgtttcagccagagccacc tacaagccccctcagacgcgaccaagcaggttcctaccaacaggcgcttggcagagacgg tcccttcgcgaaagagcaccggcaaggggcgaggcgctgcaacacaaacgttccggcagt cagttcccccggtcttgcatctag >gi568815584r:60613410_60824074|GENSCAN_predicted_peptide_8|141_aa MALQLSREQGITLRGSAEIVDEFFSFGINSILYQRGIYPSEIFTRVQKYGLTLLVTTDLE LIKYLNNVVEQLKDWLYKCSVQKLVVVISNIESDKDLVVPEKWEESGPQFITNSEEVCLC SFTTTIHKVNSMVAYKIPVND >gi568815584r:60613410_60824074|GENSCAN_predicted_CDS_8|426_bp atggcgctgcagctctcccgggagcagggaatcaccctgcgtgggagtgccgaaatcgtg gacgagttcttctcattcggcatcaacagcattttatatcagcgtggcatttatccatct gaaatctttactcgagtgcagaaatacggactcaccttgcttgtaactactgatcttgag ctcataaaatacctaaataatgtggtggaacaactaaaagattggttatacaagtgttca gttcagaaactggttgtagttatctcaaatattgaaagtgacaaagatttggttgtacct gaaaaatgggaagagtcgggaccacagtttattaccaattctgaggaagtctgcctttgt tcatttactactacaatccacaaagtaaatagcatggtggcctacaaaattcctgtcaat gactga >gi568815584r:60613410_60824074|GENSCAN_predicted_peptide_9|266_aa MIIVDKVNWIPWLIGLKSMYANNCESCVDLLFVRGAGNCPECGTPLRKSNFRVQLFEDPT VDKEVEIRKKVLKIYNKREEDFPSLREYNDFLEEVEEIVFNLTNNVDLDNTKKKMEIYQK ENKDVIQKNKLKLTREQEELEEALEVERQENEQRRLFIQKEEQLQQILKRKNKQAFLDEL ESSDLPVALLLAQHKDRSTQLEMQLEKPKPVKPVTFSTGIKMSVGITLCPAEEHTFNDER SFNLSSSWSKFNKPPSTLPFKVLDEE >gi568815584r:60613410_60824074|GENSCAN_predicted_CDS_9|798_bp atgattatcgtagacaaagttaactggatcccgtggctaattggcttgaaatccatgtat gcaaataactgtgaaagttgtgtagatttactgtttgtgagaggagctggaaactgccct gagtgtggtactccactcagaaagagcaacttcagggtacaactctttgaagatcccact gttgacaaggaggttgagatcaggaaaaaagtgctaaagatatacaataaaagggaagaa gattttcctagtctaagagaatacaatgatttcttggaagaagtggaagaaattgttttc aacttgaccaacaatgtggatttggacaacaccaaaaagaaaatggagatataccaaaag gaaaacaaagatgttattcagaaaaataaattaaagctgactcgagaacaggaagaactg gaagaagctttagaagtggaacgacaggaaaatgaacaaagaagattatttatacaaaaa gaagaacaactgcagcagattctaaaaaggaagaataagcaggcttttttagatgagctg gagagttctgatctccctgttgctctgcttttggctcagcataaagatagatctacccaa ttagaaatgcaacttgagaaacccaaacctgtaaaaccagtgacgttttccacaggcatc aaaatgagtgttgggattacactgtgcccggctgaggagcacacttttaatgatgaaaga agtttcaatctcagctcatcttggagcaaatttaataaacctccctctactctgcctttt aaagtattggatgaagag