GENSCAN 1.0 Date run: 7-Nov-116 Time: 17:47:06 Sequence gi568815597f:226123682_226325809 : 202128 bp : 42.90% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 644 688 45 1 0 91 94 40 0.518 5.65 1.02 Intr + 1626 1685 60 2 0 75 100 126 0.933 10.51 1.03 Intr + 5001 5186 186 2 0 78 73 62 0.175 2.66 1.04 Intr + 9689 9773 85 1 1 72 110 30 0.664 2.07 1.05 Term + 11438 11637 200 0 2 79 50 237 0.513 15.58 1.06 PlyA + 14259 14264 6 1.05 2.11 PlyA - 17068 17063 6 1.05 2.10 Term - 23140 22929 212 2 2 109 42 181 0.993 12.07 2.09 Intr - 28938 28654 285 1 0 50 70 318 0.756 22.59 2.08 Intr - 31152 30966 187 0 1 95 100 251 0.999 25.34 2.07 Intr - 35677 35503 175 0 1 60 96 269 0.999 23.92 2.06 Intr - 38008 37850 159 0 0 85 106 281 0.999 27.78 2.05 Intr - 42319 42178 142 2 1 115 77 121 0.997 12.19 2.04 Intr - 63068 62709 360 0 0 -19 105 408 0.041 26.07 2.03 Intr - 73527 73478 50 2 2 61 75 135 0.334 6.91 2.02 Intr - 76552 76388 165 0 0 46 67 117 0.180 3.55 2.01 Init - 85919 85756 164 2 2 76 56 168 0.061 9.74 2.00 Prom - 92682 92643 40 -5.45 3.00 Prom + 95568 95607 40 -7.45 3.01 Init + 100001 100393 393 1 0 99 94 384 0.934 36.88 3.02 Term + 101826 102131 306 2 0 96 55 215 0.516 13.13 3.03 PlyA + 105888 105893 6 1.05 4.21 PlyA - 107498 107493 6 1.05 4.20 Term - 108925 108820 106 0 1 95 48 72 0.968 0.80 4.19 Intr - 109512 109415 98 0 2 84 91 73 0.990 5.09 4.18 Intr - 109842 109663 180 0 0 55 94 184 0.957 14.84 4.17 Intr - 115415 115290 126 2 0 82 107 78 0.969 9.06 4.16 Intr - 121385 121245 141 2 0 77 72 36 0.450 0.53 4.15 Intr - 127238 127158 81 2 0 99 91 43 0.780 4.62 4.14 Intr - 131559 131449 111 0 0 112 0 70 0.346 0.26 4.13 Intr - 141953 141852 102 2 0 117 83 79 0.997 9.75 4.12 Intr - 142651 142532 120 1 0 70 92 114 0.999 9.77 4.11 Intr - 144409 144276 134 2 2 86 99 121 0.999 12.44 4.10 Intr - 154251 154094 158 2 2 48 111 113 0.989 8.33 4.09 Intr - 162777 162652 126 2 0 88 83 47 0.649 3.07 4.08 Intr - 164116 163983 134 1 2 88 44 65 0.227 0.62 4.07 Intr - 172265 172161 105 2 0 32 91 62 0.046 0.39 4.06 Intr - 184936 184821 116 2 2 106 39 39 0.018 0.05 4.05 Intr - 186123 185945 179 2 2 64 74 132 0.767 8.04 4.04 Intr - 187181 187074 108 1 0 111 44 54 0.595 1.78 4.03 Intr - 192751 192580 172 2 1 57 63 63 0.691 -1.22 4.02 Intr - 194884 194329 556 1 1 -35 71 521 0.113 29.59 4.01 Init - 201983 201837 147 2 0 78 72 41 0.221 1.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 62994 62709 286 0 1 93 105 329 0.956 32.39 S.002 Sngl - 92752 92492 261 1 0 59 43 199 0.831 7.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:226123682_226325809|GENSCAN_predicted_peptide_1|191_aa MEHLGEQQRLSSWHQEGLRSPQTQMRPLDGAKASKIPAPTPAWNAHSAFITALNSTWFPK AMSPHTTSQGYNTLGFSDTVNTKQHELQDTQDSWRGAEPHETHTKDGPAFQGSLDPWFLS WKRGAGCQLLDSRARAGLHARLRQRLAPGAQVLQECVFNELLLNAAEPCSTLLPRPGVWE PDDSPAREGRT >gi568815597f:226123682_226325809|GENSCAN_predicted_CDS_1|576_bp atggagcacctgggagagcagcagcgactcagctcctggcaccaggaagggctgcggtcc ccgcagacccagatgcgtccactagatggcgccaaagccagtaagattccagcccccacc ccagcctggaacgcccacagtgcttttatcacagctcttaactccacttggttccccaaa gctatgagcccacataccacctcccagggctacaacactttgggtttctcagacactgtc aacactaagcagcatgaactgcaggacacacaggacagctggagaggggcagaaccgcat gagacacacacgaaagacggccctgccttccagggctcgctagatccttggttcctgtcg tggaagagaggggcaggatgtcagttacttgacagcagggcccgtgctgggctccatgcc cggcttcgccagcgtctggcgccgggagcgcaggtgctccaagagtgtgtgttcaatgaa ttgttactgaatgctgcagaaccctgctccacgctgcttcctcgccctggcgtttgggag cctgatgacagccctgcccgtgaaggccgcacctag >gi568815597f:226123682_226325809|GENSCAN_predicted_peptide_2|632_aa MTLGVRPLPARSAGASWLRGAFQPSAGGSFRERKKSGAVLSCRNSADRRRRPNACLNNKT RPCPRRRRRMRRRRRRRREGEEGEGGGGGRRRGRRVGGGKKRKEEIKRGSPISTERLGQR YRLRPAAEVRGQQEVDTWLPSVPAEEVQQPEMAAVLNAERLEVSVDGLTLSPDPEERPGA EGAPLLPPPLPPPSPPGSGRGPGASGEQPEPGEAAAGGAAEEARRLEQRWGFGLEELYGL ALRFFKEKDGKAFHPTYEEKLKLVALHKQVLMGPYNPDTCPEVGFFDVLGNDRRKEEEER RRREEEERERLQKEEEKRRREEEERLRREEEERRRIEEERLRLEQQKQQIMAALNSQTAV QFQQYAAQQYPGNYEQQQILIRQLQEQHYQQYMQQLYQVQLAQQQAALQKQQEVVVAGSS LPTSSKVNATVPSNMMSVNGQAKTHTDSSEKELEPEAAEEALENGPKESLPVIAAPSMWT RPQIKDFKEKIQQDADSVITVGRGEVVTVRVPTHEEGSYLFWEFATDNYDIGFGVYFEWT DSPNTAVSVHVSESSDDDEEEEENIGCEEKAKKNANKPLLDEIVPVYRRDCHEEVYAGSH QYPGRGVYLLKFDNSYSLWRSKSVYYRVYYTR >gi568815597f:226123682_226325809|GENSCAN_predicted_CDS_2|1899_bp atgaccttgggagtccgaccacttcccgcacgtagcgcaggtgcctcctggctcagaggg gcatttcagccatctgccggaggcagcttcagagaacggaaaaagagcggcgccgtgttg agttgcaggaacagcgcggacagacggagacggccaaatgcctgcctgaacaataaaaca agaccctgtccgaggagaaggagaagaatgagaagaaggaggagaagaaggagagaagga gaagaaggagaaggaggaggaggaggaagaagaagaggaagaagagtaggaggaggaaag aaaagaaaagaggaaataaaaagaggaagccccataagcactgagcggctggggcagcgc taccgtctccggccagcagcggaggtcagaggtcagcaggaagtcgatacgtggctgccg tctgtccccgctgaggaggtgcagcagccggagatggcggcggtgctgaacgcagagcga ctcgaggtgtccgtcgacggcctcacgctcagcccggacccggaggagcggcctggggcg gagggcgccccgctgctgccgccaccgctgccaccgccctcgccacctggatccggtcgc ggcccgggcgcctcaggggagcagcccgagcccggggaggcggcggctgggggcgcggcg gaggaggcgcggcggctggagcagcgctggggtttcggcctggaggagttgtacggcctg gcactgcgcttcttcaaagaaaaagatggcaaagcatttcatccaacttatgaagaaaaa ttgaagcttgtggcactgcataagcaagttcttatgggcccatataatccagacacttgt cctgaggttggattctttgatgtgttggggaatgacaggaggaaggaggaagaggagcga aggcggcgtgaagaggaagaaagagaacgtctgcaaaaggaggaagagaaacgtaggaga gaagaagaggaaaggcttcgacgggaggaagaggaaaggagacggatagaagaagaaagg cttcggttggagcagcaaaagcagcagataatggcagctttaaactcccagactgccgtg cagttccagcagtatgcagcccaacagtatccagggaactacgaacagcagcaaattctc atccgccagttgcaggagcaacactatcagcagtacatgcagcagttgtatcaagtccag cttgcacagcaacaggcagcattacagaaacaacaggaagtagtagtggctgggtcttcc ttgcctacatcatcaaaagtgaatgcaactgtaccaagtaatatgatgtcagttaatgga caggccaaaacacacactgacagctccgaaaaagaactggaaccagaagctgcagaagaa gccctggagaatggaccaaaagaatctcttccagtaatagcagctccatccatgtggaca cgacctcagatcaaagacttcaaagagaagattcagcaggatgcagattccgtgattaca gtgggccgaggagaagtggtcactgttcgagtacccacccatgaagaaggatcatatctc ttttgggaatttgccacagacaattatgacattgggtttggggtgtattttgaatggaca gactctccaaacactgctgtcagcgtgcatgtcagtgagtccagcgatgacgacgaggag gaagaagaaaacatcggttgtgaagagaaagccaaaaagaatgccaacaagcctttgctg gatgagattgtgcctgtgtaccgacgggactgtcatgaggaggtgtatgctggcagccat caatatccagggagaggagtctatctcctcaagtttgacaactcctactctttgtggcgg tcaaaatcagtctactacagagtctattatactagataa >gi568815597f:226123682_226325809|GENSCAN_predicted_peptide_3|232_aa MATAESRALQFAEGAAFPAYRAPHAGGALLPPPSPAAALLPAPPAGPGPATFAGFLGRDP GPAPPPPASLGSPAPPKGAAAPSASQRRKRTSFSAEQLQLLELVFRRTRYPDIHLRERLA ALTLLPESRIQVWFQNRRAKSRRQSGKSFQPLARPEIILNHCAPGTETKCLKPQLPLEVD VNCLPEPNGVGGGISDSSSQGQNFETCSPLSEDIGSKLDSWEEHIFSAFGNF >gi568815597f:226123682_226325809|GENSCAN_predicted_CDS_3|699_bp atggccacagccgagtcccgtgcgctccagtttgccgagggcgccgcgtttccagcgtac cgggccccccacgccggcggggcgctcctgccgcccccgagccctgcggcagccctgctc cctgcgccgcccgcgggccccggcccagcgacctttgcgggcttcctcggccgggacccc gggccggccccgccgccccccgccagcctgggctcgcctgcgccccccaaaggcgcggcc gccccgtcggcgtcgcagcgccgcaagcgcacgtctttcagcgccgaacagctgcagctg ctggagctcgtcttccgccggacccggtaccccgacatccacttgcgcgagcgcctggcc gcgctcaccctgctccccgagtccaggatccaggtatggttccagaacaggcgtgccaag tctcggcgtcagagtgggaaatccttccaacctttggctaggccggagattatcctcaac cactgtgctcctggaactgaaacgaaatgtctgaagccccagctgcctcttgaggtagat gtgaactgcctgcccgaaccaaacggggttggagggggcatctctgactctagctcccaa ggtcagaattttgaaacctgttcccctctctctgaagacattggttcaaagctggactca tgggaggaacacatcttttctgcctttggtaacttttga >gi568815597f:226123682_226325809|GENSCAN_predicted_peptide_4|999_aa MEWAGLSSSAMRSYWLGAAWGEHGVDMNTVAELRDSSWRLEISVAVMVKNPRKYLGCVGT GETVESDAEGEKGVEAADVTALVEFHCKAVNVEQAVTIMEAIHVLGVLHTITSGITRIVS VGKRRKDRRVLPKARPNKTGPSSGEGSWRRFLPYYLKRPYGRGPQCSSPPAQGAEMEDAD SQGAGAPGRPVRQDMYVAIGHDSTGALLAKDSLERMVMKRIKRNQGDETPGQQQDVIILW PSGQLTSKIPQASQKAIAMTFALDWSTFAVTGPLPPLGCHCLDCALSSGSYCMAAPSYPR HRAGISNETSFSSHPVTHPILYFPRLERSRSPAARSSASNIRERRARIARGRWDRVPDVA TRRGLLRPAVSGKVRGRRRPSNVGFGNSPASTGTCDLHCPPHPPTGVSTPGSVSVSEGWP LPLGPQPFRNSKRSRLFSDEDDRQINTRSPKRNQRVAMVPQKFTATMSTPDKKASQKIGF RLRNLLKLPKAHKWCIYEWFYSNIDKPLFEGDNDFCVCLKESFPNLKTRKLTRVEWGKIR RLMGKPRRCSSAFFEEERSALKQKRQKIRLLQQRKVADVSQFKDLPDEIPLPLVIGTKVT ARLRGVHDGLFTGQIDAVDTLNATYRVTFDRTGLGTHTIPDYEVLSNEPHETMPIAAFGQ KQRPSRFFMTPPRLHYTPPLQSPIIDNDPLLGQSPWRSKISGSDTETLGGFPVEFLIQVE ARGSISAVFTKNQIKPLEVKLTKYAFQKSLLTGSRRTRLSKILMIKKEHIKKLREMNTEA EKLVLWQYLSRFKMLKSFDLAVALLETSTLMFAFVQNDYMFIAACLQDLKKSYSMPISIE FQRRYATIVLELEQLNKDLNKVLHKVQQYCYELAPDQGLQPADQPTDMRRRCEEEAQEIV RHANSSTGQPCVENENLTDLISRLTAILLQIKCLAEGGDLNSFEFKSLTDSLNDIKSTID ASNISCFQNNVEIHVAHIQSGLSQMGNLHAFAANNTNRD >gi568815597f:226123682_226325809|GENSCAN_predicted_CDS_4|3000_bp atggaatgggctggtttgagcagctctgccatgcgcagttattggctaggagcagcctgg ggagagcatggtgttgatatgaatactgtggcagagcttagggacagcagttggaggctg gagatctcagtggctgtgatggttaagaaccccaggaagtaccttggctgtgtaggaact ggagagactgtggagtctgacgctgaaggagaaaagggtgtggaagcagcagatgtcaca gccctggtggaattccactgcaaggcagtaaacgtggagcaggccgtaaccattatggaa gcaatccacgtcctgggggtcctccacacaattaccagcggaattaccagaatagtgagc gtggggaaaagaaggaaggataggagagtgctcccaaaggccaggcccaacaagactggc ccttcctcaggagaaggttcctggagaaggttcctgccctactacctgaagagaccctat gggcgtggaccacagtgttccagccctccagcgcagggagcggagatggaggatgctgac agccagggtgcaggagcaccaggtagaccagtgaggcaagatatgtatgtggctataggc catgattccactggggccctcttagccaaagacagcctagagaggatggtgatgaagagg ataaaaagaaatcaaggagatgagaccccaggtcagcagcaagatgttatcattctttgg ccctccggacagttaacaagtaaaatacctcaagcatcccaaaaagctattgccatgacc tttgctcttgactggtccacttttgctgtgactggaccacttccacctcttggttgccat tgtttagattgtgctttgtcttcaggatcatactgcatggcagcacccagttacccaagg cacagagctggcatcagcaatgagacttccttctcctcacacccagtgactcacccaatc ctttattttccacgtcttgaaaggtctcgctccccggctgcacgcagctcagcttccaac attcgggaaaggcgtgcccggatcgcgaggggccgctgggaccgagtcccggacgtcgcg acgcgcagggggctgctgcggccggcggtctcgggaaaagtgaggggacggaggcgccca agcaacgtgggtttcgggaactctcctgcctctacgggcacttgtgatcttcactgtcct ccccacccccccaccggtgtgtccacccctggaagtgtttccgtgtcagagggttggccc cttcccctgggaccacagcctttcagaaattcaaaacgaagtcgacttttttctgatgaa gatgataggcaaataaatacaaggtcacctaaaagaaaccagagggttgcaatggttcca cagaaatttacagcaacaatgtcaacaccagataagaaagcttcacagaagattggtttt cgattacgtaatctgctcaagcttcctaaagcacataaatggtgtatatacgagtggttc tattcaaatatagataaaccactttttgaaggtgataatgacttctgtgtatgtctaaag gaatcttttcctaatttgaaaacaagaaagttaacaagagtagaatggggaaaaattcgg cggcttatgggaaaaccacggagatgttcttctgcattttttgaggaagagagatcagca ttaaaacagaaacggcagaaaataaggctcttacaacaaaggaaagttgcagatgtttca caattcaaagatctcccagatgaaattcctttgcctctggttattggaacgaaagttaca gcacgattacgtggtgttcatgatggtttgttcactggacaaatagatgctgtggatact cttaatgctacttatagagtaacttttgataggacagggcttggaacccataccatccct gactatgaagttctcagtaatgaacctcatgagacaatgccaattgctgcctttggacaa aaacagcggccttctcgattttttatgaccccaccacggttacattatactcctcctctc cagtcaccaattatagataatgatcctttattaggacagtcgccgtggagaagtaaaatt tctggctctgacactgaaacattaggtggttttccagtagaatttcttatccaagtggaa gcaagagggagtatttctgcagtattcactaagaaccagattaagcccctggaagtaaaa ctcacaaagtatgcttttcaaaaaagcctcctaactgggtcccgtagaaccagattatca aaaattctcatgattaaaaaggaacatatcaagaaattaagggaaatgaacacagaagca gaaaaattggtgctttggcaatatctatcaagatttaaaatgctcaaatcctttgattta gcagttgcactcctggaaacttctacacttatgtttgcctttgtccaaaatgactacatg ttcattgcagcatgtttgcaagatctgaagaaatcatattccatgcccatcagcattgaa tttcagcggagatatgcaacaattgttctggagcttgaacagctgaacaaggacctaaac aaagttttgcataaagttcaacagtattgctatgagcttgctccagaccaggggctccag cctgcagatcagccaacagatatgagacgcaggtgtgaggaagaagcacaggaaattgtt cggcatgcaaattcctcaacaggacagccctgcgttgaaaatgaaaatctgacagactta atttccaggcttacagctattttgttacaaattaagtgtctagcagaaggaggagacctg aattcctttgaattcaaatcacttacagactcattaaatgatatcaagagtacaatagac gcttctaatatcagttgctttcagaataatgtagaaatccatgttgcacatattcagagt ggcctgagccagatgggaaacttacatgcctttgcagcaaataacaccaacagagactga