GENSCAN 1.0 Date run: 3-Nov-116 Time: 12:02:55 Sequence gi568815588r:101479456_101687360 : 207905 bp : 45.02% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10837 10957 121 0 1 125 85 -25 0.049 0.97 1.02 Intr + 36454 36554 101 2 2 78 73 28 0.197 0.13 1.03 Intr + 42184 42415 232 0 1 80 91 92 0.690 6.05 1.04 Intr + 46558 46697 140 2 2 43 37 142 0.562 4.78 1.05 Intr + 52840 52977 138 0 0 125 100 33 0.952 8.86 1.06 Intr + 53497 53615 119 0 2 48 75 90 0.894 2.96 1.07 Intr + 55206 55455 250 0 1 66 85 176 0.679 12.54 1.08 Intr + 55899 56017 119 2 2 116 99 15 0.945 4.66 1.09 Intr + 57088 57198 111 1 0 110 92 124 0.991 14.49 1.10 Intr + 58838 58916 79 2 1 99 83 26 0.882 2.75 1.11 Term + 71244 71405 162 2 0 82 36 139 0.751 6.04 1.12 PlyA + 71711 71716 6 1.05 2.00 Prom + 82705 82744 40 -7.46 2.01 Sngl + 87028 87342 315 0 0 40 47 256 0.820 10.65 2.02 PlyA + 92390 92395 6 1.05 3.11 PlyA - 96858 96853 6 -0.45 3.10 Term - 100362 99998 365 1 2 56 46 429 0.411 30.23 3.09 Intr - 100961 100793 169 2 1 125 70 130 0.993 14.42 3.08 Intr - 103436 103308 129 2 0 91 72 199 0.933 19.49 3.07 Intr - 104226 104053 174 0 0 113 42 192 0.995 17.24 3.06 Intr - 105464 105147 318 2 0 54 80 339 0.974 25.85 3.05 Intr - 105912 105861 52 2 1 125 71 9 0.525 1.81 3.04 Intr - 106701 106407 295 1 1 54 52 244 0.955 13.47 3.03 Intr - 107927 107791 137 1 2 37 105 78 0.542 4.61 3.02 Intr - 108863 108749 115 0 1 49 119 9 0.705 -0.39 3.01 Init - 109073 108893 181 2 1 34 -50 273 0.465 5.45 3.00 Prom - 109271 109232 40 -9.06 4.00 Prom + 109906 109945 40 -3.06 4.01 Init + 115244 115283 40 1 1 64 111 108 0.965 11.05 4.02 Intr + 121283 121407 125 0 2 96 63 103 0.645 8.90 4.03 Intr + 121715 121881 167 1 2 76 81 212 0.345 18.06 4.04 Intr + 128460 128576 117 0 0 7 82 95 0.203 0.38 4.05 Intr + 129380 129482 103 2 1 107 47 149 0.994 12.88 4.06 Term + 129912 130016 105 2 0 70 33 157 0.997 6.81 4.07 PlyA + 130818 130823 6 -0.45 5.13 PlyA - 131231 131226 6 1.05 5.12 Term - 131955 131836 120 0 0 104 47 94 0.997 5.37 5.11 Intr - 132314 132173 142 1 1 125 113 249 0.998 31.46 5.10 Intr - 133022 132882 141 1 0 94 117 159 0.993 18.87 5.09 Intr - 145355 145290 66 1 0 103 110 2 0.000 1.92 5.08 Intr - 151532 151411 122 2 2 78 81 53 0.004 2.89 5.07 Intr - 159450 159342 109 2 1 46 88 72 0.018 3.29 5.06 Intr - 163450 163281 170 1 2 54 52 79 0.009 -0.36 5.05 Intr - 165569 165430 140 0 2 2 65 135 0.115 2.78 5.04 Intr - 176210 176145 66 0 0 87 89 32 0.024 2.08 5.03 Intr - 188638 188431 208 1 1 101 110 7 0.326 2.85 5.02 Intr - 193592 193460 133 1 1 93 100 131 0.960 15.45 5.01 Intr - 194218 194033 186 0 0 88 110 102 0.838 11.20 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 149444 149538 95 0 2 66 42 123 0.882 3.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:101479456_101687360|GENSCAN_predicted_peptide_1|523_aa FYTTQKFEEEREYIKMYSAPVFRHYIVLAMCYLFGFSCEPGVGVTLRVTSDLKRDTKGAS ESRDDLFLDLGVGYTKLANGTSSMIVPKQRKLSASYEKEKELCVKYFEQWSESDQVEFVE HLISQMCHYQHGHINSYLKPMLQRDFITALPARGLDHIAENILSYLDAKSLCAAELVCKE WYRVTSDGMLWKKLIERMTIESNWRCGRHSLQRIHCRSETSKGVYCLQYDDQKIVSGLRD NTIKIWDKNTLECKRILTGHTGSVLCLQYDERVIITGSSDSTVRVWDVNTGEMLNTLIHH CEAVLHLRFNNGMMVTCSKDRSIAVWDMASPTDITLRRVLVGHRAAVNVVDFDDKYIVSA SGDRTIKVWNTSTCEFVRTLNGHKRGIACLQYRDRLVVSGSSDNTIRLWDIECGACLRVL EGHEELVRCIRFDNKRIVSGAYDGKIKVWDLVAALDPRAPAGTLCLRTLVEHSGRVFRLQ FDEFQIVSSSHDDTILIWDFLNDPAAQAEPPRSPSRTYTYISR >gi568815588r:101479456_101687360|GENSCAN_predicted_CDS_1|1572_bp ttttatactactcagaagtttgaggaggagagagaatacattaaaatgtattcagcccca gtgttcaggcactatatagtgctagctatgtgttacttatttggattctcatgtgaacct ggtgtaggagtcactctcagggtgactagtgacctgaaaagggacacaaagggggctagt gagtctcgtgatgatctgtttcttgatctgggtgttggttatacaaaacttgccaatggc acttccagtatgattgtgcccaagcaacggaaactctcagcaagctatgaaaaggaaaag gaactgtgtgtcaaatactttgagcagtggtcagagtcagatcaagtggaatttgtggaa catcttatatcccaaatgtgtcattaccaacatgggcacataaactcgtatcttaaacct atgttgcagagagatttcataactgctctgccagctcggggattggatcatattgctgag aacattctgtcatacctggatgccaaatcactatgtgctgctgaacttgtgtgcaaggaa tggtaccgagtgacctctgatggcatgctgtggaagaagcttatcgagagaatgacaata gaatctaattggagatgtggaagacatagtttacagagaattcactgccgaagtgaaaca agcaaaggagtttactgtttacagtatgatgatcagaaaatagtaagcggccttcgagac aacacaatcaagatctgggataaaaacacattggaatgcaagcgaattctcacaggccat acaggttcagtcctctgtctccagtatgatgagagagtgatcataacaggatcatcggat tccacggtcagagtgtgggatgtaaatacaggtgaaatgctaaacacgttgattcaccat tgtgaagcagttctgcacttgcgtttcaataatggcatgatggtgacctgctccaaagat cgttccattgctgtatgggatatggcctccccaactgacattaccctccggagggtgctg gtcggacaccgagctgctgtcaatgttgtagactttgatgacaagtacattgtttctgca tctggggatagaactataaaggtatggaacacaagtacttgtgaatttgtaaggacctta aatggacacaaacgaggcattgcctgtttgcagtacagggacaggctggtagtgagtggc tcatctgacaacactatcagattatgggacatagaatgtggtgcatgtttacgagtgtta gaaggccatgaggaattggtgcgttgtattcgatttgataacaagaggatagtcagtggg gcctatgatggaaaaattaaagtgtgggatcttgtggctgctttggacccccgtgctcct gcagggacactctgtctacggacccttgtggagcattccggaagagtttttcgactacag tttgatgaattccagattgtcagtagttcacatgatgacacaatcctcatctgggacttc ctaaatgatccagctgcccaagctgaacccccccgttccccttctcgaacatacacctac atctccagataa >gi568815588r:101479456_101687360|GENSCAN_predicted_peptide_2|104_aa MCMCARLCSCTPPLPLSAALMTLAQRKVMTNARYQRTRPMTNGRAPEQPHYAVITEDGWP GAPPPPPRAGNGSRSCSARPGREPSRCRRAANHVGAQDAQTEEP >gi568815588r:101479456_101687360|GENSCAN_predicted_CDS_2|315_bp atgtgcatgtgtgcgcgcctgtgctcatgcacgccccccctcccgctgtcagccgcgctg atgacattggcgcagaggaaagtgatgacaaatgcccgttatcagcgaacgaggccgatg acaaatgggcgggcgcccgagcagccccattacgctgtaataacagaagatggatggcct ggagcccccccacccccaccccgtgccggtaacgggagccgatcctgctccgcccgcccc gggagggagcccagccgctgccgccgggccgccaatcacgtcggggcccaagacgcccag accgaggagccgtga >gi568815588r:101479456_101687360|GENSCAN_predicted_peptide_3|644_aa MQDLARLTAETLPALPTQALRPFPEGRTLREKNEGARGDPRVTVLQQRSLLGCPQTLQPA PRPAAAMATVPEGHFRLTRKLFWPFALLPPFVRSHWLCWPGSSHTSMDPRGILKAFPKRQ KIHADASSKVLAKIPRREEGEEAEEWLSSLRAHVVRTGIGRARAELFEKQIVQHGGQLCP AQGPGVTHIVVDEGMDYERALRLLRLPQLPPGAQLVKSAWLSLCLQERRLVDVAGFSIFI PSRPVSPPQKAKEAPNTQAQPISDDEASDGEETQVSAADLEALISGHYPTSLEGDCEPSP APAVLDKWVCAQPSSQKATNHNLHITEKLEVLAKAYSVQGDKWRALGYAKAINALKSFHK PVTSYQEACSIPGIGKRMAEKIIEILESGHLRKLDHISESVPVLELFSNIWGAGTKTAQM WYQQGFRSLEDIRSQASLTTQQAIGLKHYSDFLERMPREEATEIEQTVQKAAQAFNSGLL CVACGSYRRGKATCGDVDVLITHPDGRSHRGIFSRLLDSLRQEGFLTDDLVSQEENGQQQ KYLGVCRLPGPGRRHRRLDIIVVPYSEFACALLYFTGSAHFNRSMRALAKTKGMSLSEHA LSTAVVRNTHGCKVGPGRVLPTPTEKDVFRLLGLPYREPAERDW >gi568815588r:101479456_101687360|GENSCAN_predicted_CDS_3|1935_bp atgcaggacctggcccggctgaccgccgagacccttccagctctgccgacccaggccctg aggcccttcccggagggccggaccctgagggaaaaaaacgaaggagcccgtggggaccct cgagttaccgtcctgcagcagcgcagtcttctgggctgtccgcagactctccaaccagcc ccacgcccagccgctgccatggcaaccgttccagagggtcacttccggctgactcggaag ctattctggccatttgccctccttccccccttcgtccgctctcattggctctgctggccg ggatccagccatacttcaatggatcccaggggtatcttgaaggcatttcccaagcggcag aaaattcatgctgatgcatcatcaaaagtacttgcaaagattcctaggagggaagaggga gaagaagcagaagagtggctgagctcccttcgggcccatgttgtgcgcactggcattgga cgagcccgggcagaactctttgagaagcagattgttcagcatggcggccagctatgccct gcccagggcccaggtgtcactcacattgtggtggatgaaggcatggactatgagcgagcc ctccgccttctcagactaccccagctgcccccgggtgctcagctggtgaagtcagcctgg ctgagcttgtgccttcaggagaggaggctggtggatgtagctggattcagcatcttcatc cccagtaggcctgtgtctcctccccaaaaggcaaaagaggcaccaaacacccaagcccag cccatctctgatgatgaagccagtgatggggaagaaacccaggttagtgcagctgatctg gaagccctcatcagtggccactaccccacctcccttgagggagattgtgagcctagccca gcccctgctgtcctggataagtgggtctgtgcacagccctcaagccagaaggcgaccaat cacaacctccatatcacagagaagctggaagttctggccaaagcctacagtgttcaggga gacaagtggagggccctgggctatgccaaggccatcaatgccctcaagagcttccataag cctgtcacctcgtaccaggaggcctgcagtatccctgggattgggaagcggatggctgag aaaatcatagagatcctggagagcgggcatttgcggaagctggaccatatcagtgagagc gtgcctgtcttggagctcttctccaacatctggggagctgggaccaagactgcccagatg tggtaccaacagggcttccgaagtctggaagacatccgcagccaggcctccctgacaacc cagcaggccatcggcctgaagcattacagtgacttcctggaacgtatgcccagggaggag gctacagagattgagcagacagtccagaaagcagcccaggcctttaactctgggctgctg tgtgtggcatgtggttcataccgacggggaaaggcgacctgtggtgatgtcgacgtgctc atcactcacccagatggccggtcccaccggggtatcttcagccgcctccttgacagtctt cggcaggaagggttcctcacagatgacttggtgagccaagaggagaatggtcagcaacag aagtacttgggggtgtgccggctcccagggccagggcggcggcaccggcgcctggacatc atcgtggtgccctatagcgagtttgcctgtgccctgctctacttcaccggctctgcacac ttcaaccgctccatgcgagccctggccaaaaccaagggcatgagtctgtcagaacatgcc ctcagcactgctgtggtccggaacacccatggctgcaaggtggggcctggccgagtgctg cccactcccactgagaaggatgtcttcaggctcttaggcctcccctaccgagaacctgct gagcgggactggtga >gi568815588r:101479456_101687360|GENSCAN_predicted_peptide_4|218_aa MAEEYDEKTSELLVRKWRVKSALGAMGQWQLEVGDPAPLGAGNLGPELIKESNANEQSSS WICLLQPIFMRKDTKMSFQWRIRNLPYPKDVYSVSVDQKERCIIVRTTNKNKDQTKDKES YSNLDSHPYRQLAVKQGKIFHLSEPEFPFRYYKKFSIPDLDRHQLPLDDALLSFAHANCT LIISYQKPKEVVVAESELQKELKKVKTAHSNDGDCKTQ >gi568815588r:101479456_101687360|GENSCAN_predicted_CDS_4|657_bp atggctgaagaatatgacgagaagacgagtgaactacttgtgagaaagtggcgtgtgaaa agtgccctgggagccatgggccagtggcagcttgaagtaggagacccagcgcccctagga gcagggaacctggggcctgaactcatcaaggaaagcaatgccaatgaacagtcctcgagt tggatttgccttctgcagcctatcttcatgcgcaaggacaccaagatgagtttccagtgg cggattcgaaacctcccctatcctaaggatgtctatagtgtctctgtggaccagaaggag cgctgcatcattgtcagaacaaccaacaagaataaggatcagaccaaggacaaggaatct tactctaatcttgactcccatccctaccgacaactggccgtgaaacaaggcaagatattt catctctctgagcccgagttcccttttaggtactacaagaagttctccattcctgatcta gatagacaccagctacctctggatgacgccttgctgagctttgcccacgccaactgcacc ctgatcatctcttaccagaagccaaaggaggttgtggtggccgagtctgagctacagaag gaactaaagaaggtgaagacagcccacagcaacgatggggactgcaagacccagtag >gi568815588r:101479456_101687360|GENSCAN_predicted_peptide_5|534_aa XQMPWMQLEDDSLYISQANFILAYQFRPDGASLNRRPLGVFAGHDEDVCHFVLANSHIVS AGGDGKIGIHKIHSTFTVKYSAHEQEVNCVDCKGGIIVSGSRDRTAKGIKDSPFLHLQQS PLPEHLSSPLIPGNANESISSPVSRCGLWPQAGWGSAYTPSRLKTESGPLLSAHYSAAAL AVLEKPGSFLLCLNEKRTNLNAEQTGKKCDLTTTYKPCAKLLLQPMAMIAPEYSSPRLSE TPLYVITGSVGITPSPALNLGTLKEGLVALMVGMALIQALPWPQCSPVAHDVFGLYSNVP KRVVMKQKFGGKHCQSCRKTGILIGSCSQAFTTQKAIEMPLWMVGSTSMALSTHHRCDSS SFWSCALEGIQVDVSPPASSFVTGTACCGHFSPLRIWDLNSGQLMTHLGSDFPPGAGVLD VMYESPFTLLSCGYDTYVRYWDLRTSVRKCVMEWEEPHDSTLYCLQTDGNHLLATGSSYY GVVRLWDRRQRACLHAFPLTSTPLSSPVYCLRLTTKHLYAALSYNLHVLDFQNP >gi568815588r:101479456_101687360|GENSCAN_predicted_CDS_5|1605_bp nntcagatgccctggatgcagctagaggatgattctctgtacatatcccaggctaatttc atcctggcctaccagttccgtccagatggtgccagcttgaatcgtcggcctctgggagtc tttgctgggcatgatgaggacgtttgccactttgtgctggccaactcgcatattgttagt gcaggaggggatgggaagattggcattcataagattcacagcaccttcactgtcaagtac tcggctcatgaacaggaggtgaactgtgtggattgcaaagggggcatcattgtgagtggc tccagggacaggacggccaagggcataaaggattccccatttctccacctccaacagtca cctctcccggagcacctctcctccccactgatcccaggcaatgccaacgagagcatctcc tctcctgtttctaggtgtggcctttggcctcaggccggctggggcagtgcttacacacca tccagactgaagaccgagtctggtccattgctatcagcccattactcagctgcagcttta gctgtcctagagaaacctgggtctttcctcctgtgcctgaatgagaagagaacaaatctc aatgccgagcaaacaggcaaaaagtgtgacctgaccaccacctacaaaccatgtgccaaa ctgctcctacagcccatggccatgattgcccctgaatatagctcacctaggctgtcagag actcctctgtatgtgatcacaggtagtgtaggcatcacgcccagccctgccctaaatctt ggcactttaaaggagggcctggtggcgctgatggtgggcatggcattgattcaggcactg ccctggccccagtgctcccctgtggctcatgatgtgtttggcctttattccaatgttcca aagagagtggttatgaagcagaagtttgggggaaagcactgtcagagctgccggaaaact ggaattctcataggcagctgttcccaggctttcaccactcagaaagccatcgagatgcca ctgtggatggtaggttccacaagcatggccctgtccactcatcacagatgtgactcgagc agcttctggagctgcgctctagagggcattcaggtggatgtttccccacctgcaagctct tttgtgacagggacggcttgttgcgggcacttctcacccctgagaatctgggacctcaac agtgggcagctgatgacacacttgggcagtgactttcccccaggggctggggtgctggat gtcatgtatgagtcccctttcacactgctgtcctgtggctatgacacctatgttcgctac tgggacctccgcaccagcgtccggaaatgtgtcatggagtgggaggagccccacgacagc accctgtactgcctgcagacagatggcaaccacctgctggccacaggttcctcctactac ggtgttgtacggctgtgggaccggcgtcaaagggcctgcctgcacgccttcccgctgacg tcgactcccctcagcagccctgtgtactgcctgcgtctcaccaccaagcatctctatgct gccctgtcttacaacctccacgtcctggattttcaaaacccatga