GENSCAN 1.0 Date run: 5-Nov-116 Time: 15:08:04 Sequence gi568815588f:101488337_101709468 : 221132 bp : 44.96% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1956 2076 121 2 1 125 85 -25 0.073 0.97 1.02 Intr + 27573 27673 101 1 2 78 73 28 0.198 0.13 1.03 Intr + 33303 33534 232 2 1 80 91 92 0.692 6.05 1.04 Intr + 37677 37816 140 1 2 43 37 142 0.563 4.78 1.05 Intr + 43959 44096 138 2 0 125 100 33 0.952 8.86 1.06 Intr + 44616 44734 119 2 2 48 75 90 0.894 2.96 1.07 Intr + 46325 46574 250 2 1 66 85 176 0.679 12.54 1.08 Intr + 47018 47136 119 1 2 116 99 15 0.945 4.66 1.09 Intr + 48207 48317 111 0 0 110 92 124 0.991 14.49 1.10 Intr + 49957 50035 79 1 1 99 83 26 0.882 2.75 1.11 Term + 62363 62524 162 1 0 82 36 139 0.751 6.04 1.12 PlyA + 62830 62835 6 1.05 2.00 Prom + 73824 73863 40 -7.46 2.01 Sngl + 78147 78461 315 2 0 40 47 256 0.820 10.65 2.02 PlyA + 83509 83514 6 1.05 3.11 PlyA - 87977 87972 6 -0.45 3.10 Term - 91481 91117 365 0 2 56 46 429 0.411 30.23 3.09 Intr - 92080 91912 169 1 1 125 70 130 0.993 14.42 3.08 Intr - 94555 94427 129 1 0 91 72 199 0.933 19.49 3.07 Intr - 95345 95172 174 2 0 113 42 192 0.995 17.24 3.06 Intr - 96583 96266 318 1 0 54 80 339 0.974 25.85 3.05 Intr - 97031 96980 52 1 1 125 71 9 0.525 1.81 3.04 Intr - 97820 97526 295 0 1 54 52 244 0.955 13.47 3.03 Intr - 99046 98910 137 0 2 37 105 78 0.542 4.61 3.02 Intr - 99982 99868 115 2 1 49 119 9 0.705 -0.39 3.01 Init - 100192 100012 181 1 1 34 -50 273 0.465 5.45 3.00 Prom - 100390 100351 40 -9.06 4.00 Prom + 101025 101064 40 -3.06 4.01 Init + 106363 106402 40 0 1 64 111 108 0.965 11.05 4.02 Intr + 112402 112526 125 2 2 96 63 103 0.645 8.90 4.03 Intr + 112834 113000 167 0 2 76 81 212 0.345 18.06 4.04 Intr + 119579 119695 117 2 0 7 82 95 0.203 0.38 4.05 Intr + 120499 120601 103 1 1 107 47 149 0.994 12.88 4.06 Term + 121031 121135 105 1 0 70 33 157 0.997 6.81 4.07 PlyA + 121937 121942 6 -0.45 5.14 PlyA - 122350 122345 6 1.05 5.13 Term - 123074 122955 120 2 0 104 47 94 0.997 5.37 5.12 Intr - 123433 123292 142 0 1 125 113 249 0.998 31.46 5.11 Intr - 124141 124001 141 0 0 94 117 159 0.993 18.87 5.10 Intr - 136474 136409 66 0 0 103 110 2 0.000 1.92 5.09 Intr - 142651 142530 122 1 2 78 81 53 0.004 2.89 5.08 Intr - 150569 150461 109 1 1 46 88 72 0.018 3.29 5.07 Intr - 154569 154400 170 0 2 54 52 79 0.009 -0.36 5.06 Intr - 156688 156549 140 2 2 2 65 135 0.115 2.78 5.05 Intr - 167329 167264 66 2 0 87 89 32 0.024 2.08 5.04 Intr - 179757 179550 208 0 1 101 110 7 0.326 2.85 5.03 Intr - 184711 184579 133 0 1 93 100 131 0.960 15.45 5.02 Intr - 185337 185152 186 2 0 88 110 102 0.816 11.20 5.01 Init - 206847 206045 803 0 2 94 69 631 0.147 53.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 140563 140657 95 2 2 66 42 123 0.882 3.49 S.002 Sngl - 206847 206041 807 0 0 94 47 641 0.848 54.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:101488337_101709468|GENSCAN_predicted_peptide_1|523_aa FYTTQKFEEEREYIKMYSAPVFRHYIVLAMCYLFGFSCEPGVGVTLRVTSDLKRDTKGAS ESRDDLFLDLGVGYTKLANGTSSMIVPKQRKLSASYEKEKELCVKYFEQWSESDQVEFVE HLISQMCHYQHGHINSYLKPMLQRDFITALPARGLDHIAENILSYLDAKSLCAAELVCKE WYRVTSDGMLWKKLIERMTIESNWRCGRHSLQRIHCRSETSKGVYCLQYDDQKIVSGLRD NTIKIWDKNTLECKRILTGHTGSVLCLQYDERVIITGSSDSTVRVWDVNTGEMLNTLIHH CEAVLHLRFNNGMMVTCSKDRSIAVWDMASPTDITLRRVLVGHRAAVNVVDFDDKYIVSA SGDRTIKVWNTSTCEFVRTLNGHKRGIACLQYRDRLVVSGSSDNTIRLWDIECGACLRVL EGHEELVRCIRFDNKRIVSGAYDGKIKVWDLVAALDPRAPAGTLCLRTLVEHSGRVFRLQ FDEFQIVSSSHDDTILIWDFLNDPAAQAEPPRSPSRTYTYISR >gi568815588f:101488337_101709468|GENSCAN_predicted_CDS_1|1572_bp ttttatactactcagaagtttgaggaggagagagaatacattaaaatgtattcagcccca gtgttcaggcactatatagtgctagctatgtgttacttatttggattctcatgtgaacct ggtgtaggagtcactctcagggtgactagtgacctgaaaagggacacaaagggggctagt gagtctcgtgatgatctgtttcttgatctgggtgttggttatacaaaacttgccaatggc acttccagtatgattgtgcccaagcaacggaaactctcagcaagctatgaaaaggaaaag gaactgtgtgtcaaatactttgagcagtggtcagagtcagatcaagtggaatttgtggaa catcttatatcccaaatgtgtcattaccaacatgggcacataaactcgtatcttaaacct atgttgcagagagatttcataactgctctgccagctcggggattggatcatattgctgag aacattctgtcatacctggatgccaaatcactatgtgctgctgaacttgtgtgcaaggaa tggtaccgagtgacctctgatggcatgctgtggaagaagcttatcgagagaatgacaata gaatctaattggagatgtggaagacatagtttacagagaattcactgccgaagtgaaaca agcaaaggagtttactgtttacagtatgatgatcagaaaatagtaagcggccttcgagac aacacaatcaagatctgggataaaaacacattggaatgcaagcgaattctcacaggccat acaggttcagtcctctgtctccagtatgatgagagagtgatcataacaggatcatcggat tccacggtcagagtgtgggatgtaaatacaggtgaaatgctaaacacgttgattcaccat tgtgaagcagttctgcacttgcgtttcaataatggcatgatggtgacctgctccaaagat cgttccattgctgtatgggatatggcctccccaactgacattaccctccggagggtgctg gtcggacaccgagctgctgtcaatgttgtagactttgatgacaagtacattgtttctgca tctggggatagaactataaaggtatggaacacaagtacttgtgaatttgtaaggacctta aatggacacaaacgaggcattgcctgtttgcagtacagggacaggctggtagtgagtggc tcatctgacaacactatcagattatgggacatagaatgtggtgcatgtttacgagtgtta gaaggccatgaggaattggtgcgttgtattcgatttgataacaagaggatagtcagtggg gcctatgatggaaaaattaaagtgtgggatcttgtggctgctttggacccccgtgctcct gcagggacactctgtctacggacccttgtggagcattccggaagagtttttcgactacag tttgatgaattccagattgtcagtagttcacatgatgacacaatcctcatctgggacttc ctaaatgatccagctgcccaagctgaacccccccgttccccttctcgaacatacacctac atctccagataa >gi568815588f:101488337_101709468|GENSCAN_predicted_peptide_2|104_aa MCMCARLCSCTPPLPLSAALMTLAQRKVMTNARYQRTRPMTNGRAPEQPHYAVITEDGWP GAPPPPPRAGNGSRSCSARPGREPSRCRRAANHVGAQDAQTEEP >gi568815588f:101488337_101709468|GENSCAN_predicted_CDS_2|315_bp atgtgcatgtgtgcgcgcctgtgctcatgcacgccccccctcccgctgtcagccgcgctg atgacattggcgcagaggaaagtgatgacaaatgcccgttatcagcgaacgaggccgatg acaaatgggcgggcgcccgagcagccccattacgctgtaataacagaagatggatggcct ggagcccccccacccccaccccgtgccggtaacgggagccgatcctgctccgcccgcccc gggagggagcccagccgctgccgccgggccgccaatcacgtcggggcccaagacgcccag accgaggagccgtga >gi568815588f:101488337_101709468|GENSCAN_predicted_peptide_3|644_aa MQDLARLTAETLPALPTQALRPFPEGRTLREKNEGARGDPRVTVLQQRSLLGCPQTLQPA PRPAAAMATVPEGHFRLTRKLFWPFALLPPFVRSHWLCWPGSSHTSMDPRGILKAFPKRQ KIHADASSKVLAKIPRREEGEEAEEWLSSLRAHVVRTGIGRARAELFEKQIVQHGGQLCP AQGPGVTHIVVDEGMDYERALRLLRLPQLPPGAQLVKSAWLSLCLQERRLVDVAGFSIFI PSRPVSPPQKAKEAPNTQAQPISDDEASDGEETQVSAADLEALISGHYPTSLEGDCEPSP APAVLDKWVCAQPSSQKATNHNLHITEKLEVLAKAYSVQGDKWRALGYAKAINALKSFHK PVTSYQEACSIPGIGKRMAEKIIEILESGHLRKLDHISESVPVLELFSNIWGAGTKTAQM WYQQGFRSLEDIRSQASLTTQQAIGLKHYSDFLERMPREEATEIEQTVQKAAQAFNSGLL CVACGSYRRGKATCGDVDVLITHPDGRSHRGIFSRLLDSLRQEGFLTDDLVSQEENGQQQ KYLGVCRLPGPGRRHRRLDIIVVPYSEFACALLYFTGSAHFNRSMRALAKTKGMSLSEHA LSTAVVRNTHGCKVGPGRVLPTPTEKDVFRLLGLPYREPAERDW >gi568815588f:101488337_101709468|GENSCAN_predicted_CDS_3|1935_bp atgcaggacctggcccggctgaccgccgagacccttccagctctgccgacccaggccctg aggcccttcccggagggccggaccctgagggaaaaaaacgaaggagcccgtggggaccct cgagttaccgtcctgcagcagcgcagtcttctgggctgtccgcagactctccaaccagcc ccacgcccagccgctgccatggcaaccgttccagagggtcacttccggctgactcggaag ctattctggccatttgccctccttccccccttcgtccgctctcattggctctgctggccg ggatccagccatacttcaatggatcccaggggtatcttgaaggcatttcccaagcggcag aaaattcatgctgatgcatcatcaaaagtacttgcaaagattcctaggagggaagaggga gaagaagcagaagagtggctgagctcccttcgggcccatgttgtgcgcactggcattgga cgagcccgggcagaactctttgagaagcagattgttcagcatggcggccagctatgccct gcccagggcccaggtgtcactcacattgtggtggatgaaggcatggactatgagcgagcc ctccgccttctcagactaccccagctgcccccgggtgctcagctggtgaagtcagcctgg ctgagcttgtgccttcaggagaggaggctggtggatgtagctggattcagcatcttcatc cccagtaggcctgtgtctcctccccaaaaggcaaaagaggcaccaaacacccaagcccag cccatctctgatgatgaagccagtgatggggaagaaacccaggttagtgcagctgatctg gaagccctcatcagtggccactaccccacctcccttgagggagattgtgagcctagccca gcccctgctgtcctggataagtgggtctgtgcacagccctcaagccagaaggcgaccaat cacaacctccatatcacagagaagctggaagttctggccaaagcctacagtgttcaggga gacaagtggagggccctgggctatgccaaggccatcaatgccctcaagagcttccataag cctgtcacctcgtaccaggaggcctgcagtatccctgggattgggaagcggatggctgag aaaatcatagagatcctggagagcgggcatttgcggaagctggaccatatcagtgagagc gtgcctgtcttggagctcttctccaacatctggggagctgggaccaagactgcccagatg tggtaccaacagggcttccgaagtctggaagacatccgcagccaggcctccctgacaacc cagcaggccatcggcctgaagcattacagtgacttcctggaacgtatgcccagggaggag gctacagagattgagcagacagtccagaaagcagcccaggcctttaactctgggctgctg tgtgtggcatgtggttcataccgacggggaaaggcgacctgtggtgatgtcgacgtgctc atcactcacccagatggccggtcccaccggggtatcttcagccgcctccttgacagtctt cggcaggaagggttcctcacagatgacttggtgagccaagaggagaatggtcagcaacag aagtacttgggggtgtgccggctcccagggccagggcggcggcaccggcgcctggacatc atcgtggtgccctatagcgagtttgcctgtgccctgctctacttcaccggctctgcacac ttcaaccgctccatgcgagccctggccaaaaccaagggcatgagtctgtcagaacatgcc ctcagcactgctgtggtccggaacacccatggctgcaaggtggggcctggccgagtgctg cccactcccactgagaaggatgtcttcaggctcttaggcctcccctaccgagaacctgct gagcgggactggtga >gi568815588f:101488337_101709468|GENSCAN_predicted_peptide_4|218_aa MAEEYDEKTSELLVRKWRVKSALGAMGQWQLEVGDPAPLGAGNLGPELIKESNANEQSSS WICLLQPIFMRKDTKMSFQWRIRNLPYPKDVYSVSVDQKERCIIVRTTNKNKDQTKDKES YSNLDSHPYRQLAVKQGKIFHLSEPEFPFRYYKKFSIPDLDRHQLPLDDALLSFAHANCT LIISYQKPKEVVVAESELQKELKKVKTAHSNDGDCKTQ >gi568815588f:101488337_101709468|GENSCAN_predicted_CDS_4|657_bp atggctgaagaatatgacgagaagacgagtgaactacttgtgagaaagtggcgtgtgaaa agtgccctgggagccatgggccagtggcagcttgaagtaggagacccagcgcccctagga gcagggaacctggggcctgaactcatcaaggaaagcaatgccaatgaacagtcctcgagt tggatttgccttctgcagcctatcttcatgcgcaaggacaccaagatgagtttccagtgg cggattcgaaacctcccctatcctaaggatgtctatagtgtctctgtggaccagaaggag cgctgcatcattgtcagaacaaccaacaagaataaggatcagaccaaggacaaggaatct tactctaatcttgactcccatccctaccgacaactggccgtgaaacaaggcaagatattt catctctctgagcccgagttcccttttaggtactacaagaagttctccattcctgatcta gatagacaccagctacctctggatgacgccttgctgagctttgcccacgccaactgcacc ctgatcatctcttaccagaagccaaaggaggttgtggtggccgagtctgagctacagaag gaactaaagaaggtgaagacagcccacagcaacgatggggactgcaagacccagtag >gi568815588f:101488337_101709468|GENSCAN_predicted_peptide_5|801_aa MVSPPPGAAAVAARAGLRVGPAPRPLMGSQGRSGPPGNGGPGEGEGGEARKLQEGRVARG KRRKGKGKGKARAGQGGRGSGAEGKPGPQTAKEAAGPGADAGARACPREEAEGGRSVEEG ARGIVKGVEGSAGAGKEAQGREYGKKEEWRVRARRREGARPGRAQGRGGQAWADIAGTGV AMAAAAGEEEEEEEAARESAARPAAGPALWRLPEELLLLICSYLDMRALGRLAQVCRWLR RFTSCDLLWRRIARASLNSGFTRLGTDLQMPWMQLEDDSLYISQANFILAYQFRPDGASL NRRPLGVFAGHDEDVCHFVLANSHIVSAGGDGKIGIHKIHSTFTVKYSAHEQEVNCVDCK GGIIVSGSRDRTAKGIKDSPFLHLQQSPLPEHLSSPLIPGNANESISSPVSRCGLWPQAG WGSAYTPSRLKTESGPLLSAHYSAAALAVLEKPGSFLLCLNEKRTNLNAEQTGKKCDLTT TYKPCAKLLLQPMAMIAPEYSSPRLSETPLYVITGSVGITPSPALNLGTLKEGLVALMVG MALIQALPWPQCSPVAHDVFGLYSNVPKRVVMKQKFGGKHCQSCRKTGILIGSCSQAFTT QKAIEMPLWMVGSTSMALSTHHRCDSSSFWSCALEGIQVDVSPPASSFVTGTACCGHFSP LRIWDLNSGQLMTHLGSDFPPGAGVLDVMYESPFTLLSCGYDTYVRYWDLRTSVRKCVME WEEPHDSTLYCLQTDGNHLLATGSSYYGVVRLWDRRQRACLHAFPLTSTPLSSPVYCLRL TTKHLYAALSYNLHVLDFQNP >gi568815588f:101488337_101709468|GENSCAN_predicted_CDS_5|2406_bp atggtgtcgcctcctcccggggcggcggcggtggcggctcgggctgggctccgcgtcggg ccggccccgcggccgctcatgggcagccagggccgctcggggccccccgggaacggcggg cccggcgagggcgagggcggagaggcgaggaagctgcaggaagggagggtggcgaggggg aagcgaaggaaggggaaggggaagggaaaagcgagagcggggcaaggcggaagaggaagc ggggcggaagggaagcccgggccgcagacggcgaaggaggcagccgggccgggggctgac gcgggagcgagggcatgcccaagggaggaagcagagggaggcagaagcgtggaggaaggg gcgagaggcatcgtcaagggagtcgaggggagcgcaggggccgggaaggaggcacaagga agagagtatgggaagaaggaggaatggagggtcagggctaggcggcgggagggcgccagg ccgggaagagcacagggacgagggggtcaggcttgggccgacatcgcggggacaggggtg gccatggcggcggcggccggggaggaggaggaggaggaggaggcggctcgggagtcggct gcccgcccggccgcggggcctgcgctctggcgcctgccggaggagctgctgctgctcatc tgctcctacctggacatgcgggccctcggccgcctggcccaggtgtgccgctggctgcgg cgcttcaccagctgcgatctgctctggcgccggatagcccgggcctcgctcaactccggc ttcacgcggctcggcaccgaccttcagatgccctggatgcagctagaggatgattctctg tacatatcccaggctaatttcatcctggcctaccagttccgtccagatggtgccagcttg aatcgtcggcctctgggagtctttgctgggcatgatgaggacgtttgccactttgtgctg gccaactcgcatattgttagtgcaggaggggatgggaagattggcattcataagattcac agcaccttcactgtcaagtactcggctcatgaacaggaggtgaactgtgtggattgcaaa gggggcatcattgtgagtggctccagggacaggacggccaagggcataaaggattcccca tttctccacctccaacagtcacctctcccggagcacctctcctccccactgatcccaggc aatgccaacgagagcatctcctctcctgtttctaggtgtggcctttggcctcaggccggc tggggcagtgcttacacaccatccagactgaagaccgagtctggtccattgctatcagcc cattactcagctgcagctttagctgtcctagagaaacctgggtctttcctcctgtgcctg aatgagaagagaacaaatctcaatgccgagcaaacaggcaaaaagtgtgacctgaccacc acctacaaaccatgtgccaaactgctcctacagcccatggccatgattgcccctgaatat agctcacctaggctgtcagagactcctctgtatgtgatcacaggtagtgtaggcatcacg cccagccctgccctaaatcttggcactttaaaggagggcctggtggcgctgatggtgggc atggcattgattcaggcactgccctggccccagtgctcccctgtggctcatgatgtgttt ggcctttattccaatgttccaaagagagtggttatgaagcagaagtttgggggaaagcac tgtcagagctgccggaaaactggaattctcataggcagctgttcccaggctttcaccact cagaaagccatcgagatgccactgtggatggtaggttccacaagcatggccctgtccact catcacagatgtgactcgagcagcttctggagctgcgctctagagggcattcaggtggat gtttccccacctgcaagctcttttgtgacagggacggcttgttgcgggcacttctcaccc ctgagaatctgggacctcaacagtgggcagctgatgacacacttgggcagtgactttccc ccaggggctggggtgctggatgtcatgtatgagtcccctttcacactgctgtcctgtggc tatgacacctatgttcgctactgggacctccgcaccagcgtccggaaatgtgtcatggag tgggaggagccccacgacagcaccctgtactgcctgcagacagatggcaaccacctgctg gccacaggttcctcctactacggtgttgtacggctgtgggaccggcgtcaaagggcctgc ctgcacgccttcccgctgacgtcgactcccctcagcagccctgtgtactgcctgcgtctc accaccaagcatctctatgctgccctgtcttacaacctccacgtcctggattttcaaaac ccatga