GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:17:05 Sequence gi568815596r:9485289_9730452 : 245164 bp : 44.04% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.17 PlyA - 839 834 6 1.05 1.16 Term - 5230 4889 342 1 0 84 28 350 0.876 23.11 1.15 Intr - 5863 5813 51 1 0 116 84 8 0.750 2.30 1.14 Intr - 7698 7610 89 1 2 56 103 10 0.864 -1.01 1.13 Intr - 8537 8459 79 2 1 106 74 74 0.947 6.92 1.12 Intr - 9479 9349 131 0 2 89 82 73 0.996 7.21 1.11 Intr - 11960 11826 135 0 0 104 89 127 0.960 14.94 1.10 Intr - 16988 16885 104 1 2 100 79 64 0.920 6.52 1.09 Intr - 20077 19878 200 1 2 72 94 225 0.974 19.65 1.08 Intr - 24843 24691 153 0 0 27 95 105 0.794 5.37 1.07 Intr - 32701 32613 89 2 2 69 95 20 0.572 0.49 1.06 Intr - 32959 32815 145 1 1 55 110 20 0.692 0.76 1.05 Intr - 36028 35915 114 1 0 1 92 122 0.866 4.54 1.04 Intr - 38050 37961 90 1 0 51 68 105 0.936 4.99 1.03 Intr - 40976 40823 154 1 1 42 83 115 0.486 6.47 1.02 Intr - 57997 57865 133 2 1 70 86 13 0.080 -1.00 1.01 Init - 70317 70221 97 0 1 94 28 135 0.277 6.61 1.00 Prom - 79035 78996 40 -4.96 2.11 PlyA - 80531 80526 6 1.05 2.10 Term - 100057 99998 60 1 0 92 35 56 0.600 -1.50 2.09 Intr - 102221 102126 96 2 0 32 79 135 0.989 7.21 2.08 Intr - 103040 102877 164 0 2 91 107 47 0.996 6.59 2.07 Intr - 106227 106104 124 0 1 82 111 104 0.814 12.26 2.06 Intr - 135473 135354 120 2 0 106 84 46 0.857 6.69 2.05 Intr - 145246 144871 376 0 1 100 105 724 0.279 70.32 2.04 Intr - 148356 148340 17 0 2 119 93 4 0.023 -1.66 2.03 Intr - 153947 153846 102 2 0 91 53 52 0.025 2.37 2.02 Intr - 164100 163895 206 1 2 100 64 59 0.016 3.62 2.01 Init - 175337 175220 118 2 1 21 61 102 0.003 0.95 2.00 Prom - 181335 181296 40 -3.76 3.00 Prom + 191477 191516 40 -2.96 3.01 Init + 207381 207755 375 2 0 59 72 105 0.583 2.96 3.02 Term + 208600 208725 126 0 0 64 48 119 0.750 3.78 3.03 PlyA + 208873 208878 6 1.05 4.03 PlyA - 208985 208980 6 1.05 4.02 Term - 212396 212270 127 1 1 131 42 14 0.532 -1.14 4.01 Init - 214286 214195 92 2 2 76 63 82 0.314 4.68 4.00 Prom - 220142 220103 40 -3.06 5.00 Prom + 225747 225786 40 -2.66 5.01 Init + 235600 235623 24 0 0 106 72 35 0.502 3.39 5.02 Intr + 239329 239504 176 0 2 123 66 24 0.598 2.44 5.03 Intr + 240003 240136 134 0 2 65 76 65 0.411 3.29 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:9485289_9730452|GENSCAN_predicted_peptide_1|701_aa MRQSLLFLTSVVPFVLAPRPPDDPGFGPHQRLEKLDSLLSDYDILSLSNIQQHSVRKRDL QTSTHVETLLTFSALKSAYKFILELVHRVKRRADPDPMKNTCKLLVVADHRFYRYMGRGE ESTTTNYLIELIDRVDDIYRNTSWDNAGFKGYGIQIEQIRILKSPQEVKPGEKHYNMAKS YPNEEKDAWDVKMLLEQFSFDIAEEASKVCLAHLFTYQDFDMGTLGLAYVGSPRANSHGG VCPKAYYSPVGKKNIYLNSGLTSTKNYGKTILTKEADLVTTHELGHNFGAEHDPDGLAEC APNEDQGGKYVMYPIAVSGDHENNKMFSNCSKQSIYKTIESKAQECFQERSNKVCGNSRV DEGEECDPGIMYLNNDTCCNSDCTLKEGVQCSDRNSPCCKNCQFETAQKKCQEAINATCK GVSYCTGNSSECPPPGNAEDDTVCLDLGKCKDGKCIPFCEREQQLESCACNETDNSCKVC CRDLSGRCVPYVDAEQKNLFLRKGKPCTVGFCDMNGKCEKRVQDVIERFWDFIDQLSINT FGKFLADNIVGSVLVFSLIFWIPFSILVHCVDKKLDKQYESLSLFHPSNVEMLSSMDSAS VRIIKPFPAPQTPGRLQPAPVIPSAPAAPKLDHQRMDTIQEDPSTDSHMDEDGFEKDPFP NSSTAAKSFEDLTDHPVTRSEKAASFKLQRQNRVDSKETEC >gi568815596r:9485289_9730452|GENSCAN_predicted_CDS_1|2106_bp atgaggcagtctctcctattcctgaccagcgtggttcctttcgtgctggcgccgcgacct ccggatgacccgggcttcggcccccaccagagactcgagaagcttgattctttgctctca gactacgatattctctctttatctaatatccagcagcattcggtaagaaaaagagatcta cagacttcaacacatgtagaaacactactaactttttcagctttgaaaagtgcatacaaa ttcatattagagcttgttcatcgagtgaaaagaagagctgacccagatcccatgaagaac acgtgtaaattattggtggtagcagatcatcgcttctacagatacatgggcagaggggaa gagagtacaactacaaattacttaatagagctaattgacagagttgatgacatctatcgg aacacttcatgggataatgcaggttttaaaggctatggaatacagatagagcagattcgc attctcaagtctccacaagaggtaaaacctggtgaaaagcactacaacatggcaaaaagt tacccaaatgaagaaaaggatgcttgggatgtgaagatgttgctagagcaatttagcttt gatatagctgaggaagcatctaaagtttgcttggcacaccttttcacataccaagatttt gatatgggaactcttggattagcttatgttggctctcccagagcaaacagccatggaggt gtttgtccaaaggcttattatagcccagttgggaagaaaaatatctatttgaatagtggt ttgacgagcacaaagaattatggtaaaaccatccttacaaaggaagctgacctggttaca actcatgaattgggacataattttggagcagaacatgatccggatggtctagcagaatgt gccccgaatgaggaccagggagggaaatatgtcatgtatcccatagctgtgagtggcgat cacgagaacaataagatgttttcaaactgcagtaaacaatcaatctataagaccattgaa agtaaggcccaggagtgttttcaagaacgcagcaataaagtttgtgggaactcgagggtg gatgaaggagaagagtgtgatcctggcatcatgtatctgaacaacgacacctgctgcaac agcgactgcacgttgaaggaaggtgtccagtgcagtgacaggaacagtccttgctgtaaa aactgtcagtttgagactgcccagaagaagtgccaggaggcgattaatgctacttgcaaa ggcgtgtcctactgcacaggtaatagcagtgagtgcccgcctccaggaaatgctgaagat gacactgtttgcttggatcttggcaagtgtaaggatgggaaatgcatccctttctgcgag agggaacagcagctggagtcctgtgcatgtaatgaaactgacaactcctgcaaggtgtgc tgcagggacctttctggccgctgtgtgccctatgtcgatgctgaacaaaagaacttattt ttgaggaaaggaaagccctgtacagtaggattttgtgacatgaatggcaaatgtgagaaa cgagtacaggatgtaattgaacgattttgggatttcattgaccagctgagcatcaatact tttggaaagtttttagcagacaacatcgttgggtctgtcctggttttctccttgatattt tggattcctttcagcattcttgtccattgtgtggataagaaattggataaacagtatgaa tctctgtctctgtttcaccccagtaacgtcgaaatgctgagcagcatggattctgcatcg gttcgcattatcaaaccctttcctgcgccccagactccaggccgcctgcagcctgcccct gtgatcccttcggcgccagcagctccaaaactggaccaccagagaatggacaccatccag gaagaccccagcacagactcacatatggacgaggatgggtttgagaaggaccccttccca aatagcagcacagctgccaagtcatttgaggatctcacggaccatccggtcaccagaagt gaaaaggctgcctcctttaaactgcagcgtcagaatcgtgttgacagcaaagaaacagag tgctaa >gi568815596r:9485289_9730452|GENSCAN_predicted_peptide_2|460_aa MRAARPPPPAAERPSRYLQAGIQPALERFRHKCREQPRTESSSLLLSPGEGEEGQSSPEP HTPSQVIANIYHVFAKCQDNSKHLKGISSFYICRNPRRQVQFPDEDVQEVCVTVVAAGFR LNTFQQFVCCRILQSEAGHISLELLHTSSRGSSRCGSALALALLALRPGPGPAPAMEKTE LIQKAKLAEQAERYDDMATCMKAVTEQGAELSNEERNLLSVAYKNVVGGRRSAWRVISSI EQKTDTSDKKLQLIKDYREKVESELRSICTTVLEYRFEDQENHRCDNEERRIVDIRHWVI KVPVNSWKPAKDLELLDKYLIANATNPESKVFYLKMKGDYFRYLAEVACGDDRKQTIDNS QGAYQEAFDISKKEMQPTHPIRLGLALNFSVFYYEILNNPELACTLAKTAFDEAIAELDT LNEDSYKDSTLIMQLLRDNLTLWTSDSAGEECDAAEGAEN >gi568815596r:9485289_9730452|GENSCAN_predicted_CDS_2|1383_bp atgagggcagccagacccccgcctcctgctgctgaacgaccctcgcgctacctgcaggcg ggaattcaaccagctttggaaaggttcagacataaatgcagggagcagccccgaacagaa agctcttcactgctgctcagccctggagagggggaggaggggcaaagctcccccgaaccc cacaccccctcgcaagtgattgctaatatttatcatgtgtttgccaagtgccaggacaat agcaaacacctcaaaggcattagctcattttatatttgcaggaaccctaggaggcaagtc cagtttccagatgaggacgttcaggaagtttgtgtgacggttgtagcagctggattccgt ctgaacacattccagcaatttgtctgctgcaggatccttcagagtgaggctggccacatc agtctggagcttcttcataccagctctcgaggctcctcccgctgcgggtcggcgctcgcc ctcgctctcctcgccctccgccccggccccggccccgcgcccgccatggagaagactgag ctgatccagaaggccaagctggccgagcaggccgagcgctacgacgacatggccacctgc atgaaggcagtgaccgagcagggcgccgagctgtccaacgaggagcgcaacctgctctcc gtggcctacaagaacgtggtcgggggccgcaggtccgcctggagggtcatctctagcatc gagcagaagaccgacacctccgacaagaagttgcagctgattaaggactatcgggagaaa gtggagtccgagctgagatccatctgcaccacggtgctggaatataggtttgaagaccag gaaaatcacagatgtgataatgaagaaaggagaatagttgatattagacattgggtaata aaagtccctgttaattcctggaaaccagctaaagatctggaattgttggataaatattta atagccaatgcaactaatccagagagtaaggtcttctatctgaaaatgaagggtgattac ttccggtaccttgctgaagttgcgtgtggtgatgatcgaaaacaaacgatagataattcc caaggagcttaccaagaggcatttgatataagcaagaaagagatgcaacccacacaccca atccgcctggggcttgctcttaacttttctgtattttactatgagattcttaataaccca gagcttgcctgcacgctggctaaaacggcttttgatgaggccattgctgaacttgataca ctgaatgaagactcatacaaagacagcaccctcatcatgcagttgcttagagacaaccta acactttggacatcagacagtgcaggagaagaatgtgatgcggcagaaggggctgaaaac taa >gi568815596r:9485289_9730452|GENSCAN_predicted_peptide_3|166_aa MLDTIRFSWGEEGGNGCWKGSTGDSYSSLLDLLPPNSLKLVEQEEDEKSCLYVVPLESLK NPQYPDLRNEHHRESKAVVGSCQACEQRDGIHAQGFLTAGPRPPHSTSALCPAMVQRQGV VKGPRVRVSHFTALSCASAAQVFECSVASHPCLSVGGFCEASEKQE >gi568815596r:9485289_9730452|GENSCAN_predicted_CDS_3|501_bp atgctggacacaatcaggttctcctggggggaggagggagggaatggctgctggaagggt tctaccggggactcctacagcagcttgttagatctccttcctcccaattccctgaagttg gtggaacaggaagaggacgagaagtcatgtttgtatgttgtccccttggaatctctcaaa aatccgcaatatcccgatttgagaaatgagcatcacagagaaagcaaagcggtagttgga agctgccaagcttgtgagcaacgagacggaatccacgcccagggctttctaacggcgggg ccacgccctccacactccacctcggccctctgccctgcgatggtgcagagacagggtgtg gtcaaggggcctagggtccgtgtgtcccatttcactgcactgtcctgtgccagcgctgcc caagtcttcgagtgctccgtggcctcccacccctgcctctccgtgggcggcttctgtgag gcctccgagaagcaggaatga >gi568815596r:9485289_9730452|GENSCAN_predicted_peptide_4|72_aa MMHTAREHTVLPRATQCVTRDAEQCEILAPRRRLEVSSQEKSKSGSWDWGMQAQLRTYTL TTDGRFKRIHEC >gi568815596r:9485289_9730452|GENSCAN_predicted_CDS_4|219_bp atgatgcatacagctcgagaacacacggttctgccccgggccactcagtgtgtcactcgg gatgctgaacaatgtgagatcctggctccaaggagaagattggaggtctcttctcaggag aagagtaaaagtggttcttgggattgggggatgcaggcacagttgaggacatacactctc accacagatggcaggtttaagcgaatacatgaatgctga >gi568815596r:9485289_9730452|GENSCAN_predicted_peptide_5|112_aa MAPRSIGQAHWQPCPSSSTRMSRPLPGCPLTASTQTPAFQGSLAQAPPKWNLCVPTPQLF HSHICTRMVRWLSAGKRVWVKVQGFHAELTQLHPSDSTSLSLSFFICRAVSX >gi568815596r:9485289_9730452|GENSCAN_predicted_CDS_5|336_bp atggctccacgctccatcggccaggcacactggcaaccttgcccttcctcgagcacgcgg atgtccagacctctccctggctgccccctcactgcctccacccaaaccccagcctttcag ggcagccttgcccaggcacctcccaagtggaacctgtgcgtgcctacgccccagctgttc cactcccacatctgcactcgcatggtccgctggcttagtgcaggcaaacgggtctgggtc aaagtgcaaggcttccacgcggagttaacacagctgcatccaagtgactcaacctctctg agcctcagtttcttcatctgtagagcagttagtgnn