GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:28:12 Sequence gi568815586r:92044083_92245535 : 201453 bp : 39.18% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2904 3030 127 1 1 69 55 107 0.178 4.32 1.02 Term + 7808 7835 28 2 1 130 47 28 0.347 -0.53 1.03 PlyA + 8060 8065 6 1.05 2.00 Prom + 13829 13868 40 -5.15 2.01 Sngl + 13883 14143 261 1 0 71 38 226 0.699 10.81 2.02 PlyA + 16917 16922 6 1.05 3.02 PlyA - 17225 17220 6 1.05 3.01 Sngl - 39568 39419 150 1 0 73 47 201 0.784 8.42 3.00 Prom - 41229 41190 40 -7.15 4.00 Prom + 45221 45260 40 -3.25 4.01 Init + 50023 50030 8 0 2 103 108 0 0.865 4.05 4.02 Term + 50647 50803 157 1 1 126 52 46 0.830 1.22 4.03 PlyA + 51109 51114 6 1.05 5.07 PlyA - 51128 51123 6 1.05 5.06 Term - 58208 58099 110 0 2 92 34 80 0.069 0.69 5.05 Intr - 77647 77594 54 2 0 102 94 50 0.354 5.03 5.04 Intr - 100365 100003 363 1 0 108 42 394 0.037 31.03 5.03 Intr - 101096 100978 119 1 2 53 47 136 0.529 5.19 5.02 Intr - 101550 101179 372 2 0 90 58 294 0.821 19.85 5.01 Init - 102482 102259 224 2 2 79 -37 252 0.647 8.88 5.00 Prom - 120350 120311 40 -3.65 6.00 Prom + 122158 122197 40 -8.95 6.01 Init + 124318 124587 270 0 0 88 32 154 0.703 6.81 6.02 Term + 124646 124798 153 1 0 17 39 198 0.834 4.74 6.03 PlyA + 124937 124942 6 1.05 7.11 PlyA - 125041 125036 6 1.05 7.10 Term - 132895 132788 108 1 0 108 47 138 0.935 9.23 7.09 Intr - 137390 137312 79 1 1 46 66 43 0.054 -3.47 7.08 Intr - 143128 142984 145 2 1 66 22 140 0.300 3.62 7.07 Intr - 157892 157809 84 0 0 86 100 51 0.272 4.97 7.06 Intr - 159218 159185 34 2 1 96 30 40 0.079 -4.12 7.05 Intr - 169915 169725 191 2 2 56 76 113 0.064 5.28 7.04 Intr - 180992 180967 26 1 2 92 113 21 0.004 1.75 7.03 Intr - 190283 190148 136 0 1 36 58 97 0.009 0.21 7.02 Intr - 197774 197689 86 1 2 59 116 50 0.507 3.44 7.01 Init - 198693 198641 53 0 2 53 113 28 0.473 2.50 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100365 99998 368 1 2 108 44 411 0.953 32.48 S.002 Init + 174537 174604 68 2 2 85 119 61 0.853 9.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:92044083_92245535|GENSCAN_predicted_peptide_1|51_aa XVNALDKSPAEYNKTENLTEEKPLYLYIYKPKHNCPCRMKMHMCCIFTSNP >gi568815586r:92044083_92245535|GENSCAN_predicted_CDS_1|156_bp natgtcaacgctctagacaaaagccctgctgagtacaacaaaaccgaaaacctgacagag gaaaaacctttatatctttatatctacaagcccaaacacaattgtccctgcaggatgaaa atgcatatgtgctgcatcttcacatccaatccataa >gi568815586r:92044083_92245535|GENSCAN_predicted_peptide_2|86_aa MGKDFMTRTPKAIATKAKTEKRDLIKLKSFCTAKEIIIRVNGQPTEWEEIFAIYPSDKGL ISRIYKELKHIYKRMSFGIKQTCVNK >gi568815586r:92044083_92245535|GENSCAN_predicted_CDS_2|261_bp atgggcaaagacttcatgacaagaacgccaaaagcaattgcaacgaaagccaaaactgaa aaacgggatctaattaaactaaagagcttctgcacagcaaaagaaattatcatcagagtg aacgggcaacctacagaatgggaggaaatttttgcaatctatccatctgacaaagggcta atatccagaatttataaggaacttaaacatatttacaagaggatgagctttggaatcaaa cagacctgcgttaataagtag >gi568815586r:92044083_92245535|GENSCAN_predicted_peptide_3|49_aa MVADSDTCFITGFEEGRRRLQAKECGYPLEAAKARMVSPLEPPEKAQAS >gi568815586r:92044083_92245535|GENSCAN_predicted_CDS_3|150_bp atggtagcagactcagacacgtgcttcatcactggctttgaagaaggaagaagaaggctg caggccaaggaatgcggctaccctctagaagctgcaaaggcaagaatggtttctcctcta gagcctccagaaaaagcccaggctagctga >gi568815586r:92044083_92245535|GENSCAN_predicted_peptide_4|54_aa MPNKPIAIAFVSAMKRTQIDGRDTTVEETTHTADTPTQEGQSHSHWLNVYSPVC >gi568815586r:92044083_92245535|GENSCAN_predicted_CDS_4|165_bp atgcccaacaagcctattgctattgcttttgtgtcagccatgaaaagaacacaaatagat ggcagagacaccacggtggaagaaacaacacacactgcagacacaccaactcaggaagga cagagtcactctcattggctaaatgtgtacagtccagtttgctga >gi568815586r:92044083_92245535|GENSCAN_predicted_peptide_5|413_aa MAPRRWQLVCEGTLAPRGCSPFSWTERPAGPFPPLSPALRQRTASTHSAGARIAPHHTRL RLEEPVTETRSGNEILPPGEEARGGCSAVGAAPPSPSRPGPPPHAAPMHPFYTRAATMIG EIAAAVSFISKFLRTKGLTSERQLQTFSQSLQELLAGEQGEGVGGTRAPHPWVRVRPSGL REPRQRPGAGRAWAPPRDRWQNGEEKTTRTTKKLKPTEAGAAVRGPSADLGTKEPWGWEH YKHHWFPEKPCKGSGYRCIRINHKMDPLIGQAAQRIGLSSQELFRLLPSELTLWVDPYEV SYRIGEDGSICVLYEASPAGGSTQNSTNVQMVDSRISCKEELLLGRTSPSKNYNMMTVSV TRITEKNIATTKCPAFQQNRNLLFILSFAVSSKFLDEIHKELVESTSLFDESW >gi568815586r:92044083_92245535|GENSCAN_predicted_CDS_5|1242_bp atggctccgcggcgttggcagctcgtctgcgaggggaccctggctccccgaggctgcagt cctttttcttggacagaacgccccgcaggaccgttccctccactttcccccgcactaagg cagcgcaccgccagcacacacagcgctggcgcccgcatcgcgccgcaccacacgcgtctc cggcttgaggagccagtcaccgagacccggagcgggaacgagatccttccgcccggtgag gaagcccggggtggctgctccgccgtcggggccgcgccgccgagccccagccgccccggg ccgcccccgcacgccgcccccatgcatcccttctacacccgggccgccaccatgataggc gagatcgccgccgccgtgtccttcatctccaagtttctccgcaccaaggggctcacgagc gagcgacagctgcagaccttcagccagagcctgcaggagctgctggcaggtgagcagggc gagggcgtcggagggacgcgggccccacatccctgggtcagagtccggccgtcggggctg cgggaacctcggcagcgccccggggccggtcgcgcttgggccccgccgcgggaccgctgg caaaacggagaggagaagacaacacgcacaacaaagaaacttaagcccaccgaggcggga gctgcggtccgaggaccgtcggcggatttggggacaaaggagccgtggggctgggaacat tataaacatcactggttcccagaaaagccatgcaagggatcgggttaccgttgtattcgc atcaaccataaaatggatcctctgattggacaggcagcacagcggattggactgagcagt caggagctgttcaggcttctcccaagtgaactcacactctgggttgacccctatgaagtg tcctacagaattggagaggatggctccatctgtgtgctgtatgaagcctcaccagcagga ggtagcactcaaaacagcaccaacgtgcaaatggtagacagccgaatcagctgtaaggag gaacttctcttgggcagaacgagcccttccaaaaactacaatatgatgactgtatcagtt acccgaattactgaaaaaaatattgcaaccactaagtgtccagctttccagcagaatagg aatctattatttatcctgtcctttgctgtgtcctctaaattcctggatgaaattcacaag gagcttgtagagtctacttcgctgtttgatgagagttggtga >gi568815586r:92044083_92245535|GENSCAN_predicted_peptide_6|140_aa MMNTTGKMRGTPYVIFRPFRKHEVVPLAMYMQICKKGDIIDIKGVGTVQKGMPHKCYHGK AERVYLVPQHAAGIVVNKQLKGKIPAKRINENDQKKKEAKENGTWVQLKRQPPPPRKADF VRTNGKKPELLDPLPYEFMA >gi568815586r:92044083_92245535|GENSCAN_predicted_CDS_6|423_bp atgatgaacacaacgggaaagatgagaggcaccccatatgtgatttttaggccttttaga aaacatgaagttgttcctttggccatgtacatgcaaatctgtaagaaaggtgatatcata gacatcaaaggagtgggtactgttcaaaaaggaatgccccacaagtgttaccatgggaaa gctgaaagagtctaccttgttccccagcatgctgctggcattgttgtaaacaaacaactt aagggcaagattcctgccaagagaattaatgaaaatgatcagaaaaagaaagaagccaaa gagaatggtacctgggttcaactgaagcgccagcctcctccacccagaaaagcagacttt gtgagaaccaatgggaaaaagcctgagctgctagaccctcttccttatgaattcatggca tag >gi568815586r:92044083_92245535|GENSCAN_predicted_peptide_7|313_aa MLEQFYICTAQYGNHLATELILYGLLPCPLASGWVQPVKGSSGNLKGAVLLCLVDKFALA PSGNLLKMQILEYYPRLNDPETQQRASDKPFSISGSDGWKENIIHLSLIKATVDALSEYL SCLTPKKLAAIIQAVIQGSSLLHPHGCASSMLASKAAERKEQSTNLPLDENRLDPARNTD PHQEQSFDFLLNDTCCITWDSFPSDPNVRSRLRFTGPSAIVSEAEHRIFEDEKCQSEPVG GYTTEKTSVRGHSLGMRRENKFAALLQSSRITVAVQTMRGIRKGKAGLWQTTAHFIGRLE EAVSDLRRAHGLV >gi568815586r:92044083_92245535|GENSCAN_predicted_CDS_7|942_bp atgctggaacagttctatatctgtactgcccaatatggtaaccacttagccacggagctg atactctatggactgcttccttgccccctggcttctggttgggtccagccagtgaagggt tctagtgggaacttgaaaggtgctgtgttactgtgcctagtggataagtttgccctagca ccatctgggaacttattgaaaatgcaaattcttgagtactacccaagacttaatgatcca gaaactcagcaacgtgcatctgacaagcccttcagcatttctggatctgatggctggaaa gaaaatattatccatctctccttaattaaggcaacagtagatgctttatcagaatatcta agttgtttaacgccaaagaagcttgctgctatcatccaagcagtgattcagggatccagt ctcctacatcctcatggttgtgcatcttcaatgttagcatctaaagctgcagaaagaaaa gagcagagcacgaatttgccgttggatgaaaacagacttgacccagcaaggaacacagat ccacaccaggagcaaagttttgactttcttctgaacgatacctgctgtatcacttgggac tcttttcccagtgatcctaatgtgaggtcaagattgagatttactggcccatctgccatt gtcagtgaggcagaacatcgcatatttgaagatgaaaagtgccaatcagaaccagttggt ggatacactactgaaaaaaccagcgttagaggccattctctgggaatgagaagagaaaat aagtttgcagcccttttgcaatcatctaggatcacagttgcagtccaaacgatgaggggc atccgaaagggaaaagctggcctatggcagaccacagcacattttataggcaggcttgag gaggcagtatcggatttacgtagggcccatggattggtttga