GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:13:01 Sequence gi568815586r:110392008_110596204 : 204197 bp : 44.04% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.23 Intr - 3213 3094 120 0 0 46 106 52 0.854 3.49 1.22 Intr - 4423 4259 165 1 0 73 91 84 0.539 7.36 1.21 Intr - 9451 9346 106 0 1 57 81 44 0.509 0.82 1.20 Intr - 11813 11520 294 1 0 29 86 234 0.585 13.42 1.19 Intr - 13601 13469 133 0 1 84 74 40 0.625 1.90 1.18 Intr - 13964 13854 111 0 0 79 91 34 0.677 3.15 1.17 Intr - 16528 16447 82 1 1 75 115 66 0.944 7.21 1.16 Intr - 44197 44103 95 2 2 60 116 30 0.906 2.68 1.15 Intr - 44676 44550 127 0 1 77 95 41 0.980 3.95 1.14 Intr - 45145 45077 69 1 0 120 86 -20 0.562 0.38 1.13 Intr - 48381 48305 77 1 2 77 94 140 0.995 12.73 1.12 Intr - 53544 53445 100 0 1 77 89 21 0.518 0.78 1.11 Intr - 61864 61736 129 1 0 38 86 139 0.995 9.59 1.10 Intr - 63675 63579 97 2 1 70 107 61 0.997 6.21 1.09 Intr - 63923 63808 116 2 2 96 81 104 0.991 9.75 1.08 Intr - 65627 65503 125 0 2 47 47 74 0.890 -0.50 1.07 Intr - 67855 67688 168 2 0 76 68 210 0.929 17.72 1.06 Intr - 73207 73099 109 1 1 83 119 163 0.619 18.76 1.05 Intr - 76980 76720 261 0 0 58 52 225 0.455 13.58 1.04 Intr - 100115 100028 88 1 1 104 9 66 0.008 0.27 1.03 Intr - 101224 100989 236 1 2 60 92 121 0.911 6.19 1.02 Intr - 104196 104005 192 0 0 23 49 151 0.135 4.39 1.01 Init - 110341 110282 60 1 0 81 49 86 0.059 5.48 1.00 Prom - 112656 112617 40 -2.06 2.00 Prom + 113285 113324 40 -3.86 2.01 Init + 126209 126216 8 1 2 40 103 0 0.317 -2.87 2.02 Intr + 127787 127909 123 2 0 69 116 36 0.793 4.20 2.03 Intr + 130170 130404 235 0 1 73 93 67 0.825 3.39 2.04 Term + 135365 135514 150 1 0 103 53 77 0.759 3.61 2.05 PlyA + 135636 135641 6 1.05 3.07 PlyA - 138980 138975 6 1.05 3.06 Term - 145088 145030 59 0 2 97 37 61 0.700 -0.25 3.05 Intr - 146266 146137 130 1 1 72 99 114 0.997 11.17 3.04 Intr - 147938 147815 124 1 1 106 86 119 0.779 14.09 3.03 Intr - 154071 153873 199 1 1 59 89 177 0.997 13.31 3.02 Intr - 159961 159782 180 2 0 101 123 22 0.880 6.84 3.01 Init - 191024 190802 223 2 1 68 109 563 0.892 53.32 3.00 Prom - 201471 201432 40 -2.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 36834 36990 157 2 1 80 53 130 0.805 8.85 S.002 Term - 43210 43148 63 1 0 72 49 42 0.834 -3.41 S.003 Init - 68373 68346 28 0 1 34 89 1 0.855 -5.44 S.004 Sngl + 68981 69229 249 1 0 75 41 356 0.997 24.68 S.005 Term - 72532 72522 11 2 2 108 40 12 0.925 -3.24 S.006 Intr + 94318 94419 102 0 0 95 32 101 0.836 5.57 S.007 Term + 94527 94697 171 2 0 58 54 93 0.825 0.73 S.008 Term - 100115 99998 118 1 1 104 42 71 0.966 2.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:110392008_110596204|GENSCAN_predicted_peptide_1|1020_aa MAAIRPKKQARKARLSSTAQLVLVLGDLHIPHRCNSLPAKFKKLLVPGKIQHILCTGNLC TKESYDYLKTLAGDVHIVRGDFDENLNYPEQKVVTVGQFKIGLIHGHQVIPWGDMASLAL LQRQFDVDILISGHTHKFEAFEHENKFYINPGSATGAYNALETNIIPSFVLMDIQASTVV TYVYQLIGDDVKRAALQKSYAPSSRTPDPGRASPRRRDREPCAGAAVPASLPRQHACVRL LVFRRGGSDAGFSPLRSRSDTARTASGERGGQIRDPGVTSTYCATMVQHCEALNRSVQVV NLDPAAEHFNYSVMADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGH VEDDYILFDCPGQIELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILA ALSAMISLEIPQVNIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKA ICGLIDDYSMVRFLPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKAYHSSLMDPDTKL IGNMALLPIRSQFKGPAPRETKDTDIVDEAIYYFKANVFFKNYEIKNEADRTLIYITLYI SECLKKLQKCNSKSQGEKEMYTLGITNFPIPGEPGFPLNAIYAKPANKQEDEVMRAYLQQ LRQETGLRLCEKVFDPQNDKPSKWLKTDTARSPRKPTGPSQTLWVTLTVEGKVEWTNGLL KTHLTKLSLQLKKDSVKDRAPKLTNQAGSHTPPPIPLEAALKNIAHYLSIPPPKIFAAAT LYDYFVLFFLLTTTTGAGTRVSFYSGTSVVTSCLGVVPRVDPMDPGDAAILESSLRILYR LFESVLPPLPAALQSRMNVIDHVRDMAAAGLHSNVRLLSSLLLTMSNNNPTLGYSHFSTL RITHQLKAQWLCAISTHMLIECVEEKYQLLVYHADSLFHDKEYRNAVSKYTMALQQKKAL SKTSKVRPSTGNSASTPQSQCLPSEIEVKYKMAECYTMLKQDKDAIAILDGIPSRQRTPK >gi568815586r:110392008_110596204|GENSCAN_predicted_CDS_1|3060_bp atggctgccatccgccctaaaaagcaggctcggaaggcgcgcctcagctcgacggcccag ttggtgttggtattaggagatctgcacatcccacaccggtgcaacagtttgccagctaaa ttcaaaaaactcctggtgccaggaaaaattcagcacattctctgcacaggaaacctttgc accaaagagagttatgactatctcaagactctggctggtgatgttcatattgtgagagga gacttcgatgagaatctgaattatccagaacagaaagttgtgactgttggacagttcaaa attggtctgatccatggacatcaagttattccatggggagatatggccagcttagccctg ttgcagaggcaatttgatgtggacattcttatctcgggacacacacacaaatttgaagca tttgagcatgaaaataaattctacattaatccaggttctgccactggggcatataatgcc ttggaaacaaacattattccatcatttgtgttgatggatatccaggcttctacagtggtc acctatgtgtatcagctaattggagatgatgtgaaacgggcggctctgcagaagagctac gctccgtccagtcggaccccggaccctggccgggcatctccgcggcgccgagaccgcgag ccgtgtgcgggagcagctgtcccagcatccctcccccgacagcacgcgtgcgtcaggctg ctggtattccggagaggcggctcggatgctggcttctcgcccctccgaagccgttcggac actgcccgaacagcttcgggagagcggggcgggcagatccgcgaccccggagttacgagc acctactgtgccaccatggtccagcactgtgaagccctcaaccggtctgtccaagttgta aacctggatccagcagcagaacacttcaactactccgtgatggctgacatccgggaactg atcgaggtggatgatgtaatggaggatgattctctgcgattcggtcccaacggaggattg gtattttgcatggagtactttgccaataattttgactggctggagaactgtcttggccat gtagaggacgactatatcctttttgattgtccaggtcagattgagttgtacactcacctg cctgtgatgaaacagctggtccagcagctcgagcagtgggagttccgagtctgtggagtt tttcttgttgattctcagttcatggtggagtcattcaagtttatttctggcatcttggca gccctgagtgccatgatctctctagaaattccgcaagtcaacatcatgacaaaaatggat ctgctgagtaaaaaagcaaaaaaggaaattgagaaatttttagatccagacatgtattct ttattagaagattctacaagtgacttaagaagcaaaaaattcaagaaactgactaaagct atatgtggactgattgatgactacagcatggttcgatttttaccttacgatcagtcagat gaagaaagcatgaacattgtattgcagcatattgattttgccattcaatatggagaagac ctagaatttaaagaaccaaaggcttaccactcttctctcatggatcctgataccaaactc atcggaaacatggcactgttgcctatcagaagtcaattcaaaggacctgcccccagagag acaaaagatacagatattgtggatgaagccatctattacttcaaggccaatgtcttcttc aaaaactatgaaattaagaatgaagctgataggaccttgatatatataactctctacatt tctgaatgtctgaagaaactgcaaaagtgcaattccaaaagccaaggtgagaaagaaatg tatacgctgggaatcactaattttcccattcctggagagcctggttttccacttaacgca atttatgccaaacctgcaaacaaacaggaagatgaagtgatgagagcctatttacaacag ctaaggcaagagactggactgagactttgtgagaaagttttcgaccctcagaatgataaa cccagcaagtggctgaagactgacactgcccgatcacctcggaagcctacaggaccatca cagacgctttgggtaactcttacagtggaaggaaaggtagaatggactaatggtctttta aaaacacacctcaccaagctcagcctccaacttaaaaaggactctgtcaaggatagagcc ccaaaactcaccaaccaagcaggttcccacacaccacccccaatcccacttgaagcagcc ctgaaaaacatcgcccattatctctccataccacccccaaaaattttcgctgccgcaaca ctttacgactatttcgttttattcttcttgttaacgaccacgacaggagcagggactcga gtttctttttattcgggcacttcagtagttacctcttgcttgggggtggtgccgcgtgtg gacccgatggaccccggcgacgccgccattttggagtcttccctaaggatcctctaccgg cttttcgagtcagtgctgccgccgctgcccgcggctttgcagagcaggatgaatgtgata gaccacgtgcgggacatggcggccgcggggctgcactccaacgtgcggctcctcagcagc ttgttacttacaatgagtaataacaacccgactctcgggtactcccacttttctaccctt cgcatcacccaccaattgaaagcgcagtggctgtgcgcaataagcactcacatgcttatt gagtgtgttgaggaaaagtaccagcttttggtgtatcatgcagattctctctttcatgat aaggaatatcggaatgctgtgagtaagtataccatggctttacagcagaagaaagcgcta agtaaaacttcaaaagtgagaccttcaactggaaattctgcatctactccacaaagtcag tgtcttccatctgaaattgaagtgaaatacaaaatggctgaatgttatacaatgctaaaa caagataaagatgccattgctatacttgatgggatcccttcaagacaaagaactcccaaa >gi568815586r:110392008_110596204|GENSCAN_predicted_peptide_2|171_aa MCIPLALSIDDMLVEANFILATLADEQSRASSPQSLCLSQKRKRSDLIEKKAGKNVTGQA LECISKKAAPRRLYPKETLTNISALENCGSPAMKRVDGDVSEVSESSVSNTEEVPGSLCL RKCDITSSEVHGLSTWTSGNHFPACHSQYHGYLESRFNSCKASLAEGIFTF >gi568815586r:110392008_110596204|GENSCAN_predicted_CDS_2|516_bp atgtgcatacctctggctttgagtattgatgatatgttagtggaagctaactttattttg gccacattagctgatgaacaaagtagagcatcttcaccacagtcactgtgtctttcacag aaacgaaaaaggtcagatctgattgaaaaaaaggctggcaaaaatgtaactggccaggcc ctggaatgtatttcaaaaaaagcagcaccaagaaggctttatcctaaggagactctcaca aacatatctgcattggaaaactgtggcagccctgcaatgaaaagagtggatggagatgtc agtgaagtatcagaaagcagtgtcagcaacacagaggaagtgccagggtctctgtgtctc agaaagtgtgacataacatcttcagaggttcatgggcttagtacatggacctctgggaac cattttcctgcctgccacagccagtatcacggctatctggagagcaggttcaacagctgc aaggcaagccttgcagagggcatcttcaccttctga >gi568815586r:110392008_110596204|GENSCAN_predicted_peptide_3|304_aa MFSVLSYGRLVARAVLGGLSQTDPRAGGGGGGDYGLVTAGCGFGKDFRKGLLKKGACYGD DACFVARHRSADVLGVADGVGGWRDYGVDPSQFSGTLMRTCERLVKEGRFVPSNPIGILT TSYCELLQNKVPLLGSSTACIVVLDRTSHRLHTANLGDSGFLVVRGGEVVHRSDEQQHYF NTPFQLSIAPPEAEGVVLSDSPDAADSTSFDVQLGDIILTATDGLFDNMPDYMILQELKK LKNSNYESIQQTARSIAEQAHELAYDPNYMSPFAQFACDNGLNVRGGKPDDITVLLSIVA EYTD >gi568815586r:110392008_110596204|GENSCAN_predicted_CDS_3|915_bp atgttctcggtcctctcgtacgggcggctggtggcccgcgccgtgctcggcggcctctcg cagaccgaccccagggccggcggcggcggcggcggcgactacggactggtgacggccggc tgcggcttcgggaaggacttccgtaagggcctcctcaagaagggcgcgtgctacggggac gacgcgtgcttcgtggcccggcaccgttccgcggacgtgctcggggttgcagatggtgta ggaggctggagagactatggagttgatccatctcaattctcagggactttaatgcggacg tgtgaacgtttagtaaaagaaggacggttcgtacctagtaatcccattggaattctcacc acaagctactgtgagttgctgcaaaataaagtccctttgctcggtagcagcaccgcctgc attgtggtgctggacagaaccagccaccgcttacacacagcaaacctgggcgattcaggc ttcctggttgtcaggggtggtgaagtcgtgcaccgatcagatgagcagcagcattacttc aacactccattccagctctcaatcgctccccctgaagccgagggagtcgtcttgagcgac agtccggatgctgctgatagcacgtctttcgatgtccagctaggagacattatcctgacg gcaacagatggactctttgacaacatgcctgattatatgattcttcaggagctaaaaaag ttaaagaattcaaattatgagagtatacaacagactgccagaagcattgctgagcaagct catgagctggcctatgacccaaattatatgtcaccttttgcacagtttgcatgtgacaat ggattgaatgtgagaggtggaaagccagatgacatcaccgtccttctttcaatagtggct gagtatacagactag