GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:29:16 Sequence gi568815592r:117463257_117702288 : 239032 bp : 39.26% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 19526 19637 112 1 1 88 92 280 0.992 26.82 1.02 Intr + 40511 40723 213 0 0 126 95 137 0.845 15.96 1.03 Intr + 43350 43457 108 1 0 64 39 82 0.009 0.24 1.04 Intr + 56560 56694 135 2 0 78 75 126 0.061 9.92 1.05 Intr + 66942 67045 104 1 2 71 23 111 0.120 1.87 1.06 Intr + 69004 69137 134 0 2 79 83 160 0.175 13.12 1.07 Intr + 71772 71934 163 0 1 57 33 79 0.449 -1.64 1.08 Intr + 73311 73371 61 2 1 107 63 65 0.682 3.29 1.09 Intr + 75364 75579 216 2 0 94 61 167 0.974 12.15 1.10 Intr + 75999 76123 125 1 2 33 116 91 0.998 5.78 1.11 Intr + 77412 77559 148 2 1 62 97 142 0.983 11.39 1.12 Intr + 77662 77769 108 2 0 96 80 75 0.968 6.84 1.13 Intr + 79868 79955 88 0 1 116 25 122 0.974 6.81 1.14 Intr + 81272 81321 50 2 2 77 77 90 0.976 4.21 1.15 Intr + 81921 82112 192 1 0 95 -7 149 0.819 4.64 1.16 Intr + 82222 82341 120 2 0 53 111 100 0.991 8.35 1.17 Term + 84651 85183 533 1 2 81 47 757 0.864 64.32 1.18 PlyA + 85514 85519 6 1.05 2.10 PlyA - 85792 85787 6 1.05 2.09 Term - 100128 99998 131 1 2 104 32 122 0.989 5.56 2.08 Intr - 103778 103598 181 2 1 105 93 260 0.994 26.72 2.07 Intr - 106480 106316 165 1 0 72 99 169 0.999 15.64 2.06 Intr - 110376 110211 166 2 1 16 50 139 0.859 1.94 2.05 Intr - 112096 111921 176 1 2 37 93 96 0.488 2.82 2.04 Intr - 115808 115644 165 2 0 61 111 165 0.777 15.34 2.03 Intr - 134161 134010 152 2 2 50 59 111 0.028 3.36 2.02 Intr - 136915 136840 76 1 1 77 56 53 0.481 -0.83 2.01 Init - 139032 138748 285 0 0 98 83 146 0.539 11.22 2.00 Prom - 147552 147513 40 -7.05 3.00 Prom + 148904 148943 40 -5.05 3.01 Init + 157500 157561 62 2 2 54 94 29 0.161 0.87 3.02 Intr + 171144 171350 207 0 0 69 113 173 0.877 15.17 3.03 Intr + 171593 171774 182 2 2 66 73 155 0.498 10.39 3.04 Intr + 171776 171903 128 0 2 -7 32 130 0.287 -2.62 3.05 Intr + 178800 178900 101 2 2 75 21 63 0.496 -3.71 3.06 Term + 181383 181809 427 0 1 127 38 304 0.880 23.09 3.07 PlyA + 184240 184245 6 1.05 4.00 Prom + 187950 187989 40 -3.65 4.01 Init + 198786 198849 64 2 1 56 115 89 0.155 9.76 4.02 Term + 206233 206531 299 2 2 -11 49 312 0.003 11.84 4.03 PlyA + 207830 207835 6 -0.45 5.00 Prom + 211208 211247 40 -5.85 5.01 Init + 212415 212829 415 2 1 45 100 406 0.946 32.18 5.02 Intr + 225386 225463 78 0 0 25 75 112 0.666 2.20 5.03 Intr + 228477 228521 45 1 0 42 121 66 0.831 2.76 5.04 Intr + 229786 229911 126 2 0 81 121 93 0.968 11.63 5.05 Intr + 230775 230924 150 1 0 89 83 147 0.980 13.51 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 201570 201498 73 0 1 72 60 79 0.905 4.78 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:117463257_117702288|GENSCAN_predicted_peptide_1|869_aa MVPGARGGGALARAAGRGLLALLLAVSAPLRLQAEELGDGCGHLVTYQDSGTMTSKNYPG TYPNHTVCEKTITVPKGKRLILRLGDLDIESQTCASDYLLFTSSSDQYGMQKEEETEVLC LSVAGAQRVDIPVQLLPSFLEGWKGPYCGSMTVPKELLLNTSEVTVRFESGSHISGRGFL LTYASSDHPESQGDRPSEKTLDQQSRTFLATGTTFVKDSFSTDGTSLLCKAAIHAGIIAD ELGGQISVLQRKGISRYEGILANGVLSREFEIFREQLFSSVLFYSWGNTVHAVIELMFPH MIVWHSGKTREGSIAAEEEGVPKLYLVIQKQELVQDLVLVATVGCSRSLSFEPDGQIRAS SSWQSVNESGDQVHWSPGQARLQDQGPSWASGDSSNNHKPREWLEIDLGEKKKITGIRTT GSTQSNFNFYVKSFVMNFKNNNSKWKTYKGIVNNEEKVFQGNSNFRDPVQNNFIPPIVAR YVRVVPQTWHQRIALKVELIGCQITQGNDSLVWRKTSQSTSVSTKKEDETITRPIPSEET STGINITTVAIPLVLLVVLVFAGMGIFAAFRKKKKKGSPYGSAEAQKTDAMPVQIVGDHT QMISQRENLGPDEGKIPFKGTAESMVRVVFAVVVNDLGMLFLAHTPEEDIDHYCWKQIKY PFARHQSAEFTISYDNEKEMTQKLDLITSDMADYQQPLMIGTGTVTRKGSTFRPMDTDAE EAGVSTDAGGHYDCPQRAGRHEYALPLAPPEPEYATPIVERHVLRAHTFSAQSGYRVPGP QPGHKHSLSSGGFSPVAGVGAQDGDYQRPHSAQPADRGYDRPKAVSALATESGHPDSQKP PTHPGTSDSYSAPRDCLTPLNQTAMTALL >gi568815592r:117463257_117702288|GENSCAN_predicted_CDS_1|2610_bp atggtgcccggcgcccgcggcggcggcgcactggcgcgggctgccgggcggggcctcctg gctttgctgctcgcggtctccgccccgctccggctgcaggcggaggagctgggtgatggc tgtggacacctagtgacttatcaggatagtggcacaatgacatctaagaattatcccggg acctaccccaatcacactgtttgcgaaaagacaattacagtaccaaaggggaaaagactg attctgaggttgggagatttggatatcgaatcccagacctgtgcttctgactatcttctc ttcaccagctcttcagatcaatatggaatgcagaaggaggaggagacagaagtgctttgt ctttcagtggctggcgctcagagagtggacattcctgtgcagctgttgcccagcttcctg gaagggtggaagggtccatactgtggaagtatgactgttcccaaagaactcttgttgaac acaagtgaagtaaccgtccgctttgagagtggatcccacatttctggccggggttttttg ctgacctatgcgagcagcgaccatccagaatcacaaggtgacagaccttcagaaaagaca ctagaccagcagtcccgaacctttttggcaacagggaccacttttgtgaaagacagtttt tccacagacgggacctctttattgtgcaaagctgccatccatgcaggaataattgctgat gaactaggtggccagatcagtgtgcttcagcgcaaagggatcagtcgatatgaagggatt ctggccaatggtgttctttcgagggaatttgaaatcttcagggagcagttgttttcatct gtgttgttttactcctggggaaacactgtacatgcggtcattgaacttatgttcccacac atgattgtttggcattcaggaaaaaccagagaaggaagtattgcagctgaggaagaagga gtccctaaactgtaccttgtcatccagaagcaggagctggttcaagacttggtgctggtt gctacagtaggttgcagcagatccttgagttttgaacctgacgggcaaatcagagcttct tcctcatggcagtcggtcaatgagagtggagaccaagttcactggtctcctggccaagcc cgacttcaggaccaaggcccatcatgggcttcgggcgacagtagcaacaaccacaaacca cgagagtggctggagatcgatttgggggagaaaaagaaaataacaggaattaggaccaca ggatctacacagtcgaacttcaacttttatgttaagagttttgtgatgaacttcaaaaac aataattctaagtggaagacctataaaggaattgtgaataatgaagaaaaggtgtttcag ggtaactctaactttcgggacccagtgcaaaacaatttcatccctcccatcgtggccaga tatgtgcgggttgtcccccagacatggcaccagaggatagccttgaaggtggagctcatt ggttgccagattacacaaggtaatgattcattggtgtggcgcaagacaagtcaaagcacc agtgtttcaactaagaaagaagatgagacaatcacaaggcccatcccctcggaagaaaca tccacaggaataaacattacaacggtggctattccattggtgctccttgttgtcctggtg tttgctggaatggggatctttgcagcctttagaaagaagaagaagaaaggaagtccgtat ggatcagcagaggctcagaaaacagatgccatgccagtgcagattgtcggagaccatacc cagatgatctcacaaagggagaatctgggacctgatgagggcaaaataccttttaaaggc acagcggaaagcatggttagagtagtgtttgctgttgtggttaatgaccttggcatgctg ttcttagcacacacacctgaggaggacattgatcactactgttggaagcagattaaatat ccctttgccagacatcagtcagctgagtttaccatcagctatgataatgagaaggagatg acacaaaagttagatctcatcacaagtgatatggcagattaccagcagcccctcatgatt ggcaccgggacagtcacgaggaagggctccaccttccggcccatggacacggatgccgag gaggcaggggtgagcaccgatgccggcggccactatgactgcccgcagcgggccggccgc cacgagtacgcgctgcccctggcgcccccggagcccgagtacgccacgcccatcgtggag cggcacgtgctgcgcgcccacacgttctctgcgcagagcggctaccgcgtcccagggccc cagcccggccacaaacactccctctcctcgggcggcttctcccccgtagcgggtgtgggc gcccaggacggagactatcaaaggccacacagcgcacagcctgcggacaggggctacgac cggcccaaagctgtcagcgccctcgccaccgaaagcgggcaccctgactctcagaagccc ccaacgcatcccgggacgagtgacagctattctgcccccagagactgcctcacacccctc aaccagacggccatgactgcccttttgtga >gi568815592r:117463257_117702288|GENSCAN_predicted_peptide_2|498_aa MSAGGPCPAAAGGGPGGASCSVGAPGGVSMFRWLEVLEKEFDKAFVDVDLLLGEIDPDQA DITYEGRQKMTSLSSCFAQLCHKAQSVSQINHKLEIPCQQEGSRLLEIWPLDLGLLCLHN LLEVLARAIRQEKEIKDIQMVKEEVRLLLFADDMIVYLQSPKDSSIKLLELAQLVDLKSE LTETQAEKVVLEKEVHDQLLQLHSIQLQLHAKTGQSADSGTIKAKLERELEANKKEKMKE AQLEAEVKLLRKENEALRRHIAVLQAEVYGARLAAKYLDKELAGRVQQIQLLGRDMKGPA HDKLWNQLEAEIHLHRHKTVIRACRGRNDLKRPMQAPPGHGGKEHGVPILISEIHPGQPA DRCGGLHVGDAILAVNGVNLRDTKHKEAVTILSQQRGEIEFEVVYVAPEVDSDDENVEYE DESGHRYRLYLDELEGGGNPGASCKDTSGEIKVLQGFNKKAVTDTHENGDLGTASETPLD DGASKLDDLHTLYHKKSY >gi568815592r:117463257_117702288|GENSCAN_predicted_CDS_2|1497_bp atgtcggcgggcggtccatgcccagcagcagccggagggggcccagggggcgcctcctgc tccgtgggggcccctggcggggtatccatgttccggtggctggaggtgctggagaaggag ttcgacaaagcttttgtggatgtggatctgctcctgggagagatcgatccagaccaagcg gacatcacttatgaggggcgacagaagatgaccagcctgagctcctgctttgcacagctt tgccacaaagcccagtctgtgtctcaaatcaaccacaagctggagatcccctgccagcaa gaaggctctcgacttctcgagatctggcccctcgaccttggacttctctgtctccataac ttactagaagtcctagccagagcaatcagacaagagaaagaaataaaggacatccaaatg gttaaagaggaagtcagactgttgctgtttgctgatgacatgatagtatacctacaaagc cctaaggattcatccataaagctcctagaactggcacagttggtggatctgaaatctgaa ctgacagaaacccaagcagagaaagttgttttggagaaagaagtacatgatcagctttta cagctgcactctattcagctgcagcttcatgctaaaactggtcaaagtgctgactctggt accattaaggcaaaattggaaagagagcttgaggcaaacaaaaaagaaaaaatgaaagaa gcacaacttgaagctgaagtgaaattgttgagaaaagagaatgaagcccttcgtagacat atagctgttctccaggctgaagtatatggggcgagactagctgccaagtacttggataag gaactggcaggaagggtccaacagatacaattgctaggacgagatatgaagggacctgct catgataagctttggaaccaattagaagctgaaatacatttgcatcgtcacaaaactgtg atccgagcctgcagaggacgtaatgacttgaaacgaccaatgcaagcaccaccaggccat ggtgggaaagaacatggtgttccaatcctcatctctgagatccatccggggcaacctgct gatagatgcggagggctgcacgttggggatgctattttggcagtcaacggagttaaccta agggacacaaagcataaagaagctgtaactattctttctcagcagagaggagagattgaa tttgaagtagtttatgtggctcctgaagtggattctgatgatgaaaacgtagagtatgaa gatgagagtggacatcgttaccgtttgtaccttgatgagttagaaggaggtggtaaccct ggtgctagttgcaaagacacaagtggggaaatcaaagtattacaaggatttaataagaag gcagtaactgacacacatgaaaatggagacctgggcactgcaagtgaaactccgctagat gacggtgcttcaaaattagatgatctgcacactctgtatcataaaaaatcttattaa >gi568815592r:117463257_117702288|GENSCAN_predicted_peptide_3|368_aa MPNKFKAQNQIAAEFKSTLFRLPVKLQKLDCSNNLIQRVTAQDFQDLQDLKHLILDNNNA SFFEAGALQRCSQLSNLALEQNLLLSIPLSEFLVSLASRIKPQTLTVSVTALKDGASGVC SFRCSDVSGVSSFLWVRGLADFKSEAADLRMSVTAHKSSVDPKSEQQQDLLQAKEQSFHS KERDATGLLLLAKTQNQSFLQEALVLSSKDWNSAAKICVLGVLMSLGLPGTLTRLDLKSN VIQNIAEREIKDLKQLHVLNLRNNKISALDLKALEGLPHLRHLYLDGNPWNCTFSLLKAR EVLMAKGTDVRGGQCAAPTEQHGESWMSSKEIMRQCKHHFHLTEKSKETKKKSKPEDPSS IRINMDDG >gi568815592r:117463257_117702288|GENSCAN_predicted_CDS_3|1107_bp atgccaaataagtttaaagcacaaaatcaaatagctgcagaatttaaatccacgttgttc cgactaccagtaaagctacaaaaacttgattgtagcaataatctgattcagagagtaaca gcacaagacttccaggacctccaagacttgaaacatttgattctggacaacaacaatgcg agtttcttcgaagctggagccctgcagaggtgttctcagctctccaacctggcgctggag cagaatctgctgttgtccatacccctaagtgagttcttggtctcgctggcttcaagaata aagccgcagaccctcacagtgagcgttacagctcttaaagatggtgcgtctggagtttgt tccttcagatgttcagatgtgtccggagtttcttccttcttgtgggttcgtggtcttgct gacttcaagagtgaagctgcagaccttcgcatgagtgttacagctcataaaagtagcgtg gacccaaagagtgagcagcagcaagatttactgcaagcgaaagaacaaagcttccacagc aaggaaagagacgcaaccgggttgctgctgctggctaagacacagaatcagtcatttctc caagaagctctggttctttccagtaaagattggaattcagcagctaagatctgtgtgctt ggtgtgctcatgtcattggggctcccagggaccttaaccaggctggacctaaaaagcaat gtcatccagaatattgctgaacgggagatcaaggacctcaagcagcttcatgttctaaac ctgaggaacaataagatctctgccttagacctaaaagccttagagggtctgcctcacctc aggcacctgtacctggatggaaatccctggaattgcaccttcagtctcttaaaagcgaga gaagtcctgatggccaagggcacagatgtaaggggaggacaatgtgcagcaccaactgaa cagcatggggagagctggatgtcttccaaggagatcatgaggcaatgtaagcatcacttt catctgaccgagaaaagtaaagagaccaaaaagaaatcaaaacctgaagacccctccagc atcagaatcaacatggatgatggatga >gi568815592r:117463257_117702288|GENSCAN_predicted_peptide_4|120_aa MGKSLDEVMRAVPEALRQHSPDLWNYELGKHDLGYLVEEISKWQSVQEEAEPESSKRLQP DNAVEKKNPFSGEKFKLAAEICISNEKPNANDPDKVSGWKCLQGMSENFTAAPPITGQEA >gi568815592r:117463257_117702288|GENSCAN_predicted_CDS_4|363_bp atgggaaagagtttggatgaagtaatgagggctgtccctgaagccttgcggcagcacagc ccagatctgtggaactatgaacttgggaaacacgatttagggtatctggtagaagaaatt tctaagtggcaaagtgttcaagaggaagctgagcctgaaagttcgaaacgtttgcagccc gacaatgcagtagaaaagaaaaacccattttctggggagaaattcaagctggctgcagaa atttgcataagtaacgagaagccaaatgctaatgacccagacaaggtgtcggggtggaag tgtctccagggcatgtcagagaacttcacagcagcccctcccatcacaggccaggaggct tag >gi568815592r:117463257_117702288|GENSCAN_predicted_peptide_5|272_aa MTGLYELVWRVLHALLCLHRTLTSWLRVRFGTWNWIWRRCCRAASAAVLAPLGFTLRKPP AVGRNRRHHRHPRGGSCLAAAHHRMRWRADGRSLEKLPVHMGLVITEVEQEPSFSDIASL VVWCMAVGISYISVYDHQVFRELALLDKPPRFGFVHSRMPTELQEYSPEFANSNDKDDQG IFKRNNSRLMDEILKQQQELLGLDCSKYSPEFANSNDKDDQVLNCHLAVKVLSPEDGKAD IVRAAQDFCQLVAQKQKRPTDLDVDTLASLLX >gi568815592r:117463257_117702288|GENSCAN_predicted_CDS_5|816_bp atgacggggctgtacgagctggtgtggcgggtgctgcacgcgctgctctgtctgcaccgc acgctcacctcctggctccgcgttcggttcggcacctggaactggatctggcggcgctgc tgccgcgccgcctctgccgcggtcctagcgccgctcggcttcacgctccgcaagcccccg gcagtcggcaggaaccgccgtcaccaccggcacccgcgcggggggtcgtgcctggcagcc gcacaccaccggatgcgctggcgcgcggacggtcgttccttggagaagctgcctgtgcat atgggcctggtgatcaccgaggtggagcaggaacccagcttctcggacatcgcgagcctc gtggtgtggtgtatggccgtgggcatctcctacattagcgtctacgaccaccaagtgttc cgagagctagctttattagataagccgccaaggtttggctttgttcactctcggatgcct acagaacttcaagaatactcaccagaatttgcaaatagtaatgacaaagatgatcaaggt attttcaaaagaaataattccagattgatggatgaaattttaaaacaacagcaagaactt ctgggcctagattgttcaaaatactcaccagaatttgcaaatagtaatgacaaagatgat caagttttaaattgccatttggcagtgaaggtgctgtctccggaagatggaaaagcagat attgtaagagctgctcaggacttttgccagttagtagcccagaagcaaaagagacccaca gatttggatgtagatacgttagccagtttacttann