GENSCAN 1.0 Date run: 3-Nov-116 Time: 13:29:31 Sequence gi568815597f:116284013_116503976 : 219964 bp : 43.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 13966 14514 549 0 0 75 42 188 0.601 7.01 1.02 PlyA + 15968 15973 6 1.05 2.00 Prom + 16731 16770 40 -4.36 2.01 Init + 25410 25468 59 2 2 66 92 32 0.450 2.19 2.02 Intr + 33312 33402 91 0 1 59 75 68 0.216 2.60 2.03 Intr + 42765 42849 85 2 1 62 69 67 0.089 1.59 2.04 Intr + 51431 51475 45 0 0 65 100 50 0.072 2.28 2.05 Intr + 75499 75622 124 2 1 40 19 94 0.018 -2.66 2.06 Intr + 79711 79785 75 1 0 78 64 111 0.111 6.33 2.07 Intr + 89463 89511 49 0 1 85 84 42 0.000 2.18 2.08 Intr + 100002 100112 111 2 0 120 115 147 0.994 21.18 2.09 Intr + 100771 100830 60 0 0 118 79 41 0.971 5.13 2.10 Intr + 103276 103479 204 0 0 74 82 67 0.940 4.10 2.11 Intr + 104119 104232 114 0 0 107 81 37 0.977 5.54 2.12 Intr + 104626 104760 135 0 0 46 65 179 0.999 12.16 2.13 Intr + 104890 105007 118 0 1 87 102 55 0.999 6.84 2.14 Intr + 105427 105695 269 2 2 86 107 224 0.999 21.35 2.15 Intr + 106201 106399 199 0 1 78 79 220 0.988 19.02 2.16 Intr + 106770 106879 110 1 2 85 91 -22 0.959 -2.20 2.17 Intr + 108842 108976 135 1 0 69 41 123 0.974 6.46 2.18 Intr + 109519 109711 193 0 1 49 57 270 0.954 19.07 2.19 Intr + 111098 111273 176 0 2 99 67 233 0.901 21.96 2.20 Intr + 112586 112722 137 1 2 99 110 220 0.999 24.67 2.21 Intr + 113876 114026 151 2 1 97 76 166 0.997 16.46 2.22 Intr + 114609 114777 169 2 1 65 100 219 0.999 20.32 2.23 Intr + 114918 115072 155 1 2 64 101 73 0.999 5.99 2.24 Intr + 115408 115531 124 0 1 99 99 186 0.999 20.96 2.25 Intr + 116849 116994 146 0 2 60 92 244 0.999 22.00 2.26 Intr + 117118 117248 131 0 2 96 100 141 0.992 15.59 2.27 Intr + 117542 117643 102 2 0 79 108 37 0.915 4.09 2.28 Intr + 122223 122386 164 0 2 30 111 43 0.502 0.42 2.29 Term + 122866 122903 38 2 2 108 54 34 0.291 -0.50 2.30 PlyA + 123258 123263 6 1.05 3.00 Prom + 129748 129787 40 -2.96 3.01 Init + 134546 134634 89 1 2 53 9 178 0.592 4.52 3.02 Term + 137481 137571 91 0 1 90 39 60 0.484 -1.51 3.03 PlyA + 137752 137757 6 1.05 4.00 Prom + 144823 144862 40 -2.16 4.01 Init + 160527 160589 63 2 0 76 82 72 0.148 6.55 4.02 Term + 161571 161636 66 2 0 80 41 51 0.069 -2.46 4.03 PlyA + 163640 163645 6 1.05 5.03 PlyA - 164566 164561 6 1.05 5.02 Term - 174204 174103 102 0 0 31 48 162 0.487 4.78 5.01 Init - 180982 180959 24 1 0 69 119 7 0.398 1.50 5.00 Prom - 181162 181123 40 -4.46 6.03 PlyA - 181511 181506 6 1.05 6.02 Term - 183054 182916 139 2 1 75 42 166 0.620 8.14 6.01 Init - 185273 185245 29 2 2 84 73 29 0.599 0.07 6.00 Prom - 187690 187651 40 -5.36 7.07 PlyA - 189200 189195 6 1.05 7.06 Term - 189409 189258 152 2 2 80 47 62 0.087 -0.63 7.05 Intr - 194160 194040 121 0 1 117 119 13 0.104 7.37 7.04 Intr - 212344 212267 78 1 0 125 31 32 0.315 0.95 7.03 Intr - 213721 213695 27 1 0 124 79 1 0.464 1.11 7.02 Intr - 214891 214790 102 1 0 46 72 74 0.368 1.97 7.01 Init - 216772 216755 18 1 0 46 100 44 0.530 1.53 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 2301 2230 72 0 0 142 37 34 0.876 1.61 S.002 Init - 89335 89038 298 1 1 65 41 238 0.916 12.20 S.003 Term - 93725 93559 167 0 2 107 48 140 0.977 9.98 S.004 Init - 95420 95357 64 2 1 84 83 39 0.953 4.45 S.005 Sngl - 156433 156290 144 1 0 34 54 169 0.835 2.01 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:116284013_116503976|GENSCAN_predicted_peptide_1|182_aa MPGSVGLGARSRGWGRSGRLGLRGNLGEKGERLGMAEHLGMAGCRSRAVPFREAAEAQRE FKNCAGRPVVLGDLAHPLQLLAWVLSPSLPGASGMGWPLRVRGLPSLHPPTRNLLWPASA VRSPGSCQCLSLHTSPQAEGASSSLGQPREGLPQCSGGLKGSSSAARVDTEAKEAQRVSE GC >gi568815597f:116284013_116503976|GENSCAN_predicted_CDS_1|549_bp atgccagggtcagtgggactgggtgcccggagcagggggtggggccggtcggggaggctg gggctgcgtgggaacttgggggagaagggggagcgcttgggcatggcagagcacttgggt atggcgggctgcaggtcccgagccgtgccctttagggaggcggctgaggcccagcgagaa ttcaagaactgtgcaggtaggccagtagtgctgggagacctggcgcaccctctgcagctg ctggcctgggtgctaagcccctcattgcctggggccagtggcatgggctggcctctccga gtgcggggcctaccaagcctgcacccacccacccggaacttgctctggcccgcgagcgct gtgcgcagccccggttcctgccagtgcctctccctccacacctccccacaagcagaggga gccagctccagcctcggccagcccagagaggggctcccacagtgcagcggcgggctgaag ggctcctcaagcgctgccagagtggacaccgaggccaaggaggcacagagagtgagcgag ggctgctag >gi568815597f:116284013_116503976|GENSCAN_predicted_peptide_2|1222_aa MRELAAQADAKRLASSITLRKCKFSFTNENDQNGFDNEENVILHTLRSPKFWLATALSID KEHHQCTANLGPKYTKQTAPECADSRLSSINITDNMYVHQLRLKRKVPTTLKTITGPGLP KAAQETFSLEIWSFVFYGSQDGHGVELSNITQQPEEPLCSDRTRRRALSTATMGKGVGRD KYEPAAVSEQGDKKGKKGKKDRDMDELKKEVSMDDHKLSLDELHRKYGTDLSRGLTSARA AEILARDGPNALTPPPTTPEWIKFCRQLFGGFSMLLWIGAILCFLAYSIQAATEEEPQND NLYLGVVLSAVVIITGCFSYYQEAKSSKIMESFKNMVPQQALVIRNGEKMSINAEEVVVG DLVEVKGGDRIPADLRIISANGCKVDNSSLTGESEPQTRSPDFTNENPLETRNIAFFSTN CVEGTARGIVVYTGDRTVMGRIATLASGLEGGQTPIAAEIEHFIHIITGVAVFLGVSFFI LSLILEYTWLEAVIFLIGIIVANVPEGLLATVTVCLTLTAKRMARKNCLVKNLEAVETLG STSTICSDKTGTLTQNRMTVAHMWFDNQIHEADTTENQSGVSFDKTSATWLALSRIAGLC NRAVFQANQENLPILKRAVAGDASESALLKCIELCCGSVKEMRERYAKIVEIPFNSTNKY QLSIHKNPNTSEPQHLLVMKGAPERILDRCSSILLHGKEQPLDEELKDAFQNAYLELGGL GERVLGFCHLFLPDEQFPEGFQFDTDDVNFPIDNLCFVGLISMIDPPRAAVPDAVGKCRS AGIKVIMVTGDHPITAKAIAKGVGIISEGNETVEDIAARLNIPVSQVNPRDAKACVVHGS DLKDMTSEQLDDILKYHTEIVFARTSPQQKLIIVEGCQRQGAIVAVTGDGVNDSPALKKA DIGVAMGIAGSDVSKQAADMILLDDNFASIVTGVEEGRLIFDNLKKSIAYTLTSNIPEIT PFLIFIIANIPLPLGTVTILCIDLGTDMVPAISLAYEQAESDIMKRQPRNPKTDKLVNER LISMAYGQIGMIQALGGFFTYFVILAENGFLPIHLLGLRVDWDDRWINDVEDSYGQQWTY EQRKIVEFTCHTAFFVSIVVVQWADLVICKTRRNSVFQQGMKNKILIFGLFEETALAAFL SYCPGMGVALRMYPLNSKKPWSSGKPWREQGKRKALNFTPDFPPEHFCPFLCDNPLNRKF YSQTFDLGRAAPMNTSPPKLPG >gi568815597f:116284013_116503976|GENSCAN_predicted_CDS_2|3669_bp atgagagagctggcagcacaggcagatgctaaaagacttgcctccagtattaccttaaga aaatgtaaattttcatttacaaatgaaaatgatcaaaatggcttcgacaatgaggaaaat gtcattttacacacgttaagaagtcccaagttctggttggctacagctctgagtattgac aaagagcatcatcaatgtactgctaaccttgggccaaaatacaccaagcagacagctcct gaatgtgctgacagcaggcttagcagcatcaatatcacggataacatgtacgttcatcag ttgcggctgaaaagaaaagttcccaccaccctgaagaccatcactggccctggacttcct aaagcagcccaagaaacattttctctggaaatctggtcttttgtcttctacgggagccaa gatggccatggagtggagctgagcaacatcacccagcagcctgaggagcccctgtgcagc gacaggacccggcgccgggcactgagcaccgccaccatggggaagggggttggacgtgat aagtatgagcctgcagctgtttcagaacaaggtgataaaaagggcaaaaagggcaaaaaa gacagggacatggatgaactgaagaaagaagtttctatggatgatcataaacttagcctt gatgaacttcatcgtaaatatggaacagacttgagccggggattaacatctgctcgtgca gctgagatcctggcgcgagatggtcccaacgccctcactccccctcccactactcctgaa tggatcaagttttgtcggcagctctttggggggttctcaatgttactgtggattggagcg attctttgtttcttggcttatagcatccaagctgctacagaagaggaacctcaaaacgat aatctgtacctgggtgtggtgctatcagccgttgtaatcataactggttgcttctcctac tatcaagaagctaaaagttcaaagatcatggaatccttcaaaaacatggtccctcagcaa gcccttgtgattcgaaatggtgagaaaatgagcataaatgcggaggaagttgtggttggg gatctggtggaagtaaaaggaggagaccgaattcctgctgacctcagaatcatatctgca aatggctgcaaggtggataactcctcgctcactggtgaatcagaaccccagactaggtct ccagatttcacaaatgaaaaccccctggagacgaggaacattgccttcttttcaaccaat tgtgttgaaggcaccgcacgtggtattgttgtctacactggggatcgcactgtgatggga agaattgccacacttgcttctgggctggaaggaggccagacccccattgctgcagaaatt gaacattttatccacatcatcacgggtgtggctgtgttcctgggtgtgtctttcttcatc ctttctctcatccttgagtacacctggcttgaggctgtcatcttcctcatcggtatcatc gtagccaatgtgccggaaggtttgctggccactgtcacggtctgtctgacacttactgcc aaacgcatggcaaggaaaaactgcttagtgaagaacttagaagctgtggagaccttgggg tccacgtccaccatctgctctgataaaactggaactctgactcagaaccggatgacagtg gcccacatgtggtttgacaatcaaatccatgaagctgatacgacagagaatcagagtggt gtctcttttgacaagacttcagctacctggcttgctctgtccagaattgcaggtctttgt aacagggcagtgtttcaggctaaccaggaaaacctacctattcttaagcgggcagttgca ggagatgcctctgagtcagcactcttaaagtgcatagagctgtgctgtggttccgtgaag gagatgagagaaagatacgccaaaatcgtcgagatacccttcaactccaccaacaagtac cagttgtctattcataagaaccccaacacatcggagccccaacacctgttggtgatgaag ggcgccccagaaaggatcctagaccgttgcagctctatcctcctccacggcaaggagcag cccctggatgaggagctgaaagacgcctttcagaacgcctatttggagctggggggcctc ggagaacgagtcctaggtttctgccacctctttctgccagatgaacagtttcctgaaggg ttccagtttgacactgacgatgtgaatttccctatcgataatctgtgctttgttgggctc atctccatgattgaccctccacgggcggccgttcctgatgccgtgggcaaatgtcgaagt gctggaattaaggtcatcatggtcacaggagaccatccaatcacagctaaagctattgcc aaaggtgtgggcatcatctcagaaggcaatgagaccgtggaagacattgctgcccgcctc aacatcccagtcagccaggtgaaccccagggatgccaaggcctgcgtagtacacggcagt gatctaaaggacatgacctccgagcagctggatgacattttgaagtaccacactgagata gtgtttgccaggacctcccctcagcagaagctcatcattgtggaaggctgccaaagacag ggtgctatcgtggctgtgactggtgacggtgtgaatgactctccagctttgaagaaagca gacattggggttgctatggggattgctggctcagatgtgtccaagcaagctgctgacatg attcttctggatgacaactttgcctcaattgtgactggagtagaggaaggtcgtctgatc tttgataacttgaagaaatccattgcttataccttaaccagtaacattcccgagatcacc ccgttcctgatatttattattgcaaacattccactaccactggggactgtcaccatcctc tgcattgacttgggcactgacatggttcctgccatctccctggcttatgagcaggctgag agtgacatcatgaagagacagcccagaaatcccaaaacagacaaacttgtgaatgagcgg ctgatcagcatggcctatgggcagattggaatgatccaggccctgggaggcttctttact tactttgtgattctggctgagaacggcttcctcccaattcacctgttgggcctccgagtg gactgggatgaccgctggatcaacgatgtggaagacagctacgggcagcagtggacctat gagcagaggaaaatcgtggagttcacctgccacacagccttcttcgtcagtatcgtggtg gtgcagtgggccgacttggtcatctgtaagaccaggaggaattcggtcttccagcagggg atgaagaacaagatcttgatatttggcctctttgaagagacagccctggctgctttcctt tcctactgccctggaatgggtgttgctcttaggatgtatcccctcaactctaagaagccc tggtcttctgggaaaccctggagggagcaaggcaaaaggaaggctctgaacttcacacct gattttccaccagaacatttctgcccttttctgtgtgacaatcctttgaatagaaagttc tacagccaaacatttgatttaggaagagcagctcccatgaacaccagtccaccaaaattg cctggttga >gi568815597f:116284013_116503976|GENSCAN_predicted_peptide_3|59_aa MRRSGGSFRGARATAWGSADSSLRYGGCRGCGRQNNGPEDDLHGIGKSGLQMELRLIIK >gi568815597f:116284013_116503976|GENSCAN_predicted_CDS_3|180_bp atgcgccgctcaggtgggagcttccggggagctcgcgcgacagcctgggggagcgcggat tcgtctctgcgctacggcggctgcagagggtgtggaagacagaataatggccccgaagac gacttacatggcatagggaaatcaggattgcagatggaattaaggttgataatcaagtga >gi568815597f:116284013_116503976|GENSCAN_predicted_peptide_4|42_aa MEPEDATHVDETHLANAQPENTNSTHEKIKPVSKLKAAPLTA >gi568815597f:116284013_116503976|GENSCAN_predicted_CDS_4|129_bp atggagccagaggatgcaactcatgtggatgagacccatttggccaatgcacagccggag aataccaactcaacacatgagaagattaagccagtatccaagctcaaggctgcaccccta acagcctga >gi568815597f:116284013_116503976|GENSCAN_predicted_peptide_5|41_aa MTGTRKKLRVGIPLALPQLHPECRQLKYTSALTELAGDVQL >gi568815597f:116284013_116503976|GENSCAN_predicted_CDS_5|126_bp atgactgggacaagaaagaagctgagggttgggattccactggccctgccccagctgcac cctgaatgccgtcagctcaagtacaccagcgcgctgacggagctggcaggtgacgtgcag ctgtga >gi568815597f:116284013_116503976|GENSCAN_predicted_peptide_6|55_aa MDNINAVIKGLIPIISTLPLSSQPLSIVTVITIIISTTIITVITITTIITTILPS >gi568815597f:116284013_116503976|GENSCAN_predicted_CDS_6|168_bp atggataacatcaatgcagtaattaaaggtcttatccccatcatctcaacactacctcta tcatcacaaccactgtcaattgtcaccgtcatcaccattatcatcagcactactatcatt actgtcatcaccatcaccaccatcattaccaccatcttaccatcatga >gi568815597f:116284013_116503976|GENSCAN_predicted_peptide_7|165_aa MGKIRKTVAPRNLLLPYLMGLNRRYIPRFTIATFRERLHQIQSYKQRQQDHADSVGPGPG STSQASEHVNLRTEQVLTSLSCFPLLQYGFDILIITATAKIPLCFINPEENTMSADPGTH SLDSKFHEDRVHVSFISEVPGPSKGPGTMQMLKKYCWMLNEWMNE >gi568815597f:116284013_116503976|GENSCAN_predicted_CDS_7|498_bp atgggaaaaatccggaagaccgtggcaccaaggaacctgctgctaccctacttgatgggg ctgaacaggagatatattccacgcttcacaattgccacctttcgggagcggcttcaccag attcagtcatataagcagaggcagcaggaccatgctgattctgtgggaccaggcccgggc tctaccagccaggcctctgaacatgtcaacttgagaactgaacaggttctcacatcgctt tcctgtttccctctcctgcaatatggatttgacatattaatcattacagcaactgctaaa attccactgtgcttcattaatccggaggaaaataccatgtcagctgaccctggcactcac tcattagattccaagttccatgaggacagggtccatgtctccttcatcagtgaggtccca gggcctagcaaagggcctggcacgatgcagatgctcaagaaatattgctggatgttgaat gaatggatgaatgaatga