GENSCAN 1.0 Date run: 8-Nov-116 Time: 17:36:33 Sequence gi568815589f:74530280_74785615 : 255336 bp : 38.39% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12048 12255 208 2 1 67 19 182 0.750 8.23 1.02 Intr + 22575 22667 93 1 0 69 37 138 0.464 5.82 1.03 Intr + 47758 47782 25 2 1 91 115 26 0.193 1.77 1.04 Term + 54244 54391 148 1 1 99 39 86 0.176 1.19 1.05 PlyA + 56284 56289 6 1.05 2.00 Prom + 59813 59852 40 -6.85 2.01 Init + 72891 73001 111 2 0 68 47 114 0.707 5.56 2.02 Intr + 77023 77106 84 0 0 84 89 43 0.063 3.10 2.03 Intr + 81704 81833 130 1 1 27 98 44 0.007 -1.35 2.04 Intr + 100003 100088 86 2 2 106 86 61 0.367 6.32 2.05 Intr + 104352 104493 142 2 1 99 95 18 0.501 2.61 2.06 Intr + 112135 112536 402 2 0 81 123 493 0.218 45.57 2.07 Intr + 130338 130459 122 1 2 94 73 99 0.065 8.29 2.08 Intr + 132195 132327 133 2 1 64 115 96 0.995 9.20 2.09 Intr + 135209 135316 108 0 0 86 97 61 0.177 6.14 2.10 Intr + 137512 137622 111 2 0 112 91 109 0.387 13.03 2.11 Intr + 141510 141622 113 1 2 98 100 51 0.060 6.48 2.12 Intr + 155184 155243 60 2 0 118 68 75 0.074 6.51 2.13 Intr + 160715 160891 177 1 0 85 67 36 0.045 0.39 2.14 Intr + 160909 161119 211 0 1 21 77 139 0.134 3.66 2.15 Intr + 164626 164781 156 2 0 21 41 160 0.305 3.66 2.16 Term + 185578 185687 110 2 2 111 38 67 0.050 1.69 2.17 PlyA + 185912 185917 6 1.05 3.04 PlyA - 186969 186964 6 1.05 3.03 Term - 194467 194334 134 2 2 128 42 67 0.391 3.37 3.02 Intr - 198066 197960 107 2 2 87 67 40 0.449 0.74 3.01 Init - 200489 200416 74 2 2 33 82 121 0.440 6.59 3.00 Prom - 202284 202245 40 -7.15 4.15 PlyA - 203226 203221 6 1.05 4.14 Term - 206108 205981 128 0 2 40 37 145 0.593 2.06 4.13 Intr - 208333 208128 206 0 2 99 55 193 0.729 15.02 4.12 Intr - 209170 209088 83 1 2 87 116 66 0.997 6.82 4.11 Intr - 209730 209444 287 1 2 72 110 140 0.696 10.74 4.10 Intr - 212813 212732 82 2 1 49 48 94 0.411 -0.21 4.09 Intr - 222089 221998 92 0 2 86 83 38 0.728 1.89 4.08 Intr - 225194 225074 121 2 1 90 91 124 0.950 12.05 4.07 Intr - 231529 231417 113 2 2 78 100 62 0.535 5.58 4.06 Intr - 232855 231720 1136 0 2 89 78 653 0.768 52.49 4.05 Intr - 240709 240590 120 0 0 49 103 108 0.957 7.09 4.04 Intr - 241556 241424 133 0 1 67 93 193 0.999 16.48 4.03 Intr - 245797 245604 194 0 2 76 97 205 0.920 18.41 4.02 Intr - 247954 247726 229 2 1 69 0 148 0.122 0.11 4.01 Intr - 252660 252400 261 1 0 82 66 184 0.185 12.14 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 123530 123322 209 0 2 53 43 185 0.871 7.02 S.002 Init + 130391 130459 69 1 0 78 73 67 0.930 5.30 S.003 Term + 133097 133251 155 0 2 64 45 162 0.820 6.60 S.004 Intr - 140788 140638 151 0 1 132 94 135 0.864 17.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:74530280_74785615|GENSCAN_predicted_peptide_1|157_aa MNNTKRICKKAKTFAEMKNVLFEIEFQWTLNCKVDTTEERLSEDIILNAAQKGKELENLK NKGREIEDEGQYNGERLAVMEELSEDLNQSGDLSERAFQTDQETEDLRRKCGMLGSKEQS GTFTMPFIASERLSVGSQSPESTDLHFSPVSQRTESH >gi568815589f:74530280_74785615|GENSCAN_predicted_CDS_1|474_bp atgaataacaccaaacgtatttgcaaaaaagcaaaaacatttgcagaaatgaaaaatgta ttgtttgaaatcgaatttcagtggacattaaattgcaaagtagacacaactgaagagaga cttagtgaagacattatcctcaatgcagcacagaaaggcaaggagctagaaaatttaaaa aacaaaggaagagaaatagaagatgaagggcaatataatggagaaagactggcagtcatg gaagaactgtctgaagatctgaatcaaagcggggacctaagtgagagagcatttcagaca gatcaggaaactgaggacctgagaaggaagtgtggtatgcttggttctaaagagcagtct gggactttcaccatgccgtttattgcatcagaaaggttatctgttggttctcagagccca gagtctactgatcttcatttcagcccagtatcacagaggacagaaagccattaa >gi568815589f:74530280_74785615|GENSCAN_predicted_peptide_2|751_aa MHMPSRGSQRGCPTFSDEENTGTHHVWAFAALFGAMAGNDFRLQGGRTNIIKAGLKFTTE LRIVTTLGDTQREGFMPKELIVKVKEKHINRIIEEYQDQMSGGDIGSQAQIEVIPCKICG DKSSGIHYGVITCEGCKGFFRRSQQNNASYSCPRQRNCLIDRTNRNRCQHCRLQKCLALG MSRDAVKFGRMSKKQRDSLYAEVQKHQQRLQEQRQQQSGEAEALARVYSSSISNGLSNLN NETSGTYANGHVIDLPKSEGYYNVDSGQPSPDQSGLDMTGIKQIKQEPIYDLTSVPNLFT YSSFNNGQLAPGITMTEIDRIAQNIIKSHLETCQYTMEELHQLAWQTHTYEEIKAYQSKS REALWQQCAIQITHAIQYVVEFAKRITGFMELCQNDQILLLKSGCLEVVLVRMCRAFNPL NNTVLFEGKYGGMQMFKALGSDDLVNEAFDFAKNLCSLQLTEEEIALFSSAVLISPDRAW LIEPRKVQKLQEKIYFALQHVIQKNHLDDETLAKLIAKIPTITAVCNLHGEKLQLYVKNL FYSRDHVVLHCLEVQTMDSYAADGGCLFVHFFPKLLKFLRCEWAIVIFASCVTWIRESSS VIVLMAKNKAQGLPELNEPCSGQHGASAPQLGSMILVFNYTLLAVVGGKHSRASVTTIRR NTLAESPDIDMLFTISNEKESDGPLAMDIERMVFPFLQRFSQKQQEKEPESSLSEDDWAE KPQGYFEIHLLKMTEPCQPVSQKDCVKQRTQ >gi568815589f:74530280_74785615|GENSCAN_predicted_CDS_2|2256_bp atgcatatgcccagtcgtggctctcagagaggatgcccaaccttcagtgatgaagaaaac acaggcactcatcatgtctgggcttttgcagctctctttggtgcaatggcaggaaatgac ttcagactgcaaggaggtagaacaaacatcattaaggcaggattgaagttcaccactgag ttgaggattgtaacgacactgggagacacacaaagagagggttttatgcctaaggagctt atagtcaaggtgaaagagaagcatataaacaggattattgaagagtaccaggaccaaatg agtggtggggacataggatcccaggcacaaattgaagtgataccatgcaaaatttgtggc gataagtcctctgggatccactacggagtcatcacatgtgaaggctgcaagggattcttt aggaggagccagcagaacaatgcttcttattcctgcccaaggcagagaaactgtttaatt gacagaacgaacagaaaccgttgccaacactgccgactgcagaagtgtcttgccctagga atgtcaagagatgctgtgaagtttgggaggatgtccaagaagcaaagggacagcctgtat gctgaggtgcagaagcaccagcagcggctgcaggaacagcggcagcagcagagtggggag gcagaagcccttgccagggtgtacagcagcagcattagcaacggcctgagcaacctgaac aacgagaccagcggcacttatgccaacgggcacgtcattgacctgcccaagtctgagggt tattacaacgtcgattccggtcagccgtcccctgatcagtcaggacttgacatgactgga atcaaacagataaagcaagaacctatctatgacctcacatccgtacccaacttgtttacc tatagctctttcaacaatgggcagttagcaccagggataaccatgactgaaatcgaccga attgcacagaacatcattaagtcccatttggagacatgtcaatacaccatggaagagctg caccagctggcgtggcagacccacacctatgaagaaattaaagcatatcaaagcaagtcc agggaagcactgtggcaacaatgtgccatccagatcactcacgccatccaatacgtggtg gagtttgcaaagcggataacaggcttcatggagctctgtcaaaatgatcaaattctactt ctgaagtcaggttgcttggaagtggttttagtgagaatgtgccgtgccttcaacccatta aacaacactgttctgtttgaaggaaaatatggaggaatgcaaatgttcaaagccttaggt tctgatgacctagtgaatgaagcatttgactttgcaaagaatttgtgttccttgcagctg accgaggaggagatcgctttgttctcatctgctgttctgatatctccagaccgagcctgg cttatagaaccaaggaaagtccagaagcttcaggaaaaaatttattttgcacttcaacat gtgattcagaagaatcacctggatgatgagaccttggcaaagttaatagccaagatacca accatcacggcagtttgcaacttgcacggggagaagctgcagttgtatgtgaagaattta ttctatagtcgtgatcatgtagtcctccattgtttggaagtgcagaccatggattcctat gcagcagatgggggttgtctgtttgtgcacttttttcctaagctgctgaaattcctccga tgtgaatgggccatagtaatctttgcctcctgcgtgacttggatcagagaaagcagctca gtgattgttctcatggccaaaaataaagcacaagggttaccggaattgaatgagccctgt tcagggcagcacggggcatctgcaccccagctgggcagcatgatccttgtattcaactat actcttctggcagtagttggtggcaagcattcgcgtgcctctgtcacaaccatacgtaga aacacattagcagagtcaccagacatagacatgcttttcaccataagcaatgagaaggaa agtgatggcccattggcaatggacattgaaagaatggtctttccctttctccaaagattt agccaaaagcaacaagagaaggagcctgaaagtagcttatcggaggatgactgggctgag aaaccccagggctactttgagatccatctgttgaagatgacagaaccctgtcagcctgta tcccagaaagactgtgtgaagcagagaacccagtga >gi568815589f:74530280_74785615|GENSCAN_predicted_peptide_3|104_aa MGLTFRMPYVSEALATRSMGRGPTRSRGMVFGPANLGEDAIRNFIAKHHCNSCCRKLKLP DLKRNDYSPERINSTFGLEIKIESAEEPPARETGRNSPEDDMQL >gi568815589f:74530280_74785615|GENSCAN_predicted_CDS_3|315_bp atggggctcacatttcgaatgccttacgtgtctgaagcacttgctacacgaagcatggga cgtggaccaacaagatcaagaggaatggtgtttggaccggccaatttgggggaagatgca attagaaacttcattgcaaaacatcattgtaactcctgctgccggaagctcaaactcccg gatttaaaaagaaatgactattcccctgaaaggataaattccacctttggacttgagata aaaatagaatcagctgaggagcctccagcaagggagacgggtagaaattccccagaagat gatatgcaactataa >gi568815589f:74530280_74785615|GENSCAN_predicted_peptide_4|1061_aa XIVMMAAGVWFLLNFKSVVKFWLYYALLQTANMFYIVIIMAIVLLSFGVARKAILSPKEP PSWSLARDIVFEPYWMIYGEVYAGEIDGRPLPSGALALKSPWWGCPFRCPLSISKALILD LEAKCSTDTRLQFPLGQLEVSFLAPAQAAKPILASEHAERHFVGNVYLDMESISNNLWKY NRYRYIMTYHEKPWLPPPLILLSHVGLLLRRLCCHRAPHDQEEGDVGLKLYLSKEDLKKL HDFEEQCVEKYFHEKMEDVNCSCEERIRVTSESGVGGCFTEVGEDRIGEAETAITGGFWE KYYGAGKLGMGTRVTEMYFQLKEMNEKVSFIKDSLLSLDSQVGHLQDLSALTVDTLKVLS AVDTLQEDEALLAKRKHSTCKKLPHSWSNVICAEVLGSMEIAGEKKYQYYSMPSSLLRSL AGGRHPPRVQRGALLEITNSKREATNVRNDQERQETQSSIVVSGVSPNRQAHSKYGQFLL VPSNLKRVPFSAETVLPLSRPSVPDVLATEQDIQTEVLVHLTGQTPVVSDWASVDEPKEK HEPIAHLLDGQDKAEQVLPTLSCTPEPMTMSSPLSQAKIMQTGGGYVNWAFSEGDETGVF SIKKKWQTCLPSTCDSDSSRSEQHQKQAQDSSLSDNSTRSAQSSECSEVGPWLQPNTSFW INPLRRYRPFARSHSFRFHKEEKLMKICKIKNLSGSSEIGQGAWVKAKMLTKDRRLSKKK KNTQGLQVPIITVNACSQSDQLNPEPGENSISEEEYSKNWFTVSKFSHTGVEPYIHQKMK TKEIGQCAIQISDYLKQSQEENADHSLLIKIGLRLSLEKEDELCKVNTGEEITVYRLEES SPLNLDKSMSSWSQRGRAAMIQVLSREEMDGGLRKAMRVVSTWSEDDILKPGQVFIVKSF LPEVVRTWHKIFQESTVLHLCLREIQQQRAAQKLIYTFNQVKPQTIPYTPRFLEVFLIYC HSANQWLTIEKYMTGEFRKYNNNNGDEITPTNTLEELMLAFSHWTYEYTRGELLVLDLQV FGEIMHIVWVVEVETEAHQQQLGGGALELQCRSLTSGSTVN >gi568815589f:74530280_74785615|GENSCAN_predicted_CDS_4|3186_bp ngtatagttatgatggctgctggtgtttggtttcttttgaacttcaaatctgtggtaaaa ttctggttgtactatgctcttttgcagacagcaaacatgttctatattgtgatcatcatg gccatagtcctgctgagctttggagtggcacgcaaggccatcctttcgccaaaagagcca ccatcttggagtctagctcgagatattgtatttgagccatactggatgatatacggagaa gtctatgctggagaaatagatgggcgaccccttccttctggagccttagctctgaaaagc ccctggtgggggtgccctttcagatgccccctttccatttcaaaggctctgattctcgat cttgaagccaaatgcagcaccgacactcggcttcagtttccactgggacagctggaggtc tcctttctagccccagcccaggcggccaagcccatcctggcatcagaacatgctgagcgg cattttgtaggcaacgtttacttagatatggaatccatttcaaataacctgtggaaatac aaccgctatcgctacatcatgacctaccacgagaagccctggctgcccccacctctcatc ctgctgagccacgtgggccttctcctccgccgcctgtgctgtcatcgagctcctcacgac caagaagagggtgacgttggattaaaactctacctcagtaaggaggatctgaaaaaactt catgattttgaggagcagtgcgtggaaaaatacttccatgagaagatggaagatgtgaat tgtagttgtgaggaacgaatccgagtgacatcagaaagtggagttggcggctgcttcact gaagtgggtgaagacagaataggagaagcagagactgcaattacaggtggcttttgggag aagtattacggtgcagggaagcttggtatggggacacgggttacagagatgtacttccag ctgaaagaaatgaatgaaaaggtgtcttttataaaggactccttactgtctttggacagc caggtgggacacctgcaggatctctctgccctgactgtggataccctgaaagtcctttct gctgttgacactttgcaagaggatgaggctctcctggccaagagaaagcattctacttgc aaaaaacttccccacagctggagcaatgtcatctgtgcagaggttctaggcagcatggag atcgctggagagaagaaataccagtattatagcatgccctcttctttgctgaggagcctg gctggaggccggcatcccccaagagtgcagaggggggcacttcttgagattacaaacagt aaaagagaggctacaaatgtaagaaatgaccaggaaaggcaagaaacacaaagtagtata gtggtttctggggtgtctcctaacaggcaagcacactcaaagtatggccagtttcttctg gtcccctctaatctaaagcgagttcctttttcagcagaaactgtcttgcctctgtccaga ccctctgtgccagatgtgctggcaactgaacaggacatccagactgaggttcttgttcat ctgactgggcagaccccagttgtctctgactgggcatcagtggatgaacccaaggaaaag cacgagcctattgctcacttactggatggacaagacaaggcagagcaagtgctacccact ttgagttgcacacctgaacccatgacaatgagctcccctctttcccaagccaagatcatg caaactggaggtggatatgtaaactgggcattttcagaaggtgatgaaactggtgtgttt agcatcaagaaaaagtggcaaacctgcttgccctccacttgtgacagtgattcctctcgg agtgaacagcaccagaagcaggcccaggacagctccctatctgataactcaacaagatcg gcccagagtagtgaatgctcagaggtgggaccatggcttcagccaaacacatccttttgg atcaatcctctccgcagatacaggcccttcgctaggagtcatagttttagattccataag gaggagaaattgatgaagatctgtaagattaaaaatctttcaggctcttcagaaataggg cagggagcatgggtcaaagcgaaaatgctaaccaaagacaggagactgtcaaagaaaaag aagaatactcaaggactccaggtgccaatcataacagtcaatgcctgctctcagagtgac cagttgaatccagagccaggagaaaacagcatctctgaagaggagtacagcaagaactgg ttcacagtgtccaaatttagtcacacaggtgtagaaccttacatacatcagaaaatgaaa actaaagaaattggacaatgtgctatacaaatcagtgattacctaaagcagtctcaagag gaaaatgcagatcactcgcttctaattaaaatcggtcttcggctctctctggaaaaagaa gatgaactgtgcaaagtgaacacaggagaagaaataactgtctacaggttggaggagagt tcccctttaaaccttgataaaagcatgtcctcttggtctcagcgtgggagagcggcaatg atccaggtattgtcccgagaggagatggatgggggcctccgtaaagctatgagagtcgtc agcacttggtctgaggatgacattctcaagccgggacaagttttcattgtcaagtccttt cttcctgaggttgtgcggacatggcataaaatcttccaggagagcactgtgcttcatctt tgcctcagggaaattcaacaacaaagagctgctcaaaaattgatctataccttcaaccaa gtgaaaccacaaaccataccctacacaccaaggttcctggaagttttcttaatctactgc cattcagccaaccagtggttgaccattgagaagtatatgacaggggagttccggaagtat aacaacaacaatggtgatgaaatcacccccaccaacaccctggaggagctgatgttggct ttctctcactggacctatgagtacactcggggagagctgctggttttagatttgcaagtc tttggagagataatgcacattgtctgggtggtggaagtggagacagaggctcaccagcag cagttaggaggaggagctctggagttgcaatgccggagtctgacttctggttccactgtg aattag