GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:29:34 Sequence gi568815594f:150999653_151204590 : 204938 bp : 42.31% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 4873 4868 6 1.05 1.02 Term - 14734 14660 75 1 0 45 50 103 0.889 -0.94 1.01 Init - 14990 14775 216 2 0 74 119 172 0.965 17.64 1.00 Prom - 15231 15192 40 -10.05 2.02 PlyA - 15252 15247 6 -3.64 2.01 Sngl - 16059 15844 216 0 0 93 50 248 0.909 14.66 2.00 Prom - 19463 19424 40 -6.05 3.00 Prom + 22090 22129 40 -6.55 3.01 Sngl + 27396 27695 300 2 0 53 44 267 0.716 14.34 3.02 PlyA + 28686 28691 6 1.05 4.00 Prom + 35718 35757 40 -6.85 4.01 Init + 38664 38772 109 2 1 65 87 83 0.741 6.33 4.02 Intr + 49646 49810 165 0 0 -69 69 315 0.692 12.91 4.03 Intr + 56122 56256 135 2 0 101 84 127 0.878 13.22 4.04 Term + 71081 71166 86 0 2 65 49 85 0.480 -0.86 4.05 PlyA + 71610 71615 6 1.05 5.00 Prom + 86707 86746 40 -6.95 5.01 Init + 100001 100062 62 1 2 113 84 126 0.883 15.47 5.02 Intr + 100833 100936 104 0 2 26 100 106 0.869 4.50 5.03 Intr + 101323 101510 188 2 2 109 94 170 0.996 18.19 5.04 Intr + 103219 103427 209 0 2 100 87 279 0.999 25.95 5.05 Intr + 104525 104634 110 2 2 49 101 94 0.995 5.81 5.06 Term + 104820 104941 122 1 2 99 39 210 0.999 14.86 5.07 PlyA + 104965 104970 6 1.05 6.20 PlyA - 106059 106054 6 1.05 6.19 Term - 122555 122439 117 2 0 104 44 116 0.989 6.36 6.18 Intr - 128063 127966 98 0 2 85 77 69 0.987 4.31 6.17 Intr - 128704 128518 187 1 1 67 80 90 0.774 4.44 6.16 Intr - 132731 132679 53 0 2 66 96 14 0.711 -2.29 6.15 Intr - 133584 133382 203 2 2 43 78 153 0.337 7.81 6.14 Intr - 135480 135422 59 0 2 58 97 8 0.181 -4.54 6.13 Intr - 138210 138080 131 1 2 58 87 68 0.436 3.19 6.12 Intr - 140213 140123 91 2 1 66 110 86 0.462 7.25 6.11 Intr - 144398 144258 141 2 0 49 83 103 0.952 5.53 6.10 Intr - 144636 144568 69 0 0 44 98 57 0.609 0.76 6.09 Intr - 148534 148270 265 0 1 55 54 242 0.721 14.09 6.08 Intr - 149909 149848 62 2 2 84 107 36 0.915 1.71 6.07 Intr - 159700 159588 113 2 2 78 111 89 0.930 9.38 6.06 Intr - 166044 165937 108 1 0 23 75 122 0.324 3.74 6.05 Intr - 176022 175018 1005 1 0 42 95 854 0.370 71.28 6.04 Intr - 177035 176873 163 2 1 92 30 131 0.548 6.33 6.03 Intr - 177303 177165 139 2 1 66 53 74 0.622 1.25 6.02 Intr - 179745 179703 43 1 1 70 86 63 0.484 0.68 6.01 Init - 182052 182037 16 0 1 86 87 14 0.610 1.57 6.00 Prom - 187651 187612 40 -5.95 7.00 Prom + 188199 188238 40 -8.85 7.01 Init + 190199 190409 211 1 1 68 57 170 0.564 10.90 7.02 Intr + 190880 191082 203 0 2 38 23 149 0.032 1.48 7.03 Intr + 191238 191526 289 2 1 -15 31 266 0.030 6.50 7.04 Term + 203476 203648 173 2 2 107 38 162 0.840 10.11 7.05 PlyA + 203755 203760 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 190880 191286 407 0 2 38 48 267 0.910 12.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:150999653_151204590|GENSCAN_predicted_peptide_1|96_aa MASEDNRVPSPPPTGDDGGGGGREETPTEGGALSLKPGLPIRGIRMKFAVLTGLVEVGEV SNRDIVETVFNLWVGEQDLKAGAMWFTHTDGDFFRV >gi568815594f:150999653_151204590|GENSCAN_predicted_CDS_1|291_bp atggctagcgaagacaatcgtgtcccttccccgccaccaacaggtgatgacgggggaggt ggagggagagaagaaacccctactgaagggggtgcattgtctctgaaaccagggctcccc atcaggggcatcagaatgaaatttgccgtgttgaccggtttggttgaagttggagaagta tccaatagggatattgtagaaactgtctttaacctgtgggttggggaacaagatcttaaa gctggagccatgtggtttactcacactgatggtgacttttttcgtgtatag >gi568815594f:150999653_151204590|GENSCAN_predicted_peptide_2|71_aa MAAGSSFSGHTTAARPVAGRAVAVASSPLAAASAAAPVAPAAAAFAKAERRGGAGQEAME FCAAADSRGAD >gi568815594f:150999653_151204590|GENSCAN_predicted_CDS_2|216_bp atggcggccgggagcagcttcagtgggcacacgacagccgcgcgacccgtggcggggcga gctgtggcagtagcatcctcaccactcgcagcagcctcagccgcggcgcccgtagcgcca gcagcggctgcttttgcaaaggctgagcgcaggggcggggcgggccaggaagccatggag ttctgtgcagccgcggactcccggggagcggactag >gi568815594f:150999653_151204590|GENSCAN_predicted_peptide_3|99_aa MRMEVLESFTKKILSTEVGEMAKQYIEKSLLVPDHVITYLMMSELENRCGQHWLLDGFPR TLGQAEALDKICEVDLVISLNIPFETLKRSSQPPLDSPS >gi568815594f:150999653_151204590|GENSCAN_predicted_CDS_3|300_bp atgcgcatggaagtcttggaatcatttaccaagaagatcctgagcacagaagttggtgag atggcaaagcagtatatagagaaaagtcttttggttccagaccatgtgatcacataccta atgatgtctgagttggagaataggtgtggccagcactggctcctcgatggttttcctagg acattaggacaagccgaagccctggacaaaatctgtgaagtggatctagtgatcagtttg aacattccatttgaaacacttaaaagatcgtctcagccgccgttggattcaccctcctag >gi568815594f:150999653_151204590|GENSCAN_predicted_peptide_4|164_aa MRSNQSPLVFLVITSNLQEYCPSEMIKEKPDYRRLEMGLLSGLGSPPTALAKLTVVLLED GLPEDGLPEDGLPEDGLPQDGLPEDGLPQSSENQKQIHSTEIFTSSNSEVKYEDEIVPGA TEKWKNSEQMIESQIISLSSGSSKITAEILLTQPCQYAEEEEHS >gi568815594f:150999653_151204590|GENSCAN_predicted_CDS_4|495_bp atgaggtctaatcaaagcccactggtttttctggtgattacaagtaaccttcaagagtat tgtcccagtgaaatgataaaggaaaagccagattaccgaaggcttgaaatggggctgctc agcggcctgggctctcctccaaccgccctggccaaactcaccgtggtcctactggaggat ggtctgccggaggatggcctgccggaggatggcctgccggaggatggcctgccgcaggat ggcctgccggaggatggcctgccgcagtcttcagaaaaccaaaaacaaatacacagcacc gagattttcaccagcagcaactcagaggtcaagtatgaggatgagatagttcctggggcc acagagaaatggaaaaattctgagcagatgatagaaagtcaaatcatctctttatcctca ggcagtagcaagataactgcagaaattcttttaactcagccatgccaatatgcagaggaa gaagagcactcttag >gi568815594f:150999653_151204590|GENSCAN_predicted_peptide_5|264_aa MAVGKNKRLTKGGKKGAKKKVVDPFSKKDWYDVKAPAMFNIRNIGKTLVTRTQGTKIASD GLKGRVFEVSLADLQNDEVAFRKFKLITEDVQGKNCLTNFHGMDLTRDKMCSMVKKWQTM IEAHVDVKTTDGYLLRLFCVGFTKKRNNQIRKTSYAQHQQVRQIRKKMMEIMTREVQTND LKEVVNKLIPDSIGKDIEKACQSIYPLHDVFVRKVKMLKKPKFELGKLMELHGEGSSSGK ATGDETGAKVERADGYEPPVQESV >gi568815594f:150999653_151204590|GENSCAN_predicted_CDS_5|795_bp atggcggttggcaagaacaagcgccttacgaaaggcggcaaaaagggagccaagaagaaa gtggttgatccattttctaagaaagattggtatgatgtgaaagcacctgctatgttcaat ataagaaatattggaaagacgctcgtcaccaggacccaaggaaccaaaattgcatctgat ggtctcaagggtcgtgtgtttgaagtgagtcttgctgatttgcagaatgatgaagttgca tttagaaaattcaagctgattactgaagatgttcagggtaaaaactgcctgactaacttc catggcatggatcttacccgtgacaaaatgtgttccatggtcaaaaaatggcagacaatg attgaagctcacgttgatgtcaagactaccgatggttacttgcttcgtctgttctgtgtt ggttttactaaaaaacgcaacaatcagatacggaagacctcttatgctcagcaccaacag gtccgccaaatccggaagaagatgatggaaatcatgacccgagaggtgcagacaaatgac ttgaaagaagtggtcaataaattgattccagacagcattggaaaagacatagaaaaggct tgccaatctatttatcctctccatgatgtcttcgttagaaaagtaaaaatgctgaagaag cccaagtttgaattgggaaagctcatggagcttcatggtgaaggcagtagttctggaaaa gccactggggacgagacaggtgctaaagttgaacgagctgatggatatgaaccaccagtc caagaatctgtttaa >gi568815594f:150999653_151204590|GENSCAN_predicted_peptide_6|1020_aa MPRGRASRTSIQSELHRDRRRPEITIVAAEPLRPASWFPGTPPPGLGFPTSSAAGSWRPN ELVPAELPPSYEQVIKEINQVQVNTTNNNNAAATPRHTITSATQTDFSEEIDNDLPQSNA TLQAPLKPLQPFSAVSSGNLPTNVAPLIVFDISEEPNCPENPSATRCPVPKPRSKSNLRP IPRDSHIKEQSQQKISPAAVGEESSPGRPQSLLDNASTSDSQAVMNIMNTEQSQNSIVSR IKVFEGQTNIETSGLPKKPEITPRSLPPKPTVSSGKPSVAPKPAANRASGEWDSGTENRL KVTSKEGLTPYPPLQEAGSIPVTKPELPKKPNPGLIRSVNPEIPGRGPLAESSDSGKKVP TPAPRPLLLKKSVSSENPTYPSAPLKPVTVPPRLAGASQAKAYKSLGEGPPANPPVPVLQ SKPLVDIDLISFDDDVLPTPSGNLAEESVGSEMVLDPFQLPAKTEPIKERAVQPAPTRKP TVIRIPAKPGKCLHEDPQSPPPLPAEKPIGNTFSTVSGKLSNVERTRNLESNHPGQTGGF VRVPPRLPPRPVNGKTIPTQQPPTKVPPERPPPPKLSATRRSNKKLPFNRSSSDMDLQKK QSNLATGLSKAKSQVFKNQDPVLPPRPKPGHPLYSKYMLSVPHGIANEDIVSQNPGELSC KRGDVLVMLKQTENNYLECQKGEDTGRVHLSQMKIITPLDEHLRSRPNPFSPPKDPSHAQ KPVDSGAPHAVVLHDFPAEQVDDLNLTSGEIVYLLEKIDTDWYRGNCRNQIGIFPANYVK VIIDIPEGGNGKRECVSSHCVKGSRCVARFEYIGEQKDELSFSEGEIIILKEYVNEEWAR GEVRGRTGIFPLNFVEPVEDYPTSGANVLSTKVPLKTKKEDSGSNSQVNSLPAEWCEALH SFTAETSDDLSFKRGDRIQILERLDSDWCRGRLQDREGIFPAVFVRPCPAEAKSMLAIVP KGRKAKALYDFRGENEDELSFKAGDIITELESVDDDWMSGELMGKSGIFPKNYIQFLQIS >gi568815594f:150999653_151204590|GENSCAN_predicted_CDS_6|3063_bp atgccccgtggaagagcttctcggacttctattcagagtgaacttcatcgagatagaagg cgcccagagatcaccattgtggcagctgagccactgaggccagcctcgtggtttccagga accccacccccaggactgggatttcctacatcatctgcagcaggctcttggaggcctaat gagctggttcctgctgagctcccaccatcttatgaacaagttataaaagaaatcaaccaa gttcaagttaatactacaaataataataatgctgctgctactccaaggcacactattact tctgcaactcagactgacttttcagaagaaatagacaacgatctgcctcaaagtaatgca acactacaggcacctctcaagcctcttcagcctttctcagcagtctcgtctggcaatctt ccaacaaatgtggcacctttaatcgtctttgatatttctgaagaaccgaattgtccagaa aaccccagtgctacaagatgtccagtgccaaaaccaagatcaaaaagcaacctcagacca atacccagagattctcacattaaagagcaaagtcaacagaaaatcagcccagcagccgta ggagaggagtcatccccaggccggccccagtctctgctggacaacgctagcacctcagac agtcaggcagtgatgaacattatgaacacagaacaaagccaaaatagtattgtttccaga attaaagtgtttgagggtcagacaaacatagaaacctcaggactgcccaagaaaccagaa attactccacgttcacttcctccaaagcctactgtttcctcagggaaaccttctgtagct cccaaaccagctgctaacagagcttctggagagtgggactctgggactgagaacagactc aaggtgacctccaaggaaggactcaccccataccctcccctgcaagaagcgggaagcatc ccagtaaccaaacctgaattgccaaagaaaccaaaccctggccttatacgaagtgttaat cctgagattccgggaagagggcccctggctgagagctctgatagtgggaagaaagtgcca actcctgccccgcggcctttgctgctgaagaaatctgtttcctcagaaaaccccacctac ccttcagctccactgaaacctgtcactgttcctccccgactcgcaggggcatcacaagcc aaagcatacaagtcactgggagaagggcccccagccaaccccccagttccagttctgcag agcaagcccttggtggacatcgatctcatcagctttgatgatgatgttttgcccacccca tcggggaacctggctgaagaatctgttggttcagagatggttctagatccctttcagctc cctgcaaaaacagaaccaataaaagaacgagcagttcaaccagcacccaccaggaagccc actgtaattcgaattccagccaaaccaggaaaatgtttacatgaggatccacaaagtcca cctcctctccctgctgaaaaacctattggaaacactttcagtacagtatctggaaagctc agtaatgttgagagaactagaaacttggaatccaaccacccaggtcaaacaggaggtttt gtgcgagtacccccaaggttgccaccgagacctgtgaatggaaaaaccattccaactcaa cagcctccaaccaaggtgccccctgagagaccacctcccccaaagctttctgcaaccaga agatctaataagaaactgccttttaatcgatcctcttctgacatggatcttcagaaaaaa caaagtaacttggcaactggactctcaaaagccaagagtcaagtttttaaaaatcaagat ccggtgctaccccctcgtcccaaaccaggacaccctctctacagtaaatacatgctgtct gtgcctcatggaattgccaatgaagatattgtctctcaaaaccccggagaactctcttgt aagcgtggggatgtacttgtgatgctgaagcagacggaaaataattacttggagtgccaa aagggagaagacactggcagagttcacctgtctcaaatgaagattatcactccacttgat gaacatcttagaagcagaccaaaccccttttctccccctaaggatccaagccacgctcag aagcctgttgacagtggtgctcctcatgctgtcgttcttcatgatttcccagcagagcaa gttgatgatttgaacctcacttctggagaaattgtttatcttctggagaagatagataca gattggtacagagggaactgtagaaaccagattggcatatttcctgccaactatgtcaaa gtgattattgatatcccagaaggaggaaatgggaaaagagaatgtgtttcatctcattgt gttaaaggctcaagatgtgttgctcggtttgaatatattggagagcagaaggatgagttg agtttctcagagggagaaattattattcttaaagagtatgtgaatgaggaatgggccaga ggagaagttcgaggcagaactgggattttccccctgaactttgtggagcctgttgaggat tatcccacctctggtgcaaatgttttaagcacaaaggtaccactgaaaaccaaaaaagaa gattctggctcaaactctcaggttaacagtcttccggcagaatggtgtgaagctcttcac agttttacagcagagaccagtgatgacttatcattcaagaggggagaccggatccagatt ctggaacgtctggattctgactggtgcaggggcagactgcaggacagggaggggatcttc ccagcagtgtttgtgaggccctgcccagctgaggcaaaaagtatgttggccatagtaccg aaggggaggaaggccaaagccttatatgatttccgaggggagaatgaagatgaactttcc ttcaaggctggagatataataacagagctggaatctgtagatgatgactggatgagtgga gaacttatgggaaaatctggaatatttcccaaaaactacatacagtttctacagatcagc tag >gi568815594f:150999653_151204590|GENSCAN_predicted_peptide_7|291_aa MWKRLWNWVTGRGWNSLEGSEEDRKMWDSLELPRDLLNGFAQNADNDMDNENQAEVFSDG DEELVGNLSKATPAVAERGQRRAWAVASEGASPKTWQLPCGVERASAQKSRIEVWEPPPR FQGIYGNTWMSRQKFAAGLALCSWKSRRHSTPAVKAAGGEAIPCKTTGVELPKTMGTHLL HQHDLDRFGPGVKVDRFGALRFDLPCWISDLHGDCSPFVLANFSHLEWLYLPNACHSPNT KGLKKTGGIPMTGPPAIAASVPLTPSSPPALSQEESADTEQSLAVGIGATG >gi568815594f:150999653_151204590|GENSCAN_predicted_CDS_7|876_bp atgtggaagcgactttggaactgggtaacaggcaggggttggaatagtttggagggttca gaagaagacaggaaaatgtgggacagtttggaacttcctagagacttgctgaatggcttt gcccaaaatgctgataatgatatggataatgaaaaccaggctgaggtgttctcagatgga gatgaggaacttgttgggaacttgagcaaagccactccagctgtggctgaaaggggacaa cgtagagcttgggccgtggcttcagagggtgcaagccccaagacttggcagcttccatgt ggtgttgagcgtgcaagtgcacagaagtcaagaattgaggtttgggaacctccacctaga tttcaggggatttatggaaacacctggatgtccaggcagaagtttgctgcagggcttgca ctgtgctcctggaaaagccgcagacactcaacgccagccgtgaaagcagctggtggggag gctataccctgcaaaaccacaggggtggagctgcccaagaccatgggaacccacctcttg catcagcatgacctggatcgttttggacctggcgtcaaagtagatcgttttggagcttta agatttgacctgccctgctggatttcagacttgcatggggactgtagcccctttgttttg gccaatttctcccatttggaatggctgtatttacccaatgcctgccacagtcctaacacc aaaggtctgaaaaagacaggtggaattcccatgacaggacccccagcaatcgcagcatcg gtgccactgaccccatcatcgccaccagccctttcccaggaagagtcagctgacacggaa caatccttggcagtggggattggggctactggatga