GENSCAN 1.0 Date run: 7-Nov-116 Time: 17:30:04 Sequence gi568815591f:151245350_151477222 : 231873 bp : 49.79% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 Intr - 322 111 212 1 2 72 109 170 0.915 15.21 1.10 Intr - 3330 3136 195 0 0 27 89 95 0.671 3.11 1.09 Intr - 8554 8466 89 2 2 96 81 70 0.878 6.79 1.08 Intr - 9150 8946 205 0 1 16 100 74 0.007 0.17 1.07 Intr - 15429 15371 59 1 2 92 100 60 0.016 6.20 1.06 Intr - 26114 25908 207 0 0 109 98 -12 0.211 0.95 1.05 Intr - 31840 31728 113 0 2 107 99 14 0.487 4.42 1.04 Intr - 45145 45043 103 2 1 15 56 82 0.027 -3.07 1.03 Intr - 46170 46065 106 0 1 34 113 58 0.073 2.69 1.02 Intr - 56610 56414 197 1 2 94 52 88 0.039 4.93 1.01 Init - 56945 56852 94 2 1 87 75 44 0.075 3.33 1.00 Prom - 66878 66839 40 -4.66 2.00 Prom + 79497 79536 40 -4.96 2.01 Init + 89080 89155 76 0 1 42 93 67 0.550 3.85 2.02 Intr + 96324 96616 293 1 2 78 72 152 0.310 9.45 2.03 Intr + 103724 103891 168 1 0 66 87 171 0.993 14.94 2.04 Intr + 106075 106133 59 0 2 84 108 29 0.932 2.18 2.05 Intr + 107463 107533 71 0 2 77 77 48 0.928 1.43 2.06 Intr + 110419 110601 183 2 0 50 65 167 0.922 10.36 2.07 Intr + 110779 110873 95 2 2 33 106 11 0.414 -2.92 2.08 Intr + 114792 114898 107 2 2 84 70 29 0.226 -0.29 2.09 Intr + 121590 121776 187 0 1 108 53 81 0.820 6.29 2.10 Intr + 122512 122619 108 0 0 53 116 63 0.883 6.08 2.11 Intr + 123386 123538 153 1 0 101 65 67 0.870 5.97 2.12 Intr + 128748 128894 147 2 0 59 64 187 0.916 13.83 2.13 Intr + 130499 130594 96 1 0 61 72 74 0.803 3.31 2.14 Intr + 131285 131462 178 1 1 94 96 203 0.999 21.19 2.15 Term + 131698 131876 179 2 2 84 36 200 0.956 12.25 2.16 PlyA + 133073 133078 6 1.05 3.19 PlyA - 133125 133120 6 -1.75 3.18 Term - 134818 134329 490 0 1 61 43 196 0.612 6.53 3.17 Intr - 135226 134928 299 1 2 37 91 240 0.607 14.87 3.16 Intr - 136397 136269 129 2 0 108 86 207 0.999 23.39 3.15 Intr - 136632 136529 104 1 2 51 96 206 0.998 17.59 3.14 Intr - 139874 139739 136 2 1 132 95 234 0.136 28.64 3.13 Intr - 146757 146617 141 0 0 47 88 75 0.501 3.95 3.12 Intr - 150847 150427 421 0 1 147 78 601 0.891 58.95 3.11 Intr - 154892 154751 142 0 1 112 76 222 0.992 22.81 3.10 Intr - 156241 156145 97 1 1 78 94 37 0.404 2.88 3.09 Intr - 157861 157746 116 2 2 66 23 85 0.209 -0.03 3.08 Intr - 163704 163608 97 0 1 107 50 69 0.906 4.58 3.07 Intr - 164038 163896 143 2 2 104 72 17 0.970 1.77 3.06 Intr - 164364 164078 287 2 2 105 69 248 0.555 21.59 3.05 Intr - 166659 166427 233 0 2 43 82 59 0.053 -2.63 3.04 Intr - 186936 186819 118 2 1 112 30 219 0.242 18.97 3.03 Intr - 190976 190831 146 2 2 -6 75 218 0.974 10.18 3.02 Intr - 192895 192647 249 1 0 114 101 401 0.992 41.63 3.01 Init - 194568 194548 21 0 0 114 113 7 0.938 5.57 3.00 Prom - 210358 210319 40 -2.96 4.04 PlyA - 210368 210363 6 1.05 4.03 Term - 214506 214381 126 0 0 48 39 134 0.511 2.78 4.02 Intr - 214869 214727 143 1 2 76 37 78 0.510 1.57 4.01 Init - 215619 215436 184 0 1 52 8 203 0.636 6.44 4.00 Prom - 217394 217355 40 -1.86 5.04 PlyA - 220686 220681 6 1.05 5.03 Term - 221862 221770 93 0 0 127 37 90 0.525 5.63 5.02 Intr - 225303 225222 82 2 1 71 92 68 0.293 5.14 5.01 Intr - 226256 226200 57 1 0 81 98 22 0.145 0.50 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 9054 8946 109 0 1 87 100 63 0.930 7.84 S.002 Intr + 62367 62493 127 2 1 55 119 16 0.828 1.65 S.003 Intr - 139938 139739 200 2 2 90 95 287 0.863 28.59 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:151245350_151477222|GENSCAN_predicted_peptide_1|527_aa MSMSREVPTFRTTLAVFVSHLHQKTPLSSKADLSVHSFILLANTYCAPTVCEVLGMLQQS NGHQSPLTEKTYSKEERGLAVMTGSAGVDHGLGQGFQQADLYRLHYPSSLIFWLLVGFAN AGPSRRLEVWRKGQSSDMPFGVPMRINVKNTHKALNTVTATARDRQIRSLRLPRSGLQRR RRRAEPSAEQGAGGRAPGRGPGGAGPEPMEKQAQASVNLSARKFKLKRGLNLRPKCRRQV LLPNLGLLTWAWEQKEMICVLKQTRLEEECWPQGPVGSSKFSKPSTKWHQEGKHFQRQRL EGRQQHPHVTDEDRAPGRKRHSPGHMARHEQIPSRPEATRLPPRSLAANIWMQPASVERW EGPYSQQVRALPNSGSLDVALLPASRATPWKKCERAQQDSEGRVGGKKKAEKGKKEEERE RVRGAAEPTPMAADEVAGGARKATKSKLFEFLVHGVRPGMPSGARMPHQGAPMGPPGSPY MGSPAVRPGLAPAGMEPARKRAAPPPGQSQAQSQGQPVPTAPARSRS >gi568815591f:151245350_151477222|GENSCAN_predicted_CDS_1|1581_bp atgagcatgagtagagaggtgcccacgttccggaccactctcgctgtatttgtctcccat ctccatcagaagactccgctgagctccaaggcagatctttctgttcactcatttatcctc ttggcaaatacgtattgtgcacctactgtgtgcgaggtgctagggatgctgcagcaatcc aacgggcaccaatctcccctcacagagaagacatactccaaagaggaaaggggcctcgct gtgatgacaggcagtgcaggtgtagaccatggccttggacaggggttccaacaagctgac ctctacagactgcattaccccagctctcttatcttctggcttctggttggatttgccaat gcaggccccagcagaagactggaggtgtggaggaaagggcaatcatcagacatgcctttt ggagtgcctatgaggatcaacgtaaagaacacacataaagcactcaacacagtgacagcc acagccagggatcgacaaatacgctccctgcgcctcccccgctccgggctgcagcggcgc aggcgccgggccgagccgagcgccgagcagggagcgggcggccgcgctccgggccggggt cccgggggagcaggtcctgaacccatggaaaaacaggcacaggcatcagttaacttaagt gcaaggaaattcaagctgaagagaggcttgaacctgaggcccaagtgtagaaggcaggtc ctactccctaacctgggtttgcttacctgggcatgggaacagaaggagatgatctgtgtc ctaaaacagactcgtctagaagaggaatgctggccacaggggccagttggatcctccaag ttctccaaacctagcaccaagtggcaccaggaaggcaagcatttccagaggcagaggttg gagggaaggcagcagcaccctcatgtcacagatgaggacagagctccagggaggaaacgg cattccccgggccacatggcccgacatgagcagatcccaagtcgaccagaagccacaagg ctgccccccagatccctggcagccaacatctggatgcagccagcatctgtagagagatgg gaagggccatacagccagcaggtcagggcactgcccaattcagggtccctggacgtggcc ctattgcccgccagcagagccacaccctggaagaagtgtgagagagcccagcaggactca gaggggagagttggaggaaaaaaaaaggcagaaaagggaaagaaagaggaagagagagag agagtgagaggagccgctgagcccaccccgatggccgcggacgaagttgccggaggggcg cgcaaagccacgaaaagcaaactttttgagtttctggtccatggggtgcgccccgggatg ccgtctggagcccggatgccccaccagggggcgcccatgggccccccgggctccccgtac atgggcagccccgccgtgcgacccggcctggcccccgcgggcatggagcccgcccgcaag cgagcagcgcccccgcccgggcagagccaggcacagagccagggccagccggtgcccacc gcccccgcgcggagccgcagn >gi568815591f:151245350_151477222|GENSCAN_predicted_peptide_2|699_aa MVVWDADTQQVIPNGIQLAGLDKSHSGFALAPPTTLFPSGGGGGGAKATAAAGAGLASPG MKTNGGRCRIRALCWSRREWRGAGEDTAAECPRPQPQQHCLAPRFPVRLGTSPGQGWSGR GAGDLAKQYSDRLECCENEVEKVIEEIRCKAIERGTGNDNYRTTGIATIEVFLPPRLKKD RKNLLETRLHITGRELRSKIAETFGLQENYIKIVINKKQLQLGKTLEEQGVAHNVKAMVL ELKQSEEDARKNFQLEEEEQNEAKLKEKQIQRTKRGLEILAKRAAETVVDPEMTPYLDIA NQTGRSIRIPPSERKALMLAMGYHEKGRAFLKRKEYGIALPCLLDADKYFCECCRELLDT VDNYAVLQLDIVWCYFRLEQLECLDDAEKKLNLAQKCFKNCYGENHQRLVHIKGNCGKEK VLFLRLYLLQGIRNYHSGNDVEAYEYLNKARQLFKELYIDPSKVDNLLQLGFTAQEARLG LRACDGNVDHAATHITNRREELAQIRKEEKEKKRRRLENIRFLKGMGYSTHAAQQVLHAA SGNLDEALKILLSNPQMWWLNDSNPETDNRQESPSQENIDRLVYMGFDALVAEAALRVFR GNVQLAAQTLAHNGGSLPPELPLSPEDSLSPPATSPSDSAGTSSASTDEDMETEAVNEIL EDIPEHEEDYLDSTLEDEEIIIAEYLSYVENRKSATKKN >gi568815591f:151245350_151477222|GENSCAN_predicted_CDS_2|2100_bp atggttgtatgggatgcagacacacagcaggttattcccaatggaatacagcttgctgga ctggataaaagccactctggcttcgccttggccccgcccacaaccctctttccaagcggc ggcggcggcggcggcgcgaaggcgacagcggcggcgggggcggggctggcctcacccgga atgaaaacaaacggcggccgctgccgcatccgggcactctgctggtcgcggcgggagtgg cgtggcgcaggtgaggacacggcggccgagtgtcctcgaccccagcctcagcagcactgc ttggcgccccggttcccggtccgactgggcacctctcctggccaggggtggagcggccgc ggggcgggggaccttgctaagcagtactctgacagactagaatgctgtgaaaatgaagta gaaaaggtaatagaagaaatacgttgcaaggcaattgagcgtggaacaggaaatgacaat tatagaacaacgggaattgctacaatcgaggtgtttttaccaccaagactaaaaaaagat aggaaaaacttgttggagacccgattgcacatcactggcagagaactgaggtccaaaata gctgaaacctttggacttcaagaaaattatatcaaaattgtcataaataagaagcaacta caactagggaaaacccttgaagaacaaggcgtggctcacaatgtgaaagcgatggtgctt gaactaaaacaatctgaagaggacgcgaggaaaaacttccagttagaggaagaggagcaa aatgaggccaaactcaaagaaaaacaaattcagaggaccaagagaggactagaaatactg gcaaagagagcagcagagacagtggtggatccagaaatgacaccgtacttagacatagct aaccagacaggcagatcaatcagaattcccccatcagaaagaaaagcccttatgttagct atgggatatcatgagaagggcagagctttcctgaaaagaaaagaatatggaatagccttg ccatgtctgttggacgctgacaaatatttctgtgagtgttgcagagagctgctggacaca gtggataactacgccgtcctccagctggatatagtgtggtgttacttccgcctggaacag ctggaatgccttgatgatgcagaaaaaaaattaaacttggcccagaaatgctttaaaaat tgttacggagaaaatcatcagagactggtccacataaaaggaaattgtgggaaagagaag gtactgtttctaagactctacttacttcaagggatccgaaactatcacagtggaaatgat gtagaggcttatgagtatcttaacaaggcacgtcagctctttaaagagctatatattgat ccatcaaaagtggacaatttgttgcagttggggtttactgcccaggaagcccggcttggc ctgagggcgtgtgatgggaacgtggatcatgcggccactcatattaccaaccgcagagag gaactggcccaaataaggaaggaggaaaaagagaagaaaagacgccgcctcgagaacatc aggtttctgaaagggatgggctactccacgcacgcggcccagcaggtactccacgcagcc agcgggaacttggatgaggccctgaagattctgctcagcaatcctcagatgtggtggtta aatgattccaatcctgaaaccgacaaccgtcaagaaagtccttcccaggaaaacattgac cgattggtgtacatgggttttgatgcactcgtggccgaagctgcgctgagagtgttcaga ggcaacgtccagctggccgcccagacccttgctcacaacggaggaagcctgcctcccgag ctgccgctgtcgccagaagactctttgtccccgccagccacgtccccttctgactccgca ggaacctctagtgcctcaacagacgaagacatggagacagaggccgtcaatgagatactg gaagacattccagagcatgaggaagactatcttgactcaactctggaagatgaagaaatt attattgcagagtacctatcctatgtagaaaataggaagtcagcaacaaagaaaaactaa >gi568815591f:151245350_151477222|GENSCAN_predicted_peptide_3|1122_aa MAQRSGKITLYEGKHFTGQKLEVFGDCDNFQDRGFMNRVNSIHVESGAWVCFNHPDFRGQ QFILEHGDYPDFFRWNSHSDHMGSCRPVGMHGEHFRLEIFEGCNFTGQCLEFLEDSPFLQ SRGWVKNCVNTIKVYGDGAWVLYEEPNYHGRMYVVESGDFRSFSDWEAHSARVQSLHKIW GQFPRGESVIYDLQEKREDRLAEDICTHSSTVQTQPPEMAGSSPPGGHFVPNRRRRGGGG GEAKNKKSINLERGSRSRAPRSPRWPRAAVVRSRRADPGAPLPSGAQLLLVPRPASGMGG GGSALRVCADHRGGINWLSLSPDGQRLLTGSEDGTARLWSTADGQCCALLQAHRPQTADP SSCARRPLSRSPAPAHPGTLAEERLSGPSIWLLSARQGRVPCGLFVANCALLLAFPVDDS HSNSRRLLRASDEKHPGKKIPQGSKKQNLASLLTKSYTESTQVKRCVAIACKENSEHVPF PLRLPLSSVCATSREAGIVPIPGHESYVTFCQLEDEAAFTCSADCTIRRWDVLTGQCLQV YRGHTSIVNRILVANNQLFSSSYDRTARVWSVDKGQMSREFRGHRNCVLTLAYSAPWDLP STPCAEEAAAGGLLVTGSTDGTAKVWQVASGCCHQTLRGHTGAVLCLVLDTPGHTAFTGS TDATIRAWDILSGEQLRVFREHRGSVICLELGSTLPCPERLKGLAEGRQHTLLQKEQLAL KAAKRLMVWGSAQVVTKLVNRLVYSGSADRTVKCWLADTGECVRTFTAHRRNVSALKYHA GTLFTGSGDACARAFDAQSGELRRVFRGHTFIINCIQVHGQVLYTASHDGALRLWDVRGL RGAPRPPPPMRSLSRLFSNKSLDGATFCIHVPRLARVPGEKLQVTVSAVESSSSRCREKH LLQEQPLQVEQPLQVEQPFQVEQHALHSLNAEDTSSETAGAARDHLLGDGRVRPRARMNR KMGLLGGKQIGAGEPAETAPCTVLRVRLKHTQTSGVARSCCVPEPLGPVAATRGIPTRED LSIVLKGRFRWRRPENRLECCLRAVLPAGPEPAQTFTLTPACRRHCCLVMLRSGLSSPSL TRQLQSPPYLPTAENGACIKPSRETQWGMDPKHGRCTHFAFL >gi568815591f:151245350_151477222|GENSCAN_predicted_CDS_3|3369_bp atggcgcagcgctcggggaagatcactctctatgaaggcaagcacttcacagggcagaag ctggaggtcttcggggactgtgacaacttccaggaccggggctttatgaaccgagtgaac tccatccacgtggagagcggagcctgggtctgcttcaatcaccccgacttccggggccag cagttcatcttggagcacggcgactaccccgacttcttccgctggaacagccacagtgac cacatgggctcctgtcggcctgtaggaatgcacggagaacatttccgcctagaaatcttc gagggttgcaacttcacgggccagtgcctggagttcctggaggacagccccttcctccag agcaggggctgggtcaagaactgtgtgaacaccatcaaggtgtacggggacggagcgtgg gtcctgtatgaggagcccaactaccacggccgcatgtacgtggtggagagtggcgacttc cgcagcttctcggactgggaggcccacagcgcacgagtgcagtcgctccacaagatctgg gggcaatttcccagaggagagtctgtaatctacgacctgcaagagaaaagggaagacagg ctggctgaggacatctgcacccacagctctacagtccagactcaacccccagagatggct ggctcatccccaccagggggccactttgtgccaaaccggagaaggaggggtgggggaggt ggagaagccaagaacaaaaaatctatcaacctggaaagaggaagtaggagtcgggctccc cgttccccgcgatggccgcgggccgcagtcgtacggagccgccgcgcggaccccggggcg cccctccctagcggggcgcagctccttcttgttccccgccctgccagcgggatggggggc ggcgggtcggccctgagggtctgcgccgaccaccgcgggggcatcaactggctgagcctg agccccgacgggcagcgcctgctgacgggcagcgaggacggcacggcccggctctggagc accgcggacggccagtgctgcgcgctcctgcaagctcaccggccccagaccgcggacccc agctcctgcgcccgcagacccctttctagaagtccagcccccgctcatcccggcacacta gcggaagagcgcttgtctgggcccagcatctggctgttgagcgcacggcagggcagggtg ccttgtggcctctttgtggccaattgtgctcttctgctggcatttcctgttgacgactct cacagcaacagtagacgcttgttaagggccagcgatgaaaaacatccaggaaaaaaaatt ccacaaggctccaaaaagcaaaatttggcttccctgctcaccaagtcctacactgaatcc acacaagtgaagcgatgtgtagccatcgcatgtaaggaaaatagcgaacatgtgccattc cctctgcggctgcctctgtcctcagtctgcgctacatcccgtgaggcaggcattgtcccc atcccaggacatgaaagctatgtgaccttctgccagctggaggatgaggctgccttcaca tgcagcgccgactgcaccatcaggaggtgggacgtgctgaccgggcagtgtctgcaggtg taccgaggacacacgtccatcgtgaacaggatcctggttgccaacaaccagctcttcagc agctcctatgaccggacagctcgggtctggagtgtggacaaggggcagatgtcccgggag ttccggggccaccgcaactgcgtgctgaccctagcctactctgccccgtgggacctcccc agcactccctgcgcggaggaggccgcggccggggggcttctggtgaccggcagcacagat ggcacagccaaggtgtggcaggtggccagcggctgctgccaccagacgctgcggggccac acgggtgcagtgctgtgcctagtgctagacacgcccggccacacggccttcacaggcagc accgacgccaccatccgtgcctgggacatcctgagtggggagcagctgcgggtgttccgg gagcaccggggctccgtcatctgtctggagctgggcagcaccttgccctgccctgagcgt ctgaaggggctggcagaaggaaggcagcacacacttttgcagaaagagcagctggcactg aaggcagccaaacgtctgatggtttggggctctgctcaggtagtcactaagctggtgaac cgactcgtgtactctggcagcgcggacaggaccgtcaagtgctggctggcagacacaggg gagtgtgtgcgcacgttcacggcccacagacgcaacgtgagcgccctcaagtaccacgcg ggcaccttgttcacgggcagcggggacgcttgcgcccgggccttcgacgcgcagtctgga gagctgcggagggtgttccggggccacacattcatcatcaactgcatccaggtgcacggc caggtgctctacaccgcctcgcacgacggcgccctgcgcctctgggacgtgcgcgggctc cgaggtgccccgcggccccctccgcccatgcgcagcctctcgcggctcttcagcaacaag tcccttgatggtgccactttctgcatccatgtgccccggttggcccgggtgccaggagag aagctgcaggtgacagtgtctgccgtggagagctcatcatctcgctgcagagagaagcac ctgctgcaggagcagccccttcaagtggagcagccccttcaagtggagcagccctttcag gtagagcagcatgcgcttcatagtcttaatgcagaagatacttcatcagagacggccggg gccgccagggatcacctgctaggtgatgggcgggtccgccctcgggcgcggatgaacagg aaaatgggcctgttgggaggcaagcagatcggtgcaggagaacccgcagaaacggcacca tgcactgttctccgggtgagacttaagcacacacagacctcgggggtggccaggtcttgc tgcgtcccggagccattggggcctgtggctgccacaaggggcatacctaccagagaggac ctttccatcgtgctgaagggacggttccggtggcgcaggccagagaaccgccttgagtgc tgcctgcgggcagtgctgccggccggccctgagcctgcacagacgttcacactgactcct gcttgccgaagacactgctgcttggtgatgctgcgctctggcctgagcagcccctcactg acacgtcagctccagtccccaccttatctccccacagcagagaatggagcatgcatcaag ccatccagagaaacccagtggggcatggacccaaaacacgggcgctgcactcactttgca tttttatag >gi568815591f:151245350_151477222|GENSCAN_predicted_peptide_4|150_aa MIVYLQQQSVLWVAPESCSAAILFLGYTSKAIRNTTTYPSKRVYQIREKIFHSSLAQGIK NNQQGIQVHQHRTLWLPYGARLTRVICAELYPKRLKQEDSRELQLEQRKSLIAEAPSGAG HLHVFQVPGTLLLLLLRAQGLHFEKHWLGQ >gi568815591f:151245350_151477222|GENSCAN_predicted_CDS_4|453_bp atgattgtgtatctccagcagcagtcggtgctgtgggtggctccggaatcctgcagtgct gccatcttattcctgggctacacatccaaagcaatacggaacaccacgacttaccccagc aaacgagtttaccaaatccgagaaaaaatatttcattccagtctggcccaaggcatcaag aacaatcagcaaggaattcaagtacatcagcacaggactctgtggctgccttacggggcg cggttgacgagggtgatatgtgctgaactttaccctaaaaggctcaagcaggaggacagc agggagttacaactggagcagcggaagtccctgattgcagaggccccgtcaggtgcagga catctgcatgtcttccaagttccagggacgctgctgctgctgctgctgcgagctcagggg ctgcattttgagaagcactggcttggacaataa >gi568815591f:151245350_151477222|GENSCAN_predicted_peptide_5|77_aa XFEVIKVIHGKLLDMVGKVQVISYEEGKALAESWNAAFLESSAKENQTAVDVFRRIILEA EKMDGAASQGKSSCSVM >gi568815591f:151245350_151477222|GENSCAN_predicted_CDS_5|234_bp nnttttgaagtgattaaagttatccatggcaaattgttggatatggtggggaaagtacag gtgatcagttatgaagaagggaaagctttggcagaatcttggaatgcagcttttttggaa tcttctgctaaagaaaatcagactgctgtggatgtttttcgaaggataattttggaggca gaaaaaatggacggggcagcttcacaaggcaagtcttcatgctcggtgatgtga