GENSCAN 1.0 Date run: 5-Nov-116 Time: 07:52:55 Sequence gi568815596f:202841903_203083621 : 241719 bp : 39.27% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 547 542 6 1.05 1.02 Term - 2685 2564 122 1 2 101 43 99 0.744 4.36 1.01 Init - 3142 3058 85 1 1 34 101 66 0.560 3.53 1.00 Prom - 5814 5775 40 -6.05 2.04 PlyA - 6422 6417 6 -0.45 2.03 Term - 8155 7983 173 2 2 82 51 137 0.090 6.41 2.02 Intr - 29871 29717 155 2 2 55 119 195 0.996 18.09 2.01 Init - 29974 29904 71 1 2 62 24 147 0.986 4.27 2.00 Prom - 33850 33811 40 -4.15 3.13 PlyA - 33943 33938 6 1.05 3.12 Term - 39035 38958 78 2 0 88 48 50 0.841 -2.22 3.11 Intr - 40881 40809 73 2 1 66 78 113 0.985 6.59 3.10 Intr - 41839 41707 133 2 1 96 94 118 0.999 11.98 3.09 Intr - 42401 42296 106 2 1 121 113 -9 0.999 3.77 3.08 Intr - 42633 42493 141 0 0 92 91 93 0.975 9.63 3.07 Intr - 54317 54163 155 0 2 109 106 20 0.860 4.77 3.06 Intr - 55513 55398 116 0 2 80 72 134 0.952 10.17 3.05 Intr - 57735 57629 107 0 2 68 27 85 0.931 -1.51 3.04 Intr - 59217 59123 95 1 2 70 89 91 0.880 6.16 3.03 Intr - 66057 65963 95 2 2 99 64 127 0.659 10.09 3.02 Intr - 69643 69534 110 1 2 13 100 76 0.232 -0.54 3.01 Init - 70716 70615 102 0 0 59 36 136 0.171 5.81 3.00 Prom - 76126 76087 40 -8.15 4.00 Prom + 76226 76265 40 -6.45 4.01 Sngl + 82798 83280 483 0 0 68 36 429 0.736 31.52 4.02 PlyA + 84466 84471 6 1.05 5.00 Prom + 85608 85647 40 -3.55 5.01 Init + 100001 100114 114 1 0 84 60 93 0.615 6.36 5.02 Intr + 100838 101065 228 1 0 33 101 128 0.667 5.74 5.03 Intr + 110657 110777 121 1 1 52 84 123 0.530 7.45 5.04 Intr + 112103 112232 130 0 1 65 100 127 0.989 10.43 5.05 Intr + 113772 113856 85 0 1 77 105 37 0.986 3.20 5.06 Intr + 119335 119524 190 0 1 43 80 138 0.965 6.74 5.07 Intr + 125076 125196 121 1 1 80 111 59 0.958 6.03 5.08 Intr + 128017 128160 144 1 0 48 52 148 0.969 5.68 5.09 Intr + 129603 129836 234 0 0 85 82 96 0.796 4.58 5.10 Intr + 132432 132594 163 0 1 86 72 28 0.134 0.06 5.11 Intr + 140170 140539 370 0 1 47 83 261 0.059 15.25 5.12 Intr + 144208 144236 29 2 2 107 103 -21 0.088 -1.78 5.13 Term + 164764 164964 201 0 0 60 43 146 0.971 3.71 5.14 PlyA + 166761 166766 6 1.05 6.00 Prom + 168332 168371 40 -5.75 6.01 Init + 172659 172806 148 2 1 77 85 88 0.744 5.98 6.02 Term + 172849 172910 62 2 2 62 38 124 0.459 1.79 6.03 PlyA + 174409 174414 6 1.05 7.00 Prom + 196496 196535 40 -6.65 7.01 Init + 198206 198795 590 1 2 76 -8 593 0.041 43.24 7.02 Intr + 207912 208073 162 0 0 106 70 74 0.287 5.57 7.03 Intr + 214525 214606 82 1 1 87 58 33 0.085 -1.08 7.04 Intr + 215424 215551 128 2 2 70 75 100 0.060 5.46 7.05 Intr + 216746 216929 184 2 1 -8 59 129 0.027 -0.73 7.06 Intr + 241317 241623 307 2 1 69 80 227 0.040 15.10 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 140374 140539 166 0 1 97 83 232 0.876 23.44 S.002 Sngl + 198206 198817 612 1 0 76 32 591 0.903 48.24 S.003 Init + 241350 241623 274 2 1 81 80 212 0.922 16.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:202841903_203083621|GENSCAN_predicted_peptide_1|68_aa MTYRIRERLEDAMLLALKMEEGGPHARKWKPLVIQLPLTGLLNLILVLGYEDYLALTVSY IQKLDHCQ >gi568815596f:202841903_203083621|GENSCAN_predicted_CDS_1|207_bp atgacatatagaattagagaaagattagaagatgctatgctgctggccttaaagatggag gaagggggaccacatgccaggaaatggaagcctctggtgattcagttacctctgactggc ttgttgaatttaattttagtgctcggttatgaagattatttggctctcacagtaagctat atccagaaacttgaccattgtcagtag >gi568815596f:202841903_203083621|GENSCAN_predicted_peptide_2|132_aa MLRSRVGPVAGSGRRPRGANADASGSLTPPRTGAEGHWLPEGRESARGSRAVAGTGAGTD HRTAERDPRRRKSGAGPSSAGLLQFAGGLLYNLFAWVSPAEATEQQRLLPVPSSESFVPE GHQPDASWNSPV >gi568815596f:202841903_203083621|GENSCAN_predicted_CDS_2|399_bp atgctccgatcccgagtcgggccggtggcggggtcaggccggcgcccgaggggggcgaac gcggacgccagcgggagcctgacacctccacgcactggtgcggaggggcactggctgccg gagggccgggagtcggcgcggggctcgcgggcggtggccggaacgggtgccgggacggat caccgtacggccgaacgcgacccgaggaggcggaagagcggcgccggcccctcttctgca ggtctgctacagtttgctggaggtctactctataacctgtttgcctgggtatcaccagca gaggctacagaacagcaacgattgctgcctgttccttcctctgaaagctttgttccagag gggcaccagcctgatgccagctggaactctcctgtatga >gi568815596f:202841903_203083621|GENSCAN_predicted_peptide_3|436_aa MWDPEKAPKLGAEGAASDRLRLGAEAGSCSELRRAGASCCGSANPYVSVGKSCVLLAMAQ LQTRFYTDNKKYAVDDVPFSIPAASEIADLSNIINKLLKDKNEFHKHVEFDFLIKGQFLR MPLDKHMEMENISSEEVVEIEYVEKYTAPQPEQCMFHDDWISSIKGAEEWILTGSYDKTS RIWSLEGKSIMTIVGHTDVVKDVAWVKKDSLSCLLLSASMDQTILLWEWNVERNKVKALH CCRGHAGSVDSIAVDGSGTKTPIVTLSGHMEAVSSVLWSDAEEICSASWDHTIRVWDVES GSLKSTLTGNKVFNCISYSPLCKRLASGSTDRHIRLWDPRTKDGSLVSLSLTSHTGWVTS VKWSPTHEQQLISGSLDNIVKLWDTRSCKAPLYDLAAHEDKVLSVDWTDTGLLLSGGADN KLYSYRYSPTTSHVGA >gi568815596f:202841903_203083621|GENSCAN_predicted_CDS_3|1311_bp atgtgggaccccgaaaaggcccccaagcttggcgcggaaggcgccgccagtgaccggctg cggctgggggcggaggccggcagttgctcggagctccggcgggcaggagcttcgtgttgt gggtctgctaacccgtacgtttccgtgggcaagtcgtgtgtactcctcgccatggctcag ctccaaacacgcttctacactgataacaagaaatatgccgtagatgatgttcccttctca atccctgctgcctctgaaattgccgaccttagtaacatcatcaataaactactaaaggac aaaaatgagttccacaaacatgtggagtttgatttccttattaagggccagtttctgcga atgcccttggacaaacacatggaaatggagaacatctcatcagaagaagttgtggaaata gaatacgtggagaagtatactgcaccccagccagagcaatgcatgttccatgatgactgg atcagttcaattaaaggggcagaggaatggatcttgactggttcttatgataagacttct cggatctggtccttggaaggaaagtcaataatgacaattgtgggacatacggatgttgta aaagatgtggcctgggtgaaaaaagatagtttgtcctgcttattattgagtgcttctatg gatcagactattctcttatgggagtggaatgtagagagaaacaaagtgaaagccctacac tgctgtagaggtcatgctggaagtgtagattctatagctgttgatggctcaggaactaaa actcccatagtgaccctctctggccacatggaggcagtttcctcagttctgtggtcagat gctgaagaaatctgcagtgcatcttgggaccatacaattagagtgtgggatgttgagtct ggcagtcttaagtcaactttgacaggaaataaagtgtttaattgtatttcctattctcca ctttgtaaacgtttagcatctggaagcacagataggcatatcagactgtgggatccccga actaaagatggttctttggtgtcgctgtccctaacgtcacatactggttgggtgacatca gtaaaatggtctcctacccatgaacagcagctgatttcaggatctttagataacattgtt aagctgtgggatacaagaagttgtaaggctcctctctatgatctggctgctcatgaagac aaagttctgagtgtagactggacagacacagggctacttctgagtggaggagcagacaat aaattgtattcctacagatattcacctaccacttcccatgttggggcatga >gi568815596f:202841903_203083621|GENSCAN_predicted_peptide_4|160_aa MGQMSALTPRTSPGWATAAASGVAWTLAWVWREAMPGLVVWRSITAATVTQSLPSPLKLD VHTNTQAVCTPEKLQIKTLNKFAPFTEKVPFLEQQNKILETKWILLQQQKVAWSNMDSMF ESYINNLKQQLDTLSQEQLKLEAELGNMERPVEDYQKCEV >gi568815596f:202841903_203083621|GENSCAN_predicted_CDS_4|483_bp atgggccagatgtctgcattaactcctagaacttctcctggatgggcaacagcagcagct tccggggtagcctggacactggcatgggtctggagggaggctatgccaggtctggtggta tggaggagcatcacagctgccacagtgacccagagcctcccgagtccccttaagctggat gtgcacaccaacacccaggctgtatgcaccccggagaaattgcagatcaagaccctaaac aagtttgcccccttcactgagaaggtacccttcctggagcagcagaacaagatactggag accaagtggatccttttgcagcagcaaaaagtagcttggagcaacatggacagcatgttt gagagctacatcaacaacctaaagcagcaactggacacactgagccaggagcagctgaag ctggaggcagaacttggcaacatggagagaccagtggaggactatcaaaagtgtgaagtt taa >gi568815596f:202841903_203083621|GENSCAN_predicted_peptide_5|709_aa MEQSNDSLRVNHNDGEESKTSAQVFEVWIQLGTFFHPRHLICMDSRDSSFGQNDSPTVLP ITTREANNSLISQNIPGPLTQTQTLSAEQFHLVDQNGQAIQYELQSLGESNAQMMIVASP TENGQVLRVIPPTQTGMAQVIIPQGQLVDVNSPRDVPEEKPSNRNLPTVRVDTLADNTSN YILHPQTSFPLPKKSVTGMLEEPLLGPLQPLSSNTPIWACRLRSCEKIGDSYRGYCVSET ELESVLTFHKQQTQSVWGTRQSPSPAKPATRLMWKSQYVPYDGIPFVNAGSRAVVMECQY GPRRKGFQLKKVSEQESRSCQLYKATCPARIYIKKVQKFPEYRVPTDPKIDKKIIRMEQE KAFNMLKKNLVDAGGVLRWYVQLPTQQAHQYHELETPCLTLSPSPFPVSSLEEEETAVRD ENCALPSRLHPQVAHKIQELVSQGIEQVYAVRKQLRKFVERELFKPDEVPERHNLSFFPT VNDIKNHIHEVQKSLRNGDTVYNSEIIPATGLQLQPRYTSPDESPAVVSVNNQPSSSPSG LLDTIGSAVMNNNSLLLGQSHSLQRDTCLTQNNSTASTMGNLPEPDQNLVAMDELVEVGD VEDTGNLEGTVHRILLGDVQTIPIQIIDNHSALSRNWSYQYFHISKYVRQKLIELQGEMD ESTIIVGDLSTPLLYKRTDPSSRHKDIVELNSNIDICRLLIQQQQNTHS >gi568815596f:202841903_203083621|GENSCAN_predicted_CDS_5|2130_bp atggaacaatctaatgattcattaagagtcaaccataatgacggtgaagagtcaaaaacc agtgctcaagtatttgaggtatggattcagctgggaacctttttccatcctaggcatcta atctgtatggactccagggattcttcctttggacaaaatgattctcctacagttttgccc atcactactcgtgaagcaaataattcactcatatcacagaatataccagggcccctgact cagacacagactctttctgcagagcaattccatctagtggaccaaaatgggcaggctatt caatatgaacttcagtcattgggggaatccaatgcacaaatgatgatcgttgccagccca acagaaaatggacaggtacttcgtgtaattccacctacccagacaggaatggcacaagtg attatacctcaggggcaacttgtggatgtgaatagtcctcgggatgtccctgaagagaaa cccagtaacagaaacttaccaactgtaagagtggatactctagcagacaataccagcaat tacattcttcatcctcaaacatccttcccattgcccaaaaagtcagtgaccggaatgctg gaagaaccccttctggggcctcttcagccactttcttctaatacacctatatgggcctgc cgtcttaggagctgtgagaaaattggagattcataccgtggctactgtgtaagtgagact gaattagaaagtgtcctaacatttcacaagcagcaaacacagagtgtttgggggacccgt cagtctccaagcccagccaagcctgctacacgcttgatgtggaaatcccagtatgttcca tatgatggaatcccatttgttaatgcagggagtagagctgtggtaatggagtgtcagtat gggccaagaagaaaaggtttccagttaaaaaaagtcagtgagcaggaaagcaggtcttgt cagctctacaaagccacttgtccagctcggatttacattaaaaaggtacagaagtttcct gaatatagagttcctacagaccccaaaattgacaagaaaattatcagaatggagcaggag aaagcttttaacatgctaaagaagaacttggtagatgctggtggtgttcttaggtggtat gtacagttacctacacagcaagctcatcagtatcatgaattagagactccctgcctcact ttgtcaccttctccttttcctgtgtcttctcttgaagaagaggaaactgcagttagagat gagaattgtgcattaccctcacgtttacatcctcaagtagcacataagattcaagaatta gtatcacagggaatagaacaagtgtatgcagtaaggaaacagctaagaaaatttgtggaa agggaactgttcaaacccgatgaggtacctgaaagacataatttatctttttttccaact gtaaatgatataaaaaatcacatccatgaggtacagaaatccttgagaaatggagatacg gtatataactcagagattattccagcaacgggtttgcagttacaaccaaggtacacctct cctgatgaatcaccagctgtggtatcagtaaataaccagccgtcctctagtccttcagga cttctggatacaataggaagtgctgtaatgaataataattctctactgcttggtcaaagt catagccttcaaagagatacatgcttaacccaaaacaatagtactgcctccaccatgggt aaccttccagaaccagatcaaaatctagttgcaatggacgagctggtagaagttggagat gttgaggatacagggaatctggaaggaactgttcatcggattctgttgggagatgtgcag actattccaatacagattatagacaaccactcagctcttagtagaaactggagctatcaa tattttcatatatcaaaatatgtgaggcaaaaactgatagaactccaaggagaaatggat gaatccactattatagttggagacttaagtacccctctattatataaacggacagatcca tcaagcagacataaggacatagttgaattgaacagtaacatcgacatctgtcgactactt atccaacaacagcagaatacacattcttaa >gi568815596f:202841903_203083621|GENSCAN_predicted_peptide_6|69_aa MTPSAKSGAGVDVSTSALLHVQLREGSFQCRGRRPPRDGQPEPVDCGCAAYATIEQLPPN GWVNPQRPK >gi568815596f:202841903_203083621|GENSCAN_predicted_CDS_6|210_bp atgacgccatcagcgaaaagcggggcgggcgtagacgtcagcacgtcagccctcctccat gtccagctgagggaaggctcgtttcagtgccgcggccggcgcccgccaagggatgggcag ccagagcctgtcgactgcggctgcgcagcctacgcgaccattgaacagctgccgcccaac ggctgggtaaatcctcagcggccgaaatga >gi568815596f:202841903_203083621|GENSCAN_predicted_peptide_7|485_aa MAEQEQRKIHLVPENLLKKRKAYQALKATQAKQALLAKKEQRKGKRLRFKRLESFLHDSW RQKCDKVHLRRLEMRPHALELPDKHAMAFVVHIKRINGMSLPVERTVVRLRLKKMFSGVF VKVNPQNLKMLLIVETYVTWGFPNLKSVRELTWKHEQAKVKNKTIPLTDNTVIEEHLEKF GDICLEDLIHEIAFPGKVDDMPPGISLLPDNILQVLRIQLLQCVQKMADGLEEQQQALSI LLVKFFIILCRNLSNVEEIGTCSYINYVITMTTLYIQQLKSKKKEKEMADQTCIEEFVIH ALAFCESLYDPYRNWRHRISGILAGVMLMSIEKKSNVEPVVALFLPRFIWYPLLGSPKGR QLEKESLHDACRSITKQREYRRRNALQAISPATMEVLMRVLADCDSWEDGDPEEVGRKAE LTLKCLTEVVHILLSSNSDQRQVETSTILENYFKLLNSDHSALPNQRRSRQWENRFIALQ IKMLX >gi568815596f:202841903_203083621|GENSCAN_predicted_CDS_7|1455_bp atggcagagcaagagcaaagaaaaatccatttggttccagaaaatctcctgaaaaagagg aaggcttatcaagccctcaaagccactcaggcaaagcaggcacttttggcaaagaaggag cagaggaaaggaaaaaggctcaggtttaagcgactggaatcattcctacatgattcctgg aggcagaaatgtgacaaggtgcatctcagacgactagaaatgagacctcatgccttggaa ttgccagataaacatgccatggcctttgttgtacacatcaaaaggattaatggcatgagt ttaccggtggagagaaccgttgtaagacttcgcctgaagaaaatgtttagtggtgtcttt gtaaaagtcaacccccagaacctaaaaatgctgcttatagtggaaacttatgtgacctgg ggatttccaaatctgaagtctgtgcgggaactcacttggaaacatgaacaagccaaggtc aagaataagaccatccctctgacagacaacacagtgattgaggagcacctggagaagttt ggtgacatttgcttggaagacctcattcatgaaattgccttcccagggaaggtagatgat atgcctccaggaatatctctgcttcctgataatattctgcaggttctgaggatccagctt ctacagtgtgttcagaaaatggcagatgggttagaggaacaacagcaagccttgtcaatt ttgcttgtcaagttcttcattattctttgcagaaatctatcaaatgtggaagaaattggg acttgctcgtacattaattatgtcatcaccatgacaacactctatattcagcaattaaaa agcaaaaaaaaagagaaggaaatggcagatcagacatgtattgaagaatttgtgatccac gcattggcattttgtgaaagcttatatgatccatatcggaattggagacatagaatttca ggaattcttgctggagtgatgctgatgagtatagagaaaaagagcaatgtggaaccagtt gttgctctctttcttccacgcttcatctggtaccctctattagggagcccaaaagggagg cagctggaaaaagaaagtttgcatgatgcctgccgcagcatcacaaagcagagagaatat agaaggaggaatgctttgcaagcaatttctccagccactatggaagttcttatgcgagta ttggcagattgtgattcctgggaggatggagatcctgaagaagtgggtaggaaggcagaa ctaactctgaagtgccttacagaagtggtacatatccttctcagtagcaactctgatcag cgtcaagtggaaaccagtactattctggagaactattttaaattgctaaattcagatcat tcagctttacctaatcaaaggaggtccagacagtgggaaaaccgatttattgctctacag atcaaaatgctgann