GENSCAN 1.0 Date run: 7-Nov-116 Time: 17:14:52 Sequence gi568815582f:24440627_24672244 : 231618 bp : 43.20% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9125 9185 61 1 1 86 103 50 0.467 7.71 1.02 Term + 15247 15353 107 2 2 121 46 24 0.241 0.07 1.03 PlyA + 17064 17069 6 1.05 2.07 PlyA - 17100 17095 6 1.05 2.06 Term - 26271 26237 35 1 2 136 48 19 0.175 0.45 2.05 Intr - 29955 29811 145 0 1 46 67 69 0.113 0.46 2.04 Intr - 54506 54358 149 0 2 90 119 -33 0.093 -0.05 2.03 Intr - 54752 54716 37 2 1 106 80 13 0.094 0.14 2.02 Intr - 58803 58723 81 0 0 113 85 -19 0.043 0.13 2.01 Init - 76411 76313 99 1 0 90 75 108 0.742 9.96 2.00 Prom - 83905 83866 40 -4.56 3.00 Prom + 84099 84138 40 -2.96 3.01 Init + 100001 100166 166 1 1 97 116 282 0.942 31.69 3.02 Intr + 105537 105636 100 1 1 81 92 43 0.433 3.17 3.03 Intr + 108319 108355 37 1 1 103 39 -9 0.233 -6.04 3.04 Intr + 114989 115077 89 1 2 46 108 130 0.888 9.57 3.05 Intr + 115195 115291 97 1 1 79 98 46 0.997 4.71 3.06 Intr + 115682 115821 140 1 2 100 85 67 0.988 6.86 3.07 Intr + 118879 119051 173 1 2 67 64 135 0.991 8.59 3.08 Intr + 121466 121535 70 0 1 77 111 11 0.514 0.54 3.09 Intr + 122573 122669 97 2 1 60 94 26 0.401 0.41 3.10 Intr + 122797 122875 79 0 1 56 105 22 0.406 -0.18 3.11 Intr + 122984 123038 55 0 1 119 79 0 0.576 0.24 3.12 Intr + 127166 127267 102 2 0 89 92 52 0.969 4.99 3.13 Intr + 128119 129873 1755 1 0 114 70 775 0.820 65.62 3.14 Term + 130250 131819 1570 2 1 86 42 1101 0.998 95.13 3.15 PlyA + 132210 132215 6 1.05 4.03 PlyA - 132713 132708 6 1.05 4.02 Term - 163032 162946 87 0 0 33 47 116 0.160 -0.34 4.01 Init - 183679 183461 219 1 0 71 67 133 0.557 8.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:24440627_24672244|GENSCAN_predicted_peptide_1|55_aa MVEEEENTFFFTWWQQGEVPGSTSTLTCLDPESPPSQTLNLARSPDPSTRLFPLE >gi568815582f:24440627_24672244|GENSCAN_predicted_CDS_1|168_bp atggtggaagaagaagaaaacacgttcttcttcacatggtggcagcaaggagaagtgcca ggaagcacaagcactctaacctgcctagacccagaatctcctccttctcaaactctgaac cttgcacggagcccagaccccagtaccagactgttccccctggaatag >gi568815582f:24440627_24672244|GENSCAN_predicted_peptide_2|181_aa MDTRSGNKMKICLQYHRYDYKAGLAVREAQQDQEKLSWVCLIPACQLRALLFRNSTTLAL VYSKEPTTSHRPEERAFPFLSVKPLLLNSLLACPRPRFPWRKTTNLRHLPQTTKVLHKVM QLGEEEEEEEGLVSLSQGWQRQKKIQVEVDPSSSNPCCSGVNWVNCVVKGNQLLWGKPAV M >gi568815582f:24440627_24672244|GENSCAN_predicted_CDS_2|546_bp atggacacacgctcggggaacaagatgaagatctgccttcaatatcaccgttatgattac aaggctggactggccgtcagagaagcacaacaggaccaggagaaattgagttgggtgtgt ctaattccagcttgccagctgagggcgcttctgttccgtaattccaccaccctggccttg gtgtacagtaaggaaccgacaacatcacaccggccagaagagagagcttttccctttctc tctgttaaacctcttctcttaaactcacttcttgcgtgtcctcgtcctcgatttccttgg cgtaagacgacgaacctcaggcatttaccccagacaacgaaggtgcttcataaggtcatg cagctgggtgaagaggaggaggaagaggaggggttggtctcactgtctcaagggtggcag aggcagaagaaaatccaggtagaagtggacccaagcagttcaaatccatgctgttcaggg gtcaactgggtcaactgtgtagttaaaggtaatcagttgctctgggggaagccagctgtc atgtga >gi568815582f:24440627_24672244|GENSCAN_predicted_peptide_3|1509_aa MSCVHYKFSSKLNYDTVTFDGLHISLCDLKKQIMGREKLKAADCDLQITNAQTKEEYTDD NALIPKNSSVIVRRIPIGGVKSTSKTYVISRTEPAMATTKATANLAEANASEEDKIKAMM SQSGHEYDPINYMKKPLGPPPPSYTCFRCGKPGHYIKNCPTNGDKNFESGPRIKKSTGIP RSFMMEVKDPNMKGAMLTNTGKYAIPTIDAEAYAIGKKEKPPFLPEEPSSSSEEDDPIPD ELLCLICKDIMTDAVVIPCCGNSYCDESPVPDITATVSISVHSEKSDGPFRDSDNKILPA AALASEHSKGTSSIAITALMEEKGYQVPVLGTPSLLGQSLLHGQLIPTTGPVRINTARPG GGRPGWEQEKKKSKLDEFTNDFAKELMEYKKIQKERRRSFSRSKSPYSGSSYSRSSYTYS KSRSGSTRSRSYSRSFSRSHSRSYSRSPPYPRRGRGKSRNYRSRSRSHGYHRSRSRSPPY RRYHSRSRSPQAFRGQSPNKRNVPQGETEREYFNRYREVPPPYDMKAYYGRSVDFRDPFE KERYREWERKYREWYEKYYKGYAAGAQPRPSANRENFSPERFLPLNIRNSPFTRGRREDY VGGQSHRSRNIGSNYPEKLSARDGHNQKDNTKSKEKESENAPGDGKGNKHKKHRKRRKGE ESEGFLNPELLETSRKSREPTGVEENKTDSLFVLPSRDDATPVRDEPMDAESITFKSVSE KDKRERDKPKAKGDKTKRKNDGSAVSKKENIVKPAKGPQEKVDGERERSPRSEPPIKKAK EETPKTDNTKSSSSSQKDEKITGTPRKAHSKSAKEHQETKPVKEEKVKKDYSKDVKSEKL TTKEEKAKKPNEKNKPLDNKGEKRKRKTEEKGVDKDFESSSMKISKLEVTEIVKPSPKRK MEPDTEKMDRTPEKDKISLSAPAKKIKLNRETGKKIGSTENISNTKEPSEKLESTSSKVK QEKVKGKVRRKVTGTEGSSSTLVDYTSTSSTGGSPVRKSEEKTDTKRTVIKTMEEYNNDN TAPAEDVIIMIQVPQSKWDKDDFESEEEDVKSTQPISSVGKPASVIKNVSTKPSNIVKYP EKESEPSEKIQKFTKDVSHEIIQHEVKSSKNSASSEKGKTKDRDYSVLEKENPEKRKNST QPEKESNLDRLNEQGNFKSLSQSSKEARTSDKHDSTRASSNKDFTPNRDKKTDYDTREYS SSKRRDEKNELTRRKDSPSRNKDSASGQKNKPREERDLPKKGTGDSKKSNSSPSRDRKPH DHKATYDTKRPNEETKSVDKNPCKDREKHVLEARNNKESSGNKLLYILNPPETQVEKEQI TGQIDKSTVKPKPQLSHSSRLSSDLTRETDEAAFEPDYNESDSESNVSVKEEESSGNISK DLKDKIVEKAKESLDTAAVVQVGISRNQSHSSPSVSPSRSHSPSGSQTRSHSSSASSAES QDSKKKKKKKEKKKHKKHKKHKKHKKHAGTEVELEKSQKHKHKKKKSKKNKDKEKEKEKD DQKVKSVTV >gi568815582f:24440627_24672244|GENSCAN_predicted_CDS_3|4530_bp atgtcctgtgtgcattataaattttcctctaaactcaactatgataccgtcacctttgat gggctccacatctccctctgcgacttaaagaagcagattatggggagagagaagctgaaa gctgccgactgcgacctgcagatcaccaatgcgcagacgaaagaagaatatactgatgat aatgctctgattcctaagaattcttctgtaattgttagaagaattcctattggaggtgtt aaatctacaagcaagacatatgttataagtcgaactgaaccagcgatggcaactacaaaa gcaactgccaatctggctgaagccaatgcttctgaagaagataaaattaaagcaatgatg tcgcaatctggccatgaatacgacccaatcaattacatgaagaaacctctaggtccacca cctccatcttacacgtgtttccgttgtggtaaacctggacattatattaagaattgccca acaaatggggataaaaactttgaatctggtcctaggattaaaaagagcactggaattccc agaagtttcatgatggaagtgaaagatcctaatatgaaaggtgcaatgcttaccaacact ggaaaatatgcaataccaactatagatgcagaagcatatgcaattgggaagaaagagaaa cctcccttcttaccagaggagccatcttcttcctcagaagaagatgatcctatcccagat gaattgttgtgtctcatctgcaaggatattatgactgatgctgttgtgattccctgctgt ggaaacagttactgtgatgaatctcctgtacctgatataactgcaacagtatccatatca gttcattcagaaaaatcagatggaccttttcgggattctgataataaaatattgccagct gcagctcttgcatcagagcactcaaagggaacctcctcaattgcaattaccgctcttatg gaagagaagggttaccaggtgcctgttcttggaaccccatctttgcttggacagtcatta ttgcatggacagttgatccccacaactggtccagtaagaataaatactgctcgtccaggt ggtggtcgaccaggctgggaacaggaaaagaaaaagtccaagctagatgagtttacaaat gattttgctaaggaattgatggaatacaaaaagattcaaaaggagcgtaggcgctcattt tccaggtctaaatctccctatagtggttcttcgtattcaagaagttcatatacttattct aaatcaagatctggttcaacacgttcacgctcttattctcgatcattcagccgctcacat tctcgttcctattcacggtcacctccataccccagaagaggcagaggcaagagccgcaat taccgttcacggtctagatctcatggatatcatcgatctaggtcaaggtcacccccttac agacgctatcattcacgatcaagatctcctcaagcgtttaggggacagtctcctaataaa cgtaatgtacctcaaggggaaacagaacgtgaatattttaatagatacagagaagttcca ccaccatatgacatgaaagcatattatgggagaagtgttgactttagagacccatttgaa aaagaacgctaccgagaatgggagagaaaatatagagagtggtatgaaaaatattataaa ggttatgctgctggagcacagcctagaccctcagcaaatagagagaacttttctccagag agatttttgccacttaacatcaggaattctcccttcacaagaggccgcagagaagactat gttggtgggcaaagtcatagaagtcgaaacataggtagcaactatccagaaaagctttca gcaagagatggtcacaatcagaaggataatacaaagtcaaaagagaaggagagtgaaaac gctccaggagatggtaaaggaaataagcataagaaacacagaaaaagaagaaaaggggag gaaagtgagggttttctgaacccagagttattagagacttctaggaaatcaagagaacct acaggtgttgaagaaaataaaacagactcattgtttgttctcccaagtagagatgatgcc acacctgttagagatgaaccaatggatgcagaatcaatcacttttaaatcagtgtctgaa aaagacaagagagaaagggataaaccaaaagcaaagggtgataaaaccaaacggaagaat gatggatctgctgtgtccaaaaaagaaaatattgtaaaacctgctaaaggaccccaagaa aaagtagatggagaacgtgagagatctcctcgatctgaacctccaattaaaaaagccaaa gaggagactccgaagactgacaatactaaatcatcatcttcctctcagaaggatgaaaaa atcactggaacccccagaaaagctcactctaaatcagcaaaagaacaccaagaaacaaaa ccagtcaaagaggaaaaagtgaagaaggactattccaaagatgtcaaatcagaaaagcta acaactaaggaagaaaaggccaagaagcctaatgagaaaaacaaaccacttgataataag ggagaaaaaagaaaaagaaaaactgaagaaaaaggcgtagataaagattttgagtcttct tcaatgaaaatctcgaaactagaagtgactgaaatagtgaaaccatcaccaaagcgcaaa atggaacctgatactgaaaaaatggataggacccctgaaaaggacaaaatttctttaagt gcgccagccaaaaaaatcaaactcaacagagaaactgggaagaaaattggaagtacagaa aatatatcaaacacaaaagaaccctctgaaaaattggagtcaacatctagcaaagttaaa caagaaaaagtcaaaggaaaggtcagacgaaaagtgactggaactgaaggatccagctca actctggtggattacaccagtacgagctcaactggaggcagtcctgtgcggaaatctgaa gaaaaaacagatacaaagcgaactgtgattaaaacgatggaagaatataataatgacaat accgcgccagctgaagatgttatcattatgattcaggttcctcaatccaaatgggataaa gatgactttgaatctgaagaagaagatgttaaatccacacagcctatatcaagtgtagga aaacctgctagtgttataaaaaatgttagtacaaagccatcaaatatagtcaagtatcct gagaaagaaagtgagccatccgagaaaattcagaaattcaccaaggacgtgagccatgaa atcatacaacatgaggttaaaagttcaaaaaactctgcatctagtgaaaaagggaaaacc aaagatcgagattattcagtgttggaaaaggagaaccctgaaaagaggaagaacagcact cagccagagaaagagagtaatttggaccgtctgaatgaacaaggaaattttaaaagtctg tctcaatcttccaaagaggctagaacgtcagataaacatgattccactcgtgcttcctca aataaagacttcactcccaatagagacaaaaaaactgactatgacaccagagagtattca agttccaaacgtagagatgaaaagaatgaattaacaagacgaaaagactctccttctcgg aataaagattctgcatctggacagaaaaataaaccaagggaagagagagatttgcctaaa aaaggaacaggagattccaaaaaaagtaattctagtccctcaagagacagaaaacctcat gatcacaaagccacttatgatactaaacggccaaatgaagagacaaaatctgtagataaa aatccttgtaaggatcgtgagaagcatgtattagaagcaaggaacaataaagagtcaagt ggcaataaactactttatatacttaacccaccagagacacaggttgaaaaagagcaaatt actgggcaaattgacaagagtactgtcaagcctaaaccccagttaagtcattcctctaga ctttcctctgacttaactagagaaactgatgaagctgcttttgaaccagactataatgaa agtgacagtgaaagtaatgtttctgtaaaagaagaggaatcttcaggaaacatttctaag gacctgaaagataaaatagtggagaaagcaaaagagagcctggacacagcagcagttgtc caggtgggcataagcaggaatcagagccacagcagccccagcgtcagccccagcagaagc cacagtccttctggaagccagacccgaagccacagtagcagtgccagctcagcagaaagt caggacagcaagaagaagaagaaaaagaaggaaaagaaaaaacacaagaaacataaaaag cataagaagcataagaaacatgcaggcactgaagtggaattggaaaaaagccaaaaacac aaacacaagaaaaagaagtcaaagaagaacaaagataaagagaaggagaaggagaaagat gaccaaaaagtgaaatctgtcactgtgtaa >gi568815582f:24440627_24672244|GENSCAN_predicted_peptide_4|101_aa MATCPAVTQWDVSKCDESRDLESTCTFGFACSCSSVCVVRNVWASLLTDVTHGMEWTYPE LSQLANSQLMPRQPGRQRETLSQNNNNNNNNDNNNNNNNSI >gi568815582f:24440627_24672244|GENSCAN_predicted_CDS_4|306_bp atggccacgtgtcctgctgtgacccaatgggatgtcagcaaatgtgacgagagcagagat ttggaatccacgtgcacatttgggtttgcgtgctcttgctcctctgtctgtgtggtaaga aacgtgtgggcgagcctgctgacggatgtgacccatggaatggagtggacttaccccgag ttgtcccagctagccaacagccagctgatgcccagacagcctgggcgacagagagagact ttgtcccaaaacaacaacaacaacaacaacaacgacaacaacaacaacaacaacaacagt atttaa