GENSCAN 1.0 Date run: 7-Nov-116 Time: 15:56:34 Sequence gi568815586f:119094850_119295391 : 200542 bp : 44.69% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7387 7533 147 1 0 111 94 56 0.118 7.85 1.02 Intr + 19429 19515 87 1 0 109 94 85 0.977 10.29 1.03 Intr + 22088 22159 72 2 0 93 72 80 0.754 5.42 1.04 Intr + 24856 25062 207 1 0 -35 82 160 0.420 1.29 1.05 Intr + 25401 25427 27 0 0 142 110 13 0.786 6.13 1.06 Intr + 30532 30630 99 1 0 111 91 32 0.716 5.03 1.07 Intr + 35829 35985 157 0 1 98 86 52 0.158 6.01 1.08 Intr + 50442 50836 395 2 2 133 86 151 0.167 12.85 1.09 Intr + 56168 56371 204 2 0 84 113 61 0.046 6.52 1.10 Intr + 58711 58800 90 1 0 90 86 43 0.046 3.41 1.11 Intr + 59394 59534 141 0 0 106 94 123 0.997 14.17 1.12 Term + 61646 61949 304 2 1 128 42 239 0.994 17.94 1.13 PlyA + 63323 63328 6 1.05 2.00 Prom + 72565 72604 40 -4.26 2.01 Init + 84464 84830 367 1 1 121 119 166 0.970 20.19 2.02 Intr + 92176 92239 64 2 1 123 100 68 0.835 9.28 2.03 Term + 98850 99009 160 0 1 76 50 142 0.720 6.61 2.04 PlyA + 99294 99299 6 -3.44 3.00 Prom + 99536 99575 40 -5.76 3.01 Init + 100001 100713 713 1 2 97 -2 576 0.469 43.88 3.02 Intr + 118828 118892 65 1 2 93 96 30 0.061 2.66 3.03 Intr + 122897 122995 99 0 0 84 93 4 0.062 0.58 3.04 Intr + 137867 138003 137 0 2 68 85 55 0.289 3.49 3.05 Term + 138894 138947 54 2 0 117 37 37 0.274 -0.94 3.06 PlyA + 141510 141515 6 1.05 4.00 Prom + 142162 142201 40 -2.96 4.01 Init + 157756 157867 112 0 1 71 68 81 0.132 4.86 4.02 Intr + 170392 170519 128 2 2 87 77 21 0.010 1.30 4.03 Intr + 172409 172541 133 1 1 115 55 33 0.037 2.92 4.04 Intr + 191694 191838 145 1 1 55 109 115 0.054 9.64 4.05 Intr + 196385 196574 190 2 1 75 101 52 0.023 4.79 4.06 Intr + 199166 199199 34 1 1 83 113 3 0.008 0.10 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 11744 11259 486 2 0 60 39 210 0.930 9.37 S.002 Init + 55881 55928 48 2 0 91 80 30 0.865 3.47 S.003 Intr + 56168 56335 168 1 0 84 86 133 0.940 12.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:119094850_119295391|GENSCAN_predicted_peptide_1|643_aa XTEPQNNPVVPAQDGPSEKLGQHLATEPLGTNSWERDKTCRELGATRGHSASHDKDLTPP PSSRGKKKKKKSTRKKRRRSSSYSPSPVKKKKKKSSKKHKRRSKSSTCGSWLSHDDDGDD EDEEEEECTTREREDEMENGGGERQKTLSTFSLSSLIQDRFTDSFGGGLEHRSFSKKRRH RSRSRPRKSHRHRHHRCPSRSQSSESRPSSCESRHRGRSPEEGQKSRRRHSRRCSKTLCK DSPEAQSSRPPSQPLQMLGYLSARGVCIMTVNISWLFRNSPWAQRSAVSNGSFCFQITGS GSAADLFTKTASPLTTSRGRSQEYDSGNDTSSPPSTQTSSARSRGQEKGSPSGGLSKSRE LNSGNTSDSGNSFTTSSPQNKGAMLENLSPTSRGRESRGFQSPCLECAEVKKSSLVPSTA RSSPMKGCSRSSSYASTRSSSHSSRSPNPRASPRYTQSRSTSSEKSYSSKSGKRSPPSRS SRSRRSPSYSRYSPSRERDPKYSEKDSQQRERERARRRRRSYSPMRKRRRDSPSHLEARR ITSARKRPIPYYRPSPSSSGSLSSTSSWYSSSSSRSASRSYSRSRSRSRSRRRSRTRTSS SSSSRSPSPGSRSRSRSRSRSRSRSRSQSRSYSSADSYSSTRR >gi568815586f:119094850_119295391|GENSCAN_predicted_CDS_1|1932_bp nngacagagccccagaataaccccgttgtcccagctcaggatggaccctcagaaaagctg ggtcagcatctggccaccgagcccttgggcaccaacagttgggagagagacaagacctgt cgggaactgggtgccaccagaggacacagtgcctctcatgacaaagacttgacaccacca ccttcctccaggggaaagaagaaaaagaagaaatccactcggaagaagagaaggaggtcc tcatcctatagcccatcgcctgtcaagaaaaagaagaagaaaagttccaagaaacacaag cgacgcagtaagagttccacatgtggaagctggctgtctcatgatgatgacggtgatgat gaggatgaggaggaggaggaatgcacaacaagggagagagaagatgagatggagaatgga ggaggggaaaggcagaagactctctctacgttctccttatccagcctcatccaggacaga tttactgacagttttgggggaggattggagcaccggtcattctccaagaagagaaggcac agatctcgaagccggccccgaaagtctcaccgccaccgccatcaccgctgcccctcgcgg tcccagagctcggagtcccgcccctcaagctgtgagagcaggcaccgcggccggtcccct gaggaagggcagaagtcccgccgaaggcactcccgccgctgctccaagaccctctgcaag gacagccctgaggcccagtccagtcgcccgcccagtcaacccctccagatgcttggctac ctgtcagccaggggtgtatgcatcatgacggtaaacatttcttggttatttagaaacagc ccatgggcccagcgctcagcagtgtccaacgggtctttttgctttcagatcactgggtcg gggtctgctgctgacctctttaccaaaacagccagcccgctcaccacctcgcgaggacgt tcccaggagtacgactcaggaaatgacacgtcctcgccaccctccacgcaaaccagctca gccaggtctcggggccaggagaaggggagccccagtgggggcttgagcaagagccgggag ctcaacagtggcaacacctctgattcagggaactccttcaccacctcctcaccccagaac aagggggccatgttggagaatctctcccccaccagcaggggcagagagtcaaggggattt cagtcaccgtgtctggaatgtgccgaagtgaagaagtccagtttggtcccatccacagcc cggagctcacccatgaaagggtgttcccgcagctcctcctatgccagcacccgatcctcc agtcactcgtcccgatccccaaatcccagggcttcccccaggtacacccaaagccgatcc acctcttctgaaaaaagctattcctccaagtctggcaagaggagcccgcccagcagaagc tctaggtcccgccgcagccctagctactcccgctacagccccagcagggagcgggatccc aaatacagtgagaaggactcgcagcagcgggagcgcgagcgagcgcgtcggagacgtcgg tcctactcgcctatgagaaagcgccggagagactccccgagccacctggaggcccggagg ataaccagtgcccggaaacgccccatcccctactatcggcccagcccctcctcatccggc agcctcagcagcacctcctcctggtacagcagcagcagtagccgctcggccagccgcagc tactcccggagccggagtcggagccggagccggagacggagccggacccgcacgagcagc agctctagctcccgcagccctagtccgggctcccgcagccggagccggagcaggagccgg agccggagccggagcaggagccagagccggagctacagctcagcagacagctactccagc acgaggcgctaa >gi568815586f:119094850_119295391|GENSCAN_predicted_peptide_2|196_aa MADGQMPFSCHYPSRLRRDPFRDSPLSSRLLDDGFGMDPFPDDLTASWPDWALPRLSSAW PGTLRSGMVPRGPTATARFGVPAEGRTPPPFPGEPWKVCVNVHSFKPEELMVKTKDGYVE VSGKHEEKQQEGGIVSKNFTKKIQLPAEVDPVTVFASLSPEGLLIIEAPQVPPYSTFGES SFNNELPQDSQEVTCT >gi568815586f:119094850_119295391|GENSCAN_predicted_CDS_2|591_bp atggctgacggtcagatgcccttctcctgccactacccaagccgcctgcgccgagacccc ttccgggactctcccctctcctctcgcctgctggatgatggctttggcatggaccccttc ccagacgacttgacagcctcttggcccgactgggctctgcctcgtctctcctccgcctgg ccaggcaccctaaggtcgggcatggtgccccggggccccactgccaccgccaggtttggg gtgcctgccgagggcaggacccccccacccttccctggggagccctggaaagtgtgtgtg aatgtgcacagcttcaagccagaggagttgatggtgaagaccaaagatggatacgtggag gtgtctggcaaacatgaagagaaacagcaagaaggtggcattgtttctaagaacttcaca aagaaaatccagcttcctgcagaggtggatcctgtgacagtatttgcctcactttcccca gagggtctgctgatcatcgaagctccccaggtccctccttactcaacatttggagagagc agtttcaacaacgagcttccccaggacagccaggaagtcacctgtacctga >gi568815586f:119094850_119295391|GENSCAN_predicted_peptide_3|355_aa MPKGGRKGGHKGWARQYRSPEEIDVQLQAEKQKAREEEEQKEGGDGAAGDPKKEKKSLDS DESEDEEDDYQQNRKGVEGLFNIENPNQVAQTTKKVTQLYLDGPKELSRREREEIEKQKA KERYMKMHLAGKTEQAKADLARWPSSGNSGRRLPGRRKRKGKQKMMPHCQENECSRSPCI SNRDPWEEMPGTWAALPGPLLCLAHPVPWRRRNSPSWPGAPHGLGPPLHFGTEIVWGMEH NTTGSKLKQTRDEGTSKQAAISSYTRGSQDRLHNHQCTCHWIIHTYFPPLPDSHPHGFHV VLALSHFMSVITYQGTALLNVGSASNAYISSTTNRHSHNFEFELFYLWLLCRGKE >gi568815586f:119094850_119295391|GENSCAN_predicted_CDS_3|1068_bp atgcctaaaggagggagaaagggaggccacaaaggctgggcgaggcagtataggagccct gaggagatcgacgtacagctgcaggctgagaagcagaaggccagggaagaagaggagcaa aaagaaggtggagatggggctgcaggtgaccccaaaaaggagaagaaatctctagactca gatgagagtgaggatgaagaagatgactaccagcaaaaccgcaaaggcgttgaagggctc ttcaacatcgagaaccccaaccaggtggcacagacaaccaaaaaggtcacacaactgtat ctggatgggccaaaggagctttcgaggagagaacgagaagagattgagaagcagaaggca aaagagcgttacatgaaaatgcacttggccgggaagacagagcaagccaaggctgacctg gcccgctggccatcatccggaaacagcgggaggaggctgcccggaagaaggaagaggaaa ggaaagcaaaagatgatgccacattgtcaggaaaacgaatgcagtcgctctccctgtata agtaaccgcgacccatgggaggagatgccggggacctgggccgcgctgccaggacctctg ctgtgtctcgcccaccctgtgccctggcgccgccgcaacagcccctcgtggccaggagcc ccccatggcctggggcctcctcttcattttggcacagaaattgtttgggggatggaacat aatacaactggatctaagcttaagcaaacacgtgatgagggaacaagtaaacaggcagct atctcctcttatacccggggctcccaagacagactgcataaccatcaatgcacctgtcac tggatcatccacacctacttccctcccttaccagactctcatcctcatgggttccatgtt gttcttgctctctcccatttcatgtccgtaatcacttaccaaggcacagccctgctgaat gtaggttcagcatccaatgcttacataagcagcaccaccaaccggcacagtcacaacttt gagtttgaactgttctacctctggcttctttgtcgtggaaaagaataa >gi568815586f:119094850_119295391|GENSCAN_predicted_peptide_4|248_aa MDIAVITVSSVVTVSSLQVTSLVEDMDPEEENRTAGCLIIIASKPQRDTISCQSEWLASK TKKITDVDEVAEKREGLYLLVRMLCIPAAPAPAVAKRSQGTAQAIASEGASPKSWQLSCG VGPAGITSKVMGDAENFTHGQRKLQLIKALPMDSGENSEGAFSHSTESRLFTRSAAALDS HRSTNPIVNCTCEGSRLHAPYENLVPDDLSPSPITPRSDCLVVGKQTQGSHGLCIMDLTQ GLGNTTEX >gi568815586f:119094850_119295391|GENSCAN_predicted_CDS_4|744_bp atggacattgctgttatcacagtgagctcagtggtgacagtgtcatcactccaagtgact tcactggttgaagacatggatccagaagaggagaacaggacagccggatgcttgataata attgcctcaaaaccacaacgagataccatctcatgccagtcagaatggctagcatcaaaa actaaaaaaataacagatgttgatgaggttgcagagaaaagagaaggcttatacctgttg gtcaggatgctctgcatcccagctgctccagctccagctgtggctaaaaggagccaaggt acagctcaggccattgcttcagagggtgcaagccccaagtcttggcagctttcatgtggt gttgggcctgcaggaatcaccagcaaagtgatgggggatgctgaaaattttacccacggg cagaggaagcttcaactcatcaaagccttgccaatggacagtggagaaaacagtgagggt gccttctcacactcaacagagtcccgcttattcaccagatcagccgcagcattagattct catagaagcacaaaccctattgtgaactgcacatgcgagggatctaggttgcatgctcct tatgagaatctagtgccggatgatctgtcaccatctcccatcacccccagatcagactgt ctagttgtaggaaaacaaactcagggctcccacggactctgcattatggacctaacccaa ggcctgggtaatacaacagaagnn