GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:37:23 Sequence gi568815586f:119079313_119293855 : 214543 bp : 44.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 10399 10551 153 0 0 -41 48 261 0.796 7.12 1.02 PlyA + 12712 12717 6 1.05 2.00 Prom + 14779 14818 40 -5.26 2.01 Init + 21493 21500 8 0 2 68 86 0 0.117 -1.72 2.02 Intr + 22924 23070 147 1 0 111 94 56 0.058 7.85 2.03 Intr + 34966 35052 87 1 0 109 94 85 0.977 10.29 2.04 Intr + 37625 37696 72 2 0 93 72 80 0.754 5.42 2.05 Intr + 40393 40599 207 1 0 -35 82 160 0.420 1.29 2.06 Intr + 40938 40964 27 0 0 142 110 13 0.786 6.13 2.07 Intr + 46069 46167 99 1 0 111 91 32 0.716 5.03 2.08 Intr + 51366 51522 157 0 1 98 86 52 0.158 6.01 2.09 Intr + 65979 66373 395 2 2 133 86 151 0.167 12.85 2.10 Intr + 71705 71908 204 2 0 84 113 61 0.046 6.52 2.11 Intr + 74248 74337 90 1 0 90 86 43 0.046 3.41 2.12 Intr + 74931 75071 141 0 0 106 94 123 0.997 14.17 2.13 Term + 77183 77486 304 2 1 128 42 239 0.994 17.94 2.14 PlyA + 78860 78865 6 1.05 3.00 Prom + 88102 88141 40 -4.26 3.01 Init + 100001 100367 367 1 1 121 119 166 0.970 20.19 3.02 Intr + 107713 107776 64 2 1 123 100 68 0.835 9.28 3.03 Term + 114387 114546 160 0 1 76 50 142 0.720 6.61 3.04 PlyA + 114831 114836 6 -3.44 4.00 Prom + 115073 115112 40 -5.76 4.01 Init + 115538 116250 713 1 2 97 -2 576 0.469 43.88 4.02 Intr + 134365 134429 65 1 2 93 96 30 0.061 2.66 4.03 Intr + 138434 138532 99 0 0 84 93 4 0.062 0.58 4.04 Intr + 153404 153540 137 0 2 68 85 55 0.289 3.49 4.05 Term + 154431 154484 54 2 0 117 37 37 0.274 -0.94 4.06 PlyA + 157047 157052 6 1.05 5.00 Prom + 157699 157738 40 -2.96 5.01 Init + 173293 173404 112 0 1 71 68 81 0.132 4.86 5.02 Intr + 185929 186056 128 2 2 87 77 21 0.010 1.30 5.03 Intr + 187946 188078 133 1 1 115 55 33 0.036 2.92 5.04 Intr + 207231 207375 145 1 1 55 109 115 0.040 9.64 5.05 Intr + 211922 212111 190 2 1 75 101 52 0.011 4.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 27281 26796 486 2 0 60 39 210 0.931 9.37 S.002 Init + 71418 71465 48 2 0 91 80 30 0.865 3.47 S.003 Intr + 71705 71872 168 1 0 84 86 133 0.940 12.84 S.004 Intr - 205548 205460 89 2 2 116 110 42 0.840 8.81 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:119079313_119293855|GENSCAN_predicted_peptide_1|50_aa TTLSIHPAIILNIIIIIIILTIITMTIIISNIITIITITIIIIKHILNIY >gi568815586f:119079313_119293855|GENSCAN_predicted_CDS_1|153_bp accaccctctctattcatcctgccatcatcctcaatatcatcatcatcatcatcatcctc accattattaccatgactatcatcatcagtaacattatcaccatcatcaccatcaccatc atcatcatcaaacatatattgaacatttattag >gi568815586f:119079313_119293855|GENSCAN_predicted_peptide_2|645_aa MERTEPQNNPVVPAQDGPSEKLGQHLATEPLGTNSWERDKTCRELGATRGHSASHDKDLT PPPSSRGKKKKKKSTRKKRRRSSSYSPSPVKKKKKKSSKKHKRRSKSSTCGSWLSHDDDG DDEDEEEEECTTREREDEMENGGGERQKTLSTFSLSSLIQDRFTDSFGGGLEHRSFSKKR RHRSRSRPRKSHRHRHHRCPSRSQSSESRPSSCESRHRGRSPEEGQKSRRRHSRRCSKTL CKDSPEAQSSRPPSQPLQMLGYLSARGVCIMTVNISWLFRNSPWAQRSAVSNGSFCFQIT GSGSAADLFTKTASPLTTSRGRSQEYDSGNDTSSPPSTQTSSARSRGQEKGSPSGGLSKS RELNSGNTSDSGNSFTTSSPQNKGAMLENLSPTSRGRESRGFQSPCLECAEVKKSSLVPS TARSSPMKGCSRSSSYASTRSSSHSSRSPNPRASPRYTQSRSTSSEKSYSSKSGKRSPPS RSSRSRRSPSYSRYSPSRERDPKYSEKDSQQRERERARRRRRSYSPMRKRRRDSPSHLEA RRITSARKRPIPYYRPSPSSSGSLSSTSSWYSSSSSRSASRSYSRSRSRSRSRRRSRTRT SSSSSSRSPSPGSRSRSRSRSRSRSRSRSQSRSYSSADSYSSTRR >gi568815586f:119079313_119293855|GENSCAN_predicted_CDS_2|1938_bp atggaaaggacagagccccagaataaccccgttgtcccagctcaggatggaccctcagaa aagctgggtcagcatctggccaccgagcccttgggcaccaacagttgggagagagacaag acctgtcgggaactgggtgccaccagaggacacagtgcctctcatgacaaagacttgaca ccaccaccttcctccaggggaaagaagaaaaagaagaaatccactcggaagaagagaagg aggtcctcatcctatagcccatcgcctgtcaagaaaaagaagaagaaaagttccaagaaa cacaagcgacgcagtaagagttccacatgtggaagctggctgtctcatgatgatgacggt gatgatgaggatgaggaggaggaggaatgcacaacaagggagagagaagatgagatggag aatggaggaggggaaaggcagaagactctctctacgttctccttatccagcctcatccag gacagatttactgacagttttgggggaggattggagcaccggtcattctccaagaagaga aggcacagatctcgaagccggccccgaaagtctcaccgccaccgccatcaccgctgcccc tcgcggtcccagagctcggagtcccgcccctcaagctgtgagagcaggcaccgcggccgg tcccctgaggaagggcagaagtcccgccgaaggcactcccgccgctgctccaagaccctc tgcaaggacagccctgaggcccagtccagtcgcccgcccagtcaacccctccagatgctt ggctacctgtcagccaggggtgtatgcatcatgacggtaaacatttcttggttatttaga aacagcccatgggcccagcgctcagcagtgtccaacgggtctttttgctttcagatcact gggtcggggtctgctgctgacctctttaccaaaacagccagcccgctcaccacctcgcga ggacgttcccaggagtacgactcaggaaatgacacgtcctcgccaccctccacgcaaacc agctcagccaggtctcggggccaggagaaggggagccccagtgggggcttgagcaagagc cgggagctcaacagtggcaacacctctgattcagggaactccttcaccacctcctcaccc cagaacaagggggccatgttggagaatctctcccccaccagcaggggcagagagtcaagg ggatttcagtcaccgtgtctggaatgtgccgaagtgaagaagtccagtttggtcccatcc acagcccggagctcacccatgaaagggtgttcccgcagctcctcctatgccagcacccga tcctccagtcactcgtcccgatccccaaatcccagggcttcccccaggtacacccaaagc cgatccacctcttctgaaaaaagctattcctccaagtctggcaagaggagcccgcccagc agaagctctaggtcccgccgcagccctagctactcccgctacagccccagcagggagcgg gatcccaaatacagtgagaaggactcgcagcagcgggagcgcgagcgagcgcgtcggaga cgtcggtcctactcgcctatgagaaagcgccggagagactccccgagccacctggaggcc cggaggataaccagtgcccggaaacgccccatcccctactatcggcccagcccctcctca tccggcagcctcagcagcacctcctcctggtacagcagcagcagtagccgctcggccagc cgcagctactcccggagccggagtcggagccggagccggagacggagccggacccgcacg agcagcagctctagctcccgcagccctagtccgggctcccgcagccggagccggagcagg agccggagccggagccggagcaggagccagagccggagctacagctcagcagacagctac tccagcacgaggcgctaa >gi568815586f:119079313_119293855|GENSCAN_predicted_peptide_3|196_aa MADGQMPFSCHYPSRLRRDPFRDSPLSSRLLDDGFGMDPFPDDLTASWPDWALPRLSSAW PGTLRSGMVPRGPTATARFGVPAEGRTPPPFPGEPWKVCVNVHSFKPEELMVKTKDGYVE VSGKHEEKQQEGGIVSKNFTKKIQLPAEVDPVTVFASLSPEGLLIIEAPQVPPYSTFGES SFNNELPQDSQEVTCT >gi568815586f:119079313_119293855|GENSCAN_predicted_CDS_3|591_bp atggctgacggtcagatgcccttctcctgccactacccaagccgcctgcgccgagacccc ttccgggactctcccctctcctctcgcctgctggatgatggctttggcatggaccccttc ccagacgacttgacagcctcttggcccgactgggctctgcctcgtctctcctccgcctgg ccaggcaccctaaggtcgggcatggtgccccggggccccactgccaccgccaggtttggg gtgcctgccgagggcaggacccccccacccttccctggggagccctggaaagtgtgtgtg aatgtgcacagcttcaagccagaggagttgatggtgaagaccaaagatggatacgtggag gtgtctggcaaacatgaagagaaacagcaagaaggtggcattgtttctaagaacttcaca aagaaaatccagcttcctgcagaggtggatcctgtgacagtatttgcctcactttcccca gagggtctgctgatcatcgaagctccccaggtccctccttactcaacatttggagagagc agtttcaacaacgagcttccccaggacagccaggaagtcacctgtacctga >gi568815586f:119079313_119293855|GENSCAN_predicted_peptide_4|355_aa MPKGGRKGGHKGWARQYRSPEEIDVQLQAEKQKAREEEEQKEGGDGAAGDPKKEKKSLDS DESEDEEDDYQQNRKGVEGLFNIENPNQVAQTTKKVTQLYLDGPKELSRREREEIEKQKA KERYMKMHLAGKTEQAKADLARWPSSGNSGRRLPGRRKRKGKQKMMPHCQENECSRSPCI SNRDPWEEMPGTWAALPGPLLCLAHPVPWRRRNSPSWPGAPHGLGPPLHFGTEIVWGMEH NTTGSKLKQTRDEGTSKQAAISSYTRGSQDRLHNHQCTCHWIIHTYFPPLPDSHPHGFHV VLALSHFMSVITYQGTALLNVGSASNAYISSTTNRHSHNFEFELFYLWLLCRGKE >gi568815586f:119079313_119293855|GENSCAN_predicted_CDS_4|1068_bp atgcctaaaggagggagaaagggaggccacaaaggctgggcgaggcagtataggagccct gaggagatcgacgtacagctgcaggctgagaagcagaaggccagggaagaagaggagcaa aaagaaggtggagatggggctgcaggtgaccccaaaaaggagaagaaatctctagactca gatgagagtgaggatgaagaagatgactaccagcaaaaccgcaaaggcgttgaagggctc ttcaacatcgagaaccccaaccaggtggcacagacaaccaaaaaggtcacacaactgtat ctggatgggccaaaggagctttcgaggagagaacgagaagagattgagaagcagaaggca aaagagcgttacatgaaaatgcacttggccgggaagacagagcaagccaaggctgacctg gcccgctggccatcatccggaaacagcgggaggaggctgcccggaagaaggaagaggaaa ggaaagcaaaagatgatgccacattgtcaggaaaacgaatgcagtcgctctccctgtata agtaaccgcgacccatgggaggagatgccggggacctgggccgcgctgccaggacctctg ctgtgtctcgcccaccctgtgccctggcgccgccgcaacagcccctcgtggccaggagcc ccccatggcctggggcctcctcttcattttggcacagaaattgtttgggggatggaacat aatacaactggatctaagcttaagcaaacacgtgatgagggaacaagtaaacaggcagct atctcctcttatacccggggctcccaagacagactgcataaccatcaatgcacctgtcac tggatcatccacacctacttccctcccttaccagactctcatcctcatgggttccatgtt gttcttgctctctcccatttcatgtccgtaatcacttaccaaggcacagccctgctgaat gtaggttcagcatccaatgcttacataagcagcaccaccaaccggcacagtcacaacttt gagtttgaactgttctacctctggcttctttgtcgtggaaaagaataa >gi568815586f:119079313_119293855|GENSCAN_predicted_peptide_5|236_aa MDIAVITVSSVVTVSSLQVTSLVEDMDPEEENRTAGCLIIIASKPQRDTISCQSEWLASK TKKITDVDEVAEKREGLYLLVRMLCIPAAPAPAVAKRSQGTAQAIASEGASPKSWQLSCG VGPAGITSKVMGDAENFTHGQRKLQLIKALPMDSGENSEGAFSHSTESRLFTRSAAALDS HRSTNPIVNCTCEGSRLHAPYENLVPDDLSPSPITPRSDCLVVGKQTQGSHGLCIM >gi568815586f:119079313_119293855|GENSCAN_predicted_CDS_5|708_bp atggacattgctgttatcacagtgagctcagtggtgacagtgtcatcactccaagtgact tcactggttgaagacatggatccagaagaggagaacaggacagccggatgcttgataata attgcctcaaaaccacaacgagataccatctcatgccagtcagaatggctagcatcaaaa actaaaaaaataacagatgttgatgaggttgcagagaaaagagaaggcttatacctgttg gtcaggatgctctgcatcccagctgctccagctccagctgtggctaaaaggagccaaggt acagctcaggccattgcttcagagggtgcaagccccaagtcttggcagctttcatgtggt gttgggcctgcaggaatcaccagcaaagtgatgggggatgctgaaaattttacccacggg cagaggaagcttcaactcatcaaagccttgccaatggacagtggagaaaacagtgagggt gccttctcacactcaacagagtcccgcttattcaccagatcagccgcagcattagattct catagaagcacaaaccctattgtgaactgcacatgcgagggatctaggttgcatgctcct tatgagaatctagtgccggatgatctgtcaccatctcccatcacccccagatcagactgt ctagttgtaggaaaacaaactcagggctcccacggactctgcattatg