GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:45:19 Sequence gi568815592r:22187405_22394586 : 207182 bp : 38.56% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5238 5288 51 2 0 32 106 61 0.450 3.51 1.02 Intr + 6853 6938 86 0 2 72 89 37 0.039 -0.10 1.03 Intr + 27717 27902 186 0 0 38 49 196 0.203 8.58 1.04 Intr + 33601 33744 144 1 0 91 61 60 0.087 2.08 1.05 Intr + 41934 42046 113 0 2 90 64 38 0.077 0.70 1.06 Intr + 43043 43105 63 0 0 78 60 84 0.112 2.37 1.07 Intr + 72932 73094 163 0 1 112 68 83 0.050 6.81 1.08 Intr + 85906 85969 64 1 1 53 119 35 0.038 0.90 1.09 Term + 88768 88941 174 0 0 36 45 119 0.038 -0.82 1.10 PlyA + 93643 93648 6 1.05 2.07 PlyA - 93658 93653 6 1.05 2.06 Term - 100189 99998 192 1 0 61 46 160 0.657 5.54 2.05 Intr - 102949 102770 180 1 0 48 99 193 0.999 15.54 2.04 Intr - 105241 105134 108 1 0 96 82 117 0.990 11.46 2.03 Intr - 107180 107005 176 0 2 32 88 173 0.412 10.44 2.02 Intr - 118860 118824 37 0 1 53 116 31 0.133 -0.68 2.01 Init - 126212 125877 336 2 0 61 8 242 0.136 10.83 2.00 Prom - 128575 128536 40 -5.45 3.02 PlyA - 128857 128852 6 1.05 3.01 Sngl - 131448 130906 543 0 0 58 37 267 0.870 12.44 3.00 Prom - 136578 136539 40 -4.15 4.06 PlyA - 137425 137420 6 1.05 4.05 Term - 139642 139560 83 2 2 79 38 82 0.704 -0.82 4.04 Intr - 141528 141421 108 1 0 73 73 129 0.293 9.24 4.03 Intr - 168157 167923 235 1 1 -1 5 285 0.009 7.64 4.02 Intr - 181453 181334 120 1 0 81 27 80 0.046 0.97 4.01 Init - 182847 182806 42 0 0 47 69 68 0.092 1.27 4.00 Prom - 189251 189212 40 -1.45 5.04 PlyA - 189497 189492 6 1.05 5.03 Term - 190295 189982 314 0 2 81 53 159 0.927 5.88 5.02 Intr - 191919 191791 129 1 0 72 74 122 0.914 8.95 5.01 Init - 196902 196749 154 0 1 74 47 72 0.881 1.89 5.00 Prom - 197079 197040 40 -3.65 6.02 PlyA - 198326 198321 6 1.05 6.01 Term - 201601 201247 355 0 1 52 42 316 0.390 16.38 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 158865 158956 92 1 2 129 48 73 0.916 4.30 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:22187405_22394586|GENSCAN_predicted_peptide_1|347_aa MYYRSLKYKTVKSIIIKSPSYKELYYWYNHHGEEVGMWGRDDRNREVYSFCIYVSTAKQP QAGPSGGIPEEGNVITDDSSMQVITPEDLLLGQDVEVEDSNIDDPDPVPSRGLASPAYPA FGDLASMFPLLMEGPWQDFAFQCHYYVSRLALNPTHLMCPPKHHTVPFCLLTRMPKKQLL SGACIRLTLLMLTDLLDQKHYTGAQKSVRFAKCPVRFTRSQDKMLFDNGLEERVKEWFRA NSRKNNLIHTWRRTQELNQMGLWCLGRKFANHWLNSSGSQTPGEPGGSSLESGCTAGDEQ QASTITWGPPPVKSAGALGSHWSANPIVNCTCEESRLHTPYENPTNA >gi568815592r:22187405_22394586|GENSCAN_predicted_CDS_1|1044_bp atgtattaccgctcattaaagtacaaaaccgttaagtccatcattatcaaaagcccctca tataaggaactgtattattggtataatcatcatggtgaagaagttggtatgtgggggaga gatgacagaaacagagaagtgtactccttctgcatctatgtgtcaactgcaaaacaacct caggcaggtccttcaggaggtattccagaggaaggcaatgtaatcacagatgacagctct atgcaggttatcacccctgaagaccttctactgggacaagatgtggaggtagaagacagt aatattgatgatcctgaccctgtccccagtaggggccttgcctctccagcgtaccctgct tttggagatcttgcctccatgtttccattattgatggagggaccgtggcaggactttgct ttccaatgtcactattatgtcagcaggctagccctgaaccccacacatctcatgtgtccc ccaaaacatcatactgtgccattttgtctcttaacacgcatgcccaagaaacagcttctc tctggggcctgcattcgattaacactcttaatgttaacagaccttcttgatcagaaacat tacactggagcccagaaatctgtgcgttttgccaaatgtccagtcagattcactcgatcc caagacaagatgctctttgataatggactagaggaaagggtgaaagaatggtttagagcc aattcaagaaagaacaatctgattcacacttggaggaggacacaggagctgaatcagatg gggctgtggtgcctggggaggaaatttgccaaccattggctaaactcaagtggaagccag acgccaggggagccaggtggatctagtctggaatcgggctgcacagcaggagatgagcag caggcaagcactatcacctggggtccacctcctgtcaaatcagcaggggcattaggttct cattggagcgcaaaccctattgtgaattgcacatgcgaggaatctaggttgcacactcct tatgagaatccaactaatgcctga >gi568815592r:22187405_22394586|GENSCAN_predicted_peptide_2|342_aa MWGWSTPHGVPTGALPSGAVERRAPSSRPQNCRSTGSLHPVPGKATDTQHQPVRAVEKSE FCKAKGVELPKALGAHLSHQCALDVVQGVKGEYFGALRCNDYPAGFQTSMGSDKGRENEQ ESSKGSLLLLLVSNLLLCQSVAPLPICPGGAARCQVTLRDLFDRAVVLSHYIHNLSSEMF SEFDKRYTHGRGFITKAINSCHTSSLATPEDKEQAQQMNQKDFLSLIVSILRSWNEPLYH LVTEVRGMQEAPEAILSKAVEIEEQTKRLLEGMELIVSQVHPETKENEIYPVWSGLPSLQ MADEESRLSAYYNLLHCLRRDSHKIDNYLKLLKCRIIHNNNC >gi568815592r:22187405_22394586|GENSCAN_predicted_CDS_2|1029_bp atgtggggttggagcaccccacatggagtccccactggggcactgcctagtggagctgtg gaaagaagagcaccatcttccagaccccagaattgtagatctactggcagcttacaccct gtgcctggaaaagccacagacactcaacaccaacctgtgagagcagtggagaagtctgaa ttctgtaaagccaaaggagtggagctgcccaaggccttgggagcccacctctcccaccag tgtgccctggatgtggtacaaggagtcaaaggagaatattttggggctttaagatgtaat gactaccctgctgggtttcaaactagcatggggtctgataaaggcagagaaaatgagcaa gaaagctctaaagggtccctcctgctgctgctggtgtcaaacctgctcctgtgccagagc gtggcccccttgcccatctgtcccggcggggctgcccgatgccaggtgacccttcgagac ctgtttgaccgcgccgtcgtcctgtcccactacatccataacctctcctcagaaatgttc agcgaattcgataaacggtatacccatggccgggggttcattaccaaggccatcaacagc tgccacacttcttcccttgccacccccgaagacaaggagcaagcccaacagatgaatcaa aaagactttctgagcctgatagtcagcatattgcgatcctggaatgagcctctgtatcat ctggtcacggaagtacgtggtatgcaagaagccccggaggctatcctatccaaagctgta gagattgaggagcaaaccaaacggcttctagagggcatggagctgatagtcagccaggtt catcctgaaaccaaagaaaatgagatctaccctgtctggtcgggacttccatccctgcag atggctgatgaagagtctcgcctttctgcttattataacctgctccactgcctacgcagg gattcacataaaatcgacaattatctcaagctcctgaagtgccgaatcatccacaacaac aactgctaa >gi568815592r:22187405_22394586|GENSCAN_predicted_peptide_3|180_aa MGAPLWAGRGRSRLPLLAGRCGGRGAGGNRGCARCSQASASSRWACLGKPRTQSSCQPQA MRGLAPGPAAVEGAPVPQHCRPATPCWNSHRASLASPRGRARDLQPAMPGPPPPVGSPAA SPMGTTPCCMAPGPIHRPRAEMCGRMAQDCQAAPRGQAQDPLGKASWAPELGGVLENFNV >gi568815592r:22187405_22394586|GENSCAN_predicted_CDS_3|543_bp atgggagcccctctctgggctggccgaggccggagccggctccctctgcttgcggggagg tgtggaggtagaggcgctggcgggaatcgaggctgtgcgcggtgctcgcaggccagtgcc agttccaggtgggcgtgcctcggcaagccccgcactcagagcagctgccagccccaggca atgaggggcttagcacctgggccagcagctgtggagggtgccccagtcccccagcactgc cggcccgccacgccgtgctggaattctcaccgagcctcactcgcctccccccggggcagg gctcgggacctgcagcctgccatgcctgggcccccacccccggtgggctcccctgcagcc tccccaatgggcaccaccccctgctgcatggcgcctggtcctatccaccgcccaagggct gagatgtgtgggcgcatggctcaggactgtcaggcagctccccggggacaggcccaggat ccactaggcaaagctagctgggctcctgagttgggtggggttttggagaacttcaatgtc tag >gi568815592r:22187405_22394586|GENSCAN_predicted_peptide_4|195_aa MNVDDVLRFLSYTESCQPTGPQSLKCLEKVSLQLTVTVYEVHSWSRRAHRTKRMVCSFTP EASETTNPPGGTNNSRCAALRAVTLTVKLCSFNPEPARPRTHQKEETANISEHQKEQTPD TPPLRTVNTHREGQSAEMQIAVMGNCPNANGNKCGQCRIVYYDVEIAYTSGKPSQEVDGR KETEDSEFIPLVPSC >gi568815592r:22187405_22394586|GENSCAN_predicted_CDS_4|588_bp atgaatgtggatgatgttctgcggtttctcagttacactgagtcctgccagcccactggg ccccaatccctcaaatgtctggagaaagtttccctccagctgacagtgaccgtgtatgaa gtgcactcatggtccagaagggcccatcggacaaagaggatggtctgcagctttactcct gaagccagtgagaccacgaacccaccaggaggaacgaacaactccagatgcgctgcctta agagctgtaacactcactgtgaagctctgcagcttcaatcctgagccagcaagaccacga acccaccagaaggaagaaactgcgaacatatccgaacatcagaaggaacaaactccagac acgccacctttaagaactgttaacactcaccgcgagggacagagtgcagaaatgcaaata gcagtgatgggaaattgccccaatgcaaatgggaataagtgtggccagtgtcgcattgtc tattacgatgtagaaattgcctatacaagtgggaaaccctcacaggaggtggatggaagg aaggagacagaagacagcgaatttattcccctggttccatcctgttaa >gi568815592r:22187405_22394586|GENSCAN_predicted_peptide_5|198_aa MNEAGSHHPWQTNTGTENQTPHVLTHKWGLNIENTWTQRGTTHTRACCGMERAESLIAKK EGIRENEEAPLYRDRGSGAPKPKEEVPTCHGYQPELEKTTLNFIWNQIRARVSKTILSKK NKAGGITLPDFKLYYKATVTKTAWYWYQNRYVDQWNTTEASEITPHIYNHLIFDKRDTNK QWGKILYLINGVGKTGQP >gi568815592r:22187405_22394586|GENSCAN_predicted_CDS_5|597_bp atgaatgaagctggaagccatcatccttggcaaacgaacacaggaacagaaaaccaaaca ccgcatgttctcactcataagtgggggttgaacattgagaacacatggacacagagggga acaacacacaccagggcctgttgcgggatggagcgagcagagagtttaatagccaagaag gaagggataagagaaaatgaagaagctcccctgtacagagacagagggagtggggctcca aagccgaaagaggaggtccccacctgccatgggtaccagccagaattggaaaaaaccact ttaaacttcatatggaaccaaattagagcccgcgtatccaagacaatcctaagcaaaaag aacaaagctggaggcatcacgctacctgacttcaaactctactacaaggctacagtaacc aaaacagcatggtactggtaccaaaacagatacgtagaccaatggaacacaacagaggca tcagaaataacaccacacatctacaaccatctgatctttgacaaacgtgacacaaacaag caatggggaaagatcctctatttaataaatggtgttgggaaaactggccagccatga >gi568815592r:22187405_22394586|GENSCAN_predicted_peptide_6|118_aa XKGTEQDEDEMDDLTEVGFKRWVIKNSAELKEHVLTQCKEAKNLDKRLEELLTRITSLGR NINDLMELKNTAQELHEAYTSINSRIDQAEENISDFEGHLAEIKHRDKIREKRMKGNK >gi568815592r:22187405_22394586|GENSCAN_predicted_CDS_6|357_bp nncaagggcacagaacaggatgaggatgaaatggatgacttgacagaagtaggtttcaaa agatgggtaataaaaaattctgctgagctaaaggaacatgttctaacccaatgcaaagaa gctaagaaccttgataaaaggttagaggagctgctaactagaataaccagtttagggagg aacataaatgacctgatggagctgaaaaacacagcacaagaacttcatgaagcgtacaca agtattaatagtcgaatcgaccaagcagaagaaaatatatcagactttgaaggccacctt gctgaaataaagcacagagacaagattagagaaaaaagaatgaaagggaacaaataa